It is more than hundreds of years of evolution of our education system. Thanks to that, today, the growth of the research is astonishing. Now we are making machines learn. And new robust and optimized models are trained day after day: from Neural Network to CNN to ViT. So, if we consider the DL models as the students of the machine education system,one could ask: Is ViT a Ph.D. student? This talk presents an analogy between the human education system and the deep learning system. Furthermore, different techniques dedicated to training transformers on mid-small databases alongside a novel hybrid model of ViT and CNN are presented.