Vision Transformer paper: AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALEVision Transfomer(ViT)는 2021년 Google research에서 발표