加载中...

ViT_PaperRead

发表于2023-12-11|更新于2024-05-04|Advanced_Model

|阅读量:

ViT Paper Read

The paper is titled ‘An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale’.

The paper provide the first evidence of transformer encoder application for image classification.

Figure-1 Structure (source: from paper)

文章作者: Linermao