Transformer 8
- BERT, ELMo, GPT-2 모델 비교
- GPT-1, GPT-2
- [Paper Review] SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
- [Paper Review] A ConvNet for the 2020s
- [Paper Review] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
- [Paper Review] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
- [Paper Review] Attention is All You Need
- [Paper Review] BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding