-
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 240 -
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 105 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 24
Collections
Discover the best community collections!
Collections including paper arxiv:2103.00020
-
Transporter Networks: Rearranging the Visual World for Robotic Manipulation
Paper • 2010.14406 • Published -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 19 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper • 2211.04325 • Published • 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 24 -
On the Opportunities and Risks of Foundation Models
Paper • 2108.07258 • Published • 1 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper • 2204.07705 • Published • 2
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 5 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1
-
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Paper • 2003.08934 • Published • 2 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 19 -
Emerging Properties in Self-Supervised Vision Transformers
Paper • 2104.14294 • Published • 4 -
Segment Anything
Paper • 2304.02643 • Published • 5
-
sentence-transformers/all-mpnet-base-v2
Sentence Similarity • 0.1B • Updated • 24.1M • • 1.2k -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 15 -
google-t5/t5-base
Translation • 0.2B • Updated • 2.23M • • 757 -
Attention Is All You Need
Paper • 1706.03762 • Published • 105
-
MIO: A Foundation Model on Multimodal Tokens
Paper • 2409.17692 • Published • 53 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Going deeper with Image Transformers
Paper • 2103.17239 • Published -
Training data-efficient image transformers & distillation through attention
Paper • 2012.12877 • Published • 2
-
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 18 -
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
Paper • 2311.15127 • Published • 15 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 19 -
U-Net: Convolutional Networks for Biomedical Image Segmentation
Paper • 1505.04597 • Published • 14
-
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 240 -
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 105 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 24
-
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Paper • 2003.08934 • Published • 2 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 19 -
Emerging Properties in Self-Supervised Vision Transformers
Paper • 2104.14294 • Published • 4 -
Segment Anything
Paper • 2304.02643 • Published • 5
-
Transporter Networks: Rearranging the Visual World for Robotic Manipulation
Paper • 2010.14406 • Published -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 19 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper • 2211.04325 • Published • 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 24 -
On the Opportunities and Risks of Foundation Models
Paper • 2108.07258 • Published • 1 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper • 2204.07705 • Published • 2
-
sentence-transformers/all-mpnet-base-v2
Sentence Similarity • 0.1B • Updated • 24.1M • • 1.2k -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 15 -
google-t5/t5-base
Translation • 0.2B • Updated • 2.23M • • 757 -
Attention Is All You Need
Paper • 1706.03762 • Published • 105
-
MIO: A Foundation Model on Multimodal Tokens
Paper • 2409.17692 • Published • 53 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Going deeper with Image Transformers
Paper • 2103.17239 • Published -
Training data-efficient image transformers & distillation through attention
Paper • 2012.12877 • Published • 2
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 5 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1
-
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 18 -
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
Paper • 2311.15127 • Published • 15 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 19 -
U-Net: Convolutional Networks for Biomedical Image Segmentation
Paper • 1505.04597 • Published • 14