Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2103.00020

about 2 hours ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 14 days ago • 240
Neural Machine Translation by Jointly Learning to Align and Translate

Paper • 1409.0473 • Published Sep 1, 2014 • 7
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 105
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 24

Papers Referred by VLA Survey Referred Papers

Transporter Networks: Rearranging the Visual World for Robotic Manipulation

Paper • 2010.14406 • Published Oct 27, 2020
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Paper • 2010.11929 • Published Oct 22, 2020 • 15

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022 • 1
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 24
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
Running on Zero

442

AICoverGen

🚀

442

Launch a web interface for model interaction

computer vision papers 👓

Rich feature hierarchies for accurate object detection and semantic segmentation

Paper • 1311.2524 • Published Nov 11, 2013 • 1
DeepPose: Human Pose Estimation via Deep Neural Networks

Paper • 1312.4659 • Published Dec 17, 2013 • 1
Generative Adversarial Networks

Paper • 1406.2661 • Published Jun 10, 2014 • 5
scikit-image: Image processing in Python

Paper • 1407.6245 • Published Jul 23, 2014 • 1

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Paper • 2003.08934 • Published Mar 19, 2020 • 2
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
Emerging Properties in Self-Supervised Vision Transformers

Paper • 2104.14294 • Published Apr 29, 2021 • 4
Segment Anything

Paper • 2304.02643 • Published Apr 5, 2023 • 5

Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19

sentence-transformers/all-mpnet-base-v2

Sentence Similarity • 0.1B • Updated Aug 19 • 24.1M • • 1.2k
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Paper • 1910.10683 • Published Oct 23, 2019 • 15
google-t5/t5-base

Translation • 0.2B • Updated Feb 14, 2024 • 2.23M • • 757
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 105

Transformer-based Models for Computer Vision

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published Sep 26, 2024 • 53
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Paper • 2010.11929 • Published Oct 22, 2020 • 15
Going deeper with Image Transformers

Paper • 2103.17239 • Published Mar 31, 2021
Training data-efficient image transformers & distillation through attention

Paper • 2012.12877 • Published Dec 23, 2020 • 2

Applied Machine Learning Papers

Reading List (Mainly Focused of VLM's and Diffusion Models)

Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 18
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

Paper • 2311.15127 • Published Nov 25, 2023 • 15
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
U-Net: Convolutional Networks for Biomedical Image Segmentation

Paper • 1505.04597 • Published May 18, 2015 • 14

about 2 hours ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 14 days ago • 240
Neural Machine Translation by Jointly Learning to Align and Translate

Paper • 1409.0473 • Published Sep 1, 2014 • 7
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 105
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 24

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Paper • 2003.08934 • Published Mar 19, 2020 • 2
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
Emerging Properties in Self-Supervised Vision Transformers

Paper • 2104.14294 • Published Apr 29, 2021 • 4
Segment Anything

Paper • 2304.02643 • Published Apr 5, 2023 • 5

Papers Referred by VLA Survey Referred Papers

Transporter Networks: Rearranging the Visual World for Robotic Manipulation

Paper • 2010.14406 • Published Oct 27, 2020
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Paper • 2010.11929 • Published Oct 22, 2020 • 15

Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022 • 1
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 24
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

sentence-transformers/all-mpnet-base-v2

Sentence Similarity • 0.1B • Updated Aug 19 • 24.1M • • 1.2k
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Paper • 1910.10683 • Published Oct 23, 2019 • 15
google-t5/t5-base

Translation • 0.2B • Updated Feb 14, 2024 • 2.23M • • 757
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 105

Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
Running on Zero

442

AICoverGen

🚀

442

Launch a web interface for model interaction

Transformer-based Models for Computer Vision

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published Sep 26, 2024 • 53
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Paper • 2010.11929 • Published Oct 22, 2020 • 15
Going deeper with Image Transformers

Paper • 2103.17239 • Published Mar 31, 2021
Training data-efficient image transformers & distillation through attention

Paper • 2012.12877 • Published Dec 23, 2020 • 2

computer vision papers 👓

Rich feature hierarchies for accurate object detection and semantic segmentation

Paper • 1311.2524 • Published Nov 11, 2013 • 1
DeepPose: Human Pose Estimation via Deep Neural Networks

Paper • 1312.4659 • Published Dec 17, 2013 • 1
Generative Adversarial Networks

Paper • 1406.2661 • Published Jun 10, 2014 • 5
scikit-image: Image processing in Python

Paper • 1407.6245 • Published Jul 23, 2014 • 1

Applied Machine Learning Papers

Reading List (Mainly Focused of VLM's and Diffusion Models)

Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 18
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

Paper • 2311.15127 • Published Nov 25, 2023 • 15
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
U-Net: Convolutional Networks for Biomedical Image Segmentation

Paper • 1505.04597 • Published May 18, 2015 • 14

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs