-
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
Paper • 2510.14972 • Published • 33 -
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 110 -
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning
Paper • 2510.19338 • Published • 114 -
The Smol Training Playbook
📚2.56kThe secrets to building world-class LLMs
Jonatan Borkowski PRO
j14i
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
zai-org/AutoGLM-Phone-9B
reacted
to
sergiopaniego's
post
with 🔥
3 days ago
Want to get started with fine-tuning but don’t know where to begin? 🤓☝️
We’re expanding our collection of beginner-friendly free Colab notebooks so you can learn and fine-tune models using TRL at no cost
🔬 Check out the full list of free notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks
🔬 If you want more advanced content, we also have a lot to cover in the community tutorials: https://huggingface.co/docs/trl/community_tutorials
And now the obvious question: what would you like us to add next?