Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs Paper • 2510.11062 • Published Oct 13 • 28
view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face +2 Oct 16 • 18
Running 3.55k The Ultra-Scale Playbook 🌌 3.55k The ultimate guide to training LLM on large GPU Clusters
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 Apr 29 • 43
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 Apr 29 • 43
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 Apr 16 • 40
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 Apr 16 • 40
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP +1 Dec 17, 2024 • 7
view article Article Accelerating Protein Language Model ProtST on Intel Gaudi 2 +5 Jul 3, 2024 • 2
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon +6 May 9, 2024 • 12
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon +6 May 9, 2024 • 12
Dynamic-TinyBERT: Boost TinyBERT's Inference Efficiency by Dynamic Sequence Length Paper • 2111.09645 • Published Nov 18, 2021
Intel/distilbert-base-uncased-sparse-90-unstructured-pruneofa Fill-Mask • Updated Apr 11, 2023 • 13 • 2