Ke Ding

kding1

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

published an article about 2 months ago

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

upvoted an article 5 months ago

Fast LoRA inference for Flux with Diffusers and PEFT

View all activity

Organizations

upvoted a paper about 2 months ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published Oct 13 • 28

published an article about 2 months ago

Article

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

Oct 16

•

upvoted 2 articles 5 months ago

Article

Fast LoRA inference for Flux with Diffusers and PEFT

Jul 23

•

Article

Mixture of Experts Explained

Dec 11, 2023

•

993

liked a Space 5 months ago

The Ultra-Scale Playbook

🌌

3.55k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 7 months ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Apr 29

•

published an article 7 months ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Apr 29

•

upvoted an article 8 months ago

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

Apr 16

•

published an article 8 months ago

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

Apr 16

•

upvoted an article 8 months ago

Article

🚀 Accelerating LLM Inference with TGI on Intel Gaudi

Mar 28

•

published an article 8 months ago

Article

🚀 Accelerating LLM Inference with TGI on Intel Gaudi

Mar 28

•

published an article 12 months ago

Article

Benchmarking Language Model Performance on 5th Gen Xeon at GCP

Dec 17, 2024

•

published an article over 1 year ago

Article

Accelerating Protein Language Model ProtST on Intel Gaudi 2

Jul 3, 2024

•

upvoted an article over 1 year ago

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

May 9, 2024

•

published an article over 1 year ago

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

May 9, 2024

•

authored a paper over 2 years ago

Dynamic-TinyBERT: Boost TinyBERT's Inference Efficiency by Dynamic Sequence Length

Paper • 2111.09645 • Published Nov 18, 2021

updated a model almost 3 years ago

kding1/dicoo_model_ddp

Text-to-Image • Updated Jan 26, 2023 • 44

updated a model over 3 years ago

kding1/sagemaker-distilbert-emotion

Updated May 29, 2022

liked a model almost 4 years ago

Intel/distilbert-base-uncased-sparse-90-unstructured-pruneofa

Fill-Mask • Updated Apr 11, 2023 • 13 • 2

Ke Ding

AI & ML interests

Recent Activity

Organizations

kding1's activity

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

Fast LoRA inference for Flux with Diffusers and PEFT

Mixture of Experts Explained

The Ultra-Scale Playbook

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Introducing HELMET: Holistically Evaluating Long-context Language Models

Introducing HELMET: Holistically Evaluating Long-context Language Models

🚀 Accelerating LLM Inference with TGI on Intel Gaudi

🚀 Accelerating LLM Inference with TGI on Intel Gaudi

Benchmarking Language Model Performance on 5th Gen Xeon at GCP

Accelerating Protein Language Model ProtST on Intel Gaudi 2

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon