Spaces:

RedHatAI
/

README

No application file

robgreenberg3 commited on about 24 hours ago

Commit

aa6953d

verified ·

1 Parent(s): 279892e

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ We believe the future of AI is open. That’s why we’re sharing our latest mod
 🔗 **Explore relevant open-source tools**:
 - [**vLLM**](https://github.com/vllm-project/vllm) – Serve large language models efficiently across GPUs and environments.
 - [**LLM Compressor**](https://github.com/vllm-project/llm-compressor) – Compress and optimize your own models with SOTA quantization and sparsity techniques.
-- [**Speculators**](https://github.com/vllm-project/speculators) – A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM.
 - [**InstructLab**](https://github.com/instructlab) – Fine-tune open models with your data using scalable, community-backed workflows.
 - [**GuideLLM**](https://github.com/neuralmagic/guidellm) – Benchmark, evaluate, and guide your deployments with structured performance and latency insights.

 🔗 **Explore relevant open-source tools**:
 - [**vLLM**](https://github.com/vllm-project/vllm) – Serve large language models efficiently across GPUs and environments.
 - [**LLM Compressor**](https://github.com/vllm-project/llm-compressor) – Compress and optimize your own models with SOTA quantization and sparsity techniques.
+- [**Speculators**](https://github.com/vllm-project/speculators) – Build, evaluate, and store speculative decoding algorithms for LLM inference in vLLM.
 - [**InstructLab**](https://github.com/instructlab) – Fine-tune open models with your data using scalable, community-backed workflows.
 - [**GuideLLM**](https://github.com/neuralmagic/guidellm) – Benchmark, evaluate, and guide your deployments with structured performance and latency insights.