nvidia
/

diar_streaming_sortformer_4spk-v2

Audio Classification

speaker-diarization

speaker-recognition

Model card Files Files and versions

jbalam-nv commited on 7 days ago

Commit

8602f49

·

verified ·

1 Parent(s): d574343

Added NVIDIA documentation links

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -197,6 +197,17 @@ This model is a streaming version of Sortformer diarizer. [Sortformer](https://a
 Sortformer resolves permutation problem in diarization following the arrival-time order of the speech segments from each speaker.
 ## Model Architecture
 Streaming sortformer employs pre-encode layer in the Fast-Conformer to generate speaker-cache. At each step, speaker cache is filtered to only retain the high-quality speaker cache vectors.

 Sortformer resolves permutation problem in diarization following the arrival-time order of the speech segments from each speaker.
+## Discover more from NVIDIA:
+For documentation, deployment guides, enterprise-ready APIs, and the latest open models—including Nemotron and other cutting-edge speech, translation, and generative AI—visit the NVIDIA Developer Portal at [developer.nvidia.com](https://developer.nvidia.com/).
+Join the community to access tools, support, and resources to accelerate your development with NVIDIA’s NeMo, Riva, NIM, and foundation models.<br>
+### Explore more from NVIDIA:  <br>
+What is [Nemotron](https://www.nvidia.com/en-us/ai-data-science/foundation-models/nemotron/)?<br>
+NVIDIA Developer [Nemotron](https://developer.nvidia.com/nemotron)<br>
+[NVIDIA Riva Speech](https://developer.nvidia.com/riva?sortBy=developer_learning_library%2Fsort%2Ffeatured_in.riva%3Adesc%2Ctitle%3Aasc#demos)<br>
+[NeMo Documentation](https://docs.nvidia.com/nemo-framework/user-guide/latest/nemotoolkit/asr/models.html)<br>
 ## Model Architecture
 Streaming sortformer employs pre-encode layer in the Fast-Conformer to generate speaker-cache. At each step, speaker cache is filtered to only retain the high-quality speaker cache vectors.