Speech apps Collection Various applications to help deal with speech better. • 25 items • Updated 5 days ago
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 6 days ago • 159
Datasets Collection Interesting datasets to help train LLMs and beyond • 43 items • Updated 5 days ago
Interpretability tools Collection Opening the hood of Computer Vision model for example ResNets, ConvNext & DETR, multimodal models and NLP models:BERT & GPTs. • 5 items • Updated 9 days ago • 2
Speech apps Collection Various applications to help deal with speech better. • 25 items • Updated 5 days ago
Interpretability tools Collection Opening the hood of Computer Vision model for example ResNets, ConvNext & DETR, multimodal models and NLP models:BERT & GPTs. • 5 items • Updated 9 days ago • 2
Datasets Collection Interesting datasets to help train LLMs and beyond • 43 items • Updated 5 days ago
Running on A100 207 Omnilingual ASR Media Transcription 🌍 207 Transcribe audio or video into text in multiple languages
Running on CPU Upgrade Featured 2.57k The Smol Training Playbook 📚 2.57k The secrets to building world-class LLMs