speech

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

speech multimodal rag edge-ai vector-database vision-transformer llm-inference

Updated May 28, 2024
Python

modelscope / modelscope

Star

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated May 28, 2024
Python

balisujohn / tortoise.cpp

Star

A ggml (C++) re-implementation of tortoise-tts. Under construction and seeking contributors.

text-to-speech text speech tts to tortoise-tts ggml

Updated May 28, 2024
C++

sp-nitech / SPTK

Star

A suite of speech signal processing tools

cpp signal-processing dsp speech lpc unix-command mfcc speech-processing audio-processing lsp sptk cepstrum

Updated May 28, 2024
C++

mishra-ankit / modi-speeches

Star

Dataset of Narendra Modi speeches released to encourage research and analysis

politics speech dataset india politicians modi

Updated May 28, 2024
JavaScript

IAHispano / Applio

Star

VITS-based Voice Conversion focused on simplicity, quality and performance.

text-to-speech ai voice speech pytorch rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated May 27, 2024
Python

jacorread / jacorread.github.io

Star

Alejandro Correa

speech linguistics phonetics corpus-linguistics

Updated May 27, 2024
SCSS

m-bain / whisperX

Star

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated May 27, 2024
Python

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech

Here are 1,620 public repositories matching this topic...

huggingface / datasets

mobilepadawan / Speakit-JS

HumeAI / hume-python-sdk

speechanddebate / tabroom

nipponjo / tts_arabic

OvidijusParsiunas / deep-chat

sensein / senselab

YasserdahouML / visper

MechatronicBeing / HumanLanguageSpoken

pytorch / audio

bytedance / SALMONN

felixbur / nkululeko

dusty-nv / NanoLLM

modelscope / modelscope

balisujohn / tortoise.cpp

sp-nitech / SPTK

mishra-ankit / modi-speeches

IAHispano / Applio

jacorread / jacorread.github.io

m-bain / whisperX

Improve this page

Add this topic to your repo