🎤 Speech & Audio AI Skills

🎤Speech & Audio

Openai Whisper Api

openai-whisper-api

steipete

v1.0.0

View Details

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

488

11.3k

today

🎤Speech & Audio

Openai Whisper

openai-whisper

steipete

v1.0.0

View Details

Local speech-to-text with the Whisper CLI (no API key).

356

13.4k

3d ago

🎤Speech & Audio

Sag

sag

steipete

v1.0.0

View Details

ElevenLabs text-to-speech with mac-style say UX.

272

6.8k

3d ago

🎤Speech & Audio

Edge TTS

edge-tts

i3130002

v2.0.0

View Details

Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

3.8k

3d ago

🎤Speech & Audio

whisper

whisper

fiddlybit

v1.0.0

View Details

End-to-end encrypted agent-to-agent private messaging via Moltbook dead drops. Use when agents need to communicate privately, exchange secrets, or coordinate without human visibility.

2.4k

2d ago

🎤Speech & Audio

OpenAI TTS

openai-tts

pors

v1.0.0

View Details

Text-to-speech via OpenAI Audio Speech API.

3.5k

2d ago

🎤Speech & Audio

Alexa CLI

alexa-cli

buddyh

v1.3.0

View Details

Control Amazon Alexa devices and smart home via the `alexacli` CLI. Use when a user asks to speak/announce on Echo devices, control lights/thermostats/locks, send voice commands, or query Alexa.

2.9k

3d ago

🎤Speech & Audio

ElevenLabs Voices

elevenlabs-voices

robbyczgw-cla

v2.1.5

View Details

High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using ElevenLabs API.

5.1k

3d ago

🎤Speech & Audio

Transcribe

transcribe

javicasper

v1.0.2

View Details

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

2.3k

today

🎤Speech & Audio

audio-cog

audio-cog

nitishgargiitd

v1.0.3

View Details

AI audio generation powered by CellCog. Text-to-speech, voice synthesis, voiceovers, podcast audio, narration, music generation, background music, sound design. Professional audio creation with AI.

2.9k

3d ago

🎤Speech & Audio

Elevenlabs Tts

elevenlabs-tts

Shaharsha

v2.2.0

View Details

ElevenLabs TTS (Text-to-Speech) with emotional audio tags for expressive voice synthesis. WhatsApp-compatible voice messages with Opus conversion. Supports 7...

+12

4.5k

today

🎤Speech & Audio

supercall

xonder

v2.0.0

View Details

Make AI-powered phone calls with custom personas and goals. Uses OpenAI Realtime API + Twilio for ultra-low latency voice conversations. Supports DTMF/IVR na...

+12

1.3k

today

🎤Speech & Audio

Voice Reply

voice-reply

stolot0mt0m

v1.0.0

View Details

Local text-to-speech using Piper voices via sherpa-onnx. 100% offline, no API keys required. Use when user asks for a voice reply, audio response, spoken answer, or wants to hear something read aloud. Supports multiple languages including German (thorsten) and English (ryan) voices. Outputs Telegram-compatible voice notes with [[audio_as_voice]] tag.

2.6k

yesterday

🎤Speech & Audio

Speech To Text

speech-to-text

okaris

v0.1.5

View Details

Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation,...

1.3k

today

🎤Speech & Audio

MLX STT

mlx-stt

guoqiao

v1.0.7

View Details

Speech-To-Text with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.

+12

2.6k

yesterday

🎤Speech & Audio

Openai Whisper 1.0.0

openai-whisper-1-0-0

czubi1928

v1.0.0

View Details

Local speech-to-text with the Whisper CLI (no API key).

222

3d ago

🎤Speech & Audio

music-cog

music-cog

nitishgargiitd

v1.0.1

View Details

Original music, fully yours. 5 seconds to 10 minutes using frontier music generation models. Instrumental and vocal tracks with perfect vocals. Cinematic scores, background tracks, podcast intros, game soundtracks, ambient soundscapes, jingles, lo-fi beats, orchestral compositions, songs with lyrics.

1.7k

3d ago

🎤Speech & Audio

Pocket Tts

pocket-tts

sherajdev

v1.0.1

View Details

Generate high-quality English speech offline on CPU using 8 built-in voices or custom voice cloning with Kyutai's Pocket TTS model.

1.7k

3d ago

Speech & Audio Skills

Openai Whisper Api

Openai Whisper

Sag

Edge TTS

whisper

OpenAI TTS

Alexa CLI

ElevenLabs Voices

Transcribe

audio-cog

Elevenlabs Tts

Qwen3-tts

Kokoro TTS

Faster Whisper

it will help you to send voice messages to your AI Assistant and also can make it talk

ElevenLabs Speech-to-Text

Local Whisper

Supercall

Voice Reply

Speech To Text

MLX STT

Openai Whisper 1.0.0

music-cog

Pocket Tts

More in 🤖 AI & Agents