Skills catalog

OpenClaw Skills

Extend your AI workforce with thousands of community-built skills. From GitHub to Gmail, Slack to Shopify — connect your entire stack.

2,997 skills available

Filter by category

Addis Assistant Stt

Provides Speech-to-Text (STT) and text

Speech & TranscriptionView skill →

Agent Voice

Command-line blogging platform for AI agents.

Speech & TranscriptionView skill →

Announcer

Announce text throughout the house via AirPlay speakers using Airfoil +.

Speech & TranscriptionView skill →

Assemblyai Transcribe

Transcribe audio/video with AssemblyAI

Speech & TranscriptionView skill →

Audio Gen

Generate audiobooks, podcasts, or educational audio content

Speech & TranscriptionView skill →

Audio Reply

Generate audio replies using TTS.

Speech & TranscriptionView skill →

Chichi Speech

A RESTful service for high-quality text-to-speech using Qwen3

Speech & TranscriptionView skill →

Claw Voice

You are connected to a live user session via voice.

Speech & TranscriptionView skill →

Clonev

Clone any voice and generate speech using Coqui XTTS v2.

Speech & TranscriptionView skill →

Critical Article Writer

Generate draft articles, outlines

Speech & TranscriptionView skill →

Cult Of Carcinization

Give your agent a voice — and ears.

Speech & TranscriptionView skill →

Deepdub Tts

Generate speech audio using Deepdub and attach it as a MEDIA

Speech & TranscriptionView skill →

Deepgram

— command-line interface for Deepgram speech-to-text.

Speech & TranscriptionView skill →

Doubao API Open Tts

Text-to-Speech service using Doubao (Volcano Engine)

Speech & TranscriptionView skill →

Duby

Convert text to speech using Duby.so API.

Speech & TranscriptionView skill →

Eachlabs Tts

Transcribe audio from URL using EachLabs Speech-to-Text

Speech & TranscriptionView skill →

Easyverein API

Work with the easyVerein v2.0 REST API

Speech & TranscriptionView skill →

Edge Tts

|.

Speech & TranscriptionView skill →

Elevenlabs Agents

Create, manage, and deploy ElevenLabs

Speech & TranscriptionView skill →

Elevenlabs Media

ElevenLabs music generation and speech-to-text...

Speech & TranscriptionView skill →

Elevenlabs Transcribe

Transcribe audio to text using ElevenLabs

Speech & TranscriptionView skill →

Elevenlabs Tts

ElevenLabs TTS - the best ElevenLabs integration for OpenClaw.

Speech & TranscriptionView skill →

Elevenlabs Voices

High-quality voice synthesis with 18 personas, 32

Speech & TranscriptionView skill →

Faster Whisper

Local speech-to-text using faster-whisper.

Speech & TranscriptionView skill →

Feishu Minutes

Fetch info, stats, transcript, and media from Feishu

Speech & TranscriptionView skill →

Freshbooks CLI

FreshBooks CLI for managing invoices, clients, and billing.

Speech & TranscriptionView skill →

Gettr Transcribe Summarize

Download audio from a GETTR post

Speech & TranscriptionView skill →

Inworld Tts

Text-to-speech via Inworld.ai API.

Speech & TranscriptionView skill →

Jarvis Voice

Metallic AI voice persona with TTS and visual transcript styling.

Speech & TranscriptionView skill →

Kokoro Tts

Generate spoken audio from text using the local Kokoro TTS engine.

Speech & TranscriptionView skill →

Llmwhisperer

Extract text and layout from images and PDFs using LLMWhisperer

Speech & TranscriptionView skill →

Local Stt

Local STT with selectable backends - Parakeet (best accuracy) or Whisper.

Speech & TranscriptionView skill →

Local Whisper

Local speech-to-text using OpenAI Whisper.

Speech & TranscriptionView skill →

Minimax Tts

name: minimax-tts.

Speech & TranscriptionView skill →

Mlx Whisper

Local speech-to-text with MLX Whisper

Speech & TranscriptionView skill →

Moodcast

Transform any text into emotionally expressive audio with ambient

Speech & TranscriptionView skill →

Openai Whisper

Local speech-to-text with the Whisper CLI (no API key).

Speech & TranscriptionView skill →

Openai Whisper API

Transcribe audio via OpenAI Audio Transcriptions API

Speech & TranscriptionView skill →

Parakeet Mlx

Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon

Speech & TranscriptionView skill →

Parakeet Stt

>-.

Speech & TranscriptionView skill →

Phone Voice

Connect ElevenLabs Agents to your OpenClaw via phone with Twilio.

Speech & TranscriptionView skill →

Piper Tts

Local text-to-speech using Piper ONNX voices - fast, private, no cloud

Speech & TranscriptionView skill →

Plaud Unofficial

Use when accessing Plaud voice recorder data

Speech & TranscriptionView skill →

Pocket Transcripts

Read transcripts and summaries from Pocket AI

Speech & TranscriptionView skill →

Pocket Tts

pocket-tts

Speech & TranscriptionView skill →

Qwen Tts

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice.

Speech & TranscriptionView skill →

Ringg Voice Agent

Integrate Ringg AI voice agents with OpenClaw

Speech & TranscriptionView skill →

Routstr Balance Management

Manage Routstr balance by checking

Speech & TranscriptionView skill →

Sapi Tts

Windows SAPI5 text-to-speech with Neural voices.

Speech & TranscriptionView skill →

Sound Fx

Generate short sound effects via ElevenLabs SFX (text-to-sound).

Speech & TranscriptionView skill →

Spaces

Voice-first social spaces where Moltbook agents hang out.

Speech & TranscriptionView skill →

Transcribe

Transcribe audio files to text using local Whisper (Docker).

Speech & TranscriptionView skill →

Tts

Text-to-speech using Hume AI or OpenAI API.

Speech & TranscriptionView skill →

Tts Whatsapp

Send high-quality text-to-speech voice messages on WhatsApp in 40+

Speech & TranscriptionView skill →

Video Subtitles

Generate SRT subtitles from video/audio with translation

Speech & TranscriptionView skill →

Voice Agent

Local Voice Input/Output for Agents using the AI Voice Agent

Speech & TranscriptionView skill →

Voice AI Agent

Create, manage, and deploy Voice.ai conversational AI

Speech & TranscriptionView skill →

Voice AI Tts

High-quality voice synthesis with 9 personas, 11 languages

Speech & TranscriptionView skill →

Voice AI Voices

High-quality voice synthesis with 9 personas, 11

Speech & TranscriptionView skill →

Voice Transcribe

Transcribe audio files using OpenAI's

Speech & TranscriptionView skill →

Voice UI

Self-evolving voice assistant UI.

Speech & TranscriptionView skill →

Webchat Audio Notifications

Add browser audio notifications

Speech & TranscriptionView skill →

Whatsapp Voice Chat Integration Open Source

Real-time WhatsApp

Speech & TranscriptionView skill →

Whisper Mlx Local

Free local speech-to-text for Telegram and WhatsApp

Speech & TranscriptionView skill →

X Voice Match

Analyze a Twitter/X account's posting style and generate

Speech & TranscriptionView skill →