15 results for tag "speech-to-text"
Skill for transcribing audio to text using ElevenLabs Scribe v2, supporting 90+ languages, speaker diarization, and word-level timestamps
AI agent skills for running 150+ AI models via the inference.sh CLI runtime. Supports image generation with Flux, video generation with Veo, LLM calls, web search with Tavily, and more across Claude Code and other coding assistants.
Speech-to-text conversion and audio transcription patterns from the Claude AI Skill Generator template repository
Transcribes audio using Sarvam's Saarika v2.5 speech-to-text model with base64 WebSocket streaming, part of the Sarvam AI skill pack built for Indian-language-first AI applications.
Transcribe audio to text via inference.sh CLI using ElevenLabs Scribe v2 (98%+ accuracy with diarization), Fast Whisper Large V3, and Whisper V3 Large. Supports multi-language transcription, timestamps, speaker diarization, and audio event tagging.
Speech-to-text transcription skill for converting audio recordings to text using various transcription engines.
A speech to text skill from Inference.sh's agent skills for AI development and deployment.
A professional Claude Code skills marketplace featuring 37 production-ready skills for enhanced development workflows.
Speech-to-text plugin for Claude Code that lets you hold a hotkey, speak, and have your words appear in the input field, all processed locally.