skillssh/skills

80 resources in this repository

🎯80Skills

🎯Skills80

Generate AI images with 50+ models (FLUX, Gemini 3 Pro Image, Grok Imagine, Seedream, Reve) via the inference.sh CLI, covering text-to-image, inpainting, LoRA, image editing, and upscaling.

ai-image-generation

🎯ai-video-generation🎯Skill

Generate AI videos (text-to-video, image-to-video, lipsync, upscaling) with 40+ models like Google Veo, Seedance, Wan, and Grok via the inference.sh CLI. Covers social videos, marketing clips, explainers, and AI avatars.

ai-video-generation

🎯agent-tools🎯Skill

Umbrella skill for running 250+ AI apps through the inference.sh CLI — image and video generation, LLM calls, web search, 3D, and Twitter automation — from a single no-GPU-required install.

agent-tools

🎯infsh-cli🎯Skill

Install and authenticate the inference.sh (`infsh`) CLI so that agents can run any of 250+ cloud AI apps — image, video, LLM, search, 3D, and more — without needing a local GPU.

infsh-cli

🎯web-search🎯Skill

Run web search and content extraction through Tavily (Search Assistant, Extract) and Exa (Search, Answer, Extract) via the inference.sh CLI for research, RAG, and fact-checking workflows.

web-search

🎯python-executor🎯Skill

Execute arbitrary Python 3.10 code in a sandboxed inference.sh environment pre-loaded with NumPy, Pandas, Matplotlib, Playwright, MoviePy, Pillow, OpenCV, trimesh, and 100+ other libraries.

python-executor

🎯remotion-render🎯Skill

Render MP4 video from React/Remotion TSX component code via the inference.sh CLI with configurable resolution, FPS, duration, and codec — supports `useCurrentFrame`, `spring`, `interpolate`, `Sequence`, etc.

remotion-render

🎯twitter-automation🎯Skill

Automate Twitter/X through the inference.sh CLI — post tweets with media, like, retweet, delete, send DMs, follow users, and fetch posts/profiles via ready-made `x/*` apps.

twitter-automation

🎯agent-browser🎯Skill

Playwright-backed browser automation for AI agents via inference.sh, using an `@e` ref system to open pages, click, fill forms, screenshot, record video, and re-snapshot between navigations.

agent-browser

🎯landing-page-design🎯Skill

Design high-converting SaaS/product landing pages using an above-the-fold formula, hero section layout, CTA psychology, social proof placement, mobile rules, and F-pattern reading.

landing-page-design

🎯image-to-video🎯Skill

Convert still images into animated videos via inference.sh models (Wan 2.5 i2v, Pruna WAN-I2V, Seedance 1.5 Pro) with guidance on model selection, motion prompts, and camera movement.

image-to-video

🎯youtube-thumbnail-design🎯Skill

Design high-CTR YouTube thumbnails at 1280×720 with AI image generation, covering safe zones, mobile-first 120px readability, face expression psychology, contrast, and A/B testing guidance.

youtube-thumbnail-design

🎯storyboard-creation🎯Skill

Create film/video storyboards with AI image generation using proper shot vocabulary (ECU/CU/MS/MLS), camera angles, continuity and the 180-degree rule, plus a panel-stitching workflow.

storyboard-creation

🎯product-photography🎯Skill

Create commercial-grade product photography with AI (hero shots, studio packshots, lifestyle, e-commerce/Amazon listings) using inference.sh image models and prompt recipes for angle, lighting, and background.

product-photography

🎯competitor-teardown🎯Skill

Run a structured 7-layer competitor teardown — product, pricing, positioning, traction, reviews, content, and team — combining Tavily search and browser screenshots for market research and investor decks.

competitor-teardown

🎯video-ad-specs🎯Skill

Platform-specific cheatsheet for producing video ads — dimensions, duration limits, safe zones, hook windows, and creative rules for TikTok, Instagram Reels, YouTube, Facebook, and LinkedIn.

video-ad-specs

🎯app-store-screenshots🎯Skill

Produce iOS App Store and Google Play screenshots with exact per-device dimensions, localization slots, gallery ordering, and device mockup prompts for ASO-focused store listings.

app-store-screenshots

🎯character-design-sheet🎯Skill

Guide for producing consistent characters across AI-generated images using reference sheets, turnaround views, expression sheets, color palettes, and FLUX LoRA training techniques.

character-design-sheet

🎯product-hunt-launch🎯Skill

Optimize Product Hunt launches — tagline limits, 1270×760 gallery images, description preview window, topic selection, maker comments, and launch-day tactics — with AI for visuals and research.

product-hunt-launch

🎯ai-automation-workflows🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

ai-automation-workflows

🎯ai-social-media-content🎯Skill

A social media content skill from the inference.sh collection, part of social guides covering LinkedIn, Twitter threads, and carousels using 250+ AI models through the inference.sh CLI.

ai-social-media-content

🎯ai-content-pipeline🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

ai-content-pipeline

🎯ai-music-generation🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

ai-music-generation

🎯prompt-engineering🎯Skill

A prompt engineering guide from the inference.sh skills collection, covering techniques for crafting effective prompts across 250+ AI models for image generation, video creation, and language tasks.

prompt-engineering

🎯social-media-carousel🎯Skill

A social media carousel guide from the inference.sh skills collection, part of the social guides covering LinkedIn, Twitter threads, and carousels using AI models through the inference.sh CLI.

social-media-carousel

🎯ai-product-photography🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

ai-product-photography

🎯logo-design-guide🎯Skill

A logo design guide from the inference.sh skills collection, part of the design guides covering landing pages, thumbnails, and logos using AI image generation models via the inference.sh CLI.

logo-design-guide

🎯seo-content-brief🎯Skill

An SEO content brief guide from the inference.sh skills collection, leveraging AI models for search-optimized content as part of the writing and product guides accessible through the inference.sh CLI.

seo-content-brief

🎯technical-blog-writing🎯Skill

A technical blog writing guide from the inference.sh skills collection, part of the writing guides covering blogs, case studies, and newsletters with AI models through the inference.sh CLI.

technical-blog-writing

🎯ai-voice-cloning🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

ai-voice-cloning

🎯email-design🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

email-design

🎯python-sdk🎯Skill

Python SDK skill from the inference.sh collection, providing async support and streaming for building AI applications with 250+ models via the inference.sh CLI.

python-sdk

🎯google-veo🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

google-veo

🎯data-visualization🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

data-visualization

🎯agent-ui🎯Skill

Full agent interface UI component from the inference.sh skills collection, providing a complete interface for interacting with 250+ AI models for image generation, video creation, LLM inference, and web search.

agent-ui

🎯llm-models🎯Skill

An inference.sh skill for working with large language models including Claude, Gemini, Kimi, and GLM through the inference.sh CLI, enabling AI agents to call LLMs via a unified interface.

llm-models

🎯ai-rag-pipeline🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

ai-rag-pipeline

🎯image-upscaling🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

image-upscaling

🎯nano-banana🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

nano-banana

🎯ai-marketing-videos🎯Skill

A marketing video skill from the inference.sh collection, part of the video guides covering storyboards, explainers, and ads using 40+ video generation models through the inference.sh CLI.

ai-marketing-videos

🎯case-study-writing🎯Skill

A case study writing guide from the inference.sh skills collection, part of the writing guides covering blogs, case studies, and newsletters with AI content creation through the inference.sh CLI.

case-study-writing

🎯ai-avatar-video🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

ai-avatar-video

🎯flux-image🎯Skill

Image generation skill from the inference.sh collection focused on FLUX models, part of a suite of 50+ image models including Gemini and Reve accessible through the inference.sh CLI.

flux-image

🎯content-repurposing🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

content-repurposing

🎯talking-head-production🎯Skill

Create talking head videos with AI avatars and lipsync via inference.sh CLI, using tools like OmniHuman and PixVerse. Covers portrait requirements, audio quality, and voiceover for spokesperson videos, course content, and presentations.

talking-head-production

🎯pitch-deck-visuals🎯Skill

Create investor-ready pitch deck visuals via inference.sh CLI with a structured 12-slide framework covering problem, solution, market, traction, and team slides. Includes visual design rules, chart type guidance, and data presentation best practices.

pitch-deck-visuals

🎯nano-banana-2🎯Skill

Generate images with Google Gemini 3.1 Flash Image Preview (Nano Banana 2) via the inference.sh CLI, supporting text-to-image, image editing, multi-image input (up to 14 images), and Google Search grounding.

nano-banana-2

🎯tools-ui🎯Skill

An inference.sh skill that provides UI components for rendering tool call inputs and results in AI agent interfaces, supporting the inference.sh platform with 250+ AI models.

tools-ui

🎯background-removal🎯Skill

Part of the inference.sh skills collection providing access to 250+ AI models via CLI, including image generation, video generation, LLM calls, web search, and social media automation.

background-removal

🎯explainer-video-guide🎯Skill

An explainer video guide from the inference.sh skills collection, part of the video guides covering storyboards, explainers, and ads using 40+ video models through the inference.sh CLI.

explainer-video-guide

🎯javascript-sdk🎯Skill

JavaScript/TypeScript SDK skill from the inference.sh collection, providing streaming support, tool integration, and React components for building AI applications with 250+ models via the inference.sh CLI.

javascript-sdk

🎯elevenlabs-voice-isolator🎯Skill

Remove background noise and isolate vocals from audio recordings using ElevenLabs via inference.sh CLI. Supports WAV and MP3 formats up to 500MB and 1 hour, useful for podcast cleanup, interview audio, and audio restoration.

elevenlabs-voice-isolator

🎯ai-podcast-creation🎯Skill

Create AI-powered podcasts and audio content using text-to-speech tools like Kokoro TTS, DIA TTS, and Chatterbox via inference.sh CLI. Supports multi-voice conversations, background music, intro/outro, and full episode production.

ai-podcast-creation

🎯elevenlabs-voice-changer🎯Skill

Transform any voice into a different voice while preserving speech content and emotion using ElevenLabs speech-to-speech models via inference.sh CLI. Supports 70+ languages with the multilingual model and English-optimized processing.

elevenlabs-voice-changer

🎯qwen-image-2-pro🎯Skill

Generate images with Alibaba Qwen-Image-2.0-Pro via inference.sh CLI, featuring professional text rendering, fine-grained realism, and enhanced semantic adherence. Ideal for posters, banners, and text-heavy designs.

qwen-image-2-pro

🎯og-image-design🎯Skill

Create Open Graph and social sharing images via inference.sh CLI with platform-specific specs for Facebook, Twitter/X, and LinkedIn. Covers text placement, branding guidelines, OG meta tags, and dynamic image generation.

og-image-design

🎯product-changelog🎯Skill

A product changelog guide from the inference.sh skills collection, part of the product guides covering competitor analysis, personas, and launches using AI models through the inference.sh CLI.

product-changelog

🎯speech-to-text🎯Skill

Transcribe audio to text via inference.sh CLI using ElevenLabs Scribe v2 (98%+ accuracy with diarization), Fast Whisper Large V3, and Whisper V3 Large. Supports multi-language transcription, timestamps, speaker diarization, and audio event tagging.

speech-to-text

🎯building-inferencesh-apps🎯Skill

An inference.sh skill for building inference.sh apps. Part of a collection of 250+ AI agent skills providing access to image generation, video generation, LLM models, web search, and social media automation through the inference.sh CLI.

building-inferencesh-apps

🎯elevenlabs-tts🎯Skill

Premium text-to-speech with 22+ ElevenLabs voices via inference.sh CLI, offering three model tiers: Multilingual v2 (highest quality, 32 languages), Turbo v2.5 (balanced speed), and Flash v2.5 (ultra-fast). Features stability and style control for voiceovers, audiobooks, and video narration.

elevenlabs-tts

🎯p-video🎯Skill

Generate videos with Pruna P-Video and WAN models via inference.sh CLI, supporting text-to-video, image-to-video, and audio integration at 720p/1080p. Pruna optimizes models for faster inference without quality loss.

p-video

🎯chat-ui🎯Skill

Chat UI component from the inference.sh skills collection, providing reusable chat interface components for conversational experiences with 250+ AI models via the inference.sh CLI.

chat-ui

🎯elevenlabs-stt🎯Skill

High-accuracy speech-to-text transcription using ElevenLabs Scribe models (v1/v2) via inference.sh CLI. Features 98%+ accuracy across 90+ languages with speaker diarization, audio event tagging, word-level timestamps, and forced alignment for subtitle generation.

elevenlabs-stt

🎯text-to-speech🎯Skill

Convert text to natural speech via inference.sh CLI using multiple TTS engines including ElevenLabs (22+ voices, 32 languages), DIA TTS for conversational speech, Kokoro TTS, Chatterbox, and more. Supports voice cloning, multi-speaker dialogue, and podcast generation.

text-to-speech

🎯video-prompting-guide🎯Skill

A video prompting guide from the inference.sh skills collection, covering prompt techniques for 40+ video generation models including Veo, Seedance, and Wan through the inference.sh CLI.

video-prompting-guide

🎯related-skill🎯Skill

An inference.sh skill for discovering related skills. Part of a collection of 250+ AI agent skills providing access to image generation, video generation, LLM models, web search, and social media automation through the inference.sh CLI.

related-skill

🎯elevenlabs-dubbing🎯Skill

Automatically dub audio and video into 29 languages using ElevenLabs via inference.sh CLI, with automatic speaker detection and voice-preserving translation. Ideal for content localization, video translation, and international distribution.

elevenlabs-dubbing

🎯twitter-thread-creation🎯Skill

A Twitter thread creation guide from the inference.sh skills collection, part of the social guides covering LinkedIn, Twitter threads, and carousels with AI content generation through the inference.sh CLI.

twitter-thread-creation

🎯linkedin-content🎯Skill

A LinkedIn content guide from the inference.sh skills collection, part of the social guides covering LinkedIn posts, Twitter threads, and carousel design using AI content generation through the inference.sh CLI.

linkedin-content

🎯book-cover-design🎯Skill

Create genre-appropriate book covers with AI image generation via inference.sh CLI. Covers fiction and non-fiction genre conventions, typography rules, sizing, thumbnail testing, and iteration workflows for self-publishing, ebook, and print covers.

book-cover-design

🎯qwen-image-2🎯Skill

Generate and edit images with Alibaba Qwen-Image-2.0 models via inference.sh CLI, offering both a fast standard model and a Pro model with professional text rendering. Supports text-to-image generation, multi-image editing, and complex text rendering.

qwen-image-2

🎯elevenlabs-sound-effects🎯Skill

An inference.sh skill for ElevenLabs sound effects generation. Part of a collection of 250+ AI agent skills providing access to image generation, video generation, LLM models, web search, and social media automation through the inference.sh CLI.

elevenlabs-sound-effects

🎯widgets-ui🎯Skill

Declarative UI widget renderer for React/Next.js that renders rich interactive interfaces from JSON via ui.inference.sh. Supports forms, buttons, cards, layouts, inputs, selects, and checkboxes for agent-generated UIs and dynamic forms.

widgets-ui

🎯customer-persona🎯Skill

A customer persona guide from the inference.sh skills collection, part of the product guides covering competitor analysis, personas, and launches using AI models through the inference.sh CLI.

customer-persona

🎯elevenlabs-music🎯Skill

An inference.sh skill for AI music generation via ElevenLabs. Part of a collection of 250+ AI agent skills that provide access to image generation, video generation, LLM models, web search, and social media automation through the inference.sh CLI.

elevenlabs-music

🎯newsletter-curation🎯Skill

An inference.sh skill for newsletter curation. Part of a collection of 250+ AI agent skills providing access to image generation, video generation, LLM models, web search, and social media automation through the inference.sh CLI.

newsletter-curation

🎯elevenlabs-dialogue🎯Skill

An inference.sh skill for ElevenLabs dialogue generation. Part of a collection of 250+ AI agent skills providing access to image generation, video generation, LLM models, web search, and social media automation through the inference.sh CLI.

elevenlabs-dialogue

🎯press-release-writing🎯Skill

An inference.sh skill for press release writing. Part of a collection of 250+ AI agent skills providing access to image generation, video generation, LLM models, web search, and social media automation through the inference.sh CLI.

press-release-writing

🎯p-image🎯Skill

An inference.sh skill for AI image generation. Part of a collection of 250+ AI agent skills providing access to 50+ image models (FLUX, Gemini, Reve), 40+ video models, LLM models, web search, and social media automation through the inference.sh CLI.

p-image

🎯dialogue-audio🎯Skill

Create realistic multi-speaker dialogue audio using Dia TTS and ElevenLabs via inference.sh CLI. Supports speaker tags for two-voice conversations with emotion control, pacing adjustments, and post-production for podcasts, audiobooks, and character dialogue.

dialogue-audio

Back to Home