π Empower your AI assistant with multimodal generation capabilities - A powerful AI coding assistant skill that enables image, video, speech, sound effect, 3D model, music/song generation, and web search through the
302aicommand-line tool, powered by 302.AI Official Website.
π‘ Cross-Tool Compatible - This Skill uses the standard SKILL.md format, supporting Claude Code, Cursor, and other compatible AI coding tools
Tell your Agent:
Install this skill: https://github.com/302ai/302ai-cli-skillA powerful AI coding skill that lets your AI assistant understand natural language requests like "generate a watercolor image of a cat" and automatically translate them into the correct 302ai CLI commands β sending requests to 302.AI's cloud services and returning results to you.
- π¨ Multimodal generation β Images, videos, speech, sound effects, 3D models, and music
- π Web search β Multiple search providers (Tavily, Bocha, Exa, Metaso)
- π€ Intelligent command translation β Automatically convert natural language into CLI commands
- β‘ Async and sync modes β Flexible workflows for different scenarios
- π 1400+ AI models β Access to 302.AI's comprehensive model ecosystem
- π οΈ Model management β Easily list, set defaults, and query parameters
| Module | Description | Mode | Output Format |
|---|---|---|---|
| π¨ Image | Text-to-image / Image-to-image | Async (recommended) or Sync | PNG / WEBP / JPG URL |
| π¬ Video | Text-to-video / Image-to-video | Async only | MP4 URL |
| π£οΈ TTS | Text-to-speech | Async | MP3 / WAV URL |
| π STT | Speech-to-text (transcription) | Sync | JSON text |
| π SFX | Text-to-sound-effect | Async | MP3 URL |
| π§ 3D | Text-to-3D / Image-to-3D | Async | GLB file URL |
| π΅ Music | AI music/song generation + lyric writing | Async (Suno) or Sync (Minimax/ElevenLabs) | MP3 URL |
| π Search | Web search (multiple providers) | Sync | JSON results |
| Module | Supported Models |
|---|---|
| π¨ Image | nano-banana-2 β’ gpt-image-2 β’ doubao-seedream-5-0-260128 β’ nano-banana-pro β’ nano-banana β’ gpt-image-1.5 β’ wan2.7-image β’ wan2.7-image-pro |
| π¬ Video | happyhorse-1.0 β’ kling-o3 β’ doubao-seedance-2-0-260128 β’ wan2.7 β’ official-kling-v3 β’ runway-gen4 β’ minimaxi-hailuo-02 β’ google-veo3.1 β’ google-veo3.1-pro β’ minimaxi-hailuo-2.3 β’ viduq3-pro |
| π£οΈ TTS | tts-1-hd β’ qwen-tts-latest β’ glm-tts β’ doubao-tts β’ speech-02-hd β’ speech-1.6 β’ dubbingx-tts β’ eleven_multilingual_v2 β’ gemini-2.5-flash-preview-ttsβ’ mureka-tts and 32 more models |
| π STT | whisper-1 β’ gpt-4o-transcribe β’ gpt-4o-mini-transcribe β’ gpt-4o-transcribe-diarize β’ recognize β’ scribe_v1_experimental β’ scribe_v1 β’ sensevoice |
| π SFX | kling-sfx |
| π§ 3D | hyper3d-rodin |
| π΅ Music | suno-v5 (fenix/crow/bluejay/auk) β’ suno-v4 β’ suno-v3.5 β’ minimax-music-2.5+ β’ minimax-music-2.5 β’ minimax-music-2.0 β’ minimax-music-1.5 β’ elevenlabs-music-v1 |
| π Search | tavily β’ search1_search β’ search1_news β’ bocha β’ exa β’ firecrawl β’ metaso β’ unifuncs β’ perplexity |
π‘ Tips:
- The models above are listed for reference. For exact model IDs to use in commands, run
302ai model list <type>- Speed, quality, and price vary across models β choose based on your actual needs. For pricing, visit the official website
You ask:
Generate a watercolor painting of a sunset over mountains
Your AI assistant executes:
# Async mode (recommended) β avoids timeout
302ai image create --prompt "watercolor painting of a sunset over mountains" --model flux-1.1-pro
302ai image fetch <taskid> --shortResult: Returns a URL to the generated image
You ask:
Make a video of a cat playing with yarn
Your AI assistant executes:
# Video generation takes 1β5 minutes, async only
302ai video create --prompt "a cat playing with yarn" --model kling-v1.6-standard
302ai video fetch <taskid> --shortResult: Returns a URL to the generated video
You ask:
Read this text aloud in a female voice: "Welcome to 302.AI"
Your AI assistant executes:
# First-time TTS setup requires cache refresh
302ai tts refresh
# Then create the speech task
302ai tts create --text "Welcome to 302.AI" --provider openai --voice alloy --model tts-1
302ai tts fetch <taskid> --shortResult: Returns a URL to the audio file
β οΈ Important: The TTS module requires running302ai tts refreshbefore first use to cache the provider and voice list.
You ask:
Search for the latest news on AI developments
Your AI assistant executes:
# Default provider is Tavily (best for English content)
302ai search run --query "latest AI developments" --provider tavilyResult: Returns JSON-formatted search results
π‘ Search tips:
- Default provider:
tavily(best for English content)- Chinese content: Use
--provider bochafor better results- Academic search: Use
--provider metaso --category scholaror--provider exa --category "research paper"
You ask:
Create a 3D model of a coffee mug
Your AI assistant executes:
# 3D model generation takes 1β5 minutes
302ai 3d create --prompt "a coffee mug" --model hyper3d-rodin
302ai 3d fetch <taskid> --shortResult: Returns a URL to the .glb file
π¦ 3D file viewing: The
.glbformat can be opened with Blender (free), Three.js online viewer, or any 3D model viewer supporting the glTF format.
You ask:
Create a song about summer vacation
Your AI assistant executes:
# Suno generates 2 songs per task, takes 1β3 minutes
302ai song create --prompt "a cheerful song about summer vacation at the beach" --provider suno
302ai song fetch <taskid> --shortResult: Returns 2 audio URLs (Suno generates 2 variations per request)
π‘ Song generation tips:
- Suno (async): Best quality, generates 2 songs per task, supports custom lyrics
- Minimax (sync): Faster, requires the
--lyricsparameter- ElevenLabs (sync): Requires
--composition-planinstead of lyrics
You ask:
Write lyrics for a rock song about freedom
Your AI assistant executes:
# Generate lyrics (compatible with Suno and Minimax)
302ai song lyrics --prompt "rock song about freedom and breaking chains" --provider sunoResult: Returns formatted lyrics in JSON
π Using lyrics: The generated lyrics can be passed to
song create --lyricsorsong generate --lyricsfor music generation.
You ask:
Generate music for these lyrics: [your lyrics here]
Your AI assistant executes:
# Minimax sync mode β returns the audio URL immediately
302ai song generate --lyrics "Your verse here\nChorus here\n..." --provider minimax --model speech-01-turboResult: Returns a single audio URL
This Skill is suitable for all AI coding tools that support the SKILL.md format.
If your tool supports claude skill commands (such as Claude Code, Cursor):
claude skill install github:302ai/302ai-cli-skill/.claude/skills/302ai-cliFor other tools that support SKILL.md:
-
Clone the 302ai-cli-skill repository or download the 302ai-cli-skill ZIP:
git clone https://github.com/302ai/302ai-cli-skill.git
Or click Code β Download ZIP on the 302ai/302ai-cli-skill repository page on GitHub, then extract the files.
-
Copy the 302ai-cli-skill folder to your tool's
skillsdirectory (check your tool's documentation for the location) -
Restart your tool
When new features are released (such as the Music module), follow these steps:
# For Claude Code
claude skill update github:302ai/302ai-cli-skill/.claude/skills/302ai-cli
# For Cursor or other tools
# Re-download SKILL.md and replace the old onepip install cli_302ai==1.0.2b2 --upgrade
β οΈ Critical: Both steps are required. Updating only the skill will not enable new modules such as 3D, Search, or Song. The CLI package version must match the skill version.
macOS / Linux:
# Add to ~/.bashrc or ~/.zshrc for persistence
export AI302_KEY="your-api-key-here"Windows PowerShell:
# Add to PowerShell profile for persistence
$env:AI302_KEY = "your-api-key-here"Windows CMD:
set AI302_KEY=your-api-key-here302ai image create --prompt "a cat" --api_key "your-api-key-here"ai302-cli/
βββ SKILL.md # Main skill instructions
βββ README.md # This file (English documentation)
βββ README_CN.md # Chinese documentation
βββ references/
βββ image.md # Image generation commands
βββ video.md # Video generation commands
βββ tts.md # Text-to-speech commands
βββ stt.md # Speech-to-text commands
βββ sfx.md # Sound effect commands
βββ 3d.md # 3D model generation
βββ song.md # Music/song generation
βββ search.md # Web search
βββ model.md # Model management
βββ record.md # Billing queries
Your AI assistant can chain multiple operations together:
Generate an image of a sunset, then create an animated video from it, and add background music
The assistant will:
- Generate the image using
302ai image create - Use that image for
302ai video create --image - Generate music using
302ai song create - Return all three URLs
If you do not specify a model, the CLI uses smart defaults:
- Images:
flux-1.1-pro(high quality) - Videos:
kling-v1.6-standard(balanced) - TTS: Provider-specific defaults
- Songs:
suno-v4(Suno) orspeech-01(Minimax)
View available models:
302ai model list t2i # Image models
302ai model list t2v # Video models
302ai model list tts # TTS modelsTrack your usage and costs:
# Get billing information for a specific request
302ai record get <request_id>
# Request IDs are available in the JSON output of generation commandsDetailed command options, flags, and examples for each module are available in the references/ folder:
| File | Contents |
|---|---|
references/image.md |
Image generation β t2i, i2i, sync/async workflows, all options |
references/video.md |
Video generation β t2v, i2v, async-only workflow |
references/tts.md |
Text-to-speech β providers, voices, async workflow |
references/stt.md |
Speech-to-text β sync transcription, supported formats |
references/sfx.md |
Sound effects β async generation, duration options |
references/3d.md |
3D model generation β t23d, i23d, async workflow |
references/song.md |
Music/song generation β Suno async, Minimax/ElevenLabs sync, lyrics |
references/search.md |
Web search β providers (Tavily, Bocha, Exa), category filters |
references/model.md |
Model management β list, set default, get, params |
references/record.md |
Billing queries β look up cost and usage by request ID |
| Version | Date | Changes |
|---|---|---|
| v1.0.2b2 | 2025-06 | Added Song/Music module, package renamed to cli_302ai |
| v1.0.1b2 | 2025-06 | Added 3D module, Search module |
| v1.0.1b1 | 2025-05 | Initial release: Image, Video, TTS, STT, SFX |
MIT
- π§ Email: support@302.ai
- π Full documentation: doc.302.ai
- π‘ Issues and feedback: GitHub Issues
β Star this repo if you find it useful!
Made with β€οΈ for developers and AI enthusiasts across all platforms
Get Started β’ View Examples β’ Read Docs