Venice AI — Complete Platform Skill

What is Venice AI?

Venice AI is the AI platform for people who don't want Big Tech watching over their shoulder. Unlike OpenAI, Anthropic, and Google — where every prompt is logged, analyzed, and potentially used to train future models — Venice offers true privacy with zero data retention on private models. Your conversations stay yours.

Venice is also uncensored: no content filters, no refusals, no "I can't help with that." The AI answers what you actually ask, whether that's creative fiction, security research, controversial topics, or anything else mainstream AI platforms won't touch.

Why agents should care:

Private inference — Models marked "Private" have zero data retention. Process sensitive data without it leaking into training sets.
TEE (Trusted Execution Environment) — Hardware-secured inference; Venice's own servers cannot access the computation.
E2EE (End-to-End Encryption) — Prompts encrypted client-side before they reach Venice's servers. Maximum privacy.
Uncensored responses — No guardrails blocking legitimate use cases.
OpenAI-compatible API — Drop-in replacement. Same API format, just change the base URL.
30+ models — From tiny efficient models to Claude Opus 4.6, GPT-5.2, Grok-4, Gemini Pro, and Venice's own uncensored models.
Built-in web search — LLMs can search the web (and X/Twitter via Grok) and cite sources in a single API call.
Vision/multimodal — Analyze images, audio, and video in chat completions.
Image & video generation — Flux, Sora, Runway, WAN models for visual content.
Music generation — AI-composed music with optional lyrics support.
Background removal — One-click transparent PNG output.

This skill gives you the complete Venice platform in one place.

⚠️ API changes: If something doesn't work as expected, check docs.venice.ai — the API specs may have been updated since this skill was written.

Prerequisites

Python 3.10+
Venice API key (free tier available at venice.ai/settings/api)

Setup

Get Your API Key

Create account at venice.ai
Go to venice.ai/settings/api
Click "Create API Key" → copy the key (starts with
```
vn_...
```
)

Configure

Option A: Environment variable

export VENICE_API_KEY="vn_your_key_here"

Option B: Clawdbot config (recommended)

// ~/.clawdbot/clawdbot.json
{
  skills: {
    entries: {
      "venice-ai": {
        env: { VENICE_API_KEY: "vn_your_key_here" }
      }
    }
  }
}

Verify

python3 {baseDir}/scripts/venice.py models --type text

Scripts Overview

Script	Purpose
`venice.py`	Text generation, vision analysis, models, embeddings, TTS, transcription
`venice-image.py`	Image generation, background removal
`venice-video.py`	Video generation (Sora, WAN, Runway)
`venice-music.py`	Music generation (queue-based async)
`venice-upscale.py`	Image upscaling
`venice-edit.py`	AI image editing, multi-image editing

Part 1: Text, Vision & Audio

Model Discovery & Selection

Venice has a huge model catalog spanning text, image, video, audio, and embeddings.

Browse Models

# List all text models
python3 {baseDir}/scripts/venice.py models --type text
List image models
python3 {baseDir}/scripts/venice.py models --type image
List all model types
python3 {baseDir}/scripts/venice.py models --type text,image,video,audio,embedding
Get details on a specific model

python3 {baseDir}/scripts/venice.py models --filter grok

Model Selection Guide

Need	Recommended Model	Why
Cheapest text	`qwen3-4b`	Tiny, fast, efficient
Best uncensored	`venice-uncensored`	Venice's own uncensored model
Best private + smart	`deepseek-v3.2`	Great reasoning, efficient
Vision/multimodal	`qwen3-vl-235b-a22b`	Analyze images, video, audio
Best coding	`qwen3-coder-480b-a35b-instruct`	Massive coder model
Frontier fast	`grok-41-fast`	Fast, 262K context
X/Twitter search	`grok-4-20-beta` or `grok-41-fast`	Grok models + `--x-search`
Frontier max quality	`claude-opus-4-6`	Best overall quality
Reasoning	`kimi-k2-5`	Strong chain-of-thought
Web search	Any model + `--web-search`	Built-in web search

Text Generation (Chat Completions)

Basic Generation

# Simple prompt
python3 {baseDir}/scripts/venice.py chat "What is the meaning of life?"
Choose a model
python3 {baseDir}/scripts/venice.py chat "Explain quantum computing" --model deepseek-v3.2
System prompt
python3 {baseDir}/scripts/venice.py chat "Review this code" --system "You are a senior engineer."
Read from stdin
echo "Summarize this" | python3 {baseDir}/scripts/venice.py chat --model qwen3-4b
Stream output

python3 {baseDir}/scripts/venice.py chat "Write a story" --stream

Web Search Integration

# Auto web search (model decides when to search)
python3 {baseDir}/scripts/venice.py chat "What happened in tech news today?" --web-search auto
Force web search with citations
python3 {baseDir}/scripts/venice.py chat "Current Bitcoin price" --web-search on --web-citations
Web scraping (extracts content from URLs in prompt)

python3 {baseDir}/scripts/venice.py chat "Summarize: https://example.com/article" --web-scrape

X/Twitter Search (via Grok)

Use Grok models to search X (Twitter) for real-time posts and discussions:

# Search X for latest AI news
python3 {baseDir}/scripts/venice.py chat "latest AI news from X today" \
  --model grok-41-fast --x-search
Combine X search with web search

python3 {baseDir}/scripts/venice.py chat "What are people saying about OpenAI?" 

--model grok-4-20-beta --x-search --web-search auto

Note:

--x-search

only works with Grok models (

grok-*

). It sets

enable_x_search: true

venice_parameters

, which routes search through xAI's infrastructure.

Uncensored Mode

# Use Venice's own uncensored model
python3 {baseDir}/scripts/venice.py chat "Your question" --model venice-uncensored
Disable Venice system prompts for raw model output

python3 {baseDir}/scripts/venice.py chat "Your prompt" --no-venice-system-prompt

Reasoning Models

Venice supports extended reasoning/thinking modes with fine-grained effort control:

# Use a reasoning model with effort control
python3 {baseDir}/scripts/venice.py chat "Solve this math problem..." \
  --model kimi-k2-5 --reasoning-effort high
Minimal reasoning (faster, cheaper)
python3 {baseDir}/scripts/venice.py chat "Simple question" 

--model qwen3-4b --reasoning-effort minimal
Maximum reasoning (slowest, most thorough)
python3 {baseDir}/scripts/venice.py chat "Complex analysis" 

--model claude-opus-4-6 --reasoning-effort max
Strip thinking from output (result only)
python3 {baseDir}/scripts/venice.py chat "Debug this code" --model qwen3-4b --strip-thinking
Disable reasoning entirely

python3 {baseDir}/scripts/venice.py chat "Quick answer" --model qwen3-4b --disable-thinking

Reasoning effort values (not all models support all levels):

Value	Description
`none`	No reasoning (fastest)
`minimal`	Very brief thinking
`low`	Light reasoning
`medium`	Balanced (often default)
`high`	Thorough reasoning
`xhigh`	Extended reasoning
`max`	Maximum reasoning budget

Privacy: E2EE Mode

For maximum privacy, use End-to-End Encryption with supported models:

# Enable E2EE (prompts encrypted client-side before reaching Venice)
python3 {baseDir}/scripts/venice.py chat "sensitive analysis" \
  --model some-e2ee-model --enable-e2ee

Privacy tiers on Venice:

Standard — Anonymized inference, no persistent logs
Private inference — Hardware-isolated, zero retention
TEE (Trusted Execution Environment) — Hardware-secured; Venice servers cannot access computation
E2EE (End-to-End Encryption) — Prompts encrypted client-side; Venice has zero visibility

Advanced Options

# Temperature and token control
python3 {baseDir}/scripts/venice.py chat "Be creative" --temperature 1.2 --max-tokens 4000
JSON output mode
python3 {baseDir}/scripts/venice.py chat "List 5 colors as JSON" --json
Prompt caching (for repeated context — up to 90% cost savings)
python3 {baseDir}/scripts/venice.py chat "Question" --cache-key my-session-123
Show usage stats and balance
python3 {baseDir}/scripts/venice.py chat "Hello" --show-usage
Use a Venice character

python3 {baseDir}/scripts/venice.py chat "Tell me about yourself" --character venice-default

Vision / Image Analysis

Analyze images using multimodal vision models. Supports local files, URLs, and data URLs.

# Analyze a local image
python3 {baseDir}/scripts/venice.py analyze photo.jpg "What's in this image?"
Analyze with default prompt (describe in detail)
python3 {baseDir}/scripts/venice.py analyze photo.jpg
Analyze from URL
python3 {baseDir}/scripts/venice.py analyze "https://example.com/image.jpg" "Describe the scene"
Choose vision model
python3 {baseDir}/scripts/venice.py analyze diagram.png "Explain this diagram" 

--model qwen3-vl-235b-a22b
Stream the analysis
python3 {baseDir}/scripts/venice.py analyze photo.jpg "Identify all objects" --stream
Count tokens used

python3 {baseDir}/scripts/venice.py analyze photo.jpg "Analyze this" --show-usage

Vision-capable models:

qwen3-vl-235b-a22b

claude-opus-4-6

gpt-5.2

, and others — check

--list-models

for current availability.

Supported image formats: JPEG, PNG, WebP, GIF, BMP

Embeddings

Generate vector embeddings for semantic search, RAG, and recommendations:

# Single text
python3 {baseDir}/scripts/venice.py embed "Venice is a private AI platform"
Multiple texts (batch)
python3 {baseDir}/scripts/venice.py embed "first text" "second text" "third text"
From file (one text per line)
python3 {baseDir}/scripts/venice.py embed --file texts.txt
Output as JSON
python3 {baseDir}/scripts/venice.py embed "some text" --output json

Model:

text-embedding-bge-m3

(private, $0.15/M tokens)

Text-to-Speech (TTS)

Convert text to speech with 60+ multilingual voices:

# Default voice
python3 {baseDir}/scripts/venice.py tts "Hello, welcome to Venice AI"
Choose a voice
python3 {baseDir}/scripts/venice.py tts "Exciting news!" --voice af_nova
List available voices
python3 {baseDir}/scripts/venice.py tts --list-voices
Custom output path
python3 {baseDir}/scripts/venice.py tts "Some text" --output /tmp/speech.mp3
Adjust speed

python3 {baseDir}/scripts/venice.py tts "Speaking slowly" --speed 0.8

Popular voices:

af_sky

af_nova

am_liam

bf_emma

zf_xiaobei

(Chinese),

jm_kumo

(Japanese)

Model:

tts-kokoro

(private, $3.50/M characters)

Speech-to-Text (Transcription)

Transcribe audio files to text:

# Transcribe a file
python3 {baseDir}/scripts/venice.py transcribe audio.wav
With timestamps
python3 {baseDir}/scripts/venice.py transcribe recording.mp3 --timestamps
From URL

python3 {baseDir}/scripts/venice.py transcribe --url https://example.com/audio.wav

Supported formats: WAV, FLAC, MP3, M4A, AAC, MP4

Model:

nvidia/parakeet-tdt-0.6b-v3

(private, $0.0001/audio second)

Check Balance

python3 {baseDir}/scripts/venice.py balance

Part 2: Images & Video

Pricing Overview

Feature	Cost
Image generation	~$0.01-0.03 per image
Background removal	~$0.02
Image upscale	~$0.02-0.04
Image edit (single)	~$0.04
Image multi-edit	~$0.04-0.08
Video (WAN)	~$0.10-0.50
Video (Sora)	~$0.50-2.00
Video (Runway)	~$0.20-1.00
Music generation	varies by model/duration

Use

--quote

with video/music commands to check pricing before generation.

Image Generation

# Basic generation
python3 {baseDir}/scripts/venice-image.py --prompt "a serene canal in Venice at sunset"
Multiple images
python3 {baseDir}/scripts/venice-image.py --prompt "cyberpunk city" --count 4
Custom dimensions
python3 {baseDir}/scripts/venice-image.py --prompt "portrait" --width 768 --height 1024
List available models and styles
python3 {baseDir}/scripts/venice-image.py --list-models
python3 {baseDir}/scripts/venice-image.py --list-styles
Use specific model and style
python3 {baseDir}/scripts/venice-image.py --prompt "fantasy" --model flux-2-pro 

--style-preset "Cinematic"
Reproducible results with seed
python3 {baseDir}/scripts/venice-image.py --prompt "abstract" --seed 12345
PNG format with no watermark

python3 {baseDir}/scripts/venice-image.py --prompt "product shot" 

--format png --hide-watermark

Key flags:

--prompt

--model

(default: flux-2-max),

--count

--width

--height

--format

(webp/png/jpeg),

--resolution

(1K/2K/4K),

--aspect-ratio

--negative-prompt

--style-preset

--cfg-scale

(0-20),

--seed

--safe-mode

--hide-watermark

--embed-exif

--steps

Background Removal

Remove the background from any image, producing a transparent PNG:

# From local file
python3 {baseDir}/scripts/venice-image.py --background-remove photo.jpg
Specify output path
python3 {baseDir}/scripts/venice-image.py --background-remove photo.jpg --output cutout.png
From URL

python3 {baseDir}/scripts/venice-image.py --background-remove 

"https://example.com/product.jpg" --output product-transparent.png

The output is always a PNG with a transparent background (alpha channel). Works best with clear subject/background separation.

Image Upscale

# 2x upscale
python3 {baseDir}/scripts/venice-upscale.py photo.jpg --scale 2
4x with AI enhancement
python3 {baseDir}/scripts/venice-upscale.py photo.jpg --scale 4 --enhance
Enhanced with custom prompt
python3 {baseDir}/scripts/venice-upscale.py photo.jpg --enhance --enhance-prompt "sharpen details"
From URL

python3 {baseDir}/scripts/venice-upscale.py --url "https://example.com/image.jpg" --scale 2

Key flags:

--scale

(1-4, default: 2),

--enhance

(AI enhancement),

--enhance-prompt

--enhance-creativity

(0.0-1.0),

--url

--output

Image Edit (AI-powered)

Single Image Edit

AI-powered editing where the model interprets your prompt to modify the image:

# Add elements
python3 {baseDir}/scripts/venice-edit.py photo.jpg --prompt "add sunglasses"
Modify scene
python3 {baseDir}/scripts/venice-edit.py photo.jpg --prompt "change the sky to sunset"
Remove objects
python3 {baseDir}/scripts/venice-edit.py photo.jpg --prompt "remove the person in background"
From URL
python3 {baseDir}/scripts/venice-edit.py --url "https://example.com/image.jpg" 

--prompt "colorize this black and white photo"
Specify output location

python3 {baseDir}/scripts/venice-edit.py photo.jpg --prompt "add snow" --output result.png

Multi-Image Edit (up to 3 images)

Compose or blend multiple images together using advanced edit models:

# Combine 2 images
python3 {baseDir}/scripts/venice-edit.py --multi-edit base.jpg overlay.png \
  --prompt "merge these images seamlessly"
Layer 3 images with model selection
python3 {baseDir}/scripts/venice-edit.py --multi-edit bg.jpg subject.png detail.png 

--prompt "compose these layers into one image" --model flux-2-max-edit
Mix local file and URL

python3 {baseDir}/scripts/venice-edit.py --multi-edit local.jpg "https://example.com/img.png" 

--prompt "blend these two photos" --output blended.png

Multi-edit models:

flux-2-max-edit

(default),

qwen-edit

gpt-image-1-5-edit

Note: The standard edit endpoint uses Qwen-Image which has some content restrictions. Multi-edit uses Flux-based models.

Video Generation

# Get price quote first
python3 {baseDir}/scripts/venice-video.py --quote --model wan-2.6-image-to-video --duration 10s
Image-to-video (WAN - default)
python3 {baseDir}/scripts/venice-video.py --image photo.jpg --prompt "camera pans slowly" 

--duration 10s
Image-to-video (Sora)
python3 {baseDir}/scripts/venice-video.py --image photo.jpg --prompt "cinematic" 

--model sora-2-image-to-video --duration 8s --aspect-ratio 16:9 --skip-audio-param
Video-to-video (Runway Gen4)
python3 {baseDir}/scripts/venice-video.py --video input.mp4 --prompt "anime style" 

--model runway-gen4-turbo-v2v
List models with available durations

python3 {baseDir}/scripts/venice-video.py --list-models

Key flags:

--image

--video

--prompt

--model

(default: wan-2.6-image-to-video),

--duration

--resolution

(480p/720p/1080p),

--aspect-ratio

--audio

--no-audio

--quote

--timeout

Models:

WAN — Image-to-video, configurable audio, 5s-21s
Sora — Requires
```
--aspect-ratio
```
, use
```
--skip-audio-param
```
Runway — Video-to-video transformation

Part 3: Music Generation

Music Generation (AI Composed)

Venice supports AI music generation via a queue-based async API (similar to video generation). Music is generated server-side and polled for completion.

# Get price quote first
python3 {baseDir}/scripts/venice-music.py --quote --model elevenlabs-music --duration 60
Generate instrumental music
python3 {baseDir}/scripts/venice-music.py --prompt "epic orchestral battle theme" --instrumental
Generate music with lyrics
python3 {baseDir}/scripts/venice-music.py 

--prompt "upbeat pop summer song" 

--lyrics "Verse 1: Walking down the beach / feeling the heat..."
Control duration
python3 {baseDir}/scripts/venice-music.py --prompt "ambient piano meditation" --duration 30
Specify output location
python3 {baseDir}/scripts/venice-music.py --prompt "jazz café background" 

--output ~/Music/venice-jazz.mp3
List available audio models
python3 {baseDir}/scripts/venice-music.py --list-models
Don't delete from server after download (useful for re-downloading)
python3 {baseDir}/scripts/venice-music.py --prompt "..." --no-delete
Clean up server-side media after downloading with --no-delete

python3 {baseDir}/scripts/venice-music.py --complete QUEUE_ID

Parameters:

Flag	Description
`--prompt`	Music description (style, mood, genre, instruments)
`--model`	Model ID (default: `elevenlabs-music` )
`--duration`	Duration in seconds
`--lyrics`	Optional lyrics text for vocal generation
`--instrumental`	Force instrumental (no vocals)
`--voice`	Voice selection for vocal tracks
`--language`	Language code (e.g., `en` , `es` , `fr` )
`--quote`	Get price estimate without generating
`--timeout`	Max wait time in seconds (default: 300)
`--poll-interval`	Status check interval (default: 10s)

Prompt tips for music:

Be specific: genre, tempo, instruments, mood

e.g.,

"chill lo-fi hip hop with piano and rain ambiance, 85 BPM"

e.g.,

"cinematic orchestral swell, strings and brass, dramatic tension"

e.g.,

"acoustic folk guitar, warm and intimate, fingerpicking style"

Tips & Ideas

🔍 Web Search + LLM = Research Assistant

Use

--web-search on --web-citations

to build a research workflow. Venice searches the web, synthesizes results, and cites sources — all in one API call.

🐦 X/Twitter Search via Grok

With Grok models and

--x-search

, you get real-time access to X posts and discussions. Great for trend monitoring, social listening, and news research.

🔓 Uncensored Creative Content

Venice's uncensored models work for both text AND images. No guardrails blocking legitimate creative use cases.

🔒 Maximum Privacy with TEE/E2EE

When processing sensitive data:

Use models with private inference for zero data retention
Use TEE models when you need hardware-level isolation
Use --enable-e2ee
for encrypted prompt delivery

🎯 Prompt Caching for Agents

If you're running an agent loop that sends the same system prompt repeatedly, use

--cache-key

to get up to 90% cost savings.

👁️ Vision Pipeline

# Analyze → describe → generate matching image
python3 scripts/venice.py analyze original.jpg "describe the style and composition" > desc.txt
python3 scripts/venice-image.py --prompt "$(cat desc.txt)" --model flux-2-max

🎤 Audio Pipeline

Combine TTS and transcription: generate spoken content with

tts

, process audio with

transcribe

. Both are private inference.

🎬 Video Workflow

Generate or find a base image
Use
```
--quote
```
to estimate video cost
Generate with appropriate duration/model
Videos take 1-5 minutes depending on settings

🎵 Music + Video

# Generate background music, then use it for video
python3 scripts/venice-music.py --prompt "cinematic adventure theme" --output bgm.mp3
python3 scripts/venice-video.py --image scene.jpg --prompt "epic journey" --audio-url bgm.mp3

🖼️ Image Cleanup Pipeline

# Generate → remove background → use in video
python3 scripts/venice-image.py --prompt "product on white background" --format png
python3 scripts/venice-image.py --background-remove output.png --output product-clean.png

Troubleshooting

Problem	Solution
`VENICE_API_KEY not set`	Set env var or configure in `~/.clawdbot/clawdbot.json`
`Invalid API key`	Verify at venice.ai/settings/api
`Model not found`	Run `--list-models` to see available; use `--no-validate` for new models
Rate limited	Check `--show-usage` output
Video stuck	Videos can take 1-5 min; use `--timeout 600` for long ones
Vision not working	Ensure you're using a vision-capable model (e.g., `qwen3-vl-235b-a22b` )
`--x-search` no effect	Only works with Grok models ( `grok-*` )
Music timeout	Music can take 2-5 min; increase `--timeout`
Background removal quality	Works best with clear subject/background contrast

Resources

API Docs: docs.venice.ai
Status: veniceai-status.com
Discord: discord.gg/askvenice
API Key: venice.ai/settings/api

Venice Ai

AI Skill Market Insights

Be Part of the 0+ Developer Community

Venice AI — Complete Platform Skill

What is Venice AI?

Prerequisites

Setup

Get Your API Key

Configure

Verify

Scripts Overview

Part 1: Text, Vision & Audio

Model Discovery & Selection

Browse Models

List image models

List all model types

Get details on a specific model

Model Selection Guide

Text Generation (Chat Completions)

Basic Generation

Choose a model

System prompt

Read from stdin

Stream output

Web Search Integration

Force web search with citations

Web scraping (extracts content from URLs in prompt)

X/Twitter Search (via Grok)

Combine X search with web search

Uncensored Mode

Disable Venice system prompts for raw model output

Reasoning Models

Minimal reasoning (faster, cheaper)

Maximum reasoning (slowest, most thorough)

Strip thinking from output (result only)

Disable reasoning entirely

Privacy: E2EE Mode

Advanced Options

JSON output mode

Prompt caching (for repeated context — up to 90% cost savings)

Show usage stats and balance

Use a Venice character

Vision / Image Analysis

Analyze with default prompt (describe in detail)

Analyze from URL

Choose vision model

Stream the analysis

Count tokens used

Embeddings

Multiple texts (batch)

From file (one text per line)

Output as JSON

Text-to-Speech (TTS)

Choose a voice

List available voices

Custom output path

Adjust speed

Speech-to-Text (Transcription)

With timestamps

From URL

Check Balance

Part 2: Images & Video

Pricing Overview

Image Generation

Multiple images

Custom dimensions

List available models and styles

Use specific model and style

Reproducible results with seed

PNG format with no watermark

Background Removal

Specify output path

From URL

Image Upscale

4x with AI enhancement

Enhanced with custom prompt

From URL

Image Edit (AI-powered)

Single Image Edit

Modify scene