Explore AI models with community-written guides. Learn what each model excels at and how to craft the best prompts for it.
Submit a modelOpenAI
OpenAI's image model with best-in-class text rendering and natural language prompt understanding.
Adobe
Black Forest Labs
High-quality open-weight image model excelling at photorealism and text rendering.
Black Forest Labs
32B parameter model by BFL. State-of-the-art image generation and editing up to 4MP. Precise text rendering, color matching, character consistency.
OpenAI
Deprecated. OpenAI's first multimodal image gen+edit model — successor to DALL-E 3, predecessor to GPT Image 2. Higher per-token cost than GPT Image 2 with slower speed; new work should use GPT Image 2.
OpenAI
OpenAI's flagship image model — state-of-the-art generation + edit/inpaint. Successor to GPT Image 1 with stronger photorealism, multi-resolution flexibility, and best-in-class text rendering carried over from DALL-E 3.
Google's native image generation via Gemini with exceptional prompt adherence and JSON prompt support.
Ideogram
Specializes in typography and graphic design renders with accurate text placement.
Ideogram
Leading model for text-in-image generation. Exceptional typography, graphic design, and logo creation capabilities.
Google's top-tier image model. Exceptional photorealism, text rendering, and compositional accuracy. Available via Vertex AI.
Leonardo AI
Midjourney
Industry-leading image generation with exceptional aesthetic quality and prompt adherence.
Midjourney
Midjourney's smartest and most coherent model. Enhanced prompt understanding, voice prompting, draft mode for rapid iteration.
Playground
Recraft
Design-focused image model excelling at vector illustrations, icons, and brand-consistent assets.
Recraft
Professional design-focused model. Brand-consistent outputs, style presets, and vector-quality illustrations.
Stability AI
Versatile open-source model with a massive ecosystem of fine-tunes, LoRAs, and community tools.
Stability AI
Next-gen architecture with improved coherence and detail over SDXL.
Minimax
ByteDance / Peking University
14B autoregressive diffusion model generating 60-second videos at real-time speed on a single GPU.
Kuaishou
Chinese video model with impressive motion quality and longer generation times.
Kuaishou
Major upgrade with improved motion quality, longer clips, and better facial expressions. Cost-effective for social media content.
Lightricks
Open-source 22B diffusion transformer. Native 4K video with synchronized audio generation in a single pass.
Luma
Minimax
Chinese video model known for consistent character identity and long-form generation.
Pika Labs
Fast video generation with good creative control and image-to-video capabilities.
Pika
Enhanced creative effects, improved scene generation, and new 'Scenes' feature for multi-shot storytelling.
Runway
Professional video generation with precise motion control and camera direction.
Runway
Runway's latest with improved temporal consistency, camera control, and up to 20-second coherent clips.
ByteDance
ByteDance's video generation model with high motion quality and dance/movement specialization.
OpenAI
OpenAI's video generation model producing cinematic-quality clips from text prompts.
Google's video generation model producing high-fidelity clips with cinematic camera control.
Anthropic
Anthropic's most powerful model — exceptional at analysis, writing, and code generation.
Anthropic
Balanced performance model — fast, capable, and cost-effective for most tasks.
Anthropic
Fastest Claude model with near-Sonnet quality. Extended thinking, computer use, image understanding. One-third the cost of Sonnet.
Anthropic
Near-Opus performance at Sonnet pricing. Anthropic's best balance of intelligence, speed, and cost for production use.
Cohere
DeepSeek
Open-source reasoning model rivaling frontier closed models. Exceptional at math, code, and step-by-step reasoning.
OpenAI
OpenAI's stable API alias `chat-latest` that always resolves to the latest Instant chat model used in ChatGPT. Auto-tracks new Instant model rollouts so apps stay current without code changes. 400K context.
OpenAI
Multimodal model excelling at reasoning, coding, and natural conversation.
OpenAI
OpenAI's most capable model with advanced reasoning and tool use.
OpenAI
Latest GPT-5 variant with 1.05M token context, 33% fewer factual errors. Standard, Thinking, and Pro variants available.
Google's flagship model with native multimodal reasoning and massive context window.
Google's GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. Supports text, image, video, audio, and PDF inputs. Designed for lightweight agentic tasks. 1M token context window.
Google's frontier model. Leading scores on 13/16 benchmarks. 2M+ context window, native multimodal, Google Search grounding.
IBM
Dense, decoder-only 8B-parameter language model from IBM, part of the Granite 4.1 family. 131K-token context window, designed for enterprise tasks like RAG, summarization, classification, and tool calling. Apache-2.0 open weights.
xAI
xAI's reasoning model with real-time X/Twitter data access and strong analytical capabilities.
xAI
xAI reasoning model accepting text and image inputs with text output. Suited for agentic workflows, instruction-following, and applications requiring high factual accuracy. 1M token context window.
Meta
Meta's open-weight frontier model — free to use, fine-tune, and deploy at any scale.
Meta
Meta's 400B MoE model with 128 experts. Strong multilingual, long-context, and instruction-following capabilities.
Mistral
Mistral
Dense 128B instruction-following model from Mistral AI. Supports text and image inputs with text output. Designed for agentic workflows, coding, and complex reasoning. 256K context window.
NVIDIA
30B-A3B open multimodal model from NVIDIA designed to function as a perception and context sub-agent in enterprise agent systems. Accepts text, image, video, and audio. 256K context. Reasoning-tuned variant.
OpenRouter
High-performance foundation model designed for agentic workloads. Native tool use and long-context support, with strong performance in code generation, automated workflows, and complex instruction execution. 1M+ token context.
Alibaba
Alibaba
Large-scale multimodal language model from Alibaba (April 2026 release). Accepts text, image, and video input with text output. 1M token context window. Tuned for production agentic and reasoning workloads.
Alibaba
Fast, efficient language model from Alibaba's Qwen 3.6 series. Supports text, image, and video input with 1M token context window. Tiered pricing kicks in beyond context thresholds. Optimized for high-throughput, low-latency workloads.
inclusionAI
1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require strong capability and operational efficiency. Optimized for coding agents and tool use. 256K context window. Available free on OpenRouter.
Aider
Open-source CLI tool for AI pair programming. Works with any LLM, git-aware, edits multiple files. Terminal-based workflow.
Augment
AI coding assistant with deep codebase understanding. Focuses on large codebases and enterprise development workflows.
Anthropic
Anthropic's agentic coding tool for autonomous development — edits, tests, and commits code.
Baidu
Baidu Qianfan code generation model optimized for coding tasks and AI Agent workflows. Features high inference throughput, low end-to-end latency, and native tool use. 131K context window. Available free on OpenRouter.
OpenAI
OpenAI's coding agent running locally via CLI with autonomous code editing.
GitHub
Cursor
AI-first IDE with context-aware code generation, multi-file editing, and custom rules system.
DeepSeek
Open-source code model rivaling proprietary alternatives in programming tasks.
Cognition
Autonomous AI software engineer. Plans, writes, debugs, and deploys code independently. Works on full tasks, not just completions.
Poolside
Flagship coding agent model from Poolside, optimized for complex software engineering tasks. Designed for agentic coding workflows with tool calling and reasoning. 131K context window.
Poolside
Second-generation efficient coding agent in the XS size class from Poolside. Combines tool calling and reasoning with a compact footprint — strong cost/quality balance for high-volume code completion and agent loops. 131K context.
NVIDIA
120B params (12B active) MoE model specialized for software development. Strong at enterprise coding, debugging, and architecture.
Codeium
AI coding assistant with Cascade flow for multi-step autonomous development tasks.
ACE Studio
Open-source music generation with fine-grained control over BPM, key, lyrics, and instrumentation.
ElevenLabs
Industry-leading voice synthesis with voice cloning and emotional control.
Hume
Emotionally intelligent voice AI. Detects and generates emotional nuance in speech. Best for empathetic voicebots and assistants.
Meta
Stability AI
Suno
Full song generation with vocals, instruments, and production from text prompts.
Suno
Latest Suno with dramatically improved vocal realism, longer generation, and better genre accuracy. Supports covers and remixes.
Udio
Music generation with high audio fidelity and precise genre/style control.