LogoClpo

Explore AI Models

AI Models

Explore our collection of cutting-edge AI models for image and video generation

Flux 2
Popular
image
Black Forest Labs

Black Forest Labs

Flux 2

Black Forest Labs' production-grade image generation model family delivering 4MP photorealistic output, multi-reference consistency across up to 10 images, and reliable text rendering — all in sub-10-second generation speeds.

text-to-imageimage-to-imagephotorealistic

From 3 credits

Nano Banana
Fast
image
Google

Google

Nano Banana

Google's Gemini Flash-powered image generation and editing model that went viral for its speed, real-world knowledge, and AI-assisted editing capabilities.

text-to-imageimage-to-imagefast

From 2 credits

Veo 3.1
Premium
video
Google

Google

Veo 3.1

Google DeepMind's state-of-the-art video generation model featuring native audio synthesis, up to 4K resolution, and cinematic realism with advanced physics simulation.

text-to-videoimage-to-videohigh-quality

From 9 credits

Kling 2.1
New
video
Kling

Kuaishou

Kling 2.1

Kuaishou's cinematic AI video model powered by 3D spatiotemporal attention — delivering industry-leading physics simulation, hyper-realistic facial expressions, and up to 1080p output across Standard, Pro, and Master tiers.

text-to-videoimage-to-videoprofessional

From 11 credits

GPT Image 1.5
Premium
image
OpenAI

OpenAI

GPT Image 1.5

OpenAI's flagship natively multimodal image model with industry-leading instruction following, precise region-aware editing, and best-in-class text rendering — now up to 4x faster than its predecessor.

text-to-imageimage-to-imagehigh-quality

From 10 credits

Sora 2
Popular
video
OpenAI

OpenAI

Sora 2

OpenAI's flagship video-and-audio generation model with advanced physics simulation, native synchronized audio, and multi-shot scene control — released September 30, 2025

text-to-videoimage-to-videocinematic

From 5 credits

Hailuo
video
Hailuo

MiniMax

Hailuo

MiniMax's Hailuo 02 video generation models deliver cinematic-grade physics simulation, expressive character motion, and versatile stylization across text-to-video and image-to-video workflows.

text-to-videoimage-to-videofast

From 13 credits

Imagen 4
image
Google

Google

Imagen 4

Google DeepMind's leading text-to-image model delivering up to 2K resolution, superior text rendering, and diverse art styles — engineered for professional creative work.

text-to-imagehigh-quality

From 2 credits

Qwen Image
image
Qwen

Alibaba

Qwen Image

Alibaba's 20-billion-parameter MMDiT image generation model excelling at precise bilingual text rendering, native high-resolution output up to 3584×3584, and unified generation and editing in a single model.

text-to-imageartistic

From 2 credits

Seedream 4.5
New
image
ByteDance

ByteDance

Seedream 4.5

ByteDance's professional-grade image generation model with class-leading text rendering, 4K output, and multi-reference consistency for commercial creative work.

text-to-imagehigh-qualityaesthetic

From 4 credits

Wan 2.6
video
Qwen

Alibaba

Wan 2.6

Alibaba's Wan2.6 series delivers multi-shot storytelling, native audio-visual synchronization, and reference-to-video generation up to 15 seconds at 1080p.

text-to-videoimage-to-videolong-form

From 30 credits

Grok Imagine
Budget
image
Grok

xAI

Grok Imagine

xAI's Aurora-powered image generation model delivering photorealistic rendering, precise instruction following, and native image editing at the lowest cost per generation

text-to-imageimage-to-imageaffordable

From 1 credits

Seedance 1.5
Native Audio
video
ByteDance

ByteDance

Seedance 1.5

ByteDance's joint audio-video generation model that natively synchronizes dialogue, sound effects, and ambient audio with video at millisecond precision using a 4.5B-parameter Dual-Branch Diffusion Transformer.

text-to-videoimage-to-videoversatile

From 18 credits

Grok Video
Fast & Affordable
video
Grok

xAI

Grok Video

xAI's Aurora-powered video generation model delivering industry-leading speed (~30s generation) and cost ($0.05/sec) with native audio, multiple aspect ratios, and both text-to-video and image-to-video modes.

text-to-videoimage-to-videoaffordable

From 9 credits

LogoClpo

Dream it. Direct it. Clpo creates it. Multi-modal AI video generation platform.

Email
Product
  • Pricing
  • AI Image
  • AI Video
  • AI Models
Resources
    Legal
    • Privacy Policy
    • Terms of Service

    Clpo is an independent product and is not affiliated with, endorsed by, or sponsored by ByteDance or any third-party AI model providers. We provide access to AI models through our custom interface.

    © 2026 Clpo. All Rights Reserved.
    Privacy PolicyTerms of Service