🐱 Creative Studio

🌙 Dark Mode

Tip: Right-click and "Open in new tab" to run multiple workflows simultaneously without losing progress.

New
Nano Banana 2

Nano Banana 2

Edit + Generate Images

Gemini 3.1 Flash Image — pro-level visual intelligence with Flash-speed efficiency. Edit with up to 14 reference images or generate from text.

Popular
Wan 2.6 Image Generation

Wan 2.6 Image Generation

Text + References → Image

Generate or edit images with Wan 2.6. Supports mixed text+image input for style/composition references.

Popular LoRA

SVI Pro - Extended Video

Image → 5-30s Video

Create high-quality extended videos up to 30 seconds! Improved WAN 2.2 models for superior quality.

Popular LoRA
Instagirl Aesthetic

Instagirl Aesthetic

Text → Instagram Style Image

Generate images with Instagram-perfect aesthetic. Optimized for portraits with the "Instagirl" style.

Popular LoRA
WAN Realistic Image

WAN Realistic Image

Text + Lora → Realistic Image

Generate highly realistic images with optimized settings and special realism-focused LoRA.

Popular
Nano Banana Pro + Imagen 4

Nano Banana Pro + Imagen 4

Edit Images + Generate

Nano Banana Pro for editing, Imagen 4 for generation. Google DeepMind's latest image AI.

Popular

Veo 3.1

AI Video Generation

Google DeepMind's advanced video generation. T2V, I2V, and First/Last Frame modes. $0.10-0.40/sec.

Popular LoRA
Qwen Text to Image

Qwen Text to Image

Text + LoRA → High Quality Image (Qwen 2512)

Generate stunning photorealistic images with Qwen-Image-2512, the latest text-to-image model from Alibaba. Features enhanced human realism, finer natural details, and improved text rendering. December 2025 update with significant quality improvements.

Popular LoRA
Qwen Text to Image Realistic

Qwen Text to Image Realistic

Text + Lora (Qwen Image) → Ultra Realistic Image

Absurdly very realistic Qwen text to image workflow with special formula that will make your AI Influencer very sharp, prompt coherent and photoreal.

Popular LoRA

WAN Long Video Enhancer

Long Video → Enhanced Video

Enhance videos up to 30 seconds with smart batch processing and seamless frame blending.

LoRA
Z-Image Text to Image

Z-Image Text to Image

Text + 3 LoRAs → Stage 1 & 2 Images

Two-stage Z-Image Base generation. Get both base and detailed refined outputs. Up to 3 custom LoRAs. Full control over denoise, steps, and CFG.

LoRA
Z-Image from Reference

Z-Image from Reference

Reference Image + 3 LoRAs → Stage 1 & 2 Images

Generate images based on a reference image. AI analyzes the reference and creates prompt automatically. Up to 3 custom LoRAs. Full control over denoise, steps, and CFG.

LoRA

WAN Animate

Image + Video → Video

Transfer motion from any video to your image. Perfect for dancing, walking, or complex movements.

WAN First Frame Last Frame

Morph Video

Create smooth video transitions morphing from first to last frame.

LoRA

Wan 2.2 Image to Video (Fast)

Image + Lora (Wan 2.2) → Video

Fast image-to-video with optional LoRA (Wan 2.2) and frame interpolation for smoother motion.

WAN 2.2 Long Video (Basic)

Image → Long Video (up to 30s)

Create longer videos up to 30 seconds. For better consistency, try SVI WAN 2.2 Extended Video.

Slow

WAN SCAIL Dance Video

Image + Dance Video → Animated Character

Transfer dance moves from a reference video to your character using SCAIL pose detection. Supports 1-10 people. ⚠️ SLOW (30 min - 2h)

LoRA
WAN 2.2 Text to Image

WAN 2.2 Text to Image

Text + 2x Custom LoRA Slots → Image

Generate high-quality images from text prompts with two optional custom LoRA slots (WAN 2.2 High/Low Noise).

Fast LoRA
Z-Image Turbo

Z-Image Turbo

Text + Lora + Upscaler → Image

Ultra-fast image generation with optional upscale. Results in seconds!

InfiniteTalk - Image to Video

InfiniteTalk - Image to Video

Image + Audio → Talking Head Video

Generate talking head videos from a face image and audio. Max 7 min audio, 1024px image. Powered by Wan 2.1 InfiniteTalk.

InfiniteTalk - Video to Video

InfiniteTalk - Video to Video

Video + Audio → Talking Head Video

Transform video into talking head synced to audio. V2V with color matching. Max 7 min audio.

LoRA
Z-Image Base + Turbo

Z-Image Base + Turbo

Text + LoRA → Base & Turbo Images

Two-stage Z-Image. Get both base and turbo-refined outputs with LoRA presets and up to 3 custom LoRAs.

FLUX Upscale

FLUX Upscale

Image → HD Image

Upscale images using FLUX model for high-quality results.

SeedVR2 4K Upscale

SeedVR2 4K Upscale

Image → 4K Image

Upscale images to 4K resolution using SeedVR2 model.

SeedVR2 Simple Upscale

SeedVR2 Simple Upscale

Image → Enhanced Image

Quick image upscaling with SeedVR2 for everyday use.

LoRA
WAN Image Enhancer

WAN Image Enhancer

Image + Lora → Enhanced Image

Enhance and upscale images with optional custom LoRA for style control.

LoRA
Wan 2.2 Video Enhancer

Wan 2.2 Video Enhancer

Video → Enhanced Video

Enhance video quality. Upscale resolution and boost details frame by frame.

SeedVR2 HD Video

Video → HD Video

Upscale videos to HD resolution using SeedVR2 model.

Qwen Image Edit

Qwen Image Edit

Image + Text → Edited Image

Edit images with text instructions using Qwen AI model.

LoRA
Qwen Change Clothes

Qwen Change Clothes

Image + Lora → New Outfit

Change clothes on people in images with consistent LoRA style.

Film Grain Effect

Film Grain Effect

Image → Film Style Image

Add authentic film grain texture to your images. Adjust intensity and saturation for vintage look.

Frame Interpolation

Frame Interpolation

Video → 2x FPS

Auto-detect and double your video frame rate using RIFE AI interpolation. Smoother motion!

LTX-2 Studio Text to Video

LTX-2 Studio Text to Video

Text → Video with Audio

Generate high-quality video with synchronized audio from text prompts. LTX-2 is a 19B DiT model with native audio generation.

LTX-2 Studio Image to Video

LTX-2 Studio Image to Video

Image → Video with Audio (up to 15s)

Animate your image into video with synchronized audio. Uses LTX-2 19B model for high-quality results.

LTX-2 Studio Canny Control

LTX-2 Studio Canny Control

Video → Styled Video with Audio

Transform videos using edge detection (Canny) for precise motion control. Keep the movement, change the style.

LTX-2 Studio Depth Control

LTX-2 Studio Depth Control

Video → Styled Video with Audio

Transform videos using depth maps for 3D-aware motion control. Maintains spatial relationships while changing style.

Wan 2.6 Text-to-Video

Text + Audio → HD Video

Alibaba's latest Wan 2.6 model. Generate 5-15 sec videos from text with optional audio and lip-sync.

Wan 2.6 Image-to-Video

Image + Audio → HD Video

Transform your image into smooth video with Wan 2.6. Add custom audio or auto-dub. 720P/1080P, up to 15 seconds.

Wan 2.6 Reference-to-Video

Wan 2.6 Reference-to-Video

Image Reference → New Video

Use reference images to maintain character consistency. Perfect for consistent characters across videos.

Multiple Character Angles

Multiple Character Angles

Character Image → 8 Angle Images

Generate 8 different camera angles (close-up, wide, 45°, 90°, aerial, low angle) from a single character image using Qwen AI.

Qwen Image Relight

Qwen Image Relight

Image → Relit Image

Change the lighting of your image using Qwen AI. Create dramatic, warm, cool, or custom lighting effects.

LoRA
Qwen Inpainting

Qwen Inpainting

Paint Mask → AI Fill

Paint over areas you want to change, then describe what should replace them. Perfect for object removal, replacement, or adding new elements.

LoRA
Qwen Upscale

Qwen Upscale

Image → HD Upscaled Image

High-definition magnification trained on Qwen-Image-Edit-2511. Losslessly enlarges images to approximately 2K size. Add your own LoRA for custom styles.

Qwen Camera Angle

Qwen Camera Angle

Image + Angle → New View

Generate different camera angles of your image using interactive 3D controls. Adjust horizontal angle (0-360°), vertical angle (-30° to 60°), and zoom level.

Qwen Multi Camera Angles

Qwen Multi Camera Angles

Image → 6 Different Views

Generate 6 different camera angles of your image at once. Configure each angle with horizontal, vertical, and zoom controls for comprehensive character sheets or product views.

Premium LoRA
LTX-2 Image + Audio to Video

LTX-2 Image + Audio to Video

Image + Audio → Video (up to 15s)

Create video from your image with your own audio track. Upload custom audio up to 15 seconds. LTX-2 19B model with camera control options.

Premium

LTX-2 V2V Controlnet + Audio

Video + Audio → Styled Video (up to 15s)

Transform video with ControlNet pose guidance and custom audio. Upload reference video for motion and your own audio track up to 15 seconds.

LoRA
LTX-2 Video Detailer

LTX-2 Video Detailer

Video → Enhanced Video

Enhance your video quality with LTX-2 Detailer. Supports up to 15 seconds input. Add custom LoRAs for style enhancement. LTX-2 compatible LoRAs only.

51
AI Tools
24/7
Available
HD+
Quality Output
Essential Guide

How to Train Your Own LoRA Model: Complete Guide to Creating AI Influencers

Watch the full video tutorial above or follow the step-by-step guide below Why LoRA Training Matters for Professional AI Content Training your own LoRA (Low-Rank Adaptation) model is essential when…

Read Tutorial 8 min read
Must Read
Music Video Creator Preview
New Feature

🎬 Music Video Creator

Create viral lip-synced music videos with the power of AI! Upload your audio track, generate stunning visuals for each beat, and export a professional music video in minutes.

  • Auto beat detection & smart segmentation
  • AI lip-sync for singing characters
  • One-click merge into final video
  • Perfect for TikTok, YouTube & Reels
Start Creating

Have a Suggestion?

Help us improve! Report bugs, request features, or share your feedback with the team.

Share Feedback

Your Voice Matters

We're constantly improving Kitty AI Studio based on your feedback. Whether it's a bug, a feature request, or just a thank you - we'd love to hear from you!