🐱 Creative Studio
Tip: Right-click and "Open in new tab" to run multiple workflows simultaneously without losing progress.
Nano Banana 2
Gemini 3.1 Flash Image — pro-level visual intelligence with Flash-speed efficiency. Edit with up to 14 reference images or generate from text.
Wan 2.6 Image Generation
Generate or edit images with Wan 2.6. Supports mixed text+image input for style/composition references.
SVI Pro - Extended Video
Create high-quality extended videos up to 30 seconds! Improved WAN 2.2 models for superior quality.
Instagirl Aesthetic
Generate images with Instagram-perfect aesthetic. Optimized for portraits with the "Instagirl" style.
WAN Realistic Image
Generate highly realistic images with optimized settings and special realism-focused LoRA.
Nano Banana Pro + Imagen 4
Nano Banana Pro for editing, Imagen 4 for generation. Google DeepMind's latest image AI.
Veo 3.1
Google DeepMind's advanced video generation. T2V, I2V, and First/Last Frame modes. $0.10-0.40/sec.
Qwen Text to Image
Generate stunning photorealistic images with Qwen-Image-2512, the latest text-to-image model from Alibaba. Features enhanced human realism, finer natural details, and improved text rendering. December 2025 update with significant quality improvements.
Qwen Text to Image Realistic
Absurdly very realistic Qwen text to image workflow with special formula that will make your AI Influencer very sharp, prompt coherent and photoreal.
WAN Long Video Enhancer
Enhance videos up to 30 seconds with smart batch processing and seamless frame blending.
Z-Image Text to Image
Two-stage Z-Image Base generation. Get both base and detailed refined outputs. Up to 3 custom LoRAs. Full control over denoise, steps, and CFG.
Z-Image from Reference
Generate images based on a reference image. AI analyzes the reference and creates prompt automatically. Up to 3 custom LoRAs. Full control over denoise, steps, and CFG.
WAN Animate
Transfer motion from any video to your image. Perfect for dancing, walking, or complex movements.
WAN First Frame Last Frame
Create smooth video transitions morphing from first to last frame.
Wan 2.2 Image to Video (Fast)
Fast image-to-video with optional LoRA (Wan 2.2) and frame interpolation for smoother motion.
WAN 2.2 Long Video (Basic)
Create longer videos up to 30 seconds. For better consistency, try SVI WAN 2.2 Extended Video.
WAN SCAIL Dance Video
Transfer dance moves from a reference video to your character using SCAIL pose detection. Supports 1-10 people. ⚠️ SLOW (30 min - 2h)
WAN 2.2 Text to Image
Generate high-quality images from text prompts with two optional custom LoRA slots (WAN 2.2 High/Low Noise).
Z-Image Turbo
Ultra-fast image generation with optional upscale. Results in seconds!
InfiniteTalk - Image to Video
Generate talking head videos from a face image and audio. Max 7 min audio, 1024px image. Powered by Wan 2.1 InfiniteTalk.
InfiniteTalk - Video to Video
Transform video into talking head synced to audio. V2V with color matching. Max 7 min audio.
Z-Image Base + Turbo
Two-stage Z-Image. Get both base and turbo-refined outputs with LoRA presets and up to 3 custom LoRAs.
FLUX Upscale
Upscale images using FLUX model for high-quality results.
SeedVR2 4K Upscale
Upscale images to 4K resolution using SeedVR2 model.
SeedVR2 Simple Upscale
Quick image upscaling with SeedVR2 for everyday use.
WAN Image Enhancer
Enhance and upscale images with optional custom LoRA for style control.
Wan 2.2 Video Enhancer
Enhance video quality. Upscale resolution and boost details frame by frame.
SeedVR2 HD Video
Upscale videos to HD resolution using SeedVR2 model.
Qwen Image Edit
Edit images with text instructions using Qwen AI model.
Qwen Change Clothes
Change clothes on people in images with consistent LoRA style.
Film Grain Effect
Add authentic film grain texture to your images. Adjust intensity and saturation for vintage look.
Frame Interpolation
Auto-detect and double your video frame rate using RIFE AI interpolation. Smoother motion!
LTX-2 Studio Text to Video
Generate high-quality video with synchronized audio from text prompts. LTX-2 is a 19B DiT model with native audio generation.
LTX-2 Studio Image to Video
Animate your image into video with synchronized audio. Uses LTX-2 19B model for high-quality results.
LTX-2 Studio Canny Control
Transform videos using edge detection (Canny) for precise motion control. Keep the movement, change the style.
LTX-2 Studio Depth Control
Transform videos using depth maps for 3D-aware motion control. Maintains spatial relationships while changing style.
Wan 2.6 Text-to-Video
Alibaba's latest Wan 2.6 model. Generate 5-15 sec videos from text with optional audio and lip-sync.
Wan 2.6 Image-to-Video
Transform your image into smooth video with Wan 2.6. Add custom audio or auto-dub. 720P/1080P, up to 15 seconds.
Wan 2.6 Reference-to-Video
Use reference images to maintain character consistency. Perfect for consistent characters across videos.
Multiple Character Angles
Generate 8 different camera angles (close-up, wide, 45°, 90°, aerial, low angle) from a single character image using Qwen AI.
Qwen Image Relight
Change the lighting of your image using Qwen AI. Create dramatic, warm, cool, or custom lighting effects.
Qwen Inpainting
Paint over areas you want to change, then describe what should replace them. Perfect for object removal, replacement, or adding new elements.
Qwen Upscale
High-definition magnification trained on Qwen-Image-Edit-2511. Losslessly enlarges images to approximately 2K size. Add your own LoRA for custom styles.
Qwen Camera Angle
Generate different camera angles of your image using interactive 3D controls. Adjust horizontal angle (0-360°), vertical angle (-30° to 60°), and zoom level.
Qwen Multi Camera Angles
Generate 6 different camera angles of your image at once. Configure each angle with horizontal, vertical, and zoom controls for comprehensive character sheets or product views.
LTX-2 Image + Audio to Video
Create video from your image with your own audio track. Upload custom audio up to 15 seconds. LTX-2 19B model with camera control options.
LTX-2 V2V Controlnet + Audio
Transform video with ControlNet pose guidance and custom audio. Upload reference video for motion and your own audio track up to 15 seconds.
LTX-2 Video Detailer
Enhance your video quality with LTX-2 Detailer. Supports up to 15 seconds input. Add custom LoRAs for style enhancement. LTX-2 compatible LoRAs only.
How to Train Your Own LoRA Model: Complete Guide to Creating AI Influencers
Watch the full video tutorial above or follow the step-by-step guide below Why LoRA Training Matters for Professional AI Content Training your own LoRA (Low-Rank Adaptation) model is essential when…
🎬 Music Video Creator
Create viral lip-synced music videos with the power of AI! Upload your audio track, generate stunning visuals for each beat, and export a professional music video in minutes.
- Auto beat detection & smart segmentation
- AI lip-sync for singing characters
- One-click merge into final video
- Perfect for TikTok, YouTube & Reels
Your Voice Matters
We're constantly improving Kitty AI Studio based on your feedback. Whether it's a bug, a feature request, or just a thank you - we'd love to hear from you!