🐱 Creative Studio
Tip: Right-click and "Open in new tab" to run multiple workflows simultaneously without losing progress.
Seedance 2.0 Text-to-Video
ByteDance Seedance 2.0 text-to-video. Choose basic (480p, fast) or high (HD, premium). Remove watermark option.
Seedance 2.0 Image-to-Video
Animate images with Seedance 2.0. Basic (480p) or high quality. Up to 15 seconds.
Seedance 2.0 Video Edit
Edit videos with text instructions. Object replacement, style transfer. Use @image1 in prompt to reference uploaded images.
Seedance 2.0 Omni Reference
Generate character-consistent videos from reference images/videos/audio. Use @image1 in prompt.
Wan 2.7 Text-to-Video
Generate high-quality video with audio from text prompts. Multi-shot narrative, audio-video sync. 720P or 1080P, up to 15 seconds.
Wan 2.7 Image-to-Video
Generate video from first frame, first+last frame, with audio sync or video continuation. Multi-shot narrative. 720P or 1080P, up to 15 seconds.
Wan 2.7 Reference-to-Video
Create videos with consistent characters from reference images or videos. Up to 5 references, multi-character interaction, voice timbre replication. 720P or 1080P.
Wan 2.7 Video Editing
Edit videos with text instructions. Style transfer, object replacement, scene changes. Optional reference images. 720P or 1080P.
Nano Banana 2
Gemini 3.1 Flash Image — pro-level visual intelligence with Flash-speed efficiency. Edit with up to 14 reference images or generate from text.
Wan 2.7 Image
Generate and edit images with Wan 2.7. Up to 9 input images for editing, fusion, style transfer, and more. Standard model, up to 2K.
Wan 2.7 Pro Image Edit
Professional image editing with Wan 2.7 Pro. Thinking mode for better composition, 4K support, up to 9 input images.
SVI Pro - Extended Video
Create high-quality extended videos up to 30 seconds! Improved WAN 2.2 models for superior quality.
Kling 3.0
Kuaishou Kling 3.0 — HD/1080p video with native audio, multi-shot storyboarding, character consistency. 3-15 seconds.
Kling 3.0 Motion Control
Transfer motion from reference video to character image. Dance, choreography, character animation.
Instagirl Aesthetic
Generate images with Instagram-perfect aesthetic. Optimized for portraits with the "Instagirl" style.
WAN Realistic Image
Generate highly realistic images with optimized settings and special realism-focused LoRA.
Nano Banana Pro + Imagen 4
Nano Banana Pro for editing, Imagen 4 for generation. Google DeepMind's latest image AI.
Veo 3.1
Google DeepMind's advanced video generation. T2V, I2V, and First/Last Frame modes. $0.10-0.40/sec.
Qwen Text to Image
Generate stunning photorealistic images with Qwen-Image-2512, the latest text-to-image model from Alibaba. Features enhanced human realism, finer natural details, and improved text rendering. December 2025 update with significant quality improvements.
Qwen Text to Image Realistic
Absurdly very realistic Qwen text to image workflow with special formula that will make your AI Influencer very sharp, prompt coherent and photoreal.
WAN Long Video Enhancer
Enhance videos up to 30 seconds with smart batch processing and seamless frame blending.
Z-Image Text to Image
Two-stage Z-Image Base generation. Get both base and detailed refined outputs. Up to 3 custom LoRAs. Full control over denoise, steps, and CFG.
Z-Image from Reference
Generate images based on a reference image. AI analyzes the reference and creates prompt automatically. Up to 3 custom LoRAs. Full control over denoise, steps, and CFG.
WAN Animate
Transfer motion from any video to your image. Perfect for dancing, walking, or complex movements.
WAN First Frame Last Frame
Create smooth video transitions morphing from first to last frame.
Wan 2.2 Image to Video (Fast)
Fast image-to-video with optional LoRA (Wan 2.2) and frame interpolation for smoother motion.
WAN 2.2 Long Video (Basic)
Create longer videos up to 30 seconds. For better consistency, try SVI WAN 2.2 Extended Video.
WAN SCAIL Dance Video
Transfer dance moves from a reference video to your character using SCAIL pose detection. Supports 1-10 people. ⚠️ SLOW (30 min - 2h)
WAN 2.2 Text to Image
Generate high-quality images from text prompts with two optional custom LoRA slots (WAN 2.2 High/Low Noise).
Z-Image Turbo
Ultra-fast image generation with optional upscale. Results in seconds!
InfiniteTalk - Image to Video
Generate talking head videos from a face image and audio. Max 7 min audio, 1024px image. Powered by Wan 2.1 InfiniteTalk.
InfiniteTalk - Video to Video
Transform video into talking head synced to audio. V2V with color matching. Max 7 min audio.
Z-Image Base + Turbo
Two-stage Z-Image. Get both base and turbo-refined outputs with LoRA presets and up to 3 custom LoRAs.
FLUX Upscale
Upscale images using FLUX model for high-quality results.
SeedVR2 4K Upscale
Upscale images to 4K resolution using SeedVR2 model.
SeedVR2 Simple Upscale
Quick image upscaling with SeedVR2 for everyday use.
WAN Image Enhancer
Enhance and upscale images with optional custom LoRA for style control.
Wan 2.2 Video Enhancer
Enhance video quality. Upscale resolution and boost details frame by frame.
SeedVR2 HD Video
Upscale videos to HD resolution using SeedVR2 model.
Qwen Image Edit
Edit images with text instructions using Qwen AI model.
Qwen Change Clothes
Change clothes on people in images with consistent LoRA style.
Film Grain Effect
Add authentic film grain texture to your images. Adjust intensity and saturation for vintage look.
Frame Interpolation
Auto-detect and double your video frame rate using RIFE AI interpolation. Smoother motion!
LTX 2.3 Text or Image to Video
Generate high-quality video from text or image using LTX 2.3 22B model. Native 24fps, up to 20 seconds. From $0.26.
LTX 2.3 Audio Sync I2V
Create audio-synced video from image with lip-sync support. Choose between talking or singing mode for realistic mouth movements. Supports custom LoRAs. From $0.29.
FireRed Image Edit 1.1
Open-source text-guided image editing with state-of-the-art identity consistency. Upload 1-3 reference images and describe the edit. Supports clothing changes, style transfer, makeup, photo restoration, virtual try-on, and more. 20B parameter model by Xiaohongshu/RedNote.
Wan 2.6 Text-to-Video
Alibaba's latest Wan 2.6 model. Generate 5-15 sec videos from text with optional audio and lip-sync.
Wan 2.6 Image-to-Video
Transform your image into smooth video with Wan 2.6. Add custom audio or auto-dub. 720P/1080P, up to 15 seconds.
Multiple Character Angles
Generate 8 different camera angles (close-up, wide, 45°, 90°, aerial, low angle) from a single character image using Qwen AI.
Qwen Inpainting
Paint over areas you want to change, then describe what should replace them. Perfect for object removal, replacement, or adding new elements.
Qwen Upscale
High-definition magnification trained on Qwen-Image-Edit-2511. Losslessly enlarges images to approximately 2K size. Add your own LoRA for custom styles.
Qwen Camera Angle
Generate different camera angles of your image using interactive 3D controls. Adjust horizontal angle (0-360°), vertical angle (-30° to 60°), and zoom level.
Qwen Multi Camera Angles
Generate 6 different camera angles of your image at once. Configure each angle with horizontal, vertical, and zoom controls for comprehensive character sheets or product views.
AI Green Screen
Remove background from any video using AI matting. Outputs green screen video with clean edges.
Video Upscale & Detail Restore
FlashVSR-powered video detail restoration. Restores hair, skin, textures while preserving face identity. Optional 2x upscale.
RTX AI Upscale
NVIDIA RTX Video & Image AI Upscaler — powered by RTX Video Super Resolution. Upscale videos up to 4x and images to ultra-high resolution.
Magi-Human
daVinci Magi-Human — audio-driven human animation. Generate realistic talking and dancing humans from text, image, or audio.
LTX 2.3 First & Last Frame
Generate smooth video transitions between two images with LTX 2.3.
How to Train Your Own LoRA Model: Complete Guide to Creating AI Influencers
Watch the full video tutorial above or follow the step-by-step guide below Why LoRA Training Matters for Professional AI Content Training your own LoRA (Low-Rank Adaptation) model is essential when…
🎬 Music Video Creator
Create viral lip-synced music videos with the power of AI! Upload your audio track, generate stunning visuals for each beat, and export a professional music video in minutes.
- Auto beat detection & smart segmentation
- AI lip-sync for singing characters
- One-click merge into final video
- Perfect for TikTok, YouTube & Reels
Your Voice Matters
We're constantly improving Kitty AI Studio based on your feedback. Whether it's a bug, a feature request, or just a thank you - we'd love to hear from you!