How to Use LTX Video 2.3: Create AI Videos in Minutes

LTX Video 2.3 turns a text prompt into a polished video clip — no camera, no editing software, no GPU required. If you've been curious about AI video but didn't know where to start, this is the guide.

What Is LTX Video 2.3?

LTX Video 2.3 is an open-source AI video model built by Lightricks. It generates video from text, images, or audio using a 22-billion-parameter diffusion transformer.

Version 2.3 is a meaningful upgrade over its predecessors. The fine detail rendering is sharper — hair, textures, edges. Complex prompts now resolve accurately thanks to a rebuilt text connector that's four times larger than before. It also supports native portrait video (9:16), making it directly usable for TikTok, Reels, and Shorts without cropping.

The easiest way to use it is ltx-23.app. No GPU, no install, no API key setup. Create an account, get free credits, and start generating in under two minutes.

Step 1: Choose Your Generation Mode

LTX Video 2.3 supports four input modes:

Text to Video — describe a scene, get a clip
Image to Video — upload a still photo, animate it
Audio to Video — feed a voice or soundtrack, get synchronized video with lip-sync
Video to Video — transform or restyle an existing clip

Most beginners start with text-to-video. Pick the mode that fits your project.

Step 2: Write a Strong Prompt

This is where most people go wrong. Short, vague prompts produce generic results.

Write 4–6 sentences in present tense. Include the shot type ("close-up," "wide establishing shot"), lighting and atmosphere ("golden hour," "soft fog"), what moves and how, camera movement ("slow dolly in," "handheld track"), and audio if it matters.

LTX 2.3's text connector handles complexity that earlier versions couldn't. Use it. A prompt like "a woman walks" is wasted potential. "A woman in her late 30s walks through a rain-soaked Tokyo street at dusk, handheld camera following from behind, reflections shimmering on wet pavement, faint jazz from a nearby bar" — that produces something worth watching.

Step 3: Set Your Parameters

On ltx-23.app, you control:

Duration: 4 to 20 seconds per clip
Resolution: 1080p, 1440p, or 4K
Aspect ratio: 16:9 for landscape, 9:16 for vertical social content
Mode: Fast Flow for quick iteration, Pro Flow for final output

Start with Fast Flow. It renders faster, so you can test your prompt direction without burning credits on a bad take.

Step 4: Iterate

Your first generation rarely nails it. That's normal.

Change one variable at a time — tweak the camera angle, adjust the lighting, sharpen the action description. When the direction feels right, switch to Pro Flow for your final render.

What LTX 2.3 Does Best

Cinematic compositions with intentional lighting
Emotional facial expressions and subtle movement
Atmospheric settings — fog, rain, reflections, golden hour
Stylized aesthetics like noir, analog film, or fashion editorial
Characters speaking, singing, or reacting with lip-sync

What to Avoid

Readable text rarely renders cleanly in generated video. Chaotic group scenes with many characters tend to fall apart. Don't describe internal emotions — describe the physical expression instead. "Furrowed brow, tight jaw" works. "She felt worried" doesn't.

Get Started

Head to ltx-23.app. Free credits on signup. No hardware required. The learning curve is short — most users have their first solid clip within a few iterations.