LTX 2.3

Why LTX 2.3 Is the Sharpest AI Video Generator Right Now

Rebuilt latent space with updated VAE - Fine textures, hair, edge lines, and on-screen text stay sharp through the full generation pipeline.

4x larger text connector - Complex prompts with multiple subjects, spatial layout, and style instructions now resolve accurately — not approximately.

Stronger image-to-video motion - Less freezing, less Ken Burns drift. Input frames anchor the generation so subjects move naturally and stay visually consistent.

Cleaner audio - A new vocoder and filtered training data cut artifacts and sync mismatches across both text-to-video and audio-conditioned workflows.

Native portrait up to 1080×1920 - Trained on vertical data, not cropped from landscape. Ready for TikTok, Reels, and Shorts without post-crop quality loss.

Up to 20-second clips - Enough duration for product demos, short ads, and narrative scenes in a single generation.

Full LTX-2 capability set, upgraded - Every LTX-2 feature carries forward with engine-level improvements in detail, motion, audio, and prompt fidelity.

Sharper Fine Detail - The VAE Rebuild That Changes Everything

LTX 2.3 rebuilds its latent space with an updated Variational Autoencoder trained on higher-quality data. The practical difference shows up immediately in outputs — fine hair texture, fabric weave, product surface gloss, and small on-screen text all pass through the generation pipeline without softening or smearing.

Prompt	Output Video
Pixar-style 3D girl with brown braids, white tee, jeans, red sneakers, leaping toward camera, fisheye lens, medieval European timber town, bright sunny day, playful adventurous vibe, 4K

Tighter Prompt Adherence — The 4x Text Connector Upgrade

Most AI video generators struggle with specificity. Mention three subjects in different positions doing different actions, and the model collapses them into something approximate. LTX 2.3 video generator uses a 4x larger text connector that resolves multi-subject prompts, spatial relationships, and stylistic instructions with clear accuracy.

Prompt	Output Video
A 6-second fantasy animation of a multicolored acrylic paint-splash cat (red, blue, yellow, orange, purple, green) walking on a white canvas. The cat leaps and splatters paint droplets (0-2s), spins mid-air and dissolves into swirling color streams (2-4s), then the paint converges and coils into a glossy paint-splash flower (4-6s). Fluid paint physics, vibrant saturated colors, cartoonish style, white background, 8K, whimsical atmosphere.

Prompt

Output Video

A 6-second fantasy animation of a multicolored acrylic paint-splash cat (red, blue, yellow, orange, purple, green) walking on a white canvas. The cat leaps and splatters paint droplets (0-2s), spins mid-air and dissolves into swirling color streams (2-4s), then the paint converges and coils into a glossy paint-splash flower (4-6s). Fluid paint physics, vibrant saturated colors, cartoonish style, white background, 8K, whimsical atmosphere.

Stronger Image-to-Video — Real Motion, No More Frozen Frames

The image-to-video problem in earlier models came in two forms: subjects that barely moved (the Ken Burns effect), or subjects that moved but lost visual consistency with the source frame. LTX 2.3 video generator addresses both.

Start Frame	End Frame	Prompt	Output Video
		A 6-second whimsical animation: a glossy acrylic paint-splash cat (red, blue, yellow, orange, purple, green) walks on a white canvas, leaps and splatters paint droplets (0-2s), spins and dissolves into swirling color streams (2-4s), which converge and solidify into a bright paint-splash flower with a green stem (4-6s). Fluid paint physics, vibrant colors, cartoonish style, 8K, soft white background.

Cleaner Audio - New Vocoder, Better Training Data

Audio quality in AI video generation is often the weakest point. Clicks, pops, misaligned dialogue, and inconsistent environmental sound degrade the final output even when visuals are strong. LTX 2.3 addresses this with a new vocoder and filtered training data that removes audio artifacts before they reach the output.

Prompt	Output Video
A 5-second cinematic video of luxury perfume "CELESTIAL BLOSSOM" on a soft beige background. Blue delphinium petals float gently (0-1s), accelerate and swirl toward the center forming a bottle silhouette (1-3s), then solidify into transparent glass filled with golden liquid, black cap and gold lettering appear (3-4s). Final shot: the complete bottle stands elegantly with a few petals floating around it in warm light (4-5s). Soft focus, golden lighting, magical particle physics, dreamy elegant style, 8K.

Prompt

Output Video

A 5-second cinematic video of luxury perfume "CELESTIAL BLOSSOM" on a soft beige background. Blue delphinium petals float gently (0-1s), accelerate and swirl toward the center forming a bottle silhouette (1-3s), then solidify into transparent glass filled with golden liquid, black cap and gold lettering appear (3-4s). Final shot: the complete bottle stands elegantly with a few petals floating around it in warm light (4-5s). Soft focus, golden lighting, magical particle physics, dreamy elegant style, 8K.

Native Portrait Mode - Built for Vertical Video From the Start

LTX 2.3 video generator generates portrait video up to 1080×1920 resolution using training data collected in vertical orientation. This is different from simply cropping a landscape video. The composition, subject framing, and motion behavior are all designed for portrait from the beginning.

Prompt	Output Video
A smiling man with curly hair and a woman ride a yellow vintage Vespa scooter through a bustling Asian alleyway, warm golden hour light, hanging red lanterns, fruit stalls, cobblestone street, cinematic bokeh, joyful travel vibe, 8K

FAQ

What is LTX 2.3?

LTX 2.3 is Lightricks' latest AI video generation model, available on Dzine. It features a rebuilt VAE for sharper fine detail, a 4x larger text connector for better prompt adherence, stronger image-to-video motion, cleaner audio with a new vocoder, and native portrait video support up to 1080×1920. It generates continuous clips up to 20 seconds and builds on the full LTX-2 capability set with engine-level improvements.

How does LTX 2.3 differ from LTX 2?

LTX 2.3 upgrades LTX-2 at the engine level. The VAE is rebuilt for sharper texture and edge detail. The text connector is 4x larger, enabling accurate rendering of complex multi-subject prompts. Image-to-video motion is stronger with less freezing and drift. The audio system uses a new vocoder with filtered training data for fewer artifacts and better sync. Portrait mode is now natively supported at up to 1080×1920. All existing LTX-2 capabilities carry forward with these improvements.

Can I use LTX 2.3 for commercial projects?

Yes. All videos generated with the LTX 2.3 video generator on Dzine are watermark-free and cleared for commercial use. This includes advertising campaigns, product demonstrations, social media marketing, client deliverables, and broadcast distribution.

What makes LTX 2.3's image-to-video better than earlier versions?

LTX 2.3 AI video generator reduces two common image-to-video problems: subjects that barely move (Ken Burns effect) and subjects that drift away from the source frame's visual appearance. Input frames now anchor generation correctly, so characters and objects retain their look throughout the clip while executing genuine, natural motion.

Does LTX 2.3 support portrait video for TikTok and Instagram Reels?

Yes. LTX 2.3 video generator supports native portrait output at resolutions up to 1080×1920. The model was trained on vertical-orientation data, so portrait videos are not cropped from landscape — composition, subject framing, and motion behavior are all optimized for vertical format from the start.

How long can videos be with LTX 2.3?

LTX 2.3 generates continuous video clips up to 20 seconds in length. This is sufficient for product demos, short social ads, branded content sequences, and narrative scenes that would otherwise require multiple clips stitched together.

How does LTX 2.3's audio compare to other AI video models?

LTX 2.3 uses a new vocoder and filtered training data that significantly reduces audio artifacts, unexpected dropout events, and sync mismatches. Sound effects land on the correct frame, dialogue aligns with lip movements more accurately, and ambient audio transitions smoothly between scene changes. This applies to both text-to-video and audio-conditioned generation workflows.

Start Creating with LTX 2.3 on Dzine Today

LTX 2.3 raises the baseline for what an AI video generator should deliver - sharper detail, accurate prompts, real motion, cleaner audio, and native vertical video in one model. Create production-ready content on Dzine without expensive equipment, post-production work, or technical expertise. Whether you need polished social ads, product showcase clips, or branded story videos, the LTX 2.3 video generator gives you the output quality your projects deserve.

Why LTX 2.3 Is the Sharpest AI Video Generator Right Now

How to Use LTX 2.3 on Dzine

Step 1: Upload Image & Enter Prompt

Step 2: Select LTX 2.3 as the Model

Step 3: Generate & Download Your Video

Sharper Fine Detail - The VAE Rebuild That Changes Everything

Tighter Prompt Adherence — The 4x Text Connector Upgrade

Stronger Image-to-Video — Real Motion, No More Frozen Frames

Cleaner Audio - New Vocoder, Better Training Data

Native Portrait Mode - Built for Vertical Video From the Start

Explore Our Advanced Video/Image Models

Why Try LTX 2.3 on Dzine AI?

Advanced AI Models

One Click to Generate

Free Trial

High Quality Results Export

Watermark Free

Online Platform No Downloading

What Our Users Said

The Detail Quality on LTX 2.3 Finally Matches Our Product Standards

Prompt Accuracy Means I Actually Get What I Write

Native Portrait Mode Solved My Entire Short-Form Workflow

FAQ