|
|
官方提示词优化指南
LTX-2.3 introduces major improvements to detail, motion, prompt understanding, audio reliability, and native portrait support.
LTX-2.3 在细节、动作、提示理解、音频可靠性和原生肖像支持方面带来了重大改进。
This isn’t just a model update. It changes how you should prompt.
这不仅仅是一个模型更新。它改变了你应该如何提示的方式。
Here’s how to get the most out of it.
这是如何充分利用它的方法。
1. Be More Specific. The Engine Can Handle It.
1. 更加具体。引擎可以处理。
LTX-2.3 includes a larger, more capable text connector. It interprets complex prompts more accurately, especially when they include:
LTX-2.3 包含一个更大、更强大的文本连接器。它更准确地解释复杂提示,尤其是在它们包含以下内容时:
Multiple subjects 多个主题
Spatial relationships 空间关系
Stylistic constraints 风格约束
Detailed actions 详细动作
Previously, simplifying prompts improved consistency.
之前,简化提示改善了一致性。
Now, specificity wins.
现在,具体性取胜。
Instead of: 而不是:
A woman in a café
咖啡馆里的女人
Try: 尝试:
A woman in her 30s sits by the window of a small Parisian café. Rain runs down the glass behind her. Warm tungsten interior lighting. She slowly stirs her coffee while glancing at her phone. Background softly out of focus.
一位三十多岁的女士坐在巴黎一家小咖啡馆的窗边。雨珠顺着她身后的玻璃滑落。温暖的钨丝室内照明。她慢慢搅动着咖啡,同时瞥了一眼手机。背景模糊不清。
The creative engine drifts less. Use that.
创意引擎漂移减少。使用那个。
2. Direct the Scene, Don’t Just Describe It
2. 指导场景,不只是描述它
LTX-2.3 is better at respecting spatial layout and relationships.
LTX-2.3 更擅长尊重空间布局和关系。
Be explicit about:
明确说明:
Left vs right 左与右
Foreground vs background
前景与背景
Facing toward vs away
面向与背向
Distance between subjects
主体之间的距离
Instead of: 而不是:
Two people talking outside
两个人在外面交谈
Try: 尝试:
Two people stand facing each other on a quiet suburban sidewalk. The taller man stands on the left, hands in pockets. The woman stands on the right, holding a bicycle. Houses blurred in the background.
两个人站在安静的郊区人行道上,面对面站立。较高的男士站在左边,手插在口袋里。女士站在右边,手持自行车。背景中的房屋模糊不清。
Block the scene like a director.
像导演一样构图这个场景。
3. Describe Texture and Material
3. 描述纹理和材质
With a rebuilt latent space and updated VAE, fine detail is sharper across resolutions.
通过重建的潜在空间和更新的 VAE,不同分辨率下的细节更加清晰。
So describe: 所以描述:
Fabric types 织物类型
Hair texture 发质
Surface finish 表面光泽
Environmental wear 环境磨损
Edge detail 边缘细节
Example: 示例:
Close-up of wind moving through fine, curly hair. Individual strands visible. Soft afternoon backlight catching edge detail.
风穿过细长卷曲的头发时的特写。可见单独的发丝。柔和的午后逆光捕捉到边缘细节。
You should need less compensation in post.
你不需要在后期支付那么多补偿。
4. For Image-to-Video, Use Verbs
4. 对于图像到视频,使用动词
One of the biggest upgrades in 2.3 is reduced freezing and more natural motion.
2.3 版本最大的升级之一是减少了冻结效果,并增加了更自然的动态效果。
But motion still needs clarity.
但动态效果仍然需要清晰度。
Avoid: 避免:
The scene comes alive
场景变得生动
Instead: 改为:
The camera slowly pushes forward as the subject turns their head and begins walking toward the street. Cars pass.
镜头缓缓向前推进,随着主体转动头部开始走向街道。车辆驶过。
Specify: 指定:
Who moves 谁移动
What moves 什么移动
How they move 他们如何移动
What the camera does
相机的作用
Motion is driven by verbs.
运动由动词驱动。
5. Avoid Static, Photo-Like Prompts
5. 避免静态、照片般的提示
If your prompt reads like a still image, the output may behave like one.
如果你的提示读起来像一幅静止的图像,输出可能会表现得像一幅。
Instead of: 而不是:
A dramatic portrait of a man standing
一个男子站立的戏剧性肖像
Try: 尝试:
A man stands on a windy rooftop. His coat flaps in the wind. He adjusts his collar and steps forward as the camera tracks right.
一个男人站在一个多风的屋顶上。他的外套在风中飘动。他调整领子并向前走,而摄像机向右移动。
Action reduces static outputs.
动作减少静态输出。
6. Design for Native Portrait
6. 为原生肖像设计
LTX-2.3 supports native vertical video up to 1080x1920, trained on vertical data.
LTX-2.3 支持原生竖屏视频,最高可达 1080x1920,基于竖屏数据训练。
When generating portrait content, compose for vertical intentionally.
生成肖像内容时,有意采用竖屏构图。
Example: 示例:
Influencer vlogging while on holiday.
度假期间网红进行视频博客。
Don’t treat vertical as cropped landscape. Frame for it.
不要将竖屏视为裁剪的横屏。为其构图。
7. Be Clear About Audio
7. 明确音频要求
The new vocoder improves reliability and alignment.
新的声码器提高了可靠性和一致性。
If you want sound, describe it:
如果你想要声音,请描述它:
Environmental audio 环境音效
Tone and intensity 音调和强度
Dialogue clarity 对话清晰度
Example: 示例:
A low, pulsing energy hum radiates from the glowing orb. A sharp, intermittent alarm blares in the background, metallic and urgent, echoing through the spacecraft interior.
一个发光的球体发出低沉的脉动能量嗡嗡声。背景中尖锐的间歇性警报声金属般急促,在航天器内部回荡。
Specific inputs produce more controlled outputs.
具体的输入能产生更受控的输出。
8. Unlock More Complex Shots
8. 解锁更复杂的镜头
Earlier checkpoints rewarded simplicity.
早期的检查点奖励简洁性。
LTX-2.3 rewards direction.
LTX-2.3 奖励方向性。
With significantly stronger prompt adherence and improved visual quality, you can now design more ambitious scenes with confidence.
随着显著增强的提示遵循能力和改进的视觉质量,你现在可以更有信心地设计更雄心勃勃的场景。
ou can: 你可以:
Layer multiple actions within a single shot
在单次拍摄中叠加多个动作
Combine detailed environments with character performance
将精细的环境与角色表现相结合
Introduce precise stylistic constraints
引入精确的风格限制
Direct camera movement alongside subject motion
与主体运动同步的相机移动
The engine holds structure under complexity. It maintains spatial logic. It respects what you ask for.
引擎在复杂情况下保持结构。它维持空间逻辑。它尊重你的要求。
LTX-2.3 is sharper, more faithful, and more controllable.
LTX-2.3 更加锐利、忠实且可控。 |
|