| This model was trained on 8,000 video pairs, and training is still ongoing for a few thousand more steps. It is still experimental, not trained with a fully professional production target, and the model may be updated unexpectedly as new checkpoints. The current goal is not final polished production quality, but to explore:
The model was trained around four main prompt patterns: Add Remove Replace Convert / Style Workflow URL: Model URL: ltx23_edit_anything_global_rank128_v1_9000steps_adamw.safetensors · Alissonerdx/LTX-LoRAs at main Or One important thing during inference is CFG. A good starting point is testing a distilled setup with CFG = 1. If the edit feels too weak or the model is not following the prompt well enough, increasing CFG can be the key. In some cases, increasing the distill LoRA strength to around 1.2 can also help. The workflow is also not fully optimized yet. It still needs more testing to find the best combination of:
It may also be interesting to combine this model with other models and see what kinds of results emerge. If you can test it, please share your findings. Feedback on prompt behavior, edit strength, consistency, style transfer, and failure cases would be very helpful while training is still in progress. Add a small, brown dog dancing in the foreground next to the woman. Remove the blue car in the background of the scene. Add a wide, genuine smile to the person’s face. Replace the person’s clothing with a dark blue hoodie and gray sweatpants. submitted by /u/Round_Awareness5490 |