置顶推文

I spent $30k and 3 months RL post-training an anime video model.
This is only step 30 out of a planned 1000 step run.
All samples are local text-to-video with no reference image/audio. Since it's based on LTX-2.3, each output takes under a minute on a single GPU.
I'm 19 and a solo researcher. Most of the budget went into ablations, reward design, and trying different configurations before reaching this setup.
The run is still extremely early, but the results already look much better than I expected.
It's compute-limited, not idea-limited.
I'm starting a company to continue scaling this and build frontier stylized video models.
If you're an investor, compute partner, video team, or someone who wants to help build this, DMs are open.
English








