1 posts

Qx

@ConvixionAI

Katılım Aralık 2025

16 Takip Edilen0 Takipçiler

Qx@ConvixionAI·16 May

@Michaelzsguo why don’t you tell us lmao it’s your post

English

120

Michael Guo@Michaelzsguo·16 May

Did agent accomplish anything in that 13 hours? x.com/Michaelzsguo/s…

Michael Guo@Michaelzsguo

Totally fair. The 13 hours wasn’t “one prompt thinking really hard,” it was an autonomous loop doing the unglamorous work: - set up the local fine-tuning project - generated and labeled training data (with local hosted qwen model) - found bad labels and built review/adjudication files - trained multiple MLX LoRA checkpoints - ran evals after each one - diagnosed failure modes like reject→revise confusion and false positives - built new hard-example datasets from those failures - kept notes/plans/checkpoints so the work could resume instead of vanish So the result wasn’t “it solved everything in one shot.” The result was: we went from a vague local-model fine-tuning idea to a working training/eval pipeline, several checkpoints, clear metrics, and a much better understanding of what data the judge needs next. That’s the kind of long-running agent work I actually want: not magic, but steady progress with receipts.

English

6.1K

Michael Guo@Michaelzsguo·15 May

My Deepseek V4 Pro agent (inside codex) has been pursuing goal for more than 13 hours, burning ~100M tokens, and has only costed me $1.85. Yes you saw it right. Not $185, but 1 dollar 85 cents.

English

110

223.8K

Keşfet

@Michaelzsguo @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine