
Nothing special to see
4.8K posts

Nothing special to see
@M_P_net
Don't know why you're bothering to look. Wrong or right, Opinions are my own. Try to see from a different points of view. Challenge my view or change my mind





INCREDIBLE Trinity-Large-Thinking just dropped > The STRONGEST American open model we’ve gotten so far > #2 on PinchBench > ~96% cheaper than Opus > not hype, look at the benchmarks > GPQA-D > 76.3 > beats MiniMax M2.7 !!! > but still behind Opus 4.6 > Tau2-Airline > 88.0 > basically trading blows with the frontier > ahead of most of the pack > Tau2-Telecom > 94.7 > strong > but GLM-5 still spikes higher here > PinchBench (agent check) > 91.9 > #2 overall > sitting right behind Opus 4.6 > and ahead of all other opensource attempts > this is the important one btw > because this is what your agents actually feel like in prod > AIME25 (math / reasoning) > 96.3 > competitive with Kimi and GLM > still under Opus ceiling > MMLU-Pro > 83.4 > solid general intelligence baseline > BCFLv4 > 70.1 > middle of the pack, not its strongest axis > this model is not trying to be the best at everything > it’s optimizing for: > multi-turn coherence > tool use > long-horizon agents > cost > aka the stuff that actually matters when you deploy > not “we beat everyone on X benchmark” > but “we’re close enough where economics flip the decision” > also Apache 2.0 weights > meaning: > you can run it > fine-tune it > break it > own it > American open models finally starting to look real again > took long enough Buy a GPU
































