Ted Spare

2.7K posts

Ted Spare banner
Ted Spare

Ted Spare

@TedSpare

Building @rubriclabs

Montreal, Canada Beigetreten Temmuz 2021
685 Folgt1.1K Follower
Sazabi
Sazabi@sazabi·
Time's up. The new Sazabi website, codenamed "Midnight", is live now. 🔗 Don’t miss it: sazabi.com
English
19
24
102
26.4K
Ted Spare
Ted Spare@TedSpare·
Coding agent feat request: pause + resume
English
0
0
0
106
Josh Pigford
Josh Pigford@Shpigford·
i've been heads down lately working on a new thing: @RumoredAI and today it's available! everyone's familiar with SEO and everyone's becoming more familiar with AEO/GEO (which is optimization for AI). yes it's interesting to know what terms/phrases surface your business, but what nobody has tackled is what to do when AI is getting your business *wrong*. and we found that AI hallucinates business facts for quite literally every brand. rumored.ai surfaces what AI is saying about your brand, what it's getting wrong, how you compare to your competitors and (most importantly) the exact things to do to fix those issues. you get a ridiculously in-depth interactive threat report covering 12 sections: from executive summary and active threats to competitive analysis, schema audit, and a prioritized action plan with copy-paste fix prompts. this isn't a subscription (yet?). it's a one-time purchase of an in-depth audit of your business. launch price is $25. but the price goes up by $25 each time someone purchases. 📈 have been testing this with a lot of companies and the response has nearly universally been 🤯. i think you'll love it.
English
29
2
80
33.3K
Ted Spare
Ted Spare@TedSpare·
Lots of talk about how to power GW-scale datacentres, GE’s backlog on turbines, etc. Is solar not an incredibly obvious choice? Price tokens at a markup on electricity. Demand is highest when the sun shines. The world is already building 600 GW of solar a year, 1+ TW by 2030.
Ted Spare tweet media
English
0
0
0
56
Ted Spare retweetet
Rubric Labs
Rubric Labs@RubricLabs·
New post: Primitives over Pipelines The AI systems most teams are shipping today were designed for dumber models. Now that frontier intelligence can follow instructions, reason, and self-correct the move is to give agents primitives and let them cook. rubriclabs.com/blog/primitive…
English
3
7
20
2.1K
Ted Spare
Ted Spare@TedSpare·
Tech Twitter: the EU can't build The EU:
Ted Spare tweet media
English
0
0
5
199
Ted Spare
Ted Spare@TedSpare·
马上发财
Ted Spare tweet media
中文
0
0
4
122
Ted Spare
Ted Spare@TedSpare·
@maksym_andr So cool. Could you apply a heatmap/colorscale to this table for easy parsing?
English
1
0
0
182
Maksym Andriushchenko
Maksym Andriushchenko@maksym_andr·
Interesting finding from our PostTrainBench: Sonnet 4.5 released ~3 months ago can barely improve the performance of base LLMs. But there's been _a lot_ of progress since then: - Opus 4.5 does perform much better - GPT-5.1 Codex Max outperforms the rest by a wide margin!
Maksym Andriushchenko tweet media
English
6
5
93
5.7K
Ted Spare
Ted Spare@TedSpare·
What is the barrier to major benchmarks auto-updating when a new frontier model drops? Some of my favourites (METR, SWE-bench, τ²-bench) take days or weeks to update
English
0
0
0
143
Ted Spare retweetet
Sarim Malik
Sarim Malik@sarimrmalik·
Introducing → rubric.tv It's a dead simple frontend generator, designed to test gpt-5.2 and its capabilities. It's surprisingly good. For example, check this jet simulator where I can fly over Toronto. We set aside a budget for this, it's free to try, have fun.
Ted Spare@TedSpare

GPT-5.2 from @OpenAI launched yesterday and we wanted to test it. We built a playground to test its coding + UI capabilities and set a $1000 API limit. Go nuts: rubric.tv

English
2
1
4
583
Ted Spare
Ted Spare@TedSpare·
@OpenAI This exercise also just reminds me how incredibly creative you all are Keep it going
English
0
0
2
67
Ted Spare
Ted Spare@TedSpare·
It definitely loves Purple Hell, but then you are what you eat I guess?
Ted Spare tweet mediaTed Spare tweet mediaTed Spare tweet media
English
1
0
2
85
Ted Spare
Ted Spare@TedSpare·
GPT-5.2 from @OpenAI launched yesterday and we wanted to test it. We built a playground to test its coding + UI capabilities and set a $1000 API limit. Go nuts: rubric.tv
Ted Spare tweet mediaTed Spare tweet mediaTed Spare tweet mediaTed Spare tweet media
English
1
2
8
904