Greg Roodt

6.2K posts

Greg Roodt banner
Greg Roodt

Greg Roodt

@groodt

Internet geek. Building AI platforms at scale for Canva. Sometimes I fly kites. Cofounded AirHelp.

Sydney, Australia Katılım Nisan 2009
2.5K Takip Edilen982 Takipçiler
Greg Roodt retweetledi
Ryan Yuan
Ryan Yuan@RainbowYuhui·
We’re excited to launch #MagicLayers globally at Canva 🚀🚀🚀 This is currently the most advanced image-to-layer decomposition model in the world-purpose-built for design-native content. ⚡ 20–200× faster than Qwen-Image-Layer 🎨 Super strong on designs canva.com/newsroom/news/…
English
3
10
30
7.7K
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
NVIDIA MOAT ALERT: The performance of BLACKWELL increased 3.25x in the span of just 4 months.  At iso-interactivity of 95 tok/s/user, b200 deepseek fp4 perf went from 400tok/s/gpu to 1300tok/s/gpu
SemiAnalysis tweet media
English
30
75
816
66.8K
Greg Roodt retweetledi
akira
akira@realmcore_·
Feels like everyone making their own agent stumbles across the same primitives and thinks they solved something Let me save you some time (read this, it's funny and useful): - You're going to make an agent - You're going to run it on benchmarks > It's going to suck - You're going to make a tool to analyze traces - You're going to say this helped you > It wont work - You're going to think about role based agents for solving a single task - You're going to make a workflow for solving a benchmark > Both will work. Neither are generalized - You'll think you made it > It will be nearly unusable by an end user > Back to square one - You're going to realize you're stuck with a for loop - You're going to think about swarms > In swarms single agent usability doesn't matter - But wait you need a task manager - But wait you need a merge queue - But wait you need compression for long jobs > Compression is a foot gun - But wait now you need an agent to manage it all - But wait now you need something that checks to make sure the manager is managing - You're going to go back to single loop agents - Well, subagents seem like the way to do all of this > Bam! Plot twist: subagents are hard to do well - You're going to think "Hmm well subagents isolate context" because said so - You're going to start to look at other agent implementations > How have they all solved compaction, multi-agent, task management, memory etc.? - You're going to realize it's all just tradeoffs, but most of them have only one side people care about - "Oh it's all just context engineering" > Yep. But it has to be good and it has to be general. > Back to the starting loop. Rinse and repeat. Congrats. Keep it simple. Keep it general.
English
61
35
714
55.8K
Hunter Leath
Hunter Leath@jhleath·
about 10 years ago, when I was a fledgling engineer at Amazon EFS, we had a customer who’s entire data was stored as ONE BILLION kilobyte-scale json files in the root of their file system. Didn’t work well on EFS since then, it’s been my mission to make small files functional for users
heiner@HeinrichKuttler

@jhleath POSIX semantics really put some demand on the implementation. the fact that users love to put millions of files into one directory doesn't help

English
15
10
420
212.7K
Greg Roodt
Greg Roodt@groodt·
I remain unconvinced that any companies need “realtime data”. The only place it makes sense is your observability stack, and you’re probably using datadog, honeycomb, or otel for that already. Everything else: use an OLTP database in product and a data warehouse for offline.
TDM (e/λ) (L8 vibe coder 💫)@cto_junior

We used Kafka + Flink + ScyllaDB for a project that emitted 1 record every 5 minutes instead of Spring + Postgres Why? so we could flex our "realtime data stack" and get promoted faster It worked.

English
0
0
2
197
Greg Roodt retweetledi
Greg Roodt retweetledi
pash
pash@pashmerepat·
everyone's constantly posting the meme about having a bunch of different agent rule files while the real nightmare continues to be totally ignored: - openai just dropped responses api that breaks every single existing agent architecture - anthropic format was the universal translator (superset of openai completions), now it's obsolete - every provider has different message shapes, tool calling patterns, reasoning hydration - cline has anthropic baked into disk storage, 30+ providers, core interfaces - migration would total architectural hell who gives a fuck about .cursorrules vs agents md when your reasoning traces disappear between api calls and your entire codebase assumes one message format that's no longer the superset? can we please standardize on a future proof llm API standard?
English
61
58
703
205.3K
Greg Roodt
Greg Roodt@groodt·
Anyone else waiting for Zuck to announce that Meta has hired Soham Parekh?
English
0
0
0
107
Greg Roodt retweetledi
rahulvohra
rahulvohra@rahulvohra·
Superhuman is being acquired by @Grammarly! 💜💚 Together, we will build the AI-native productivity suite of choice 🥇 We will invest even more deeply in AI and email, reimagine chat and collaboration, and build AI agents that unlock a whole new way of working. More below 👇
Superhuman Mail@SuperhumanMail

x.com/i/article/1940…

English
226
84
2.1K
330.9K
Greg Roodt retweetledi
Simon Willison
Simon Willison@simonw·
@mitsuhiko I've also encountered groups who rampage around a company trying to replace all of the the custom, wonky shaped homegrown wheels with "don't reinvent the wheel" external products and libraries... without taking the time to understand why the wonky wheels are that particular shape
English
3
1
37
3K
Greg Roodt
Greg Roodt@groodt·
@pymhq Congrats to everyone one theahnch! Is it too early to ask about performance expectations on Bedrock vs 1P?
English
0
0
1
36
Andy Peng
Andy Peng@pymhq·
Teams are currently having happy hour with the Anthropic offsite crew. Cheers #Claude4 🥂
Andy Peng tweet mediaAndy Peng tweet mediaAndy Peng tweet media
Andy Jassy@ajassy

Anthropic’s most powerful models – Claude Opus 4 and Claude Sonnet 4 – just landed in Amazon Bedrock. Major enhancements across the board, particularly in coding (@AnthropicAI’s benchmarks show Opus 4 is the world’s best coding model) and advanced reasoning. Gonna matter for startups and enterprises looking to save time and accelerate their AI innovation. aboutamazon.com/news/aws/anthr…

English
1
0
0
264
Andy Peng
Andy Peng@pymhq·
Check out my current team’s latest delivery: ~2x model inference acceleration.
Andy Peng tweet media
English
1
0
1
317
Greg Roodt retweetledi
Scott Condron
Scott Condron@_ScottCondron·
New Evals API I’m excited to share a new API for logging evals with W&B Weave. EvaluationLogger - log_prediction - log_score - log_summary Our design goal for this API was to get out of your way and build the most flexible eval API out there, inspired by wandb.log, which our @weights_biases users love. - No hidden logic, you control the eval loop and what you log - Easy to integrate into existing evals with any model or framework - Log and version everything so you don’t accidentally compare incomparable things - Easy to query - Works with our existing comparison UIs We'd love you to try it out and share your thoughts on it. link below
Scott Condron tweet mediaScott Condron tweet media
English
3
8
15
5.1K
Greg Roodt
Greg Roodt@groodt·
@OfficialLoganK Are you going to simplify the cache API? It's the most awkward one to use at the moment compared to competitors.
English
0
0
1
26
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Context caching updates in the Gemini API: - ✅ Added support for 2.0 Flash - ✅ Added support for 2.5 Pro Preview - 📉 Reduced min context size from 32K down to 4K Much more to come still, please send any feedback on the experience!
English
150
65
1.7K
117.6K
Greg Roodt
Greg Roodt@groodt·
@OfficialLoganK Congrats on the release! Gemini is crushing it across the board recently. Do you know if there are any plans to improve the caching semantics? It's more awkward to use than your competitors.
English
0
0
0
63
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
The Gemini 2.5 series is the world’s best model lineup 👍
English
189
76
2.7K
294.9K
Greg Roodt
Greg Roodt@groodt·
Agents are just selecting a DAG of tool calls aren't they? So we're doing query planning, optimization and execution again aren't we? It's SQL isnt it? The answer is SQL. It's always SQL. AQL: Agent Query Language.
English
1
0
2
80
Greg Roodt
Greg Roodt@groodt·
Remember xgboost?
English
1
0
2
99
Greg Roodt
Greg Roodt@groodt·
@pymhq Bedrock is so good! Would you estimate the ETA is days? Weeks? Months?
English
0
0
0
31
Andy Peng
Andy Peng@pymhq·
Llama4 is single node, and Bedrock’s onboard coming soon.
English
1
0
0
146