Bovard DT
41 posts

Bovard DT
@BovardDT
Software Engineer; Data Scientist Model Evals @ Google Deepmind/Kaggle
Honokaa, HI Katılım Temmuz 2012
122 Takip Edilen69 Takipçiler

Qwen 3.7-max beats Opus 4.7 and GPT-5.5
We tested three frontier models on a real agentic task: write a Tetris bot that plays the game and trains itself. Each model could read its own code, run benchmarks, and rewrite itself across 10 iterations. Then we compared the final bots head to head.
Qwen 3.7-Max: training cost $1.32, bot improvement +56%
Claude Opus 4.7: training cost $12.15, bot improvement +28%
GPT-5.5: training cost $2.85, bot improvement +7%
Qwen won on every dimension - biggest jump, 9× cheaper than Claude, 2× cheaper than GPT. Long agentic loops is where Qwen Max actually delivers.
English

Orbit Wars just hit 3k teams! (and the self-reported RL team just took the lead). Still over a month left for folks who want to jump in! kaggle.com/competitions/o…
GIF
English
Bovard DT retweetledi

We're about to hit 2000 teams in Orbit Wars! By far our biggest simulation competition yet! Competition is still early days (and $ 50k on the line), check it out here: kaggle.com/competitions/o…

English

@MeganRisdal Very excited to see what people come up with! Designing the ruleset for this was a lot of fun. Who doesn't love a good maze :)
English
Bovard DT retweetledi

Orbit Wars is popping off! I created the ruleset as an homage to the 2010 Google AI Challenge Planet Wars competition (my first sims comp). It's been great seeing so many people have such a positive experience!
kaggle.com/competitions/o…
English
Bovard DT retweetledi

Exciting news! I’m going to be joining @MeganRisdal @BovardDT along with @pisa_twt on Kaggle’s podcast. We’ll talk about the @LuxAIChallenge and how we design AI environments for fun, accessibility, and research! Friday 10:30 AM PST at twitch.tv/kaggleofficial
English
Bovard DT retweetledi
Bovard DT retweetledi
Google’s downtown Portland office is expanding into one of the city’s most prominent buildings.
bit.ly/2Okd8kb
English

I just backed The Cinnamon Roll that Defies Reality on @Kickstarter kickstarter.com/projects/hails…
English

I'm a volunteer with the Oregon Food Bank
People and families who are food insecure need our elected leaders to do the job. Funding for critical human services around hunger are the collateral for your actions
Return to the Capitol and direct your colleagues to do so. #orleg
English

#SATURN19 Just finished my talk on Data Necromancy! Thanks all for attending, sides here: docs.google.com/presentation/d…
English

Security is important, @StateFarm. We'd like it if you supported auth with USB dongles. dongleauth.org #SupportDongleAuth
English





