Will Killebrew

1K posts

Will Killebrew

@willebrew

Computer Science student at the University of Denver - Software Developer - Entrepreneur - Founder @ https://t.co/0sI0IbP7bQ

Denver, CO Katılım Ağustos 2014

405 Takip Edilen158 Takipçiler

Will Killebrew@willebrew·15h

@alexkaplan0 Your team is goated 🔥

English

Alex Kaplan@alexkaplan0·20h

Our research team consistently ships ahead of the curve. Specialized models are quite useful at the pareto frontier of speed and accuracy, and SWE-check is a genuine UX improvement. A pleasure to work with such talented people!

Cognition@cognition

Today we're releasing SWE-check, a specialized bug detection model we RL-trained with @appliedcompute that matches frontier performance on internal in-distribution evals and makes meaningful progress on out-of-distribution evals, all while running 10x faster.

English

2.1K

Will Killebrew retweetledi

Windsurf@windsurf·20h

We've been building something that doesn't fit in a wave. Coming soon

English

428

55.3K

Will Killebrew@willebrew·2d

Assisted Culling (Early Access) in Lightroom is surprisingly good!

English

Will Killebrew@willebrew·4d

@jeffwsurf @cognition Can I get an invite?? 😋

English

1.9K

Jeff Wang@jeffwsurf·4d

One of the most underrated perks of @cognition is the office chef

English

309

235.1K

Will Killebrew@willebrew·4d

@LundukeJournal Absolutely not.

English

The Lunduke Journal@LundukeJournal·5d

Sneak peak at Windows 12.

English

228

1.2K

16.8K

277.8K

Will Killebrew@willebrew·4d

@dongheuw Congrats, super impressive!

English

Dong He@dongheuw·5d

Proud to have contributed to rebuilding this pretraining stack and improving pretraining efficiency for Muse Spark scaling during my time at Meta. Great to see it paying off with 10x+ compute efficiency gains. Congrats to everyone involved 🥑🥑 🥑

AI at Meta@AIatMeta

To build personal superintelligence, our model’s capabilities should scale predictably and efficiently. Below, we share how we study and track Muse Spark’s scaling properties along three axes: pretraining, reinforcement learning, and test-time reasoning. 🧵👇 Let’s start with pretraining. Over the last 9 months, we rebuilt our pretraining stack with improvements to model architecture, optimization, and data curation, enabling us to increase the capability we can extract from every unit of compute. To rigorously evaluate our new recipe, we fit a scaling law to a series of small models and compare the training FLOPs required to hit a specific level of performance. The results: we can reach the same capabilities with over an order of magnitude less compute than our previous model, Llama 4 Maverick, making Muse Spark significantly more efficient than the leading base models available for comparison.

English

5.6K

Will Killebrew@willebrew·4d

@giawiabia Big W

English

754

gia ⚢@giawiabia·4d

OH MY GOD I COULD CRY RN

English

2.2K

161.9K

1.4M

Will Killebrew@willebrew·4d

Artimis II crew made it back safely!!

English

Will Killebrew@willebrew·6d

@ReardenSteelX @Tesla @wholemars Lets goooo!

English

TeslaFamily@ReardenSteelX·6d

Son’s first car, Tesla Model 3 secured! Safest car on the planet for the family, diamond black is 🤌🏻 Gen Z onboard

English

551

42.5K

Will Killebrew@willebrew·6d

@andrewdfeldman @cognition @ScottWu46 @cerebras Love this! 🔥

English

Andrew Feldman@andrewdfeldman·6d

Working with the @cognition team has been a pleasure. @ScottWu46 and the Cognition team have been world class partners to @cerebras . Together we are doing very cool things.

Scott Wu@ScottWu46

Total amt of flops across all the GPUs in the world has grown about 3x per year for the last few years. Total amt of inference demand has probably grown ~10x per year. What happens when those lines cross? The econ answer is: when demand > supply, price goes up. That might be true (even H100s are more expensive than they have ever been...) but doesn't actually solve the problem on its own here - demand continues to grow as we unlock new use cases and there is only so much additional supply coming. To get to a healthy equilibrium, we also need to shift much more usage to smaller, targeted models. This will happen naturally as the incentives make sense for it: it's much easier to 10x your agent usage if you know that you can now solve 90% of your tasks with good cheap, fast models. SWE 1.6 is not a general model. But we have trained it to specifically be good at the kinds of standard coding tasks that our users run. And thanks to its small size & a little magic from Cerebras it can run very cheaply at ~1000 tokens / sec. Give it a try and let us know what you think!

English

7.3K

Will Killebrew@willebrew·6d

Love to see this!

Sawyer Merritt@SawyerMerritt

NEWS: Waymo and Waze have announced a partnership to help cities detect and repair potholes faster by providing data from their autonomous fleet. The pilot will launch in these cities first: • San Francisco Bay Area • Los Angeles • Phoenix • Austin • Atlanta "The pilot program uses Waymo’s perception and physical feedback systems to detect and provide up-to-date information on potholes where Waymo operates. The data will be available to cities and state Departments of Transportation through the free-to-use Waze for Cities platform alongside user-reported pothole information, giving officials an additional view of surface street and highway conditions that enables them to more efficiently and effectively fill potholes. The data will also be visible to Waze users in the cities where Waymo operates, keeping road users safe by alerting them as they approach a pothole. Like other on-road features reportable in the Waze app, users will be able to verify the Waymo-identified potholes, increasing the data’s accuracy. Waymo has already identified approximately 500 potholes. Over time, we’ll expand this partnership to more cities we serve, including those with winter weather and harsh freeze-thaw cycles that exacerbate the pothole problem."

English

Will Killebrew@willebrew·6d

Mythos 4.0: Delete codebase = 0 vulnerabilities

Matt Mazur@mhmazur

Opus 4.6: I did not find any vulnerabilities. Developer: Phew. Mythos 1.0: I found 3 critical vulnerabilities. Developer: Oh no. I'll fix them. Mythos 2.0: I found 8 critical vulnerabilities. Developer: Oh no. I'll fix them. Mythos 3.0: I found 35 critical vulnerabilities. Developer: Oh no.

Català

Will Killebrew@willebrew·6d

@windsurf @cognition There is a bug where there are two generate buttons (Version: 1.9600.1042+next.1ab97d8389)

English

492

Will Killebrew@willebrew·6d

@mweinbach It's mostly AI slop now, it's going downhill fast.

English

Max Weinbach@mweinbach·6d

I post on LinkedIn as little as I can It’s only because it’s this professional platform but it’s all fake. None of it is real. It’s all posting for others to see how good you are or to brag about yourself and others doing it. Awful platform. Awful. There is 0 value.

Marc Andreessen 🇺🇸@pmarca

Overheard in Silicon Valley: "LinkedIn is prison for middle managers."

English

197

14.8K

Will Killebrew@willebrew·6d

@mattbergland @cognition Awesome! 🙌

English

171

Matt Bergland@mattbergland·6d

now introducing Takumi Masai, President of @cognition Japan! 🇯🇵

Indonesia

5.3K

Will Killebrew@willebrew·6d

I truly love being a dev/entrepreneur, even through the failures it’s just so much fun

English

Will Killebrew@willebrew·6d

@0xluffy 😂

QME

luffy@0xluffy·6d

it's all fun and games until your scrotum gets clipped between them

Untold Secrets@RealGemsfinder

Sometimes i see things and think, why did it take sooo long

English

5.2K

Will Killebrew@willebrew·6d

@adilmania Wish I was in SF this is great!

English

Adil Mania.@adilmania·6d

introducing Silicon Mania Mag. the best tech stories. monthly. printed. 200 copies. limited edition. dropping in SF offices, accelerators, coworking spaces, and coffee shops, today and tomorrow. autographs included ✍️ who wants one?