Alex Gajewski

98 posts

Alex Gajewski

@apagajewski

Building a preschool for robots @pantographPBC. Previously cofounder @sfcompute, @ExaAILabs

San Francisco Katılım Haziran 2014

907 Takip Edilen2.4K Takipçiler

Sabitlenmiş Tweet

Alex Gajewski@apagajewski·23 Ara

Extremely excited to show a preview of what we've been working on!

Pantograph@pantographPBC

Introducing Pantograph. We're building a preschool for robots: they teach themselves through exploration, failure, and curiosity. What we're building and why: pantograph.com/blog/building-…

English

13K

Alex Gajewski@apagajewski·25 Şub

As the software models get better, the physical world is going to be increasingly important. Come work on it with us!

Pantograph@pantographPBC

Pantograph is building robots that learn through self-supervised RL at unprecedented scale. We're hiring a software engineer to work on our core robot stack and testing infrastructure, including controller logic, component testing, and data collection pipelines. Rust and embedded software experience is a big plus, but mostly we're looking for a relentlessly curious and capable generalist who is excited to learn and get robots out into the world.

English

1.5K

Alex Gajewski retweetledi

Standard Intelligence@si_pbc·23 Şub

Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.

GIF

English

186

404

3.9K

1.1M

Alex Gajewski retweetledi

Mox SF@moxspace·30 Ara

Weird opp??? Mox used to be a city records center, so we inherited a pretty legit server room. - 120A @ 240V - 5 ton cooling unit - 100kW diesel genny w/ 1000-gal tank - 2.5Gb sym fiber Can get it live in ~1 month. Who needs serious on-prem infra in SF?

English

3.9K

Alex Gajewski retweetledi

Pantograph@pantographPBC·25 Ara

Merry Christmas from your favorite robots! 🎄

English

36.6K

Alex Gajewski@apagajewski·24 Ara

@atroyn fixed :)

English

anton 🇺🇸@atroyn·24 Ara

@apagajewski typo:

English

Alex Gajewski@apagajewski·23 Ara

Extremely excited to show a preview of what we've been working on!

Pantograph@pantographPBC

Introducing Pantograph. We're building a preschool for robots: they teach themselves through exploration, failure, and curiosity. What we're building and why: pantograph.com/blog/building-…

English

13K

Alex Gajewski retweetledi

camille fassett@camfassett·23 Ara

the robots are strong and i am having a great time

Pantograph@pantographPBC

Today, we're sharing an early preview of our first generation hardware: treaded base, two six-degree-of-freedom arms, 1kg continuous payload each. We've put 10,000+ hours of stress and endurance testing into the critical components.

English

8.5K

Alex Gajewski retweetledi

Kudzo S Ahegbebu@scikud·23 Ara

These are even cooler in person! Robotics like most truly interesting things are really data limited and new techniques for these domains will be super important.

Alex Gajewski@apagajewski

Extremely excited to show a preview of what we've been working on!

English

1.8K

Alex Gajewski@apagajewski·3 May

@gwern What do you imagine such a process being applied to at that level of overhead? Even at 10x overhead I have a hard time coming up with applications

English

262

Alex Gajewski@apagajewski·31 Oca

I wonder what you would get if you trained something Cycle-GAN-like between images and music. Probably possible today with the quality of generative models we have!

English

2.1K

Alex Gajewski@apagajewski·31 Oca

The new google image model is quite good except for the fact that it doesn't like to draw physicists:

English

1.7K

Alex Gajewski@apagajewski·30 Oca

This one seems like a good idea to me, increasingly I think datasets and RL environments are the limiting factor:

Y Combinator@ycombinator

Devtools for AI Agents @dessaigne AI agents are the next wave: autonomous tools that reason, decide, and amplify human productivity. We’re funding startups building devtools for agents, whether you’re creating agent builders or building blocks to perform complex tasks.

English

1.6K

Alex Gajewski@apagajewski·29 Oca

@distributionat Yeah, things in that direction. I think OpenAI is likely to be a bit too conservative with what they let Operator do.

English

toucan@distributionat·29 Oca

@apagajewski By computer control do you mean something like Operator?

English

107

Alex Gajewski@apagajewski·29 Oca

Feels like a good time to start a computer control startup. The methods are generally known (RL on top of base models), and it probably doesn't require that much compute, just thoughtful environment design. I would probably start with a text-only representation of websites.

English

936

Alex Gajewski@apagajewski·29 Oca

I hope that somebody starts a company to make an AI-native smartwatch. It feels to me like the ideal form factor for most of what I want a language model to do.

English

883

Alex Gajewski@apagajewski·29 Oca

Very excited for this new cluster. Big enough to train R1, but it's running our combinatorial auction so the prices should be rational

evan conrad@evanjconrad

Hey friends, we're excited to announce that an additional 2,000 H100s will be added @sfcompute's on-demand market. It's the largest* interconnected cluster, from any provider (including hyperscalers), that you can get on a per hour basis. You're not locked in with San Francisco Compute. If DeepSeek can compete with OpenAI using 2,000 H800s, you too can train a state of the art RL model without ever having to sign a long-term contract that you can't exit. You could have trained DeepSeek-v3 for $4.5m for 1.5mo on SFC or $35m if you could only buy a 1 year contract off market. This was the dream Alex & I had since our audio model company (Junelark) died because it couldn't procure enough GPUs, and it's what we've been working towards for nearly two years. Long-term contracts are a trap; they make it so only the biggest of the big can compete in AI. They force startup founders to raise at massive valuations pre-revenue, which dilutes founders and employees and sets them up to fail when they can't raise their next round. This cluster will roll out over the next few weeks as we scale our infrastructure. Soon you'll be able to access it via our managed Kubernetes service or by reaching out to set up a custom solution. We're also exploring other ways of partnering with service providers to let them offer GPU-based services, like workers and inference endpoints, without being forced into a long-term contract with a hyperscaler. You no longer need to bet your company on GPU prices to offer GPU-based services. * We think! If you know of a larger, please correct us!

English

1.5K

Alex Gajewski@apagajewski·28 Oca

very excited the weights of o1 finally arrived

English

1.1K

Alex Gajewski@apagajewski·23 Oca

One part of SF Compute we haven’t talked about very much yet is that post-AGI (presumably soon), the models will want to train more models. (Really, people will ask the first models to train more models, or perhaps to solve tasks that would benefit from, say, some custom RL). It will probably be most natural for those models to buy compute from a liquid market, where they can get precisely the compute they need for each run they need to do.

English

1.2K

Alex Gajewski@apagajewski·23 Oca

Has anyone tried “sub-token attention”? Artificially increase the sequence length by including K copies of each token next to each other (say, each linearly projected by a different map), and let the different copies attend to each other. True self-attention :P (And then at the output project back to a single token to combine)

English

635

Keşfet

@atroyn @gwern @distributionat @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates