Aidarbek Suleimenov (@idarbek) - Twitter Profili

Aidarbek Suleimenov@idarbek·15h

@cloneofsimo But alien intelligence wasn’t trained on vast corpus of human generated data

English

1

0

17

663

Simo Ryu@cloneofsimo·20h

This is pure perception task with heavy cultural prior: you need geometical intuition, some prior on solving mazes, what typical game interaction feels like. Like if alien intelligence that doesnt have visual perception like ours, and doesnt know what nintendo is, how are they supposed to solve this? (And yes, there are animals without visual perception) (Oh and guess what other intelligence dont have visual perception and geometrical prior like ours)

François Chollet@fchollet

ARC-AGI-3 is out now! We've designed the benchmark to evaluate agentic intelligence via interactive reasoning environments. Beating ARC-AGI-3 will be achieved when an AI system matches or exceeds human-level action efficiency on all environments, upon seeing them for the first time. We've done extensive human testing that shows 100% of these environments are solvable by humans, upon first contact, with no prior training and no instructions. Meanwhile, all frontier AI reasoning models do under 1% at this time.

English

17

1

87

26K

Aidarbek Suleimenov@idarbek·15h

@xlr8harder Like % will change ofc, but delta between models and human so big, it won’t make significant difference?

English

1

0

64

Aidarbek Suleimenov@idarbek·15h

@xlr8harder I mean that most of the games don’t contain fog of war mechanism, so removing them won’t significantly affect performance? I played first few and didn’t see it

English

2

1

0

69

xlr8harder@xlr8harder·19h

measuring against the 90th percentile when coinflip mechanics add enormous score variance seems like a poor choice.

Psyho@FakePsyho

AI (or any human) will never get 100% in ARC-AGI-3 Let me introduce you to the worst game mechanic you can find in a puzzle game: fog of war At the start, if you go right instead of bottom, you're wasting many moves. Your score on this level literally depends on a conflip!

English

9

1

127

6K

Aidarbek Suleimenov@idarbek·1d

@fchollet There are a lot of weird parts of the economy where building software suddenly becomes viable

English

0

1

58

François Chollet@fchollet·1d

There's going to be a lot more software, and a lot more demand for software engineers. And a lot more token consumption.

Aaron Levie@levie

Jevons paradox is happening in real time. Companies, especially outside of tech, are realizing that they can now afford to take on software projects that they wouldn’t have been able to tackle before because now AI lets them do so. We’re going to start to use software for all new things in the economy because it’s incrementally cheaper to produce. Marketing teams at big companies will have engineers helping to automate workflows. Engineers in life sciences and healthcare will automate research. Small businesses will hire engineers for the first to build better digital experiences. And as long as AI agents still require a human who understands what to prompt, how to review when an agent goes off the rails, how it guide back, how to maintain the system that was built, how to fix the ongoing bugs, and more, we will still have humans managing these agents. This is why all the advice you get of not going into engineering is wrong. The world is going to increasingly be made up of software, and the people that understand it best will be in a strong economic position. This will happen in other roles as well where output goes up and demand increases.

English

38

57

654

62.1K

Aidarbek Suleimenov@idarbek·1d

@Aquafiber114720 Adilet’s infinite war with microplastics

English

0

1

6

Adilet@Aquafiber114720·1d

Thank you!

sarah@s4rah_dev

Did a deep dive on teabags after the TL told us last week they were killing us all with microplastics. I knew twinnings, stash, and a few others claimed to be plastic free, but wondered about the tech. Turns out they have this machine (worth $300,000 new, from Germany) that makes and fills the teabags and it works by pressing super thin paper together to create a long tube and then filling and filing, then using a string to stitch it back together…. By doing this they use no glue, no staples, and no plastic. Never really stopped to look but a pretty cool marvel of engineering in its simplicity. Thank you for joining me in my special interest today.

English

1

0

3

29

Aidarbek Suleimenov@idarbek·1d

@thenanyu Still miss it 😭

English

0

81

Nan Yu@thenanyu·1d

Heroku fumbled so hard.

Patrick Collison@patrickc

When @karpathy built MenuGen (karpathy.bearblog.dev/vibe-coding-me…), he said: "Vibe coding menugen was exhilarating and fun escapade as a local demo, but a bit of a painful slog as a deployed, real app. Building a modern app is a bit like assembling IKEA future. There are all these services, docs, API keys, configurations, dev/prod deployments, team and security features, rate limits, pricing tiers." We've all run into this issue when building with agents: you have to scurry off to establish accounts, clicking things in the browser as though it's the antediluvian days of 2023, in order to unblock its superintelligent progress. So we decided to build Stripe Projects to help agents instantly provision services from the CLI. For example, simply run: $ stripe projects add posthog/analytics And it'll create a PostHog account, get an API key, and (as needed) set up billing. Projects is launching today as a developer preview. You can register for access (we'll make it available to everyone soon) at projects.dev. We're also rolling out support for many new providers over the coming weeks. (Get in touch if you'd like to make your service available.) projects.dev

English

9

1

58

21.4K

Aidarbek Suleimenov@idarbek·1d

@ClementDelangue @Pinterest @Airbnb @NotionHQ @cursor_ai @eoghan @intercom Plus, @PrimeIntellect @browserbase announcement

English

0

1

93

clem 🤗@ClementDelangue·1d

After @Pinterest @Airbnb @NotionHQ @cursor_ai, today it’s @eoghan @intercom publicly sharing that they’re finding it better, cheaper, faster to use and train open models themselves rather than use APIs for many tasks. And hundreds of other companies are doing the same without sharing. Ultimately, I believe the majority of AI workflows will be in-house based on open-source (vs API). It took much more time than we anticipated but it’s happening now!

English

66

143

1.3K

205.3K

Aidarbek Suleimenov@idarbek·1d

@the_judge1111 Idk, founders quit easily just as employees these days

English

0

55

Judge Holden@the_judge1111·1d

I think this is a bad take & surprised to hear it from William Some perceived career risk is not the only risk "CEO/Founder" is not an obviously employable person In fact they should be generally terrible employees They also take time, financial, identity, reputation risk

Patrick OShaughnessy@patrick_oshag

William on how an early stage employee takes way more risk than a founder: "If I'm making $400-500K at Google or Meta and go to an early stage company to get 1% of this company and make $90,000. I've now changed the trajectory of my life, that's a lot of risk. But as a founder, you're not. It's a much higher likelihood that of the next round, regardless of your company, you'll be able to sell some secondary. If it shuts down, you can get employed at a great company, and you have a CEO on your resume. That first employee, they have first employee at a failed company. That's actually not a great resume line item. So we've de-risked the founder, but we haven't de-risked the early stage employee."

English

5

0

12

6.7K

Aidarbek Suleimenov@idarbek·1d

@m_sirovatka The trick that I learned only later in life from my wife is to not own socks with holes in them

English

1

0

2

116

Matej Sirovatka@m_sirovatka·1d

I can't imagine working at cursor - the thought of having to think about matching my socks and being sure I didn't take ones with hole in them... my cortisol would never recover

Cody Blakeney@code_star

these photos are so much funnier to me knowing that they are not wearing shoes

English

7

2

65

8.8K

Aidarbek Suleimenov@idarbek·1d

@browserbase @PrimeIntellect Any plans on updating benchmarks with some of the RL trained models?👀 stagehand.dev/evals

English

0

69

Browserbase@browserbase·2d

We're excited to announce our partnership with @PrimeIntellect to allow anyone to train browser agents. General-purpose models aren't optimized for your browser workflows, BrowserEnv lets you train one that is. Checkout browserenv.com and train your own custom model in a few hours.

English

34

43

496

171.8K

Aidarbek Suleimenov@idarbek·2d

@JesseTinsley Or maybe the bull case for SaaS is precisely that they can afford to sleep on AI and just swoop in after startups did all the discovery

English

1

0

1

27

Jesse Tinsley@JesseTinsley·2d

@idarbek I’m sure they are. Maybe not publicly disclosed yet. Not claiming to have any insider info just my suspicion

English

1

0

3

92

Jesse Tinsley@JesseTinsley·2d

Orland Bravo (the founder and CEO of Thoma Bravo) and I share the same thesis. Very bullish on SaaS and here’s why… 1) Fundamentals of FCF LTV is cheaper than most SaaS market caps 2) SaaS becomes AI with the right stack building on top of base LLMs like Grok, Gemini, Claude etc. the difference is you short cut customer and revenue distribution. 3) moats exist in many SaaS verticals due to compliance and regulatory constraints. That will take years or decades to unwind. Lots of opportunities to buy vs build right now. Which is the first time in a long time.

The Icahnist@TheIcahnist

BREAKING: Thoma Bravo just released their LP meeting slides. The world's largest software PE firm thinks the market has it completely wrong on software right now. Public markets are panic-selling software based on AI fear. Here's what they're seeing:

English

4

3

18

7K

Aidarbek Suleimenov@idarbek·2d

@PrimeIntellect @browserbase That’s pretty neat, browser agents consume surprisingly more tokens to do even simple actions (compared to coding/cli), so I can see how post-training your own model can make financial sense

English

0

1

32

Prime Intellect@PrimeIntellect·2d

The next step to shipping your own self-improving agents is an agent that actually operates inside your browser. We partnered with @browserbase to build exactly that.

Browserbase@browserbase

We're excited to announce our partnership with @PrimeIntellect to allow anyone to train browser agents. General-purpose models aren't optimized for your browser workflows, BrowserEnv lets you train one that is. Checkout browserenv.com and train your own custom model in a few hours.

English

5

17

157

16.5K

Aidarbek Suleimenov@idarbek·2d

@swyx @aiDotEngineer Applied, sounds exciting!!!

English

0

29

swyx@swyx·2d

Clawfather and friends are coming to @aidotengineer in London in 2 weeks :) some transparency from me - running the first international AIE has been extremely hard on our team - even though booths and tickets are ALL sold out, we're still not profitable*. We'll be ok, all first years are investment years, but support from GDM, OAI, Braintrust and WorkOS has been so invaluable in keeping us afloat. We still have our Afterparty, Leadership Luncheon, and Expo Cafe available for sponsors who'd like to support and attend on short notice. Would deeply appreciate all referrals! *events have superlinear cost and logistical complexity curves, i hate it

AI Engineer@aiDotEngineer

We are excited to welcome @OpenAI to the AIE Expo for the first time as Platinum sponsors for AIE EU! OAI has shipped SO much for AI Engineers this year alone, and this is the best place to catch up: - Meet the team at the Ask OpenAI lounge (bring your hardest tasks and best questions!) - Hear keynotes from @steipete and @lopopolo, and - get hands on with in-depth Codex workshops from @kagigz and @reach_vb! See you April 8-10 in London! AI Engineers💙@OpenAIDevs !

English

20

4

111

15.9K

Aidarbek Suleimenov@idarbek·2d

@corm_h @var_epsilon Fall 2019

English

0

398

corm h@corm_h·2d

@idarbek @var_epsilon how long ago did you intern there? i highly doubt less than 90% of google interns these days are cold applications

English

1

0

1

419

varepsilon@var_epsilon·2d

this was how I got my google internship in 2021 actually. really unique program, having enough search queries like "dependency injection" or "mutex lock" would show you this popup and you had to solve a series of 5 challenges to automatically get an interview excited for whatever the next iteration of this looks like with claude code/codex!

Varunram Ganesh@varunram

In the early-mid 2010s, if your search history was really good, Google would automatically invite you to foo bar and solving that would get you an interview at Google Now, if your agent history is really good on GStack, YC will (soon) automatically fill your YC application and that would get you into YC YC is the agent native YC

English

28

52

2.6K

309.4K

Aidarbek Suleimenov@idarbek·2d

@atelicinvest How weird? My take it’ll lead to proliferation of weirder roles on intersections, that’s happening already to some degree now, e.g. FDE (sales guy who codes), GTM engineer (marketing guy who codes), etc

English

1

0

1

87

Unemployed Capital Allocator@atelicinvest·2d

This is absolutely going to happen but the shape of it will be a lot weirder than any of us imagine.

Aaron Levie@levie

Jevons paradox is happening in real time. Companies, especially outside of tech, are realizing that they can now afford to take on software projects that they wouldn’t have been able to tackle before because now AI lets them do so. We’re going to start to use software for all new things in the economy because it’s incrementally cheaper to produce. Marketing teams at big companies will have engineers helping to automate workflows. Engineers in life sciences and healthcare will automate research. Small businesses will hire engineers for the first to build better digital experiences. And as long as AI agents still require a human who understands what to prompt, how to review when an agent goes off the rails, how it guide back, how to maintain the system that was built, how to fix the ongoing bugs, and more, we will still have humans managing these agents. This is why all the advice you get of not going into engineering is wrong. The world is going to increasingly be made up of software, and the people that understand it best will be in a strong economic position. This will happen in other roles as well where output goes up and demand increases.

English

4

1

38

9.7K

Aidarbek Suleimenov@idarbek·3d

@larsencc The worse part its stretched across several months or years

English

0

1

39