Ayush Kalani ⚡

2.2K posts

Ayush Kalani ⚡

@chill_scenes

Banter on tech, startups, life, and growth.

Hacker Hostel Beigetreten Mart 2013

758 Folgt210 Follower

Angehefteter Tweet

Ayush Kalani ⚡@chill_scenes·1 Eyl

Your story won't be written with a pen. It won't be written through imagination, dreams, nor aspirations. It will be written by you living it. What story will you tell? @techleadhd

English

Ayush Kalani ⚡@chill_scenes·23 Nis

Stop San Francisco From Imposing Fees for Use of Public Tennis and Pickleball Courts - Sign the Petition! chng.it/9WMtjPFgpL via @Change

English

Ayush Kalani ⚡ retweetet

Smillew (certified liar)@Smillew_Rahcuef·27 Mar

@sama

QME

467

13.5K

291.1K

Ayush Kalani ⚡ retweetet

Aiden Bai@aidenybai·13 Mar

you can just clone entire YC startups now

English

207

313

5.3K

698.4K

Ayush Kalani ⚡ retweetet

Atty Eleti@athyuttamre·11 Mar

Introducing the Responses API: the new primitive of the OpenAI API. It is the culmination of 2 years of learnings designing the OpenAI API, and the foundation of our next chapter of building agents. 🧵Here’s the story of how we designed it:

English

116

356

2.8K

Ayush Kalani ⚡ retweetet

ChatGPT@ChatGPTapp·21 Şub

4e8 🫂

QST

274

120

3.5K

688.4K

Ayush Kalani ⚡@chill_scenes·9 Şub

Back to the beginning

English

Ayush Kalani ⚡ retweetet

evan conrad@evanjconrad·29 Oca

Hey friends, we're excited to announce that an additional 2,000 H100s will be added @sfcompute's on-demand market. It's the largest* interconnected cluster, from any provider (including hyperscalers), that you can get on a per hour basis. You're not locked in with San Francisco Compute. If DeepSeek can compete with OpenAI using 2,000 H800s, you too can train a state of the art RL model without ever having to sign a long-term contract that you can't exit. You could have trained DeepSeek-v3 for $4.5m for 1.5mo on SFC or $35m if you could only buy a 1 year contract off market. This was the dream Alex & I had since our audio model company (Junelark) died because it couldn't procure enough GPUs, and it's what we've been working towards for nearly two years. Long-term contracts are a trap; they make it so only the biggest of the big can compete in AI. They force startup founders to raise at massive valuations pre-revenue, which dilutes founders and employees and sets them up to fail when they can't raise their next round. This cluster will roll out over the next few weeks as we scale our infrastructure. Soon you'll be able to access it via our managed Kubernetes service or by reaching out to set up a custom solution. We're also exploring other ways of partnering with service providers to let them offer GPU-based services, like workers and inference endpoints, without being forced into a long-term contract with a hyperscaler. You no longer need to bet your company on GPU prices to offer GPU-based services. * We think! If you know of a larger, please correct us!

English

679

92.5K

Ayush Kalani ⚡@chill_scenes·31 Oca

Major outage at claude, /A. Middle of the work

English

Ayush Kalani ⚡ retweetet

Subhash Choudhary@subhashchy·28 Oca

i am building an open-source foundational model, from India, for the world. Who is in? And what can you contribute?

English

659

266

4.9K

594.9K

Ayush Kalani ⚡@chill_scenes·21 Oca

DeepSeek

English

Ayush Kalani ⚡ retweetet

evan conrad@evanjconrad·9 Oca

This is a little guide to how @sfcompute market works. The primary goal of SFC is to reduce inventory risk for vendors & buyers. Because of scaling laws, customers push to spend their compute budgets as optimally as possible, which makes them price sensitive. That pushes down margins for GPU clouds and increases capex for them at the same time. That makes them very sensitive to swings in the market price, because minor adjustments can make slim margins go negative. To prevent this, vendors push their customers to buy large volumes, which offsets the cloud's risk, but then customers end up over paying for compute they don't necessarily need, to the tune of hundreds of millions of dollars. SFC's market is designed to optimally price the risk of owning any block of compute and increase liquidity for the buyers and sellers. Literally that means that there's always a buyer and always a seller, at SOME price, for any "shape" of compute that you want to buy or sell. Have only one node for a few hours in the evening? That has a price. Have 200 nodes for exactly 2 days? That has a price. By pricing EVERY block, we get rid of gaps in capacity, which means higher margins for vendors, and more options for buyers. That's why SFC's prices are so good. Some corners of the market can be as low as 50c! Other corners can be as high as $2.80! To make this work, SFC assigns a "shape" to every buy and sell. This is an abstract, 2-dimensional object that associates every discrete unit of time, (we call it an "epoch"), with a "quantity" (nodes). In some markets, the things being traded are perfectly fungible. As in, one share of AAPL is the same as one share of AAPL. In ours, there's one additional constraint: packing. Folks doing training care if their nodes are colocated. Having two H100 nodes in one cluster (specifically on the same InfiniBand fabric) isn't the same as having two H100 nodes in two different clusters. To solve for this, the market keeps track of all the clusters available on the system, and represent how much capacity they have in total as a shape. When people buy & sell, they get "contracts", which represent how many nodes they own and for how long. On every transaction, the market guarantees that all the contracts can fit within all the clusters. Once a contract has started, it's pinned to that cluster, but contracts that have not started can move around, which means every transaction plays a massive game of binpacking to optimally place new orders. That means every shape on the market is priced differently and optimally for what you're trying to buy! Just sampling some of the orders, there's currently ~1.4k open orders, with prices as high as $6/hr and low as 99c. We have lots of new things we're shipping this year in the market, and we'd love if you join us! If you're math-inclined and like Rust, please come talk to us! My DMs are open.

English

359

118.1K

Ayush Kalani ⚡ retweetet

Kyle Vogt@kvogt·11 Ara

In case it was unclear before, it is clear now: GM are a bunch of dummies.

English

398

270

4.5K

1.3M

Ayush Kalani ⚡@chill_scenes·31 Eki

"The single greatest task for any founder or CEO is to recruit the right people. No company has a culture; every company is different, and the CEO is the one most responsible for the difference. Hire people who are extremely smart, independent, and had intense personalities."

English

Ayush Kalani ⚡@chill_scenes·7 Eki

impatient with actions, patient with results.

English

Ayush Kalani ⚡ retweetet

Nikita Bier@nikitabier·1 Ağu

FAANG Engineer vs. Startup Founder

Nederlands

524

8.1K

497.9K

Ayush Kalani ⚡ retweetet

Vivek Raju@vivekraju93·1 Ağu

VC funded startup Marwadi with slack, crm, business with aws, analytics... with whatsapp

English

905

11.4K

455K

Ayush Kalani ⚡ retweetet

Tomas Hernando Kofman@tomas_hk·30 Tem

We’re building the “meta-model” of AI. Robust routing infrastructure will be critical to effective AI, and by shifting the future towards networks of specialized models rather using a single giant monolithic model for everything, we can create a much safer world to live in.

English

726

Ayush Kalani ⚡ retweetet

Tomas Hernando Kofman@tomas_hk·30 Tem

Another feature I’m excited about is the ability to jointly-optimize prompts and model routing recommendations. Not Diamond integrates with auto-prompt frameworks like DSPy and SAMMO so you can translate prompts across models and always call the best model with the best prompt.

English

876

Ayush Kalani ⚡ retweetet

Tomas Hernando Kofman@tomas_hk·30 Tem

By default, Not Diamond maximizes quality above all else. But Not Diamond also allows you make optimized tradeoffs that can drastically lower your costs and latency by routing to smaller models when doing so doesn’t impact quality.

English

874

Ayush Kalani ⚡ retweetet

Tomas Hernando Kofman@tomas_hk·30 Tem

Small, specialized models can outperform larger models on narrow domains. Routing gives specialized models the robustness of general ones. This is not only more computationally efficient—we get huge interpretability and safety benefits as a free bonus. This is our trojan horse.

English

762

Entdecken

@Change @sama @sfcompute @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates