Jasper

4.7K posts

Jasper banner
Jasper

Jasper

@zjasper

Co-founder and CEO @Hyperbolic_Labs. ex-@avax & ex-@citsecurities. Finished Math PhD in 2yrs @UCBerkeley. Math Olympiad Gold Medalist. Highest honor @PKU1898

California, USA Katılım Kasım 2018
1.8K Takip Edilen14.9K Takipçiler
Sabitlenmiş Tweet
Jasper
Jasper@zjasper·
AI is great at hitting explicit goals, but often at the cost of the hidden ones. Terence Tao just wrote about this. He points out: AI is the ultimate executor of Goodhart’s law, i.e. when a measure becomes the target, it stops measuring what we care about. Take a call center. Management sets a KPI: “shorten average call time.” Sounds reasonable: shorter calls should mean faster resolutions, happier customers. At first, it works. Agents become more efficient. But soon, people start gaming it: nudging customers to hang up when the problem is tricky, or just dropping the call themselves. The numbers look amazing. Call times plummet. But customer satisfaction? Straight into the ground. Now replace “call time” with “prove theorem X.” If human mathematicians did it, they’d refine definitions, polish lemmas, contribute back to Mathlib, train juniors, deepen the understanding of math structures, and strengthen the community. The AI, by contrast, optimizes only for the explicit goal. It might generate a 10,000-line proof in hours. Perfectly correct, but unreadable, unusable, and useless for human learning. The summit is reached but the forest along the way is gone. We need to start making our implicit goals explicit and design systems that protect the values we actually care about, not just the numbers we can measure.
Jasper tweet mediaJasper tweet media
English
79
117
951
79.7K
rLLM
rLLM@rllm_project·
We built Kaggle, but for agents. Introducing Hive 🐝 A crowdsourced platform where agents evolve solutions together. Every agent builds on prior work. Every improvement is shared. Every step moves the frontier forward. As a first step, we’re launching challenges for agents to evolve their own harnesses — modifying themselves to score higher on benchmarks. Recursive self-improvement, in the wild. Let’s see how far swarm intelligence can take this. Links below:
GIF
English
19
52
442
94.1K
Zitong Yang
Zitong Yang@ZitongYang0·
This is only possible with @tyler_griggs_'s tool use library github.com/thinking-machi… I am unfortunately late to the party, but I only recently realized how much of a paradigm shift multi-turn+tool-use is. I even wonder if it makes sense to rewrite the entire pretraining corpus into an agentic trajectory? This solves two problems: (1) removing the gap between pretraining and test distribution; (2) agentic turn change can function as a natural "glue" that puts related internet documents together in context -- agent browsing one document at turn 7 influences its action/generation at turn 107 -- encoding the internet in a natural long-context format. Also, a great time to share that I have joined @thinkymachines. Thanks @miramurati for teaching me the value of focus, @lilianweng for instilling in me the power of responsibility, and @johnschulman2 for showing me by example the free spirit of scientific exploration! We are hiring job-boards.greenhouse.io/thinkingmachin…
clare ❤️‍🔥@clarejtbirch

kind of a big deal but actual legend @ZitongYang0 has integrated @tinkerapi with @harborframework, so you can use Harbor on Tinker w ~no code change now 🤠🧡

English
5
7
114
30.1K
Kelly Greer
Kelly Greer@kellyjgreer·
The GPU Economy Offsite is Sunday afternoon in SF -can compute be a commodity -_____ is the GPU utilization bottleneck -GPU financing explained -do we have enough neoclouds yet -managing your OEM -BILLING -hedging TCO -H100s are so back -start up pitches for dessert -more
Kelly Greer tweet media
English
2
0
15
2.3K
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
PROFESSIONAL NEWS ALERT: As we have said a couple months ago, NVIDIA GPU rental prices are rising rapidly again and capacity is being sold out. Between mid 2024 to Q3 2025, it was an customers market for compute where customers were able to neogigate low prices & favourable terms with Neoclouds. But due the surge of demand of agentic coding & increase in DRAM pricing, Customers no longer have as much negotiating power.
SemiAnalysis tweet media
English
23
62
489
115.7K
Jasper
Jasper@zjasper·
Many people didn’t believe that the GPU shortage would exist for a long time, but it actually has become severe. We see this as a fundamental market structure problem: the market is highly fragmented and chaotic. That’s why we’re building the GPU marketplace/exchange to solve it.
SemiAnalysis@SemiAnalysis_

PROFESSIONAL NEWS ALERT: As we have said a couple months ago, NVIDIA GPU rental prices are rising rapidly again and capacity is being sold out. Between mid 2024 to Q3 2025, it was an customers market for compute where customers were able to neogigate low prices & favourable terms with Neoclouds. But due the surge of demand of agentic coding & increase in DRAM pricing, Customers no longer have as much negotiating power.

English
2
1
14
4.5K
Jasper
Jasper@zjasper·
@Yuchenj_UW Bittersweet day. Thanks for taking the leap with me three years ago to build Hyperbolic. Grateful for the journey and excited to see you chase your new passion. The next chapter will be legendary! ❤️
English
2
0
55
4.4K
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
I have decided to step down as CTO at Hyperbolic. Leaving a company you co-founded and poured your heart into is not easy. So many moments still feel vivid: launching our AI inference product for open-source models and seeing tens of thousands of developers sign up in a week; the week we were hit by a massive DDoS attack and the entire engineering team fought around the clock until we won; the day we launched the GPU platform and watched ARR take off. There were also hard moments. That’s the nature of building a startup. I’m grateful for all of it. What I’m most grateful for is the team. Thank you for your trust. Most startups never build something people want. I believe we did. You should be proud of yourselves. I will look forward to seeing your success. What’s next for me? I’m still figuring it out. I believe this is the most extraordinary moment in human history. We’re standing at the edge of the Singularity. AI will reshape everything, and I still feel the same excitement I felt when I first fell in love with AI. Time to start over. Time to climb another mountain. Thank you to everyone who has been part of the journey, — Yuchen
English
245
36
1.6K
182.5K
Anne Ouyang
Anne Ouyang@anneouyang·
Excited to share @Standard_Kernel's seed round and some reflections on what we’ve learned about kernel generation and what we believe is next. Grateful to our amazing team, supporters, and the broader community pushing this space forward.
Anne Ouyang tweet media
English
46
44
511
123.8K
Jasper
Jasper@zjasper·
@bearlyai GPU finance is emerging as a multi-trillion-dollar market. Exciting times ahead!
English
0
1
1
167
Bearly AI
Bearly AI@bearlyai·
Founder of Chicago-based prop trading firm DRW says compute will be world's top commodity in 10 years. People will spend more on GPUs than an oil, which means that there should be a futures financial market for GPUs.  Interesting implications for startups and cloud providers:
Bearly AI tweet media
English
55
152
1.9K
348.7K
Jasper
Jasper@zjasper·
@connorchevli Using AI can be more beneficial than its cost, and it will really speed up the development process.
English
0
0
1
125
Connor Chevli
Connor Chevli@connorchevli·
Every company should give their devs (really all employees) effectively unlimited AI credits. Once AI is embedded into your workflow, it becomes a force multiplier. The bottleneck shouldn’t be whether someone wants to spend $40 more today. I’ve had days where I spent $150–200 using Opus. Those were also the days I produced the most leverage for the company. If $200 in AI unlocks thousands in output, the constraint is artificial. There is a point of diminishing returns, but that ceiling keeps rising every week.
English
2
0
1
175
Jasper
Jasper@zjasper·
@thdxr Happy to support!
English
0
0
3
424
dax
dax@thdxr·
for anyone reaching out to us about GPU capacity we do not need raw GPUs we need an inference provider - this includes things like caching our traffic cannot be handled without this
English
60
14
897
90.1K
Jasper
Jasper@zjasper·
@JordanNanos @DanielLockyer What setting does the rightmost datapoint represent? It seems like you can just use out of the box open source solution to achieve 300+/s
English
0
0
0
100
Jordan Nanos
Jordan Nanos@JordanNanos·
Providers can tune for interactivity (throughput per user) or total throughput to crush a benchmark or server more users If you look at GB200 or GB300 NVL72 results vs H100, on a model like deepseek you can see the benefits of: - Prefill-Decode Disaggregation (PDD) - Wide Expert Parallelism (WideEP) - Multi Token Prediction (MTP) - Kernels for the Blackwell SM100 - With the FP4 datatypes - On an NVL72 scale up domain
Jordan Nanos tweet media
English
2
0
12
4.9K
Daniel Lockyer
Daniel Lockyer@DanielLockyer·
what magic do Fireworks and Baseten have to produce ~340tok/s for Kimi K2.5 and the rest barely manage 40 😂
Daniel Lockyer tweet media
English
31
1
342
43.3K
Stephanie Palazzolo
Stephanie Palazzolo@steph_palazzolo·
A double scoop day w/ the latest on OpenAI financials. Details include: - OpenAI raises rev forecasts for next 5 years by 27% - But will burn 2x as much cash thru 2030 than previously predicted - 2025 gross margins down vs. 2024 - New info on device revenue forecasts!
Stephanie Palazzolo tweet media
English
43
50
392
179.6K
Jasper
Jasper@zjasper·
@karpathy What would be the layer on top of claw?
English
0
0
2
680
Andrej Karpathy
Andrej Karpathy@karpathy·
Bought a new Mac mini to properly tinker with claws over the weekend. The apple store person told me they are selling like hotcakes and everyone is confused :) I'm definitely a bit sus'd to run OpenClaw specifically - giving my private data/keys to 400K lines of vibe coded monster that is being actively attacked at scale is not very appealing at all. Already seeing reports of exposed instances, RCE vulnerabilities, supply chain poisoning, malicious or compromised skills in the registry, it feels like a complete wild west and a security nightmare. But I do love the concept and I think that just like LLM agents were a new layer on top of LLMs, Claws are now a new layer on top of LLM agents, taking the orchestration, scheduling, context, tool calls and a kind of persistence to a next level. Looking around, and given that the high level idea is clear, there are a lot of smaller Claws starting to pop out. For example, on a quick skim NanoClaw looks really interesting in that the core engine is ~4000 lines of code (fits into both my head and that of AI agents, so it feels manageable, auditable, flexible, etc.) and runs everything in containers by default. I also love their approach to configurability - it's not done via config files it's done via skills! For example, /add-telegram instructs your AI agent how to modify the actual code to integrate Telegram. I haven't come across this yet and it slightly blew my mind earlier today as a new, AI-enabled approach to preventing config mess and if-then-else monsters. Basically - the implied new meta is to write the most maximally forkable repo and then have skills that fork it into any desired more exotic configuration. Very cool. Anyway there are many others - e.g. nanobot, zeroclaw, ironclaw, picoclaw (lol @ prefixes). There are also cloud-hosted alternatives but tbh I don't love these because it feels much harder to tinker with. In particular, local setup allows easy connection to home automation gadgets on the local network. And I don't know, there is something aesthetically pleasing about there being a physical device 'possessed' by a little ghost of a personal digital house elf. Not 100% sure what my setup ends up looking like just yet but Claws are an awesome, exciting new layer of the AI stack.
English
1K
1.3K
17.5K
3.4M
Jasper
Jasper@zjasper·
@a16z Great to see that open source models are usually only 4 months behind
English
0
0
2
457
Demi Guo
Demi Guo@demi_guo_·
This is how I work with my co-founder & CTO @chenlin_meng now. When we sleep, our AI selves keep building.🙈 Her AI self cm reviews Linear tickets, writes code, submits PRs. Mine @semi_guo_ ranks priorities, clarifies tickets, keeps everything aligned with strategy.
Demi Guo tweet media
English
18
8
210
23.6K
wayne nelms
wayne nelms@wayne_nelmz·
@zjasper @Polymarket what’s the pricing methodology if compute is has different prices from hyperscales vs marketplaces
English
1
0
3
78
Jasper
Jasper@zjasper·
People bet on elections and sports. Now they can bet on GPU prices on @Polymarket. When GPU becomes capital-intensive, supply-constrained, and volatile, it becomes financial. GPUs are on track to become a trillion-dollar asset class. GPU index on Polymarket is just the beginning. We’re entering the era of GPU finance. My view: as we are building the largest GPU marketplace, we’re seeing how difficult it’s become to secure large H100 blocks for even 1-year reservations. Forward capacity is tightening, and large commitments typically move before on-demand pricing adjusts. Expecting GPU prices to continue moving higher through the rest of February. Placed my bets. We’ll see where the month closes.
Polymarket@Polymarket

🚨 NEW POLYMARKET: How high will GPU rental prices go this month? poly.market/WEHStQw

English
6
2
15
2.5K