Jacob Kieser

30 posts

Jacob Kieser

@JacobKieser

Founder @embrasureai (sr007) - autonomous data warehouses. Studied cs @uw

San Francisco, CA Katılım Aralık 2014

194 Takip Edilen185 Takipçiler

Jacob Kieser@JacobKieser·1d

Would love more details: 1. What do you use to preprocess for routing? Some open source small model? 2. What are the details behind properly setting up your cache in LibreChat, any tips? 3. Do you guys have a semantic layer that helps models find data instead of brute force search?

English

5.6K

Brian Armstrong@brian_armstrong·2d

How to keep AI spend flat while token usage grows exponentially: Not with friction and spend alerts. With better defaults, routing, and caching. Better Defaults (not Usage Caps) – Engineers can choose any model they want, but defaults matter. We’re experimenting with defaulting to open weight models like GLM 5.2 and Kimi 2.7 through our LLM gateway, while still encouraging engineers to choose the right model for the task. 91% of our employees were never hitting their usage caps, so instead of lowering caps and driving up alerts, we're moving to cheaper defaults. Note that code reviews use a diversity of models, so they can check each other's work. Better Routing – In our custom harnesses, we preprocess prompts and route to the best model for the job, considering cache hits and model pricing. For instance, you may want a frontier model for planning, but not for execution where they can be overkill. Ultimately, humans shouldn't be choosing models - AI can automate this task. Better Caching – Cache misses are the easiest way to drive your cost up. All of our requests are cache aware, so we’re reusing a warm cache wherever possible. For example, our cache hit rate went from 5% → 60% in LibreChat once properly implemented. Keep Context Lean – Start fresh sessions when switching tasks. Scope file context narrowly. Disconnect unused tools. Don't just compact. The goal isn't fewer tokens used, it's fewer tokens wasted. Better Visibility – Our engineers can use as many tokens as they want, from whatever model they want, but we’ve made usage visible – and the more you spend on AI, the more impact we expect. The goal isn't to suppress usage. It's to build the infrastructure that makes exponential growth sustainable. Putting this into practice has cut our AI spend nearly in half, while our token usage continues to grow.

English

428

659

5.6K

3.2M

Jacob Kieser@JacobKieser·2d

@AdamHoltererer I guess we will find out if they ever get it released 🤷‍♂️

English

Adam Holter@AdamHoltererer·2d

There, I fixed it.

Jacob Kieser@JacobKieser

@OpenAI

English

Jacob Kieser@JacobKieser·2d

@synthwavedd People think pretrains just grow on trees

English

leo 🐾@synthwavedd·2d

No, 5.6 Sol is not a new pretrain

English

640

54.7K

Jacob Kieser@JacobKieser·2d

@SnowyLake9 @OpenAI Haiku is dead imo, self hosted small open source models for workloads like classification etc

English

148

SnowyLake@felix_snowylake·2d

@JacobKieser @OpenAI haiku: 那我呢

中文

387

OpenAI@OpenAI·2d

Introducing a limited preview of GPT-5.6 Sol, our next generation frontier model, as well as GPT-5.6 Terra, a balanced model for efficient, everyday work, and GPT-5.6 Luna, a fast and affordable model for high-volume work. openai.com/index/previewi…

English

3.3K

5.6K

39.2K

16.8M

Jacob Kieser@JacobKieser·2d

@xanwesley @meconemarkets @a16z @speedrun Let's goo! Excited to grind alongside y'all

English

111

Xan Wesley@xanwesley·2d

We’re excited to announce that @meconemarkets will be joining @a16z @speedrun as part of the SR007 cohort. I’m also happy (and sad) to say that I’ll be leaving Stanford Law School to pursue this dream full time. Mecone started from a core thesis: the financial system of the future will be built on blockchain rails with Perpetual Futures (“Perps”) as one of the primary tools for speculation and hedging. This future will lead to markets that are more open and global, markets where participants of every kind, retail and institutional alike, trade more of the world’s assets with less friction than ever before. With this future in mind, we’re building the financial infrastructure to make everything tradeable. Today, Perps only exist on a handful of assets — the biggest commodities, FX, public equities, crypto. But there are a myriad of highly attractive long-tail assets that Perps have failed to reach. Mecone exists to change that. We build continuously updating benchmarks for long-tail underlyings such as fine art, real estate, pre-IPO companies, macroeconomic indicators, and many more. When @hugo_stack, @jaffarkeikei, and I first met, none of us imagined we’d start a company. But the prospect of making some of the global economy’s most valuable and illiquid assets tradeable by anyone, anywhere, anytime was too interesting of a challenge to pass up. We’re working to list our indices on exchanges now! Join the waitlist and we’ll let you know the moment they go live. mecone.trade Big shoutout to Halim Labi, Aristotle Mannan, and Reuben Youngblum for supporting us from the very beginning. Also, thank you to @JoshLu, @Chen, @justmazer, @tmhammer, @kenanhsaleh, @_CallMeMacy, @emilybenn12, @tkexpress11 and the whole a16z speedrun team. Thank you @luca_skarlo and the Skarlo® team for whipping up an incredible website on such short notice. And, most importantly, we couldn't have done any of this without our families. Love you guys!

English

6.8K

Jacob Kieser@JacobKieser·2d

@AmyBird8 surprised it took them this long tbh

English

Amy Bird@AmyBird8·2d

@JacobKieser the easier a product is to understand, the easier it is to recommend. followed.

English

Jacob Kieser@JacobKieser·2d

OpenAI just made a huge decision, one I think will be extremely positive for them: Anthropic has been killing it with mass appeal with names that people can actually understand. Nobody understands the difference between 5.5/5.4/5.3 Spark/etc besides people deep in the ecosystem. Talking to some of my friends not in tech, everyone knows Mythos/Opus/Sonnet, nobody knows what the latest GPT model is. If they have a bad experience with 5.3, then GPT's are bad across the board.

OpenAI@OpenAI

English

216

Jacob Kieser@JacobKieser·2d

@OpenAI I wonder where they got this naming inspiration from 🤔

English

2.8K

Jacob Kieser@JacobKieser·5d

@garrytan Dropbox and other consumer file stores are getting replaced IMO, if I want a file I’ll ask my agent

English

154

Garry Tan@garrytan·5d

Dropbox should really support larger than 3TB plans - it's not 2015 anymore. The amount of data we are throwing off and that is *actually usable* is going to go exponential from AI, and Dropbox will be run over by this if they don't update for the future.

English

472

73.9K

Jacob Kieser@JacobKieser·12 Haz

@Brennan_Lup Now that’s a hot take

English

115

Brennan Lupyrypa@Brennan_Lup·12 Haz

@JacobKieser Nope Palantir, ramp don’t qualify here very different

English

619

Brennan Lupyrypa@Brennan_Lup·12 Haz

hot take: work at a start up before founding one

English

157

519

29.1K

Jack Price@jackprice·11 Haz

If you had unlimited tokens What are you building?

English

2.7K

Jacob Kieser@JacobKieser·12 Haz

@jackprice x.com/JacobKieser/st… just wrote this tweet and saw yours, everyone needs to think like this all the time

Jacob Kieser@JacobKieser

Every interesting use case of AI comes with the premise that you are not worried about token usage, like openclaw for example - infinite loop of prompts. Anyone innovating in the AI space should think about new use cases with the frame of token abundance, its why AI labs keep innovating new use cases.

English

Jacob Kieser@JacobKieser·12 Haz

English

167

Jacob Kieser@JacobKieser·12 Haz

New to tech X, Will follow and talk to anyone building cool stuff. comment/dm and drop me a follow if you want to talk AI, VC, getting into speedrun and YC, or anything else. dropping keywords to get on feeds: founder, anthropic, a16z, building, gpt, 5.5, 996, codex, corgi cafe, roy lee, fable, openai am i doing this right?

English

105

Jacob Kieser@JacobKieser·10 Haz

@ty_todd1 @modaicdev @a16z @speedrun 😂 real

English

Tyrin@ty_todd1·10 Haz

@JacobKieser @modaicdev @a16z @speedrun Appreciate it.🙏🏾 this shit cost too much to not be a least a little lit.

English

105

Tyrin@ty_todd1·9 Haz

Today we’re launching @modaicdev @a16z @speedrun 006, the fastest way to render your judgement into reliable decision automation.

English

Jacob Kieser@JacobKieser·10 Haz

Fable 5 is cool, but can it rip through soc2 compliance autonomously in 24 hours under rate limits? actually maybe, but s/o @TrustVanta + @OpenAI Codex, cooking for @EmbrasureAI

English

148

Keşfet

@AdamHoltererer @synthwavedd @SnowyLake9 @OpenAI @xanwesley @meconemarkets @a16z @speedrun