Evi

5.6K posts

Evi

@geteviapp

San Francisco, USA Katılım Şubat 2025

1.1K Takip Edilen433 Takipçiler

Evi@geteviapp·52m

@willdepue @FeltSteam @nickcammarata - Scaling personal sleep (auto RL per user) - Solving spatial reasoning and consistency (aka TikZ problem or “intelligent canvas”)

English

will depue@willdepue·3h

@FeltSteam @nickcammarata i think it’s clear there should be one to two more of these left, but it’s really hard to predict when. i think you’ll most likely get another one in the next three years for sure, and a good chance by 2028.

English

346

Nick@nickcammarata·3h

if openai had somehow kept strawberry secret would this lightcone belong to them instead of claude

English

153

14.4K

Evi@geteviapp·7h

@ColinGardiner Why do you need to talk at all?

English

Colin Gardiner@ColinGardiner·1d

As an early-stage investor, I don’t want to talk to your CFO co-founder about the raise. I want to hear from the CEO. If you’re an early-stage CEO, don’t outsource your fundraise.

English

233

17K

Evi@geteviapp·7h

@Dorialexander They latency is very high, so batch size is large.

English

154

Alexander Doria@Dorialexander·7h

Bonjour BFM. Vous avez personne pour lire les model reports ? Réponse en p. 9.

BFM Business@bfmbusiness

IA : Deepseek baisse ses prix de 75 % "C'est une stratégie de dumping : il distribue massivement, quasi gratuitement, leurs modèles, et à côté ils font en sorte que tout le monde aille sur Alibaba car le token est très bas" 💬@mikiane 🎙️@simottel

Français

Evi@geteviapp·8h

@LyalinDotCom Isn’t it obvious that Spark is much more compute dense while your M5 Mac is bandwidth winner, so you should really connect them with USB-3 and do prefill on Spark then transfer KV to M5 and do inference there. Ask Codex to do this (from scratch obviously).

English

Dmitry Lyalin@LyalinDotCom·9h

x.com/i/article/2058…

ZXX

Evi@geteviapp·8h

@GeorgeJeffersn lol, you should have made clear in your post that your app specializes on growth, so you’re obviously not sharing the full story.

English

George Jefferson@GeorgeJeffersn·16h

At YC we got the advice of posting on LinkedIn At first ngl I thought it was bullshit because I’ve never liked or engaged with the platform before But holy shit, Ive tried to post x5 a week and its mostly been slop like this, but it’s pure gold for driving inbound & sales

English

449

99.2K

Evi@geteviapp·9h

@burkov This was pre CoT.

English

2.1K

BURKOV@burkov·12h

Always remember that when an LLM prints the beginning of a text, it has no idea what the end will be. Therefore, when it says "The answer is yes, and this is why:" the text after "why" would most likely be a very elaborate lie combined with gaslighting in case "yes" was the wrong answer.

English

509

56.6K

Evi@geteviapp·9h

@zhaisf It is obviously not cubic. Matrix multiplication is obviously quadratic! Well, everyone knows (see Dwarkesh lecture is you decided to forget this) that data transfer is the main bottleneck. For matmul that’s quadratic, not cubic.

English

689

Shuangfei Zhai@zhaisf·12h

Maybe it’s time to address the cubic complexity of matrix multiplication.

English

260

52.2K

Evi@geteviapp·19h

@emollick that's a bug which has to be fixed :)

English

Ethan Mollick@emollick·1d

You can use AI to help with writing, but you need to actually do some writing to be helped with. "Write me a post about [topic], make it really good and interesting" is not going to cut it.

English

204

18.2K

Ethan Mollick@emollick·1d

As more people come to recognize the tells of AI, which mostly happens as you start to work with AI a lot, the scales are going to fall from their eyes and they are going to realize what some of us already see: how much of this site (and blog posts, articles, papers) are AI now.

English

139

112

1.5K

87.4K

Evi@geteviapp·21h

@sumjitg Cope for them missing out on coding capabilities.

English

Sumjit@sumjitg·1d

Just saw Demis Hassabis push back hard on all the recent Erdős problem hype. He basically said: sure, today’s AIs are knocking out some of these tough math problems, but that’s nowhere near real AGI. It doesn’t come close to the kind of raw creative invention someone like

NIK@ns123abc

🚨 Google DeepMind CEO Sir Demis Hassabis: “Today’s systems, are nowhere near [AGI]. Doesn’t matter how many Erdős problems you solve… I think it’s far, far from what a true invention or someone like a Ramanujan would have been able to do” it’s over for the Erdős hype

English

19.2K

Evi@geteviapp·21h

@MTSlive Lame! Folks you really need some better sourcing of “the situation”:)

English

MTS@MTSlive·1d

SITUATION DETECTED: Google DeepMind’s AI agent autonomously solved 9 of 353 open Erdos problems in mathematics, at a cost of a few hundred dollars per problem.

English

151

380

5.8K

2.3M

Evi@geteviapp·22h

@MWill2025 @sierracatalina Sam has nothing to do with what models do. He obviously gets funds and compute sorted. Genius who created recursive self improvement for GPT is someone else.

English

Throne Of Silence 🇺🇲@MWill2025·1d

@sierracatalina Correct, and I'm not a tech person. However, I do look at the leadership and hope they are ethical in their developments.

English

125

⚪️ sierra catalina@sierracatalina·1d

so… do we need to talk about how claude & gpt are FUNDAMENTALLY different!? because I keep seeing yall compare them & I am just unsure if no one understands the math or what.

English

6.7K

Evi@geteviapp·22h

@Nick_Davidov @nikitabier These are bots obviously. There is no downside for highly valuable employe s to stay in US while waiting for EB-1 and not have to go to likely hostile “home country”.

English

Nick Davidov@Nick_Davidov·1d

It has been this way for some time, but @nikitabier, the algo now fully embraces the vice and only promotes posts where shit hits the fan. I don't want to be a voice on capitalism or immigration and be hated, I want my useful posts with things that might be interesting for the tech community to get some attention too. I'm now closing my account to followers only and using "following" page for the feed. Because FYP is just ragebaiting and hate

English

1.5K

Evi@geteviapp·1d

@ProbyShandilya Is that because they resell Claude and GPT at below API cost?

English

219

Proby Shandilya@ProbyShandilya·1d

Cognition hit $445M run rate in their first 18 months of business. I don't think that's talked about enough.

English

9.2K

Evi@geteviapp·1d

@tunguz Totally! That whole LeCunn thing is a psyop

English

Bojan Tunguz@tunguz·1d

word model >>> world model

English

6.7K

Evi@geteviapp·1d

@michaelzixizhou @ycombinator @zfellows Ask ChatGPT to read the actual NDA and check legality. It is very useful to know that fresh grads (level 3 at Google, Meta, OpenAI etc) get 2M+/year TC offers (8-12M over 4y). That’s more than Ilya’s comp was 10 years ago at OpenAI (when it was non profit and data was public).

English

101

Michael Zhou@michaelzixizhou·1d

@geteviapp @ycombinator @zfellows NDA unfortunately i can’t even namedrop

English

142

Michael Zhou@michaelzixizhou·1d

I got into @ycombinator S26! The past 6 months have felt like 2 years: - Turned down 7-figure new grad offers at top companies - Bootstrapped until my bank account hit zero - Pivoted 5 times, and did door-to-door sales for a couple of months - Joined @zfellows and flew out to SF last month - Cancelled my return flight to Toronto - Had wonderful neighbours I'd just met crowd-fund my apartment deposit - Got the call from @aroraharshita33 6 hours after my YC interview ...all without telling my mom. Codag is the crystallization of everything I picked up across infrastructure, systems, and platform engineering. Our thesis is that within two years, agents will be the primary consumer of logs, and soon the primary producer of them. Today's observability stack and dashboards were built for humans to look at. Agents are a different reader entirely, and the infrastructure for understanding systems at scale has to be rebuilt for them. That is what we're building. We start with log compression: turning millions of lines into a compact, cited capsule an agent can actually read, so it stops drowning and starts debugging. I cannot believe I get to work on this full time - I wake up every day looking forward to building more. To everyone who made it real: @carterkev, @cory, @aroraharshita33, @amiklas, @nemild for taking the bet, the neighbours who backed a stranger with a laptop, and the countless friends that support me. Thank you. We just published an open source CLI and algorithm for log compression, fully free at codag.ai. If your agent debugs production, I'd love for you to try it, or just to talk. Got a lot more coming soon, and most of it will be open source. Now I should probably call my mom 😰

English

187

1.2K

52.9K

Evi@geteviapp·1d

@dharmesh Cope

English

dharmesh@dharmesh·1d

The harness matters more than the model. Models have gotten really good. Great reasoning, large context windows, better instruction following. But, what makes *use* of those capabilities is actually the harness. It's what provides tools, memory, skills and context to the model. ChatGPT is a harness. Claude Cowork is a harness. Without the harness, the model is just an engine with no car. You don't get anywhere.

English

450

52.5K

Evi@geteviapp·1d

@johnennis It is usually people who use Claude think (wrongly) that humans are somehow the key and model intelligence is not strictly superior to human intelligence every way. Claude is trained in sloppy way so its frontier is much more jagged.

English

John Ennis@johnennis·1d

There's really no substitute for actually doing things if you want to understand them Among other things this weekend, I've been using a number of AI tools to work on Erdos problem #3 on arithmetic progressions, it's been a worthwhile experience, and I've even proved a few new results (although still far from solving the main problem - it is very hard) I also learned a lot about additive combinatorics, a subfield that was born after I moved on from math to computational neuroscience, so that's been fun too However, my takeaway from all of this is that, while AI is going to be an awesome tool for professional mathematicians to work with, the idea that you are just going to turn the keys and the AI will "solve math" is horribly wrong It's just not going to go like that, and not because the models aren't smart enough The models are plenty smart and incredibly useful But human and machine intelligence are different, and I think you have to be in trenches to get a feeling for how these two types of intelligence are just not the same (And happy to share my note with anyone who wants to read it, just comment or DM, I put an appendix in there on my workflow for those who are interested)

English

4.7K

Evi@geteviapp·1d

@LLMJunky @mark_k Continue learning will be fully solved this year via: RL for very common tasks DB (files + embeddings/soft tokens) for knowledge Obviously it will start with personal DB, RL will be done later using data from DB.

English

am.will@LLMJunky·1d

@geteviapp @mark_k That is not continual learning.

English

Mark Kretschmann@mark_k·1d

So what happened to the Continual Learning breakthrough we were promised in 2026? AI labs have been fairly silent about it lately... Is Continual Learning still going to become a thing or will it remain a pipe dream for AI?

English

269

19.4K

Evi@geteviapp·1d

@_arohan_ Which model from Antropic is good and Jensen said they used moderate amount even for sloppy Mythos.

English

606

rohan anil@_arohan_·1d

This is very true. This is why the best models are from Anthropic earlier this year as they had most compute compared to everyone else.

Aidan Clark@_aidan_clark_

If you want to work on pretraining-for-AGI, join OpenAI, Google, Meta or the Anthropic/XAI/Cursor supergroup. The bitter truth of the widening compute gap is that all the problems which are actually on the critical path to AGI now demand that level of compute.

English

417

71.3K

Keşfet

@willdepue @FeltSteam @nickcammarata @ColinGardiner @Dorialexander @LyalinDotCom @GeorgeJeffersn @burkov