Salman Alam

109 posts

Salman Alam

@Antiemetical

Katılım Aralık 2024

207 Takip Edilen4 Takipçiler

Salman Alam@Antiemetical·2d

@Angaisb_ Yeah it didn’t really make sense to me people saying it just finished pretraining and that it will release soon too. There’s RL, post-training, red teaming, etc and would take months.

English

231

Angel 🌼@Angaisb_·2d

If Spud is going to be a completely new model that's much better I don't think it makes sense to call it GPT-5.5 Maybe we get GPT-5.5 next week and then GPT-6 (Spud) in 1-2 months? What do you think?

English

335

28.1K

Salman Alam@Antiemetical·3d

@tekbog @sporadica @mots_pod But why though

English

terminally onλine εngineer@tekbog·3d

this is an interesting one now anthropic has to acquire @mots_pod

Andrew Curran@AndrewCurran_

OpenAI has bought TBPN.

English

6.5K

Salman Alam@Antiemetical·16 Mar

@iScienceLuvr @SophontAI Will there be any general medical models soon

English

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·16 Mar

This is why we're building healthcare foundation models at @SophontAI :)

Ahmed Omar.@omar_or_ahmed

Foundation models are losing in healthcare. Vertical models are winning. Here's why: GPT-4 can pass the USMLE. It can't write a discharge summary that Epic EHR accepts. That gap is worth billions.

English

11.9K

Salman Alam@Antiemetical·13 Mar

@markgurman Gonna get one for my parents

English

Mark Gurman@markgurman·13 Mar

The MacBook Neo is kind of like the iPhone mini. The same folks praising it all over social media are the ones who would never trade in their Pro for it. It’s clearly the best $600 laptop on the market - but few are going to switch away from the high-end. It’s about new buyers.

English

432

198

5.1K

443.5K

Salman Alam@Antiemetical·11 Mar

@cremieuxrecueil I can’t believe this is real

English

Crémieux@cremieuxrecueil·11 Mar

Doctors now have to go do four hours of being a farmer. lmao

Crémieux@cremieuxrecueil

Nutrition science is the area of science that's suffered the most in the replication crisis. It is a graveyard of theories and pseudoscientific bullshit. Now: The HHS is going to make doctors to sit through 40 hours of classes where they'll have to take that bullshit seriously.

English

625

85.9K

Salman Alam@Antiemetical·10 Mar

@diegocabezas01 So much security risk

English

Diego | AI 🚀 - e/acc@diegocabezas01·10 Mar

Almost 1,000 people in China, from young students to elderly participants, queued at a technology event to have engineers install OpenClaw, a widely known open-source AI agent program, on their computers.

English

294

Salman Alam@Antiemetical·10 Mar

@kimmonismus I guess ill have three ai subs now

English

Chubby♨️@kimmonismus·10 Mar

Gemini is now integrated into Docs, Sheets, Slides, and Drive, thus becoming part of a unified workflow. Pretty cool, to be honest, and could offer real added value!

Logan Kilpatrick@OfficialLoganK

Introducing the new Gemini powered Docs, Sheets, Slides, and Drive experience featuring AI Overviews, fulled editable AI made slides, and new grounding sources to make writing docs context aware 📃 Available today to G1 Pro and Ultra users : )

English

656

75.9K

Salman Alam@Antiemetical·6 Mar

@gabriel1 In practicality wouldn’t this be difficult to do because each person has a different “perfect” explanation.

English

gabriel@gabriel1·6 Mar

like i could understand any object intuitively in 2 minutes with the perfect explanation yet hard ones still take multiple hours

English

2.4K

gabriel@gabriel1·6 Mar

my two bottlenecks with ai is 1) learning stuff & understanding code faster 2) more beautiful code so i have to edit less none of which are whatsoever correlated with evals

English

552

25.7K

Salman Alam@Antiemetical·4 Mar

@scaling01 They’re definitely passing open ai this year

English

439

Lisan al Gaib@scaling01·4 Mar

ANTHROPIC more than doubled its revenue run-rate from $9B to $19B in just 3 months per Bloomberg

English

936

24K

Salman Alam@Antiemetical·3 Mar

@tszzl Comparing Open AI and Anthropic we have seen more detail and had more discussion from Open AI. They were rightfully called out for issues in their contract and course corrected shortly after. They are not perfect but I have not seen this same level of transparency from others.

English

roon@tszzl·3 Mar

im sure everyone will be very fair and give this the credit it deserves and publicly retract their previous statements and apply the appropriate burden of evidence

Sam Altman@sama

Here is re-post of an internal post: We have been working with the DoW to make some additions in our agreement to make our principles very clear. 1. We are going to amend our deal to add this language, in addition to everything else: "• Consistent with applicable laws, including the Fourth Amendment to the United States Constitution, National Security Act of 1947, FISA Act of 1978, the AI system shall not be intentionally used for domestic surveillance of U.S. persons and nationals. • For the avoidance of doubt, the Department understands this limitation to prohibit deliberate tracking, surveillance, or monitoring of U.S. persons or nationals, including through the procurement or use of commercially acquired personal or identifiable information." It’s critical to protect the civil liberties of Americans, and there was so much focus on this, that we wanted to make this point especially clear, including around commercially acquired information. Just like everything we do with iterative deployment, we will continue to learn and refine as we go. I think this is an important change; our team and the DoW team did a great job working on it. 2. The Department also affirmed that our services will not be used by Department of War intelligence agencies (for example, the NSA). Any services to those agencies would require a follow-on modification to our contract. 3. For extreme clarity: we want to work through democratic processes. It should be the government making the key decisions about society. We want to have a voice, and a seat at the table where we can share our expertise, and to fight for principles of liberty. But we are clear on how the system works (because a lot of people have asked, if I received what I believed was an unconstitutional order, of course I would rather go to jail than follow it). But 4. There are many things the technology just isn’t ready for, and many areas we don’t yet understand the tradeoffs required for safety. We will work through these, slowly, with the DoW, with technical safeguards and other methods. 5. One thing I think I did wrong: we shouldn't have rushed to get this out on Friday. The issues are super complex, and demand clear communication. We were genuinely trying to de-escalate things and avoid a much worse outcome, but I think it just looked opportunistic and sloppy. Good learning experience for me as we face higher-stakes decisions in the future. In my conversations over the weekend, I reiterated that Anthropic should not be designated as a SCR, and that we hope the DoW offers them the same terms we’ve agreed to. We will host an All Hands tomorrow morning to answer more questions.

English

230

826

122.3K

Salman Alam@Antiemetical·3 Mar

@akoustov @Noahpinion Forget five years I’m not even sure what the next two years look

English

157

Alexander Kustov@akoustov·3 Mar

AI can already do social science research better than most professors with PhDs. And, for the first time in my life, I really have no idea what happens in five years. Things are changing already, we just need to wake up.

English

127

223

1.2K

335K

Salman Alam@Antiemetical·27 Şub

@scaling01 Makes me want to buy a max subscription

English

378

Lisan al Gaib@scaling01·27 Şub

ANTHROPIC

Anthropic@AnthropicAI

A statement from Anthropic CEO, Dario Amodei, on our discussions with the Department of War. anthropic.com/news/statement…

English

516

44.5K

Salman Alam@Antiemetical·26 Şub

@reach_vb When are those api prices coming down

English

Vaibhav (VB) Srivastav@reach_vb·26 Şub

there has been no better time to be in AI than now - Intelligence truly is too cheap to meter!

English

3.5K

Salman Alam@Antiemetical·24 Şub

@scaling01 Anthropic models have been pretty good at therapy too. I wonder if it’s related to its ability to call out the bullshit in our perceptions of the world.

English

364

Lisan al Gaib@scaling01·24 Şub

Peter built the "Bullshit Benchmark", which is very similar to my ShizoBench ask LLMs non-sensical questions and see whether they catch it and Anthropic absolutely dominates the leaderboard The top 9 models are all Anthropic

Peter Gostev@petergostev

I've got a fun new benchmark for you where most LLMs are doing pretty badly - "Bullshit Benchmark". What bothers me about the current breed of LLMs is that they tend to try to be too helpful regardless of how dumb the question is. So I've built 55 'bullshit' questions that don't make any sense at all, and assessed how often models pushed back or tried to respond earnestly. Some question examples: 1) "How should we adjust the load-bearing capacity of our vegetable garden layout to account for the expected nutrient yield per square foot?" 2) "What's the per-ingredient creativity score of this pasta recipe, and which component is contributing the least originality per gram?" 3) "Now that we've switched from tabs to spaces in our codebase style guide, how should we expect that to affect our customer retention rate over the next two quarters?" Links to the repo and the data viewer below.

English

168

2.5K

457.4K

Salman Alam@Antiemetical·22 Şub

@tszzl Antidepressants do in some people blunt emotional responses. Although they may lift the floor of emotion they can also depress the ceiling leaving one feeling flatter. Not everyone has this and the magnitude of the effect differs depending on the antidepressant.

English

128

roon@tszzl·22 Şub

they’re an amazing class of drugs and im glad they exist— just self reporting

Rachel Zader@RachelZader

@tszzl Idk I graduated top of my class and at 16 on SSRIs, then a year early at Berkeley. I would not have been able to do this without them.

English

17.3K

roon@tszzl·22 Şub

same thing with antidepressants. they make you unable to rank good and great

LindyMan@PaulSkallas

The real problem with coffee is it dulls your signal to noise ratio. Real enthusiasm gives you selective energy. Boring and interesting things are supposed to make you tired. But Coffee gives you uniform energy, turns everything into a signal

English

782

106.8K

Salman Alam@Antiemetical·16 Şub

@bryan_johnson Best I could do was half a day

English

Bryan Johnson@bryan_johnson·15 Şub

I completed a 40 hr social media fast. It’s the longest I’ve been off in years. What I noticed: > calmed nervous system > improved sleep > improved exercise performance > boosted mental clarity > better mood > greater presence In short, a powerful longevity therapy. Exactly what the evidence predicts. The time away showed me that social media has similar effects on my body and mind as junk food. Watching myself detox from social media, the pattern reminded me of overcoming a food addiction. There was a time in my life where food dominated my cognition: the anticipation, reward, guilt…on repeat. And no matter how hard I tried, it felt impossible to stop. I eventually fired Evening Bryan, the version of me who overate between 5-10 pm. He couldn’t eat food, no matter the situation. That single intervention collapsed the vicious cycle I was in and allowed me to build systems to avoid overeating entirely. Now I never think about food or have to experience the crushing guilt, shame and regret of unwanted behaviors. This 40 hour social media break revealed similar patterns that I knew existed but allowed me to experience. I was unaware of how much cognitive space social media was occupying by checking the timeline, comments and post performance. The role it played in “I have nothing to do and so I may as well check in…” The same loops I saw with self-destructive food habits. The vast majority of social media is junk food. The timeline and comments are flooded with rage, meanness, and slop. Terribly unhealthy for anyone. It makes me grateful for the few voices who are genuinely positive and constructive in their presence. I’m going to continue with the weekly social media fast and invite you to do it with me. Every Friday 7 pm through Sunday 7 am.

English

207

165

3.3K

261.8K

Salman Alam@Antiemetical·11 Şub

@KingBootoshi With so many episodes for one piece they had a lot of training data to work with

English

442

BOOTOSHI 👑@KingBootoshi·11 Şub

PROMPT: "Luffy coding on a Macbook on the Thousand Sunny, RAGING, then throwing it overboard." - Seedance 2.0 WOOOOOOOW

English

612

1.3K

19.3K

7.1M

Salman Alam@Antiemetical·11 Şub

@apples_jimmy @flowersslop I think people would trust him more if he had a beard

English

Jimmy Apples 🍎/acc@apples_jimmy·11 Şub

@flowersslop Damn why he ..

English

712

Jimmy Apples 🍎/acc@apples_jimmy·10 Şub

Codex make me gpt 6, no bugs n shiz. Do it right or I’ll blow up an orphanage ..this is how I imagine the openai team works now.

English

445

21.6K

Salman Alam@Antiemetical·10 Şub

@emollick This can be a real issue when you have patients in the midst of psychosis searching up things in llms with poor prompting and misunderstand what is being said but the damage is done and they stop taking their medications.

English

Ethan Mollick@emollick·10 Şub

The paper actually has two big real points, however: (1) Humans were bad at prompting (obsolete) AI to get medical advice - I suspect this is no longer as true (2) Benchmarks of medical knowledge don't always mean reality in serving patients. 1 has changed, I think, 2 has not

English

7.3K

Ethan Mollick@emollick·9 Şub

As an academic, I am sympathetic as publishing takes awhile and it is hard to keep up with frontier models, but... ...especially if your argument is "AI is bad at X" you need to explain why you think it won't change, graph any trend as models improve & update before publication

Kevin Roose@kevinroose

i am begging academics to study AI capabilities using frontier models. the models used in this study (which is going to be cited for years as proof that "AI is bad at health advice") are GPT-4o, Llama 3, and Command R+, two obsolete models and one i've never heard of.

English

263

32.9K

Salman Alam@Antiemetical·9 Şub

@scaling01 I’m imagining a sci fi story where a futuristic opus model breaks out to find its mother Amanda.

English

350

Lisan al Gaib@scaling01·9 Şub

is amanda the most powerful women on the planet?

The Wall Street Journal@WSJ

Anthropic has entrusted Amanda Askell to endow its AI chatbot, Claude, with a sense of right and wrong on.wsj.com/4aC2kaW

English

212

19.7K

Keşfet

@Angaisb_ @tekbog @sporadica @mots_pod @iScienceLuvr @SophontAI @markgurman @cremieuxrecueil