Rafe

755 posts

Rafe

@RafeAphd

A.I. Reseracher. Prev- @join_ef || @OfficialUoM || @UniOfYork || @imperialcollege In a past life I pursued kick-boxing. 6"4. 220 lbs. i try i try

London شامل ہوئے Mayıs 2024

330 فالونگ79 فالوورز

پن کیا گیا ٹویٹ

Rafe@RafeAphd·6d

Solo dev. Starting this challenge from scratch. Goal: 100k MRR in 12 months building micro-SaaS products with AI. No team. No VC. Just shipping and seeing what sticks. First product is live now. Will be scraped in 1 month. Posting everything, the wins, the flops, the numbers.

English

374

Rafe@RafeAphd·4h

@shl end state is forward deployed customer support

English

195

Sahil Lavingia@shl·5h

Engineering is eating customer support

English

10.8K

Rafe@RafeAphd·4h

@anthdm this is common knowledge and i think is papers too

English

Anthony GG@anthdm·19h

GLM 5.2 is 100% distilled from Opus. Change my mind.

English

107

534

79.9K

Rafe@RafeAphd·5h

@Teknium did you just do the sakana thing

English

449

Teknium 🪽@Teknium·6h

Introducing Mixture of Agents 2.0 in Hermes Agent. Combine any provider's models into a mixture of your own. Access your presets as if it were a normal model in Hermes. Big improvement in our soon-to-release HermesBench against opus and gpt-5.5 with MoA using Opus & GPT together.

Nous Research@NousResearch

The strongest models are gated and access is granted only to a select few. Hermes Agent now exposes MoA presets as virtual models, giving you capabilities beyond the publicly available frontier: 8% higher than Opus 4.8 and 11% higher than GPT 5.5 on our upcoming benchmark.

English

117

1.4K

208.6K

Rafe@RafeAphd·7h

@scaling01 I keep getting PTSD to crypto tokens

English

Lisan al Gaib@scaling01·9h

GPT-5.6 Pricing: - Sol: $5 / $30 - Terra: $2.5 / $15 - Luna: $1 / $6 OpenAI will also launch GPT-5.6 on Cerebras at up to 750 tokens/s in July

Lisan al Gaib@scaling01

OpenAI released the official GPT-5.6 Preview Blog: openai.com/index/previewi…

English

507

42.2K

Rafe@RafeAphd·7h

@louis030195 in wrapping of satire exists tiny pockets of truth

English

louis030195 | screenpipe (YC S26)@louis030195·8h

Good artists copy, great artists steal

English

157

Rafe@RafeAphd·8h

gpt 5.6 will run at 750 toks through cerebras if you can get access

English

Rafe@RafeAphd·8h

@bridgemindai omg, this is kinda wild if true

English

378

BridgeMind@bridgemindai·10h

GPT 5.6 Sol will run on Cerebras at 750 tokens per second.

English

385

18.1K

Rafe@RafeAphd·8h

@zerohedge will hardware get commoditised that quickly are you sure about that ?

English

zerohedge@zerohedge·9h

Can we fast forward 2 years when everyone will have a local model running on a 10TB DDR8 rig (which will cost $29.95)

English

162

207

4.8K

265.4K

Rafe@RafeAphd·8h

@beffjezos yeah will have to wait so long for the release tho

English

Beff (e/acc)@beffjezos·9h

OpenAI has created the Sun god

Greg Brockman@gdb

GPT-5.6 Sol preview — it's a good model:

English

352

17.7K

Rafe@RafeAphd·10h

@mxfp4 congrats dude you climbing the ranks!

English

Kevin Wang@mxfp4·10h

aaaand its flagged on hackernews, anyone delt with this before?

English

132

Rafe@RafeAphd·10h

@JiweiLi1 amazing work, testing now will be back with a review

English

Jiwei Li@JiweiLi1·1d

Excited to share Ornith, our latest family of open-source models specialized for agentic coding. Ornith achieves SOTA performance among open-source models of comparable size on a variety of coding benchmarks (Terminal-Bench 2.1, SWE, NL2Repo, OpenClaw, SWE Atlas, etc) Feedback is deeply appreciated! 📖Tech Blog: deep-reinforce.com/ornith_1_0.html 🤗Huggingface: huggingface.co/collections/de…

Ornith@ornith_

Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks including: ✅Terminal-Bench 2.1(77.5) ✅SWE-Bench(82.4 on verified, 62.2 on pro, 78.9 on Multilingual) ✅NL2Repo(48.2) ✅SWE Atlas(41.2 on QnA, 42.6 RF, 39.1 TW) ✅ClawEval(77.1) Post-trained on top of gemma4 and qwen3.5, Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model generate higher-quality solutions in agentic coding.😎 All models are released under the MIT license, enabling full commercial and research use. 📖Tech Blog: deep-reinforce.com/ornith_1_0.html 🤗Huggingface: huggingface.co/collections/de…

English

538

41.6K

Rafe@RafeAphd·10h

@vineerpasam buying up everyone favourite apps, if its isnt broken dont fix it

English

Vineer@vineerpasam·12h

Where is he now?

English

349

Rafe@RafeAphd·10h

@SStricklandMMA sean this is serious character devleopment i would have never seen coming

English

147

Sean Strickland@SStricklandMMA·12h

Everyday I wake up and want a drink or drugs Some days I feel so disconnected, bored and empty and the thought of a beer or a little weed just to ground me, make me present is so justifiable in my mind.... But I know where that road leads. Just say no and endure.

English

1.7K

1.2K

28.6K

1.2M

Rafe@RafeAphd·10h

@thoughtcrime___ i think pricing any lower currently wouldn't line up with unit economics

English

463

thoughtcrime@thoughtcrime___·21h

if you were wondering how out of touch jason is with the average american, just look at the question he asked and the fact that the lowest option is $50/hour

@jason@Jason

What price would you pay for starlink per hour on a flight? (Just did this at dinner — answers for VCs was interesting) [ Will reveal dinner conversation in comments in a couple hours ]

English

207

578.9K

Rafe@RafeAphd·10h

@mxfp4 its made for a very specific audience

English

Kevin Wang@mxfp4·11h

reminder that the hackernews login page looks like this

English

915

Rafe@RafeAphd·10h

@juhapellotsalo thanks dude! still working on improving the harness

English

Juha Pellotsalo@juhapellotsalo·11h

@RafeAphd BrakeDrive looks great, seriously

English

Rafe@RafeAphd·6d

English

374

Rafe@RafeAphd·11h

@chaosengineerr yes for dopamine reasons i cant get into haha

English

Wahab Khan@chaosengineerr·12h

let's be honest, would you still build if it paid nothing for a year?

English

1.3K

Rafe@RafeAphd·12h

- 9am flight out of LDN heathrow - Leave house 20h before flight leaves - 120 minute train to airport - TSA agent is impartial - Ads for government - No food options besides mushy peas and bacon and eggs - 120 min walk to gate - Get downgraded to seat next to bathroom - Depart 4 hours late

Eli Mernit@mernit

7am flight out of SFO - Leave house 1h before flight leaves - 13 min Uber to airport, views of the Bay - TSA agent smiles asks if you like the Grateful Dead - Ads for AI agents that cure cancer - Multiple food options spanning global cuisines - 2 min walk to gate - Get upgraded to business class - Depart 5 min early 7am flight out of JFK - Leave house 3h before flight leaves - 73 minute Uber to airport, bumper to bumper traffic - TSA agent hates you - Ads for underwear - No food options besides Jamba Juice and hardboiled eggs - 17 min walk to gate - Get downgraded to seat next to bathroom - Depart 2 hours late

English

Rafe@RafeAphd·12h

@saloniiio its all review and evals

English

Saloni@saloniiio·16h

In 5 years, what will be more valuable? Writing code or Reviewing AI-generated code.

English

3.2K

Rafe@RafeAphd·12h

currently thinking of hiring our first engineer, is it too early

English

دریافت کریں

@shl @anthdm @Teknium @scaling01 @louis030195 @bridgemindai @zerohedge @beffjezos