djsh

4.5K posts

djsh

@djays

I build AI stacks and models, currently at Groq. (Past: LLMs, Autonomous Checkout, Medical Imaging).

SF Bay Area, CA Katılım Nisan 2008

731 Takip Edilen316 Takipçiler

djsh@djays·20 Nis

@patrickc What's your stack and model of chocie? I find most models aren't grounded in the nuance of genetics and can overindex on impact of a single gene.

English

277

Patrick Collison@patrickc·17 Nis

I'm lucky enough to have a great doctor and access to excellent Bay Area medical care. I've taken lots of standard screening tests over the years and have tried lots of "health tech" devices and tools. With all this said, by far the most useful preventative medical advice that I've ever received has come from unleashing coding agents on my genome, having them investigate my specific mutations, and having them recommend specific follow-on tests and treatments. Population averages are population averages, but we ourselves are not averages. For example, it turns out that I probably have a 30x(!) higher-than-average predisposition to melanoma. Fortunately, there are both specific supplements that help counteract the particular mutations I have, and of course I can significantly dial up my screening frequency. So, this is very useful to know. I don't know exactly how much the analysis cost, but probably less than $100. Sequencing my genome cost a few hundred dollars. (One often sees papers and articles claiming that models aren't very good at medical reasoning. These analyses are usually based on employing several-year-old models, which is a kind of ludicrous malpractice. It is true that you still have to carefully monitor the agents' reasoning, and they do on occasion jump to conclusions or skip steps, requiring some nudging and re-steering. But, overall, they are almost literally infinitely better for this kind of work than what one can otherwise obtain today.) There are still lots of questions about how this will diffuse and get adopted, but it seems very clear that medical practice is about to improve enormously. Exciting times!

English

488

642

9.6K

4.1M

djsh@djays·11 Mar

@adelwu_ Would love to attend this

English

adel 🌟@adelwu_·10 Mar

i want to host an event for people confused about their career path. for the multi-hyphenates who don’t know what to pursue, bc there’s too many things. i left my last job not knowing what was next, only that i didn’t want to keep doing what i was doing. when i pivoted from eng to growth, the most eye opening conversations were those with people 2-3 years ahead of me who went through the same struggle. they could actually resonate with the existential stress i felt navigating the process. who would be interested? i’d love to put together a panel of people with amazing multi-faceted careers and took sabbaticals/breaks to figure things out!

Marc Randolph@marcrandolph

My path to entrepreneurial success was not linear, by any stretch of the imagination. I didn’t start working in tech until I was 32. I didn’t even move to California until I was 30. Before becoming an entrepreneur, I was: -The worst realtor in the state of New York -A gofer for the CEO of a sheet music company -An aspiring brand manager for flea shampoo Don’t be disillusioned if the path ahead isn’t clear. Relax. Find something that strikes your interest. And don’t be afraid to take a trail just because you can’t see the end.

English

236

46.4K

djsh retweetledi

Bo Wang@BoWang87·3 Mar

Prof. Donald Knuth opened his new paper with "Shock! Shock!" Claude Opus 4.6 had just solved an open problem he'd been working on for weeks — a graph decomposition conjecture from The Art of Computer Programming. He named the paper "Claude's Cycles." 31 explorations. ~1 hour. Knuth read the output, wrote the formal proof, and closed with: "It seems I'll have to revise my opinions about generative AI one of these days." The man who wrote the bible of computer science just said that. In a paper named after an AI. Paper: cs.stanford.edu/~knuth/papers/…

English

150

1.9K

9.1K

1.4M

djsh@djays·17 Eki

@IanAndrewsDC Looks like Austin!

English

Ian Andrews@IanAndrewsDC·17 Eki

Just landed. This is the first thing I see outside the terminal. Tell me which city.

English

1.4K

djsh@djays·14 Eki

@akankshanc @Cohere_Labs I had applied a few weeks back but never got a response. 😔

English

Akanksha@akankshanc·14 Eki

@Cohere_Labs is an open science community with many specialized subgroups to learn and conduct research on various ML/AI subfields, and you can join the community here: cohere.com/research/open-…

English

1.5K

Akanksha@akankshanc·14 Eki

After building some mathematical foundation for transformers, we are on to tackling the next foundational paper, Toy Models of Superposition at the ML understanding group of @Cohere_Labs. This paper explores how networks pack many features into fewer dimensions, forming beautiful geometric patterns that balance efficiency and interference. PS: I still find these geometric arrangements surreal even after reading this paper so many times! Join us Friday, 10/17!

English

326

26.7K

djsh@djays·12 Eki

Excited to get started on this! Also, the reader notes *chefs kiss*

English

djsh retweetledi

Groq Inc@GroqInc·26 Eyl

It’s official: McLaren F1 x Groq Bringing inference speed at a winning cost to the grid and beyond. See you in Singapore. 🧡🏁

English

116

469

218.5K

djsh retweetledi

Groq Inc@GroqInc·4 Eyl

📣Groq’s first agentic system is ready for production at scale. Already battle tested by 100K+ developers across 5M+ requests. Compound is now GA, available to everyone on GroqCloud. Go Build ⬇️

English

332

77.4K

djsh retweetledi

François Chollet@fchollet·15 Ağu

We were able to reproduce the strong findings of the HRM paper on ARC-AGI-1. Further, we ran a series of ablation experiments to get to the bottom of what's behind it. Key findings: 1. The HRM model architecture itself (the centerpiece of the paper) is not an important factor. 2. The outer refinement loop (barely mentioned in the paper) is the main driver of performance. 3. Cross-task transfer learning is not very helpful. What matters is training on the tasks you will test on. 4. You can use much fewer data augmentations, especially at inference time. Finding 2 & 3 mean that this approach is a case of *zero-pretraining test-time training*, similar to the recently published "ARC-AGI without pretraining" paper by Liao et al.

English

295

2.6K

368.2K

djsh retweetledi

Groq Inc@GroqInc·5 Ağu

OpenAI’s open models are live and already running on Groq. Try gpt-oss-20B and gpt-oss-120B today. Groq delivers 128K context and built-in tools such as code execution and browser search. For the first time, developers and enterprises can deploy open models backed by OpenAI instantly, anywhere, at scale. Start building now. Links in comments.

English

162

1.8K

1.5M

djsh retweetledi

Aarush Sah@aarush·31 Tem

Introducing OpenBench 0.1: Open, Reproducible Evals 🧵

English

408

146.4K

djsh@djays·28 Tem

@paraschopra Instructing Claude to regularly update Claude.md. Also multiple markdowns for bigger codebases. (Essentially a form of long term memory)

English

834

Paras Chopra@paraschopra·28 Tem

Just tried Claude Code and it’s awesome! There’s something about vibe coding in the terminal that feels more fun than doing it in IDEs! I love how it makes its own todos and checks off them one by one. Any tips on Claude Code that you can give me?

English

102

888

69.2K

djsh retweetledi

stochasm@stochasticchasm·19 Tem

thought he only had 2 moments

English

116

7.7K

djsh@djays·19 Tem

@paraschopra What was a big learning?

English

139

Paras Chopra@paraschopra·19 Tem

Just finished a summer course at Oxford on Plato’s Republic. What’s inspiration is that my class comprised of people of all ages (including one who is 80+) Everyone was there for their hunger for learning, and I hope it stays with all of us until the very end of our lives.

English

1.3K

46.3K

djsh@djays·18 Tem

@elsewhere_today Can you try again? I adjusted some settings

English

Elsewhere@elsewhere_today·17 Tem

@djays Can't DM :(

English

Elsewhere@elsewhere_today·16 Tem

Everyone in SF complains about how every café closes by 5pm and there’s nowhere to work. Introducing Elsewhere: the world’s first late-night café coworking chain for builders, creatives & digital nomads. Launching July 24. Comment your favorite drink & we'll DM you the invite.

English

276

783

166.1K

djsh retweetledi

sunny madra@sundeep·15 Tem

Groq 👀👀👀 1) Which inference providers are you using or considering to access models? 2) Which inference providers are you using or considering using to access models? Survey by @ArtificialAnlys

English

165

227.8K

djsh retweetledi

Groq Inc@GroqInc·15 Tem

*YOLO Launch* Kimi K2 is now in preview on GroqCloud at 185 tokens/sec. Build fast. Link in comments.

English

1.3K

269.9K

djsh retweetledi

sunny madra@sundeep·1 May

Full vertical integration, silicon to cloud. Thanks for the shout out @finkd

Stratechery@stratechery

5-1-2025 An Interview with Meta CEO Mark Zuckerberg About AI and the Evolution of Social Media stratechery.com/2025/an-interv…

English

126

33.9K

djsh retweetledi

Gavin@GavinSherry·5 Nis

Proud that @GroqInc was first to market with high performance @AIatMeta Llama 4 at best in market prices. Check it out at console.groq.com or via our partners

OpenRouter@OpenRouter

Llama 4 Scout & Maverick are now available on OpenRouter. Meta's flagship model series achieves a new record 10 million token context length 🚀 @togethercompute and @GroqInc are the first providers. We'll be adding more over the course of the weekend.

English

8.3K

Keşfet

@patrickc @adelwu_ @IanAndrewsDC @akankshanc @Cohere_Labs @paraschopra @elonmusk @BarackObama