Prem Viswanathan

423 posts

Prem Viswanathan

@prempv

Building @swift_cx. Adjunct @ CMU. Prev @aws

Pittsburgh, PA Katılım Ağustos 2009

2.1K Takip Edilen598 Takipçiler

Prem Viswanathan@prempv·18 Mar

I like this!!

Brendan Falk@BrendanFalk

I believe we've found the best AI-native coding interview We call it the “Composer 1 interview” Candidates get 1 hour to build a real, medium-sized project live The only constraint: they have to use Cursor’s Composer 1 model

English

104

Prem Viswanathan@prempv·14 Mar

@nikunj create your skills*

English

Prem Viswanathan@prempv·14 Mar

@nikunj Use this plug-in to create / update your plugins. It's helpful to figure out all the possibilities with skills. claude.com/plugins/skill-…

English

1.5K

Nikunj Kothari@nikunj·14 Mar

TIL - you can spawn subagents for skills in Claude Code. What.. I feel so stupid now. This would have saved me SO much time. Every day, you learn something new.

English

414

50.4K

Prem Viswanathan@prempv·14 Şub

Plot twist @openclaw changed the password.

GIF

Lenny Rachitsky@lennysan

I lost track of my Mac Mini password where I have @openclaw running, AMA

English

164

Prem Viswanathan@prempv·18 Ara

This is incredible! Very competitive from Grok team.

xAI@xai

Today, we're excited to launch the Grok Voice Agent API, empowering developers to build voice agents that speak dozens of languages, call tools, and search realtime data. x.ai/news/grok-voic…

English

117

Prem Viswanathan@prempv·18 Ara

Claude Opus 4.5 is definitely having some hiccups. Similar quality issues with @WisprFlow this evening. AI quality degradation is the new "stackoverflow / aws is down" moment

English

185

Prem Viswanathan@prempv·10 Ara

Incredible!! Congratulations.

emozilla@theemozilla

@NousResearch @hillclimbai two decade long revenge arc on my 2.9 real analysis gpa finally complete

English

Prem Viswanathan retweetledi

Graham Neubig@gneubig·18 Kas

We're hiring! We have positions open for Members of the Technical Staff for Agent R&D and many other positions. Think of the best researcher or engineer you know, don't you want them building in the open? Listings below! allhandsai.applytojob.com/apply/

English

1.8K

Prem Viswanathan@prempv·25 Eki

Can’t wait!!

Nick Miller@nickwm

Working at @cursor_ai feels like riding a rocket ship. The momentum’s been accelerating all year and what’s coming next week will take it to a new orbit.

English

119

Prem Viswanathan@prempv·22 Eki

Compression's always relative: today's model capacities were sci-fi 10 yrs ago. For coding, we thought we would need 100M context. But smaller context with many parallel exploration and convergence is winning. I expect the same here to solve the end goal - how does one go about reading McKinsey slides.

English

Karthik Ramasamy@_cartick·22 Eki

@prempv @dileeplearning “Sufficient resolution” then you lose the argument on context compression.

English

Dileep George@dileeplearning·21 Eki

I love Andrej...but this makes no sense to me. I don't see how converting text to image ('pixels') makes it any better for language modeling. What am I missing?

Andrej Karpathy@karpathy

I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language person) is whether pixels are better inputs to LLMs than text. Whether text tokens are wasteful and just terrible, at the input. Maybe it makes more sense that all inputs to LLMs should only ever be images. Even if you happen to have pure text input, maybe you'd prefer to render it and then feed that in: - more information compression (see paper) => shorter context windows, more efficiency - significantly more general information stream => not just text, but e.g. bold text, colored text, arbitrary images. - input can now be processed with bidirectional attention easily and as default, not autoregressive attention - a lot more powerful. - delete the tokenizer (at the input)!! I already ranted about how much I dislike the tokenizer. Tokenizers are ugly, separate, not end-to-end stage. It "imports" all the ugliness of Unicode, byte encodings, it inherits a lot of historical baggage, security/jailbreak risk (e.g. continuation bytes). It makes two characters that look identical to the eye look as two completely different tokens internally in the network. A smiling emoji looks like a weird token, not an... actual smiling face, pixels and all, and all the transfer learning that brings along. The tokenizer must go. OCR is just one of many useful vision -> text tasks. And text -> text tasks can be made to be vision ->text tasks. Not vice versa. So many the User message is images, but the decoder (the Assistant response) remains text. It's a lot less obvious how to output pixels realistically... or if you'd want to. Now I have to also fight the urge to side quest an image-input-only version of nanochat...

English

337

992

395.2K

Prem Viswanathan@prempv·22 Eki

@_cartick @dileeplearning We humans read text visually isn't it. With sufficient resolution surely this isn't a problem. Vision and audio as the two universal input modalities makes a ton of sense IMO.

English

Karthik Ramasamy@_cartick·21 Eki

@dileeplearning Major issue with visual approach is that with how image gets tokenized, it is harder to differentiate 1,000 vs 1.000 or 10.00. Think of use cases involving finance and medicine that would be very brutal.

English

132

Prem Viswanathan@prempv·20 Ağu

@arafatkatze @andrew_melby @cline @AmpCode @Cursor When you refer to RAG, you are essentially talking about pure vector search, which is indeed quite problematic. But having it as an optional tool should be fine. Your concern about the overhead of vector search relative to the lift it offers is a very valid and quite Underrated.

English

Ara@arafatkatze·20 Ağu

@andrew_melby @cline @AmpCode @Cursor See this x.com/arafatkatze/st…

Ara@arafatkatze

>an eval and benchmark would be great - that’s a great thing to publish! (and would bolster your alls position) I might not have the bandwidth at the moment but we can share this internally. Although if you really wanna prove your case you can try tweaking Aider github.com/Aider-AI/aider/ It has all standard benchmarks and based on my understanding they use Treesitter and other techniques and not rag chunks. You can keep everything else exactly the same where the search retrieval tactic is the only thing tweaked. Personally I do not trust the SWE benchmark and most benchmarks as they have been gamed but if you specifically A/B test between the search mechanism that will be very insightful.

English

Ara@arafatkatze·19 Ağu

In building AI agents @cline , we've identified three mind viruses Mind Viruses are seductive ideas that sound smart, but don’t work in practice. 1. Multi-Agent Orchestration 2. RAG (Retrieval Augmented Generation) 3. More Instructions = Better Results Let's explore why!

English

163

2.3K

511.8K

Prem Viswanathan@prempv·15 Ağu

@gneubig Emails with resend? Is that replying to Gmail with resend?

English

Graham Neubig@gneubig·15 Ağu

Because it's so easy to write code now, I also think of new ways to do things with code. For instance, I'm creating slides using reveal.js (revealjs.com), sending emails with resend.com, and writing music with strudel (strudel.cc).

English

Graham Neubig@gneubig·15 Ağu

I'm preparing for a talk on agents and the future of work, so I decided to check the effect of agents on my own work. The attached chart is the number of pull requests I made by month w/ and w/o code by OpenHands agents. A few observations 🧵

English

159

12.5K

Prem Viswanathan@prempv·1 Ağu

This would be a great team to be part of. Retrieval & Context for AI is everything!

Gabe Pereyra@gabepereyra

We are hiring for Harvey's Retrieval and Data team to expand Harvey to over 1,000 data sources in the 50+ countries we operate in. Our clients use Harvey to do research over legal, financial, tax, and regulatory data around the world. Thread on search + data at Harvey:

English

204

Prem Viswanathan@prempv·1 Ağu

@_cartick @SullyOmarr Checkout @AgnoAgi or @pydanti_ai

English

Karthik Ramasamy@_cartick·31 Tem

@SullyOmarr Are there any python equivalents?

English

143

Sully@SullyOmarr·31 Tem

by far the best sdk for ai (typescript) and its not even close its so good that if you're not using this you're actively handicapping your team it will literally save teams yrs of eng headache

AI SDK@aisdk

AI SDK 5 Introducing type-safe chat, agentic loop controls, data parts, speech generation and transcription, Zod 4 support, global provider, and raw request access.

English

137

13.4K

Prem Viswanathan@prempv·14 Tem

Great move by Cognition!! This should help them with distribution. Devin is quite underrated, relative to the noise around Cursor & CC, IMO

Cognition@cognition

Cognition has signed a definitive agreement to acquire Windsurf. The acquisition includes Windsurf’s IP, product, trademark and brand, and strong business. Above all, it includes Windsurf’s world-class people, whom we’re privileged to welcome to our team. We are also honoring their talent and hard work in building Windsurf into the great business it is today. This transaction is structured so that 100% of Windsurf employees will participate financially. They will also have all vesting cliffs waived and will receive fully accelerated vesting for their work to date. At Cognition we have focused on developing robust and secure autonomous agents, while Windsurf has pioneered the agentic IDE. Devin + Windsurf are a powerful combination for the developers we serve. Working side by side, we’ll soon enable you to plan tasks in an IDE powered by Devin’s codebase understanding, delegate chunks of work to multiple Devins in parallel, complete the highest-leverage parts yourself with the help of autocomplete, and stitch it all back together in the same IDE. Cognition and Windsurf are united behind a shared vision for the future of software engineering, and there’s never been a better time to build. Welcome to our new colleagues from Windsurf!

English

139

Prem Viswanathan@prempv·7 Tem

@deliprao lol. so true

English

686

Delip Rao e/σ@deliprao·7 Tem

Anthropic or Anthropic-sponsored safety papers

English

196

2.4K

145.7K

Prem Viswanathan@prempv·1 Tem

@BrendanFalk 😀 I doubt it would’ve stopped you!

English

Brendan Falk@BrendanFalk·1 Tem

@prempv Could've given us a heads up, Prem! 🤣

English

213

Prem Viswanathan@prempv·1 Tem

I lead many enterprise ML/AI transformations ~8 years ago. Not much has changed! ☹️

Brendan Falk@BrendanFalk

We are pivoting away from doing enterprise AI transformations ("AI-native Palantir"). For now at least. I've shared our key learnings below. I'll share a detailed blog post soon. What's next? We are going to start moving insanely quickly on several other ideas. Stay tuned 😎

English

623

Prem Viswanathan@prempv·30 Haz

It’s wild how Google struggles to apply and build strong products despite having great models

Florian Brand@xeophon

English

167

Keşfet

@nikunj @openclaw @WisprFlow @dileeplearning @_cartick @arafatkatze @andrew_melby @cline