Matt Robinson

1.8K posts

Matt Robinson

@robinsonmatt

Journalist. Founder @_AI_Street ex @business Follow me on LinkedIn, where I'm more active: https://t.co/00AmQX7F4u

Milan, Lombardy Katılım Ocak 2010

885 Takip Edilen2.2K Takipçiler

Sabitlenmiş Tweet

Matt Robinson@robinsonmatt·20 Şub

@richardcraib runs one of Wall Street’s most unconventional business models: a crowdsourced hedge fund. He also counts JPMorgan as his biggest backer. Now, he's retooling @numerai for AI agents. This was a fun interview. ai-street.co/p/the-hedge-fu…

English

6.4K

Matt Robinson@robinsonmatt·24 Mar

@FundamentEdge Got it. I'm still surprised how much the conversation is still about the the latest capabilities and not about evaluation/ figuring out if its outputs are accurate.

English

Brett Caughran@FundamentEdge·23 Mar

@robinsonmatt it's not there one-shot reliably yet. Requires a thoughtful validation process. But emerging MCPs help a lot in my experience (in past, LLMs wanted to web scrape financial data which is a really bad idea)

English

251

Brett Caughran@FundamentEdge·22 Mar

It's really remarkable how fast AI tools for Excel have evolved. Even three months ago I found them almost completely unusable. Today, I was able to update my Uber model for the last four quarters in a fraction of the time, accurately, even when I consider the time I spent de-bugging and validating the key inputs. The three big unlocks for me were creating my own skills files, which are recipe cards encoding an incredibly detailed dissection of every step of the financial modeling process (put together in an 86 page document then crafted into six distinct modeling skills...unfortunately, I won't be sharing this at this time, but will consider in the future), connecting the Daloopa MCP to Claude in Claude Excel for accurate data, and creating a validation space in Perplexity Computer to do final checks and de-bugging. (I am not sponsored by either Daloopa or Perplexity, or any vendor for that matter) Obviously this AI augmented process is only valuable to the extent that it is 98%+ accurate and 100%+ accurate on critical metrics. Validation has to be a systematic process blending coding tools and human validation checklists (i.e. hand checking key model variables and understanding where in the model there is tolerance for mistakes, and where there isn't). But the ability of new LLMs to read & analyze models (particularly GPT 5.4) and the rise of Agentic Workspaces like Perplexity Computer to route tasks to the right LLMs seems to be resulting in big progress here. Really exciting stuff. I have been a huge skeptic here...Excel-based models are the foundation of institutional decision making, and they are no place for AI slop. With the technology improving, particularly workflows around systematic validation, that skepticism is melting.

English

549

62.3K

Matt Robinson@robinsonmatt·4 Mar

@FundamentEdge This is really the core challenge of combining probabilistic LLMs with deterministic software. A lot of current work is about building scaffolding between those layers, but there’s no settled architecture yet. I think it eventually gets there, but it’s hard to say how quickly.

English

Brett Caughran@FundamentEdge·3 Mar

This is one of the biggest sticking points on AI Excel that I'm trying to understand. 73% accuracy is progress, but is it useful for anything at all? We were on a vendor call last month and the vendor bragged of hitting 65% accuracy in Excel and Andrew Carr and I texted "an analyst who is 65% accurate in Excel is 100% fired". Why is AI Excel only 60-70% accurate? Are these issues fundamental or solvable? > Is MCP fundamentally too brittle to get to 99% accuracy? > Is the data layer clean enough to hit 99% accuracy (i.e. there's a reason why hedge fund analysts don't start their models with a Bloomberg download) > Are the foundation models powerful enough to handle the multi-modal (filings, PRs, investor decks, data supplementals), multi-document, "needle in a haystack" issues for LLMs? Context windows have grown, but they are still not large enough to capture all of the documents and files for one ticker (letalone a coverage universe) > Is the commercial opportunity large enough for foundation labs to build RL environments for public equity modeling, as they are doing on investment banking modeling? Does the "march of 9s" on AI Excel take 6 months or 6 years? Driverless cars took 13 years from DARPA Urban challenge to first Waymo. These are legit questions. I don't know. I also don't really trust public evaluation sets (i.e. LLM's win physics competitions...then you learn the LLM trained on the physics competition test bank lol). The real questions in investment research modeling are out of sample questions (i.e. how to model SAAS retention in a Claude-world...there is no prior on which to rely). So I am building my own evaluation set. 100 use cases ranging from simple (input 3 statements from 10-K to AMZN model) to complex (model GE split/spin). Am I wasting my time? 36 months form now, are we still only at 80% accuracy in AI Excel? These are questions, now answers - love your takes in replies or DM!

Patrick OShaughnessy@patrick_oshag

For all the spreadsheet people out there …

English

113

45.3K

Matt Robinson@robinsonmatt·20 Şub

...quantitative finance. Takeaway: LLMs can be a Swiss Army knife in daily life, but for mathematical analysis, specialized agents may be the sharper tool. ai-street.co/i/1835820

English

100

Matt Robinson@robinsonmatt·20 Şub

The authors argue this setup improves analytical rigor and helps mitigate behavioral biases like overconfidence. While limited in scope and not a full portfolio optimizer, the study suggests specialized, debating agents may prove more reliable than general models for ...

English

121

Matt Robinson@robinsonmatt·20 Şub

𝗕𝗹𝗮𝗰𝗸𝗥𝗼𝗰𝗸 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵𝗲𝗿𝘀 𝗗𝗲𝘃𝗲𝗹𝗼𝗽 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁 𝗦𝘆𝘀𝘁𝗲𝗺 𝗳𝗼𝗿 𝗦𝘁𝗼𝗰𝗸 𝗣𝗶𝗰𝗸𝘀 Instead of relying on one frontier model, BlackRock built three AI “agents” that mimic different analyst roles:

English

2.3K

Matt Robinson retweetledi

Andrew Chen@achenfinance·4 May

I'm delighted to be featured in the AI Street newsletter!

English

Matt Robinson retweetledi

Kris@KrisBennatti·3 Mar

Where chatbots fail, @HudsonLabs delivers. Find out why the Co-Analyst is getting rave reviews from institutional investors. Thank you for the feature @robinsonmatt. Highly recommend subscribing. ai-street.co/p/separating-f…

English

402

Matt Robinson retweetledi

Bigdata.com@bigdatadotcom·22 Kas

ai-street.co/p/ravenpack-s-…

ZXX

252

Matt Robinson@robinsonmatt·22 Kas

@ZekeFaux haha gotta have b roll of "the Town" and have bill burr show up for some reason.

English

Zeke Faux@ZekeFaux·22 Kas

@robinsonmatt As long as the thieves are pats fans, I stand ready to tell their story

English

Matt Robinson@robinsonmatt·21 Kas

Oh man I want the tick tock of this cc @ZekeFaux NFL warns of burglary groups targeting players apnews.com/article/nfl-se…

English

422

Matt Robinson@robinsonmatt·17 Kas

Great to catch up @TedMerz!

Ted Merz@TedMerz

View from the Office. I met up with Matt Robinson at the Pain Quotidien at Columbus Circle. I had the small cappuccino because the large ones are ridiculously huge. Matt had coffee and cream. Matt recently started an AI focused newsletter called AI Street after moving to Milan with his wife and daughter. Previously, he spent more than a decade as a reporter at Bloomberg. I know from first-hand experience that Matt knows a lot about newsletters because in 2011 he was hired by Bloomberg to write about structured products for a publication I created. His new title takes a different tack than many of the others I subscribe to which are focused on the large language model companies, such as OpenAI, or prompt engineering. Matt writes about how companies are — or are not — integrating AI into their day-to-day operations. He interviews founders and academics working on practical solutions. For example, for a recent edition he interviewed Alex Kim, a Ph.D. candidate in accounting at the University of Chicago, who has pioneered ways to measure vocal tones using AI. For another article, he spoke with Peter Hafez, chief data scientist at the textual analysis platform RavenPack. He has also written about Siddhant Jayakumar, CEO and founder of Finster AI and a former engineer at Google DeepMind, who is building AI agents. Matt’s goal is to bridge the gap between experts and a wider audience. He said that running his own newsletter company was both familiar and pretty different than running one for Bloomberg. The big change is that now he has to think about distribution and making money through subscriptions or advertising. It was great to re-connect and talk about the state of media and tech. Next time, I’m going to insist on meeting in Milan. Matt can be reached via LinkedIn or DM me for a warm intro.

English

312

Matt Robinson retweetledi

Bigdata.com@bigdatadotcom·13 Kas

"The lines are blurring between quantitative and discretionary investment strategies, with platforms like Bigdata.com bridging the gap."

English

161

Keşfet

@FundamentEdge @HudsonLabs @ZekeFaux @TedMerz @elonmusk @BarackObama @taylorswift13 @cristiano