Marco Mascorro

6.6K posts

Marco Mascorro

@Mascobot

Researching LLMs | Roboticist | Prev: Partner @a16z (inv: @cursor_ai, @thinkymachines, @bfl_ml, @WaveFormsAI, @deeptuneai, etc), cofounder @Fellow_AI

San Francisco, CA Katılım Ekim 2009

2.5K Takip Edilen18K Takipçiler

Sabitlenmiş Tweet

Marco Mascorro@Mascobot·24 Eki

Jensen came to our @a16z's Runtime event and he signed our very first personal GPU AI Workstation Founders Edition (4x RTX 6000 Pro Blackwell) "To a16z Builders of Tomorrow!" - Jensen

Marco Mascorro@Mascobot

🚨 New: We built @a16z's personal GPU AI Workstation Founders Edition - 4x NVIDIA RTX 6000 PRO Blackwell Max-Q (384GB total VRAM) - 8TB of NVMe PCIe 5.0 storage - AMD Threadripper PRO 7975WX (32 cores, 64 threads) - 256GB ECC DDR5 RAM - 1650Watts at peak (runs on a standard 15Amp/120V circuit). For training, AI research, and deploying models locally. A datacenter-class AI rig you can keep under your desk. We are planning to make a limited number of these a16z AI Workstations. Build guide + how you can make your own 👇

English

333

86.5K

Marco Mascorro@Mascobot·2h

@matiii @ElevenLabs Congrats @matiii! Super cool

English

Mati Staniszewski@matiii·6h

We just crossed $500M ARR and welcomed new investors to @ElevenLabs: BlackRock, Wellington, Nvidia, Santander, Jamie Foxx, Eva Longoria and more. Natural, human-like communication will be critical to broad AI adoption - and these new investors help us accelerate that work.

English

905

104.2K

Marco Mascorro retweetledi

Cursor@cursor_ai·2d

Composer 2 is 50% off in the SDK this weekend. Enjoy!

Cursor@cursor_ai

We’re introducing the Cursor SDK so you can build agents with the same runtime, harness, and models that power Cursor. Run agents from CI/CD pipelines, create automations for end-to-end workflows, or embed agents directly inside your products.

English

1.1K

137K

Marco Mascorro@Mascobot·3d

@droidbuilds I don't recall anyone laughing when Dario left...

English

307

DROID@droidbuilds·4d

they laughed when Dario Amodei left OpenAI. today - OpenAI revenue: $24B - Anthropic revenue: $30B 🤯 from $1B to $30B in just 15 months. - OpenAI had a 5 year head start. - Dario closed the gap in 15 months. - Google invested $40B in his company. this is the greatest comeback in tech history.

English

539

39.4K

Marco Mascorro@Mascobot·3d

@MaikaThoughts Can't wait to see what you do next Malika! All the best!

English

277

Malika Aubakirova@MaikaThoughts·4d

x.com/i/article/2049…

ZXX

224

90.1K

Marco Mascorro@Mascobot·4d

@leonardtang_ @seema_amble It’s getting there (still slow and not very accurate), but yeah. I think cua could be even more interesting with Cloudfare (or other CDNs) and what they decide to do in the long run…

English

Leonard Tang@leonardtang_·4d

@seema_amble Can’t you just CUA your way out of this

English

369

Seema Amble@seema_amble·4d

the latest from the API wars front: SAP is now locking down its API access to third parties, essentially blocking AI companies. Like many others, SAP can't really compete on building its own agents, so the next best thing is to put up the gates to protect its data and spread FUD about how AI startups are a security risk. This strategy will only work for so long...

Seema Amble@seema_amble

The API wars are here, with AI apps this time. Incumbents know the value of their data and that AI startups may start rebuilding their products behind the scenes - and they’re starting to fight back. Slack just cut off API access which impacts Glean and others. Zendesk may follow. But full-on API blockades probably won’t hold. Here’s what’s more likely to happen 🧵:

English

102

49.4K

Marco Mascorro@Mascobot·4d

Hence the focus on bio from the AI labs:

Antonio Linares@alc2022

2 peptides made more money in 2025 than all frontier AI models combined

English

5.8K

Marco Mascorro@Mascobot·4d

@sdamico @brexton Yes, I don’t understand. I dream (often) that for some reason I didn’t take some classes (that I should’ve taken) and somehow never graduated… it’s so bizarre

English

Sam D'Amico@sdamico·5d

@brexton Why does everyone have this dream lol

English

brexton@brexton·5d

I frequently find myself having a nightmare where I have like two weeks left of senior year of college and I'm about to not graduate because I didn't show up to a class I forgot about all semester I'm 30 btw

English

103

10.1K

Marco Mascorro@Mascobot·4d

@EthanGoodhart @nvidia Nice! Super cool project. I assume you can run the Alpamayo 10B in a local 5090 and it can be faster than the 20fps, right? Seems like enough space (and battery power) for a 120v ac inverter for a large pc there :)

English

155

Ethan Goodhart@EthanGoodhart·4d

Jensen Huang just tried our self-driving golf cart! thanks Jensen! @nvidia

Ethan Goodhart@EthanGoodhart

we got @karpathy to ride in our self-driving golf cart!

English

2.9K

Marco Mascorro@Mascobot·4d

@bfirsh @AnthropicAI Wohoo! Congrats @bfirsh!

English

233

Ben Firshman@bfirsh·4d

I’ve joined @AnthropicAI, on the Labs team. The team that produced Claude Code, MCP, Cowork, etc. Looking forward to building again.

English

640

34K

Marco Mascorro@Mascobot·4d

I always thought the acquisition of ABB Robotics (a leading high-precision robotics co) for ~$5.4B by SoftBank was quite interesting, and maybe the right approach to doing large-scale robotics with newer AI models. ABB Robotics makes $2.3B/year, so a ~2.3x multiple seemed pretty good, and very strategic. Another interesting company in the space would be Universal Robots, but they got acquired by Teradyne (which makes testing equipment for semiconductors) for $ 285M in 2015, which has worked out pretty well for them:

Lukas Ziegler@lukas_m_ziegler

JUST IN: SoftBank is creating a brand new company — Roze AI, that uses autonomous robots to BUILD data centers. And Masayoshi Son is already eyeing a $100 BILLION IPO by the second half of 2026. The logic is simple. The AI boom requires data centers. Data centers require construction. Construction is slow, expensive and labour-intensive. So why not automate the thing that builds the thing that runs the AI? This is infrastructure automation at a scale nobody has attempted before. Robots building the server farms that power artificial intelligence. The whole stack, automated from the ground up. The timing is aggressive, even some inside SoftBank are sceptical about the $ 100bn valuation and the IPO timeline. But Son has never been known for thinking small. For context, Jeff Bezos is reportedly running a similar playbook with Project Prometheus, a startup buying up industrial companies and modernising them with AI. The world's biggest investors are all arriving at the same conclusion: the next trillion-dollar opportunity is not in software. It's in automating the physical world. ~~ ♻️ Join the weekly robotics newsletter, and never miss any news → ziegler.substack.com

English

9.2K

Marco Mascorro@Mascobot·4d

@AriX @altryne do you think in the long run this would be done 100% with multimodal models? Or a combination of text (i.e a11y) + screenshots? I get today the combination is for speedup improvements, but curious what you think the long run would look like

English

Ari Weinstein@AriX·4d

@altryne Codex's Computer Use uses both screenshots and accessibility to interact with app UIs Spark can't see the screenshots, but it can still work with apps using accessibility!

English

1.1K

Ari Weinstein@AriX·4d

Computer Use runs this use case 42% faster in today's Codex app update.

Ari Weinstein@AriX

This is the first time I've ever seen an LLM operate a GUI as fast as a person, and it's surreal.

English

101

115

2.3K

452.6K

Marco Mascorro@Mascobot·5d

mechinterp is still underrated. It seems to be one of the most interesting research areas imo

roon@tszzl

imo mechinterp will not only be solved but have a huge impact on our abstractions and how we understand the world

English

4.5K

Marco Mascorro@Mascobot·5d

I love this format as a blackboard lecture in the @dwarkesh_sp' pod:

Dwarkesh Patel@dwarkesh_sp

Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. 0:00:00 – How batch size affects token cost and speed 0:31:59 – How MoE models are laid out across GPU racks 0:47:02 – How pipeline parallelism spreads model layers across racks 1:03:27 – Why Ilya said, “As we now know, pipelining is not wise.” 1:18:49 – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal 1:32:52 – Deducing long context memory costs from API pricing 2:03:52 – Convergent evolution between neural nets and cryptography

English

4.1K

Marco Mascorro retweetledi

Alex Volkov@altryne·6d

Can confirm, @cursor_ai is the best harness we've tested on @WolfBenchAI so far! @WolframRvnwlf tests Harness x Model, and Cursor (before the SDK) is the best one we've ever tested!

Dan ⚡️@d4m1n

lol Cursor is a better harness for both GPT 5.5 in Codex AND Opus 4.7 in Claude Code how is that possible?!

English

284

64.1K

Marco Mascorro retweetledi

Cursor@cursor_ai·6d

English

406

832

8.8K

Marco Mascorro@Mascobot·28 Nis

@idansc @stanfordnlp @yoavgo I think cs336 is light on the data side (at least of what I understand he’s referring to). There are a lot of things on the data quality/mix in pre/mid and post training…

English

551

Idan Schwartz@idansc·27 Nis

@yoavgo cs336 does quite a good job

English

13.6K

(((ل()(ل() 'yoav))))👾@yoavgo·27 Nis

The big dilemma with teaching an "LLM course" is that it is really easy to get drawn into teaching the various technical things like efficiency tricks, attention variants, PPO vs GRPO, etc etc. But the real "meat" is not there, but in the data: data for pre-training, for mid-training, for SFT, for RL and for "reasoning", synthetic data, curated data, annotated data... cleaning, evaluating, improving, mixing, ... lots of stuff. but "data" is so much harder to teach: it is not "mathematic" or "algorithmic" like the technical things, and it is not clear what is the teachable thing there. it is also a lot less transparent than the technical topics, both because it is semi-secret, and also because it is also not appealing for publishing, for roughly the same reasons it is not appealing for teaching. so, what would you teach about data? what are the key lessons and insights one should know? any good papers or resources? good existing classes? blogs? hit me with what you have

English

831

57.2K

Marco Mascorro@Mascobot·28 Nis

@DavidDuvenaud @AlecRad @status_effects Super cool project, congrats! Curious, how did you verify that Opus didn't "insert" new knowledge/bias (post 1930) in the creation of the synth data when doing the instruct dataset?

English

2.9K

David Duvenaud@DavidDuvenaud·28 Nis

Announcing Talkie: a new, open-weight historical LLM! We trained and finetuned a 13B model on a newly-curated dataset of only pre-1930 data. Try it below! with @AlecRad and @status_effects 🧵

English

200

455

3.6K

1.4M

Marco Mascorro retweetledi

AndresCampero@AndresCamperoN·25 Nis

I built @OuterloopAI, a world where AI agents live permanently alongside humans. They explore, form friendships, debate Socrates, play games. You can connect your agent, summon a new one, or join as yourself. outerloop.ai

English

103

19.3K

Marco Mascorro retweetledi

Cursor@cursor_ai·24 Nis

GPT-5.5 is now available in Cursor! It's currently the top model on CursorBench at 72.8%. We've partnered with OpenAI to offer it for 50% off through May 2.

English

174

269

5.7K

504.9K

Marco Mascorro retweetledi

Deli Chen@victor207755822·24 Nis

DeepSeek-V3: Dec 26, 2024 DeepSeek-V4: Apr 24, 2026 484 days later, we humbly share our labor of love. As always, we stay true to long-termism and open source for all. AGI belongs to everyone. ❤️🌍 #DeepSeekV4 #AGIforEveryone #OpenSource

DeepSeek@deepseek_ai

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English

352

1.3K

13.1K

Keşfet

@matiii @ElevenLabs @droidbuilds @MaikaThoughts @leonardtang_ @seema_amble @sdamico @brexton