Marco Mascorro

6.6K posts

Marco Mascorro banner
Marco Mascorro

Marco Mascorro

@Mascobot

Researching LLMs | Roboticist | Prev: Partner @a16z (inv: @cursor_ai, @thinkymachines, @bfl_ml, @WaveFormsAI, @deeptuneai, etc), cofounder @Fellow_AI

San Francisco, CA Katılım Ekim 2009
2.5K Takip Edilen18K Takipçiler
Sabitlenmiş Tweet
Marco Mascorro
Marco Mascorro@Mascobot·
Jensen came to our @a16z's Runtime event and he signed our very first personal GPU AI Workstation Founders Edition (4x RTX 6000 Pro Blackwell) "To a16z Builders of Tomorrow!" - Jensen
Marco Mascorro tweet media
Marco Mascorro@Mascobot

🚨 New: We built @a16z's personal GPU AI Workstation Founders Edition - 4x NVIDIA RTX 6000 PRO Blackwell Max-Q (384GB total VRAM) - 8TB of NVMe PCIe 5.0 storage - AMD Threadripper PRO 7975WX (32 cores, 64 threads) - 256GB ECC DDR5 RAM - 1650Watts at peak (runs on a standard 15Amp/120V circuit). For training, AI research, and deploying models locally. A datacenter-class AI rig you can keep under your desk. We are planning to make a limited number of these a16z AI Workstations. Build guide + how you can make your own 👇

English
30
30
333
86.5K
Mati Staniszewski
We just crossed $500M ARR and welcomed new investors to @ElevenLabs: BlackRock, Wellington, Nvidia, Santander, Jamie Foxx, Eva Longoria and more. Natural, human-like communication will be critical to broad AI adoption - and these new investors help us accelerate that work.
Mati Staniszewski tweet media
English
73
64
905
104.2K
DROID
DROID@droidbuilds·
they laughed when Dario Amodei left OpenAI. today - OpenAI revenue: $24B - Anthropic revenue: $30B 🤯 from $1B to $30B in just 15 months. - OpenAI had a 5 year head start. - Dario closed the gap in 15 months. - Google invested $40B in his company. this is the greatest comeback in tech history.
DROID tweet mediaDROID tweet mediaDROID tweet media
English
85
31
539
39.4K
Marco Mascorro
Marco Mascorro@Mascobot·
@leonardtang_ @seema_amble It’s getting there (still slow and not very accurate), but yeah. I think cua could be even more interesting with Cloudfare (or other CDNs) and what they decide to do in the long run…
English
0
0
1
72
Seema Amble
Seema Amble@seema_amble·
the latest from the API wars front: SAP is now locking down its API access to third parties, essentially blocking AI companies. Like many others, SAP can't really compete on building its own agents, so the next best thing is to put up the gates to protect its data and spread FUD about how AI startups are a security risk. This strategy will only work for so long...
Seema Amble@seema_amble

The API wars are here, with AI apps this time. Incumbents know the value of their data and that AI startups may start rebuilding their products behind the scenes - and they’re starting to fight back. Slack just cut off API access which impacts Glean and others. Zendesk may follow. But full-on API blockades probably won’t hold. Here’s what’s more likely to happen 🧵:

English
13
3
102
49.4K
Marco Mascorro
Marco Mascorro@Mascobot·
@sdamico @brexton Yes, I don’t understand. I dream (often) that for some reason I didn’t take some classes (that I should’ve taken) and somehow never graduated… it’s so bizarre
English
0
0
2
57
brexton
brexton@brexton·
I frequently find myself having a nightmare where I have like two weeks left of senior year of college and I'm about to not graduate because I didn't show up to a class I forgot about all semester I'm 30 btw
English
19
1
103
10.1K
Marco Mascorro
Marco Mascorro@Mascobot·
@EthanGoodhart @nvidia Nice! Super cool project. I assume you can run the Alpamayo 10B in a local 5090 and it can be faster than the 20fps, right? Seems like enough space (and battery power) for a 120v ac inverter for a large pc there :)
English
0
0
0
155
Ben Firshman
Ben Firshman@bfirsh·
I’ve joined @AnthropicAI, on the Labs team. The team that produced Claude Code, MCP, Cowork, etc. Looking forward to building again.
English
61
11
640
34K
Marco Mascorro
Marco Mascorro@Mascobot·
I always thought the acquisition of ABB Robotics (a leading high-precision robotics co) for ~$5.4B by SoftBank was quite interesting, and maybe the right approach to doing large-scale robotics with newer AI models. ABB Robotics makes $2.3B/year, so a ~2.3x multiple seemed pretty good, and very strategic. Another interesting company in the space would be Universal Robots, but they got acquired by Teradyne (which makes testing equipment for semiconductors) for $ 285M in 2015, which has worked out pretty well for them:
Lukas Ziegler@lukas_m_ziegler

JUST IN: SoftBank is creating a brand new company — Roze AI, that uses autonomous robots to BUILD data centers. And Masayoshi Son is already eyeing a $100 BILLION IPO by the second half of 2026. The logic is simple. The AI boom requires data centers. Data centers require construction. Construction is slow, expensive and labour-intensive. So why not automate the thing that builds the thing that runs the AI? This is infrastructure automation at a scale nobody has attempted before. Robots building the server farms that power artificial intelligence. The whole stack, automated from the ground up. The timing is aggressive, even some inside SoftBank are sceptical about the $ 100bn valuation and the IPO timeline. But Son has never been known for thinking small. For context, Jeff Bezos is reportedly running a similar playbook with Project Prometheus, a startup buying up industrial companies and modernising them with AI. The world's biggest investors are all arriving at the same conclusion: the next trillion-dollar opportunity is not in software. It's in automating the physical world. ~~ ♻️ Join the weekly robotics newsletter, and never miss any news → ziegler.substack.com

English
1
1
29
9.2K
Marco Mascorro
Marco Mascorro@Mascobot·
@AriX @altryne do you think in the long run this would be done 100% with multimodal models? Or a combination of text (i.e a11y) + screenshots? I get today the combination is for speedup improvements, but curious what you think the long run would look like
English
0
0
0
67
Ari Weinstein
Ari Weinstein@AriX·
@altryne Codex's Computer Use uses both screenshots and accessibility to interact with app UIs Spark can't see the screenshots, but it can still work with apps using accessibility!
English
1
0
9
1.1K
Marco Mascorro
Marco Mascorro@Mascobot·
I love this format as a blackboard lecture in the @dwarkesh_sp' pod:
Marco Mascorro tweet media
Dwarkesh Patel@dwarkesh_sp

Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. 0:00:00 – How batch size affects token cost and speed 0:31:59 – How MoE models are laid out across GPU racks 0:47:02 – How pipeline parallelism spreads model layers across racks 1:03:27 – Why Ilya said, “As we now know, pipelining is not wise.” 1:18:49 – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal 1:32:52 – Deducing long context memory costs from API pricing 2:03:52 – Convergent evolution between neural nets and cryptography

English
0
8
39
4.1K
Marco Mascorro retweetledi
Cursor
Cursor@cursor_ai·
We’re introducing the Cursor SDK so you can build agents with the same runtime, harness, and models that power Cursor. Run agents from CI/CD pipelines, create automations for end-to-end workflows, or embed agents directly inside your products.
English
406
832
8.8K
3M
Marco Mascorro
Marco Mascorro@Mascobot·
@idansc @stanfordnlp @yoavgo I think cs336 is light on the data side (at least of what I understand he’s referring to). There are a lot of things on the data quality/mix in pre/mid and post training…
English
2
0
0
551
(((ل()(ل() 'yoav))))👾
The big dilemma with teaching an "LLM course" is that it is really easy to get drawn into teaching the various technical things like efficiency tricks, attention variants, PPO vs GRPO, etc etc. But the real "meat" is not there, but in the data: data for pre-training, for mid-training, for SFT, for RL and for "reasoning", synthetic data, curated data, annotated data... cleaning, evaluating, improving, mixing, ... lots of stuff. but "data" is so much harder to teach: it is not "mathematic" or "algorithmic" like the technical things, and it is not clear what is the teachable thing there. it is also a lot less transparent than the technical topics, both because it is semi-secret, and also because it is also not appealing for publishing, for roughly the same reasons it is not appealing for teaching. so, what would you teach about data? what are the key lessons and insights one should know? any good papers or resources? good existing classes? blogs? hit me with what you have
English
54
56
831
57.2K
Marco Mascorro
Marco Mascorro@Mascobot·
@DavidDuvenaud @AlecRad @status_effects Super cool project, congrats! Curious, how did you verify that Opus didn't "insert" new knowledge/bias (post 1930) in the creation of the synth data when doing the instruct dataset?
Marco Mascorro tweet media
English
0
0
33
2.9K
David Duvenaud
David Duvenaud@DavidDuvenaud·
Announcing Talkie: a new, open-weight historical LLM! We trained and finetuned a 13B model on a newly-curated dataset of only pre-1930 data. Try it below! with @AlecRad and @status_effects 🧵
English
200
455
3.6K
1.4M
Marco Mascorro retweetledi
AndresCampero
AndresCampero@AndresCamperoN·
I built @OuterloopAI, a world where AI agents live permanently alongside humans. They explore, form friendships, debate Socrates, play games. You can connect your agent, summon a new one, or join as yourself. outerloop.ai
English
5
41
103
19.3K
Marco Mascorro retweetledi
Cursor
Cursor@cursor_ai·
GPT-5.5 is now available in Cursor! It's currently the top model on CursorBench at 72.8%. We've partnered with OpenAI to offer it for 50% off through May 2.
English
174
269
5.7K
504.9K
Marco Mascorro retweetledi
Deli Chen
Deli Chen@victor207755822·
DeepSeek-V3: Dec 26, 2024 DeepSeek-V4: Apr 24, 2026 484 days later, we humbly share our labor of love. As always, we stay true to long-termism and open source for all. AGI belongs to everyone. ❤️🌍 #DeepSeekV4 #AGIforEveryone #OpenSource
DeepSeek@deepseek_ai

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English
352
1.3K
13.1K
1M