Ashish Khanagwal

6.7K posts

Ashish Khanagwal banner
Ashish Khanagwal

Ashish Khanagwal

@TheAshrex

MLOps • AI • Startups Building @CrooviOfficial ⧽ CrooFx ⚡https://t.co/q4ISN4pxaf ⧽ CroFlux ⚡ https://t.co/obJy3Si2cf ⧽ CroVew ⚡https://t.co/4SSbjwUTa1

Server room Katılım Mayıs 2021
195 Takip Edilen671 Takipçiler
Sabitlenmiş Tweet
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
Building an execution-focused ecosystem at @CrooviOfficial ⚡ • Croofx → AI execution engine that helps developers understand, modify & ship code faster • CroFlux → turns product ideas into clear milestones & daily tasks so builders stay consistent • CroVew → real-time “god view” dashboard to see exactly how users interact with your product Waitlist open for Croofx & CroFlux 👇 ⧽ CrooFx - croovi.com ⧽ CroFlux - croflux.vercel.app CroVew demo + waitlist coming soon 👀
Ashish Khanagwal tweet media
English
1
1
4
309
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@ThePrimeagen if you're not minminning your min while simultaneously maxmaxxing your max, your minimax is just a maxi with extra steps
English
0
0
0
121
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@claudeai 94.2% on GPQA Diamond. 87.6% on SWE-bench Verified. I don't care about benchmarks until I do. And I do, Alright Anthropic, you have my attention. 👀
English
0
0
0
123
Claude
Claude@claudeai·
Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.
Claude tweet media
English
4.7K
10.2K
81K
13.9M
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@ThePrimeagen social distancing from X to slow the spread… on X the AGI is already in the walls bro
English
0
0
0
68
ThePrimeagen
ThePrimeagen@ThePrimeagen·
It's been 0 days since AGI We are currently experiencing a more aggressive strain of AGI, code-named mythos. I would recommend social distancing from platforms like X for the next 24 to 48 hours to slow the spread
ThePrimeagen@ThePrimeagen

its been 0 days since agi

English
90
47
1.6K
49.7K
Anthropic
Anthropic@AnthropicAI·
New on the Engineering Blog: Building Managed Agents—our hosted service for long-running agents—meant solving an old problem in computing: how to design a system for “programs as yet unthought of.” Read more: anthropic.com/engineering/ma…
English
393
458
3.6K
572.4K
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@claudeai a new feature every week is cool until you realise nothing from last week is fully stable yet
English
0
0
1
110
Claude
Claude@claudeai·
Introducing Claude Managed Agents: everything you need to build and deploy agents at scale. It pairs an agent harness tuned for performance with production infrastructure, so you can go from prototype to launch in days. Now in public beta on the Claude Platform.
English
2.1K
6K
57K
21.6M
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@Spamfromk honestly i think happiness is just a decision. not easy, but it's always a choice
English
0
0
0
5
白夜 ♡
白夜 ♡@Spamfromk·
But honestly why is our generation is so unhappy??
English
123
13
119
12.5K
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@marclou honestly fair. the best tool is the one you actually use, not the one with the best benchmarks
English
0
0
1
229
Marc Lou
Marc Lou@marclou·
I've tried Claude Code and Codex. The upgrade was nowhere near worth the time needed to adapt. I ignore new tools that take more than 1 minute to set up and use. You can build a startup with GPT-3.
Adithya@curiousadithya

@marclou marc i don't understand why you are still using cursor. i a confused, am i missing something. Or is it your personal choice to code inside cursor. can you tell me what made you stick to cursor instead of using claude code or codex directly ??

English
184
20
638
132.1K
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@leylndd too dangerous to release = it actually works and everyone's scared
English
0
0
2
48
leyland
leyland@leylndd·
“we cant merge my pr. it’s too dangerous to release”
leyland tweet media
English
5
2
81
2.7K
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@shiri_shh meta goes closed source for the first time and stock pumps 9% in a day 💀 wall street has been waiting for this move for a while
English
0
0
0
32
shirish
shirish@shiri_shh·
Meta stock price after dropping their own frontier LLM model.
shirish tweet media
Artificial Analysis@ArtificialAnlys

Meta is back! Muse Spark scores 52 on the Artificial Analysis Intelligence Index, behind only Gemini 3.1 Pro, GPT-5.4, and Claude Opus 4.6. Muse Spark is the first new release since Llama 4 in April 2025 and also Meta's first release that is not open weights Muse Spark is a new model from @Meta evaluated on Artificial Analysis. We were given early access by Meta to independently benchmark the model. It is the first frontier-class model from Meta since Llama 4 Maverick was released in April 2025, and notably the first @AIatMeta model that is not being released as open weights. The release follows Meta's reorganization of its AI efforts under Meta Superintelligence Labs, and signals that Meta is re-entering the frontier race after roughly a year of relative quiet. For context, Llama 4 Maverick and Scout scored 18 and 13 respectively on the Artificial Analysis Intelligence Index as non-reasoning models at the time of their release, while Muse Spark scores 52. Muse Spark essentially closes the gap between to the frontier in a single release. The model is not open source and is not yet accessible via an API but Meta has shared they expect this to come soon. Meta is also integrating Muse Spark into their first party products including their Meta AI chat product, Facebook, Instagram and Threads. Key takeaways from our benchmarks: ➤ Muse Spark scores 52 on the Artificial Analysis Intelligence Index, placing it within the top 5 models we have benchmarked. It sits ahead of Claude Sonnet 4.6, GLM-5.1, MiniMax-M2.7, Grok 4.20 and behind Gemini 3.1 Pro Preview, GPT-5.4 and Claude Opus 4.6 ➤ Muse Spark is notably token efficient for its intelligence level. It used 58M output tokens to run the Intelligence Index, comparable to Gemini 3.1 Pro Preview (57M) and notably lower than Claude Opus 4.6 (Adaptive Reasoning, max effort, 157M), GPT-5.4 (xhigh, 120M) and GLM-5 (110M) ➤ Muse Spark is the second-most capable vision model we have benchmarked. It scores 80.5% on MMMU-Pro, behind only Gemini 3.1 Pro Preview (82.4%) ➤ Muse Spark performs strongly on reasoning and instruction-following evaluations. It scores 39.9% on HLE, trailing only Gemini 3.1 Pro Preview (44.7%) and GPT-5.4 (xhigh, 41.6%). The model also achieved 5th highest in CritPT with a score of 11%, an eval that is focused on difficult physics research questions. This is substantially above above Gemini 3 Flash (9%) and Claude 4.6 Sonnet (3%) ➤ Agentic performance does not stand out. On GDPval-AA, our evalaution focused on real world work tasks, Muse Spark scores 1427, behind both Claude Sonnet 4.6 at 1648 and GPT-5.4 at 1676, but ahead of Gemini 3.1 Pro Preview at 1320. On On TerminalBench Hard, Muse Spark trails Claude Sonnet 4.6, GPT-5.4, and Gemini 3.1 Pro. Muse Spark joins others in achieving a high τ²-Bench Telecom score of 92% Key model details: ➤ Modalities: Multimodal including text and vision input, text output ➤ License: Proprietary, Meta's first frontier model not released as open weights ➤ Availability: No public API at the time of publishing. Meta expects to provide API access soon. Meta has started integration into their first party AI offering Meta AI and inside Facebook, Instagram, and Threads

English
6
0
27
3.2K
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@Jashanx_gill interesting idea, how exactly does the job simulation work? like what kind of tasks or scenarios do users go through?
English
1
0
1
13
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@alexabelonix 100% agree, first sale is the hardest unlock once that happens everything becomes a process curious, any marketing advice for early-stage builders trying to get their first users? 👀
English
0
0
0
23
Alexa Web3 (e/acc)
Alexa Web3 (e/acc)@alexabelonix·
Believe in this If you can sell 1 ... You can sell 10 You can sell 100 You can sell 1,000 You can sell 10,000 You can sell 1,000,000
English
19
1
48
949
Rohit
Rohit@rxhit05·
As a founder, What would you focus on the most right now? -growth -product -revenue -users -content
English
63
1
58
2.7K
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
Waitlist is now open:- croflux.vercel.app CroFlux turns a rough idea or PDS into a structured execution roadmap with milestones, tasks, and build order, instantly. If you’re building something this year, this will save you weeks of figuring out what to do next.
English
0
0
0
21
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
Today was one of those real builder days. Shipped the core engine of @CrooviOfficial CroFlux: → PDS → AI roadmap generation → Structured milestones, tasks, boss stages → Stored in Supabase → Usage limits + guardrails → Auth-protected API → Loading UX wired → Gemini fallback stack working Basically: idea → execution plan → dashboard Still left: • polish loader UX • roadmap editing layer • gamification layer (XP, bosses, streaks) • dashboard refinement • deployment Getting dangerously close to something usable. Builders: the hardest part isn’t AI… It’s turning outputs into structured execution. CroFlux is solving that.
English
1
1
2
89
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
@SNagarani1419 senior devs still own the architecture, the decisions, the hard debugging. but execution? junior with AI is right there. the gap closed faster than anyone expected
English
0
0
1
11
Nagarani
Nagarani@SNagarani1419·
Be honest: Is a Junior Dev with AI tools more valuable than a Senior Dev without them in 2026?
English
35
1
25
1.5K
Ashish Khanagwal
Ashish Khanagwal@TheAshrex·
Looking to contribute to AI research under strong professors/researchers. My interests: applied AI, LLM systems, AI tooling, developer productivity, and real-world impact. If you know researchers open to motivated collaborators or research interns, I’d really appreciate an intro or direction 🙏 Happy to share portfolio, GitHub, and work. RT appreciated.
English
0
0
1
71
Samantha Simonhoff
Samantha Simonhoff@RealProductGirl·
Every scar is a lesson. Every setback, a setup. Keep going.
English
18
0
44
716