Ashish Khanagwal

6.7K posts

Ashish Khanagwal

@TheAshrex

MLOps • AI • Startups Building @CrooviOfficial ⧽ CrooFx ⚡https://t.co/q4ISN4pxaf ⧽ CroFlux ⚡ https://t.co/obJy3Si2cf ⧽ CroVew ⚡https://t.co/4SSbjwUTa1

Server room Katılım Mayıs 2021

195 Takip Edilen671 Takipçiler

Sabitlenmiş Tweet

Ashish Khanagwal@TheAshrex·23 Mar

Building an execution-focused ecosystem at @CrooviOfficial ⚡ • Croofx → AI execution engine that helps developers understand, modify & ship code faster • CroFlux → turns product ideas into clear milestones & daily tasks so builders stay consistent • CroVew → real-time “god view” dashboard to see exactly how users interact with your product Waitlist open for Croofx & CroFlux 👇 ⧽ CrooFx - croovi.com ⧽ CroFlux - croflux.vercel.app CroVew demo + waitlist coming soon 👀

English

309

Ashish Khanagwal@TheAshrex·16 Nis

@ThePrimeagen if you're not minminning your min while simultaneously maxmaxxing your max, your minimax is just a maxi with extra steps

English

121

ThePrimeagen@ThePrimeagen·16 Nis

This means you have to minminning your min and maxmaxxing your max to properly minimax

Casey Muratori@cmuratori

If you're not maxmaxxing, then anything else you maxx will not actually be maxed. Let that sink in.

English

339

23.4K

Ashish Khanagwal@TheAshrex·16 Nis

@claudeai 94.2% on GPQA Diamond. 87.6% on SWE-bench Verified. I don't care about benchmarks until I do. And I do, Alright Anthropic, you have my attention. 👀

English

123

Claude@claudeai·16 Nis

Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.

English

4.7K

10.2K

81K

13.9M

Ashish Khanagwal@TheAshrex·9 Nis

@ThePrimeagen social distancing from X to slow the spread… on X the AGI is already in the walls bro

English

ThePrimeagen@ThePrimeagen·9 Nis

It's been 0 days since AGI We are currently experiencing a more aggressive strain of AGI, code-named mythos. I would recommend social distancing from platforms like X for the next 24 to 48 hours to slow the spread

ThePrimeagen@ThePrimeagen

its been 0 days since agi

English

1.6K

49.7K

Ashish Khanagwal@TheAshrex·9 Nis

WHAT ?!

English

Ashish Khanagwal@TheAshrex·8 Nis

@AnthropicAI every week a new feature. zero time to actually go deep on any of them

English

150

Anthropic@AnthropicAI·8 Nis

New on the Engineering Blog: Building Managed Agents—our hosted service for long-running agents—meant solving an old problem in computing: how to design a system for “programs as yet unthought of.” Read more: anthropic.com/engineering/ma…

English

393

458

3.6K

572.4K

Ashish Khanagwal@TheAshrex·8 Nis

@claudeai a new feature every week is cool until you realise nothing from last week is fully stable yet

English

110

Claude@claudeai·8 Nis

Introducing Claude Managed Agents: everything you need to build and deploy agents at scale. It pairs an agent harness tuned for performance with production infrastructure, so you can go from prototype to launch in days. Now in public beta on the Claude Platform.

English

2.1K

57K

21.6M

Ashish Khanagwal@TheAshrex·8 Nis

@Spamfromk honestly i think happiness is just a decision. not easy, but it's always a choice

English

白夜 ♡@Spamfromk·7 Nis

But honestly why is our generation is so unhappy??

English

123

119

12.5K

Ashish Khanagwal@TheAshrex·8 Nis

@marclou honestly fair. the best tool is the one you actually use, not the one with the best benchmarks

English

229

Marc Lou@marclou·8 Nis

I've tried Claude Code and Codex. The upgrade was nowhere near worth the time needed to adapt. I ignore new tools that take more than 1 minute to set up and use. You can build a startup with GPT-3.

Adithya@curiousadithya

@marclou marc i don't understand why you are still using cursor. i a confused, am i missing something. Or is it your personal choice to code inside cursor. can you tell me what made you stick to cursor instead of using claude code or codex directly ??

English

184

638

132.1K

Ashish Khanagwal@TheAshrex·8 Nis

@leylndd too dangerous to release = it actually works and everyone's scared

English

leyland@leylndd·8 Nis

“we cant merge my pr. it’s too dangerous to release”

English

2.7K

Ashish Khanagwal@TheAshrex·8 Nis

so meta finally drops open weights, goes closed with Muse Spark, lands just behind claude opus 4.6 and gpt-5.4 on benchmarks, and wall street immediately pumps them 9% 😭 the message is pretty clear. open source was great for developers, closed is great for investors. zuck read the room

shirish@shiri_shh

Meta stock price after dropping their own frontier LLM model.

English

Ashish Khanagwal@TheAshrex·8 Nis

@shiri_shh meta goes closed source for the first time and stock pumps 9% in a day 💀 wall street has been waiting for this move for a while

English

shirish@shiri_shh·8 Nis

Meta stock price after dropping their own frontier LLM model.

Artificial Analysis@ArtificialAnlys

Meta is back! Muse Spark scores 52 on the Artificial Analysis Intelligence Index, behind only Gemini 3.1 Pro, GPT-5.4, and Claude Opus 4.6. Muse Spark is the first new release since Llama 4 in April 2025 and also Meta's first release that is not open weights Muse Spark is a new model from @Meta evaluated on Artificial Analysis. We were given early access by Meta to independently benchmark the model. It is the first frontier-class model from Meta since Llama 4 Maverick was released in April 2025, and notably the first @AIatMeta model that is not being released as open weights. The release follows Meta's reorganization of its AI efforts under Meta Superintelligence Labs, and signals that Meta is re-entering the frontier race after roughly a year of relative quiet. For context, Llama 4 Maverick and Scout scored 18 and 13 respectively on the Artificial Analysis Intelligence Index as non-reasoning models at the time of their release, while Muse Spark scores 52. Muse Spark essentially closes the gap between to the frontier in a single release. The model is not open source and is not yet accessible via an API but Meta has shared they expect this to come soon. Meta is also integrating Muse Spark into their first party products including their Meta AI chat product, Facebook, Instagram and Threads. Key takeaways from our benchmarks: ➤ Muse Spark scores 52 on the Artificial Analysis Intelligence Index, placing it within the top 5 models we have benchmarked. It sits ahead of Claude Sonnet 4.6, GLM-5.1, MiniMax-M2.7, Grok 4.20 and behind Gemini 3.1 Pro Preview, GPT-5.4 and Claude Opus 4.6 ➤ Muse Spark is notably token efficient for its intelligence level. It used 58M output tokens to run the Intelligence Index, comparable to Gemini 3.1 Pro Preview (57M) and notably lower than Claude Opus 4.6 (Adaptive Reasoning, max effort, 157M), GPT-5.4 (xhigh, 120M) and GLM-5 (110M) ➤ Muse Spark is the second-most capable vision model we have benchmarked. It scores 80.5% on MMMU-Pro, behind only Gemini 3.1 Pro Preview (82.4%) ➤ Muse Spark performs strongly on reasoning and instruction-following evaluations. It scores 39.9% on HLE, trailing only Gemini 3.1 Pro Preview (44.7%) and GPT-5.4 (xhigh, 41.6%). The model also achieved 5th highest in CritPT with a score of 11%, an eval that is focused on difficult physics research questions. This is substantially above above Gemini 3 Flash (9%) and Claude 4.6 Sonnet (3%) ➤ Agentic performance does not stand out. On GDPval-AA, our evalaution focused on real world work tasks, Muse Spark scores 1427, behind both Claude Sonnet 4.6 at 1648 and GPT-5.4 at 1676, but ahead of Gemini 3.1 Pro Preview at 1320. On On TerminalBench Hard, Muse Spark trails Claude Sonnet 4.6, GPT-5.4, and Gemini 3.1 Pro. Muse Spark joins others in achieving a high τ²-Bench Telecom score of 92% Key model details: ➤ Modalities: Multimodal including text and vision input, text output ➤ License: Proprietary, Meta's first frontier model not released as open weights ➤ Availability: No public API at the time of publishing. Meta expects to provide API access soon. Meta has started integration into their first party AI offering Meta AI and inside Facebook, Instagram, and Threads

English

3.2K

Ashish Khanagwal@TheAshrex·8 Nis

@Jashanx_gill interesting idea, how exactly does the job simulation work? like what kind of tasks or scenarios do users go through?

English

Jashan@Jashanx_gill·8 Nis

Day 15 of building Practa (a job simulation platform) ✅ Spent the day focusing on marketing strategy ✅ Decided to go all in on YouTube ✅ Created the channel today Waitlist is open - app.youform.com/forms/cxzusmmu

Jashan@Jashanx_gill

Jobs don't want to train employees anymore. They want Plug-and-Play employees So, I am creating a job Simulation platform and this is Day 14 Today I, ✅Bought the Domain Name. ( The platform has a name now) ✅Brainstormed Marketing Ideas. If you are a fresher, follow along the journey of "Practa"

English

1.5K

Ashish Khanagwal@TheAshrex·8 Nis

@alexabelonix 100% agree, first sale is the hardest unlock once that happens everything becomes a process curious, any marketing advice for early-stage builders trying to get their first users? 👀

English

Alexa Web3 (e/acc)@alexabelonix·8 Nis

Believe in this If you can sell 1 ... You can sell 10 You can sell 100 You can sell 1,000 You can sell 10,000 You can sell 1,000,000

English

949

Ashish Khanagwal@TheAshrex·8 Nis

@rxhit05 Marketing is most importatnt ig

English

Rohit@rxhit05·8 Nis

As a founder, What would you focus on the most right now? -growth -product -revenue -users -content

English

2.7K

Ashish Khanagwal@TheAshrex·8 Nis

Waitlist is now open:- croflux.vercel.app CroFlux turns a rough idea or PDS into a structured execution roadmap with milestones, tasks, and build order, instantly. If you’re building something this year, this will save you weeks of figuring out what to do next.

English

Ashish Khanagwal@TheAshrex·8 Nis

Today was one of those real builder days. Shipped the core engine of @CrooviOfficial CroFlux: → PDS → AI roadmap generation → Structured milestones, tasks, boss stages → Stored in Supabase → Usage limits + guardrails → Auth-protected API → Loading UX wired → Gemini fallback stack working Basically: idea → execution plan → dashboard Still left: • polish loader UX • roadmap editing layer • gamification layer (XP, bosses, streaks) • dashboard refinement • deployment Getting dangerously close to something usable. Builders: the hardest part isn’t AI… It’s turning outputs into structured execution. CroFlux is solving that.

English

Ashish Khanagwal@TheAshrex·8 Nis

@SNagarani1419 senior devs still own the architecture, the decisions, the hard debugging. but execution? junior with AI is right there. the gap closed faster than anyone expected

English

Nagarani@SNagarani1419·8 Nis

Be honest: Is a Junior Dev with AI tools more valuable than a Senior Dev without them in 2026?

English

1.5K

Ashish Khanagwal@TheAshrex·8 Nis

@primal_brainer just 'code' without the claude

English

primordial intelligence@primal_brainer·7 Nis

who am i without my claude

English

1.2K

Ashish Khanagwal@TheAshrex·8 Nis

Looking to contribute to AI research under strong professors/researchers. My interests: applied AI, LLM systems, AI tooling, developer productivity, and real-world impact. If you know researchers open to motivated collaborators or research interns, I’d really appreciate an intro or direction 🙏 Happy to share portfolio, GitHub, and work. RT appreciated.

English

Ashish Khanagwal@TheAshrex·7 Nis

@RealProductGirl and every setback upgrades your decision-making engine

English

Samantha Simonhoff@RealProductGirl·7 Nis

Every scar is a lesson. Every setback, a setup. Keep going.

English

716

Keşfet

@ThePrimeagen @claudeai @AnthropicAI @Spamfromk @marclou @leylndd @shiri_shh @elonmusk