Ash Explained

1.4K posts

Ash Explained banner
Ash Explained

Ash Explained

@AshExplained

I Explain Things!

India शामिल हुए Kasım 2022
247 फ़ॉलोइंग64 फ़ॉलोवर्स
Ash Explained
Ash Explained@AshExplained·
Claude says our project will take 4 to 6 months. Haha no you are going to finish it in one hour lol
English
0
0
0
4
Gary Simon
Gary Simon@designcoursecom·
What is everyone building right now? I can't be the only one with 3 simultaneous projects.
English
125
0
67
7.5K
Kaito
Kaito@KaiXCreator·
I'm a vibe coder, scare me with one word.
Kaito tweet media
English
131
1
108
7.9K
Sick
Sick@sickdotdev·
We used to learn coding in our college days because there was no Claude. What are students learning now?
English
39
0
51
2.3K
Tibo
Tibo@tibo_maker·
so I am traveling for the next 2 weeks what agent workflow should I ABSOLUTELY set up before I go?
English
38
0
75
11.3K
Ash Explained
Ash Explained@AshExplained·
something clicked 7 weeks into building RennOS. i stopped writing CLAUDE.md for myself. started writing it for a version of claude that has zero memory of this project. that's when 99 agents actually started cooperating. github.com/AshExplained/R…
English
1
0
1
13
Ash Explained
Ash Explained@AshExplained·
@ayaboch Built in Second assessment of task done by claude itself...I have found it useful
English
0
0
0
15
Aya Bochman
Aya Bochman@ayaboch·
coding with claude and reviewing the sloppy code with codex is meta
English
1
0
4
456
Lex Christopherson
Lex Christopherson@official_taches·
Claude Code's leaked source code is one of the all time best leaks of all time.
English
6
0
34
2.3K
Linus ✦ Ekenstam
Linus ✦ Ekenstam@LinusEkenstam·
Can I see some links of what you’ve built with Claude code? Blow my mind
English
336
43
1.2K
385K
Ash Explained
Ash Explained@AshExplained·
@codewithantonio Do you have a video showing how you use both codex and claude code in tandem for production level tasks?
English
0
0
0
66
Code With Antonio
Code With Antonio@codewithantonio·
claude reviewing codex changes
Code With Antonio tweet media
English
3
0
66
4.2K
Ash Explained
Ash Explained@AshExplained·
@sarahwooders I tried gpt5.4 in codex mac app and it feels way better...have you tried it in the app ? Cli still feels meh
English
0
0
0
451
Sarah Wooders
Sarah Wooders@sarahwooders·
To the people who had early access to GPT-5.4 and told us all it was amazing for coding - did you even use it?? I'm fully back to Codex 5.3
English
92
6
331
86.2K
Ash Explained
Ash Explained@AshExplained·
For Development projects: IDE > Terminal > Desktop Apps
English
0
0
0
8
Ash Explained
Ash Explained@AshExplained·
@official_taches tremendous product...my only concern is anthropic or google just stopping it like they did in opencode...i thinks thats the only bottleneck
English
0
0
0
35
Ash Explained
Ash Explained@AshExplained·
@0xvegito every time i've let it build more than one feature at a time i've regretted it.
English
1
0
0
32
vegito
vegito@0xvegito·
Claude Code & Codex and other coding agents are not very precise. They cause bloat, they create unnecessary abstractions, they’re literally unable to think in a top down systems or architectural manner, and as a result they cause a lot of tech debt
English
1
0
1
214
Ash Explained
Ash Explained@AshExplained·
@_philschmid half the cost is the part that's actually gonna make people switch
English
0
0
0
29
Philipp Schmid
Philipp Schmid@_philschmid·
Step by Step.
Artificial Analysis@ArtificialAnlys

Google is once again the leader in AI: Gemini 3.1 Pro Preview leads the Artificial Analysis Intelligence Index, 4 points ahead of Claude Opus 4.6 while costing less than half as much to run @GoogleDeepMind gave us pre-release access to Gemini 3.1 Pro Preview. It leads 6 of the 10 evaluations that make up the Artificial Analysis Intelligence Index and improves significantly over Gemini 3 Pro Preview across capabilities, with the biggest gains in reasoning and knowledge, coding, and hallucination reduction. Gemini 3.1 Pro Preview also remains relatively token efficient, using ~57M tokens to run the Artificial Analysis Intelligence Index (+1M from Gemini 3 Pro Preview), lower than other frontier models at max reasoning settings such as Opus 4.6 (max) and GPT-5.2 (xhigh). Combined with lower per-token pricing, Gemini 3.1 Pro Preview is cost-efficient among frontier peers, costing less than half as much as Opus 4.6 (max) to run the full Intelligence Index, though still nearly 2x the leading open-weights model, GLM-5. Key Takeaways: ➤ State-of-the-art intelligence at lower costs: Gemini 3.1 Pro Preview is leading 6 of the 10 evaluations that make up the Artificial Analysis Intelligence Index at less than half the cost to run of frontier peers from @OpenAI and @AnthropicAI. It obtains the highest score in Terminal-Bench Hard (agentic coding), AA-Omniscience (knowledge & hallucination), Humanity’s Last Exam (reasoning & knowledge), GPQA-Diamond (scientific reasoning), SciCode (coding) and CritPt (research-level physics). The CritPt score is particularly notable, scoring 18% on unpublished, research-level physics reasoning problems, over 5 p.p. above the next best model ➤ Improved real-world agentic performance, but not leading: Gemini 3.1 Pro Preview shows an improvement in GDPval-AA, our agentic evaluation focusing on real-world tasks, but is still not the leading model in this area. The model increases its ELO score over 100 points to 1316 (up from Gemini 3 Pro Preview), however still sits behind Claude Sonnet 4.6, Opus 4.6, GPT-5.2 (xhigh), and GLM-5 ➤ Leading coding abilities: Gemini 3.1 Pro Preview leads the Artificial Analysis Coding Index, achieving the highest score in both Terminal-Bench Hard (54%) and SciCode (59%) ➤ Reduced hallucinations: Gemini 3.1 Pro Preview shows a major improvement in tendency to guess incorrectly when it doesn’t know the answer, reducing its AA-Omniscience hallucination rate by 38 p.p. from Gemini 3 Pro Preview ➤ Maintained token and cost efficiency: Gemini 3.1 Pro Preview improves without material increases in cost or token usage. It uses only ~2% more tokens to run the Artificial Analysis Intelligence Index than Gemini 3 Pro Preview, and keeps the same pricing ($2/$12 per 1M input/output tokens for ≤200k context). Its cost to run the Artificial Analysis Intelligence Index of $892 is less than half of frontier models such as Opus 4.6 (max) and GPT-5.2 (xhigh), though still ~2x the cost of leading open weights models such as GLM 5 ($547) ➤ Google takes top 3 spots in multi-modality: Gemini 3.1 Pro Preview ranks #1 on MMMU-Pro, our multimodal understanding and reasoning benchmark, ahead of Gemini 3 Pro Preview and Gemini 3 Flash, reinforcing Google’s leadership in multimodal reasoning ➤ Other model details: Gemini 3.1 Pro Preview retains the same 1 million token context window as its predecessor, and includes support for tool calling, structured outputs, and JSON mode

English
5
4
119
6.1K
Ash Explained
Ash Explained@AshExplained·
@testingcatalog asked for opus competition three hours ago. google apparently took it personally
English
0
0
2
369
Ash Explained
Ash Explained@AshExplained·
keep vibe coding. seriously. building fast is a real skill. just don't stop at "it works." run a security pass. set up a real deployment pipeline. add monitoring. the people filling that gap are the ones who'll still be standing when the hype settles.
English
0
0
0
22
Ash Explained
Ash Explained@AshExplained·
right now everyone is competing on who can build fastest. the people who are gonna win are the ones who can also audit, harden, and actually deploy. that gap is wide open and almost nobody is filling it.
English
1
0
0
26
Ash Explained
Ash Explained@AshExplained·
AI coding tools mass produce working code. nobody's checking if that code is actually safe to ship. the gap between "it runs on my machine" and "it's live and not leaking data" is where vibe coders quietly disappear. 🧵
English
1
0
0
29