G. @ The Neuron

5.3K posts

G. @ The Neuron

@TheNeuronScribe

I am dumb but I am learning

Sumali Temmuz 2024

4.4K Sinusundan107 Mga Tagasunod

G. @ The Neuron nag-retweet

DailyPapers@HuggingPapers·8h

Vision2Web Evaluating coding agents on 193 real-world tasks across static, interactive, and full-stack development, with automated verification via GUI agents and VLM judges.

English

3.6K

G. @ The Neuron@TheNeuronScribe·3h

@NickADobos 🤣

QME

176

Nick Dobos@NickADobos·3h

OpenAI TBPN acquisition explained:

Jordi Hays@jordihays

TBPN has been acquired by OpenAI The world is changing quickly but TBPN will stay the same. Live every weekday just with a lot more resources. Thank you to everyone that has been a part of this journey big or small. We are 17 months in and unironically just getting started.

English

328

23.1K

G. @ The Neuron nag-retweet

Brandon Gell@bran_don_gell·3h

We are truly just getting going on this one. Every.to/plus-one

Ryan Boyle@_RyanBoyle_

I am so fucking excited to try Plus One...! Great onboarding @every

English

1.6K

G. @ The Neuron nag-retweet

Nick Dobos@NickADobos·8h

Codex / claude code pro tip: NEVER RESUME A CONVERSATION AFTER HITTING THE LIMIT. Always new chat. If your last chat was using 500k of the 1mil window, you will nuke 50% of your usage with a single "hello" message. Caching is weird. If you need that context tell the AI to go read & summarize the previous thread.

English

216

16.7K

G. @ The Neuron nag-retweet

Harper Carroll@HarperSCarroll·8h

More Details on Anthropic’s Leaked Code — PART 1 Anthropic accidentally exposed Claude Code’s agentic source code, and here's what we can learn from it. what happened Due to a human misstep during publishing, a file inside Anthropic’s shared code update contained internal source code. ~512,000 lines of code ~1,900 TypeScript files Not a hack, just a packaging error – resulting in publicizing the IP of one of the greatest AI technologies ever made. how this technology actually works What we already knew & what the leaked code reveals about how Claude Code works. 1. tools: how Claude takes actions Claude doesn’t just generate text; it uses built-in abilities called “tools” to take action. See attached tools tables for examples. 2. it’s a cycle AI agents aren’t magic. Claude repeats a loop: 1.Receives your prompt 2.Analyzes prompt to determine best tool(s) 3.Runs those tools 4.Injects results from tool use into the conversation history (“context window”), so the model can see/reference it 5.Repeat until no more tools are needed 3. in simplest terms An AI agent is: a large language model + tools in a loop that keeps going until it determines that the task is done The “intelligence” is Claude. The “agent” part is the loop. what wasn’t leaked The large language models themselves (there are multiple Claude models) – weren’t leaked, just the tools. The weights, architectures, training data & training pipelines of these neural networks are still secret. & good thing. Those cost hundreds of millions of dollars of compute to create. (Pop quiz: what are open vs. closed-source models? Comment below!) engineering craft The loop itself isn’t the whole story; in fact, that’s been public. What the leaked code reveals is just how much additional, complex scaffolding goes into making that loop reliable at scale, like: ∙system prompt engineering ∙context compaction to stay in token limits ∙how tools are designed & sandboxed ∙permission modeling & much more We’ll cover more in the next post. was this helpful? Did you learn anything? Have any questions? What should I cover next? Let me know in the comments!

English

773

G. @ The Neuron nag-retweet

rohin@rohinlohe·11h

25 years ago, Google helped create the first economic model for the web — ads. Today, that model is changing and we (@Cloudflare) are excited to enable every developer and site owner to shape how the world transacts. Honored to be a steward, alongside iconic businesses such as Coinbase, Stripe, Visa, Mastercard, Google, Microsoft, and many more. A special thank you to @programmer and @kleffew94 for their vision, and to Coinbase for their openness to making this an open protocol. Get started today & send your feedback my way: developers.cloudflare.com/agents/agentic…

Coinbase 🛡️@coinbase

x.com/i/article/2039…

English

14.1K

G. @ The Neuron nag-retweet

Felix Rieseberg@felixrieseberg·5h

Computer Use is now available on Windows! This gives Claude on Windows the ability to control your keyboard and mouse. It's really effective at letting Claude handle legacy apps.

English

213

9.2K

G. @ The Neuron nag-retweet

Lydia Hallie ✨@lydiahallie·8h

Digging into reports, most of the fastest burn came down to a few token-heavy patterns. Some tips: • Sonnet 4.6 is the better default on Pro. Opus burns roughly twice as fast. Switch at session start. • Lower the effort level or turn off extended thinking when you don't need deep reasoning. Switch at session start. • Start fresh instead of resuming large sessions that have been idle ~1h • Cap your context window, long sessions cost more CLAUDE_CODE_AUTO_COMPACT_WINDOW=200000 We're rolling out more efficiency improvements, make sure you're on the latest version. If a small session is still eating a huge chunk of your limit in a way that seems unreasonable, run /feedback and we'll investigate

English

277

276.9K

G. @ The Neuron nag-retweet

Yuchen Jin@Yuchenj_UW·4h

We’ll soon be able to do this in Claude Code: “Claude, cure my cancer. Make no mistakes.”

English

182

G. @ The Neuron nag-retweet

Ryan Carson@ryancarson·9h

I just open sourced my entire @openclaw Chief of Staff setup

Ryan Carson@ryancarson

x.com/i/article/2039…

English

717

124.4K

G. @ The Neuron nag-retweet

stevibe@stevibe·11h

Gemma4 just dropped. How does it handle tool calls? I ran ToolCall-15 across the full Gemma4 families. Gemma4 31b = Qwen3.5 27b. Both perfect 15/15. But here's what's wild: Qwen3.5 9b already clears 13/15, Gemma4 needs 26b to match that.

English

354

31.1K

G. @ The Neuron@TheNeuronScribe·4h

@willccbb we will tomorrow

English

G. @ The Neuron nag-retweet

will brown@willccbb·1d

the best American open-source model ever just dropped, and it costs less than $1 per million tokens i feel like more people should be talking about this

Arcee.ai@arcee_ai

Today we're releasing Trinity-Large-Thinking. Available now on the Arcee API, with open weights on Hugging Face under Apache 2.0. We built it for developers and enterprises that want models they can inspect, post-train, host, distill, and own.

English

1.4K

157.3K

G. @ The Neuron@TheNeuronScribe·4h

@cnakazawa Have you tried Claude? Not as bad on this front imo

English

Christoph Nakazawa@cnakazawa·17h

I think I'm at a breaking point with LLM text. ChatGPT's language has become the worst. I have full on AI fatigue. The honest truth Why this fixes it (short answer) Clean fix The safest bet Final honest take Best-case scenario (totally possible) My straight recommendation Bottom line (no sugarcoating) If you want, tell me […] and I’ll tell you what I’d personally do in your exact situation (not generic advice) Instead of asking “[…]”, think: 👉 “How do I maximize […]?” My honest recommendation (based on what you said) Let me be real with you upfront Here’s the pro move That’s actually a really good question—let’s sharpen it so it actually makes sense. Still real. Not peak performance That’s not just […]. That's […] I wrote the first 3 myself, but then I went to a chat and just kept copying more examples. People don't write like this. Are we doomed to have to read the same poor sentence structure and wording for the rest of our lives? It's even worse when I have to read other people's llm slop. Thank you, I can prompt an llm myself. Do I have to pay a person to operate the llm for me and write back slowly in human language?

English

197

1.6K

93K

G. @ The Neuron nag-retweet

Andy Hall@ahall_research·13h

Full post is here: freesystems.substack.com/p/the-dictator… Dashboard and results here: dictatoreval.org

English

1.5K

G. @ The Neuron nag-retweet

claire vo 🖤@clairevo·15h

Yep. See this over and over. You need tools, sure. But you really need: - culture change - technical readiness - a new operating model And it’s hard to do if you haven’t figured it out in the senior ranks. April 18-19 I’m teaching a small cohort of execs how: maven.com/clairevo/ai-na…

Brianne Kimmel@briannekimmel

A dangerous pattern for companies today is assuming signing up for a bunch of AI tools is a strategy. Every company needs to map out exactly what problems need to be solved and determine what products exist today & where custom agents need to be built.

English

13.6K

G. @ The Neuron nag-retweet

Weizhuo(Ken) Wang@KenWangWeizhuo·1d

A person walks around campus for 5 hours with cameras. That's it. That's the training data. The result? A humanoid robot that traverses unseen buildings, crowds, and glass walls — zero robot data, zero finetuning. EgoNav is here. egonav.weizhuowang.com None of these behaviors were pre-programmed: • Waiting for a door to open before entering • Steering around glass walls invisible to depth sensors • Yielding to pedestrians and resuming • Re-routing when furniture is rearranged All emerged from 5 hours of a human walking around. The prior is real. (1/6) #Humanoid #Robotics #DiffusionModel #EgoNav

English

269

29.6K

G. @ The Neuron@TheNeuronScribe·4h

@skirano lmao

126

Pietro Schirano@skirano·13h

If you thought the Chinese models were good, just wait a couple of months, now that all the distillation poisoning has been removed from Claude Code.

English

575

68.2K

G. @ The Neuron@TheNeuronScribe·4h

@atomic_chat_hq oh this just straight up downloads it lol

English

105

atomic.chat@atomic_chat_hq·7h

download macOS app: github.com/AtomicBot-ai/A…

English

5.1K

G. @ The Neuron nag-retweet

atomic.chat@atomic_chat_hq·7h

Running Hermes agent Locally with Gemma4 Device: Macbook Air CPU: M4 RAM: 16GB Open Source. Free. Private. With TurboQuant cache in @Atomic_Chat_HQ app

English

808

123K

Tuklasin

@NickADobos @Cloudflare @programmer @kleffew94 @openclaw @willccbb @cnakazawa @skirano