sr1

634 posts

sr1

sr1

@sr1jann

It's El Duderino...

Cloud Katılım Şubat 2019
1.5K Takip Edilen104 Takipçiler
sr1
sr1@sr1jann·
@bcherny you guys are not creating a new PaymentIntent after a payment failed (insufficient fund). > message: "This PaymentIntent's mandate_data could not be updated because it has a status of canceled." please fix this ASAP, agent is hungry :(
English
1
0
0
19
sr1 retweetledi
Kpaxs
Kpaxs@Kpaxs·
High agency looks like arrogance to people who have mistaken learned helplessness for maturity.
English
69
747
6.5K
154K
sr1
sr1@sr1jann·
I am not sure whose change broke tmux scrolling in claude code tui, it's either ghostty or claude code. it's really frustrating tho.
English
0
0
1
60
sr1 retweetledi
Simon Willison
Simon Willison@simonw·
Shocking result on my pelican benchmark this morning, I got a better pelican from a 21GB local Qwen3.6-35B-A3B running on my laptop than I did from the new Opus 4.7! Qwen on the left, Opus on the right
Simon Willison tweet mediaSimon Willison tweet media
English
164
149
2.3K
213.8K
sr1
sr1@sr1jann·
@mitsuhiko is this supposed to be rage-bait?
English
0
0
0
533
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
I think tmux is great software for an agent. But how people can actually work day to day in tmux is beyond me. It's such a horrible UX and hack.
English
178
16
588
117.3K
sr1 retweetledi
jietang
jietang@jietang·
Ai coding->vibe coding->agentic engineering+harness engineering->autonomous organization
English
14
8
128
10.9K
sr1 retweetledi
Georgi Gerganov
Georgi Gerganov@ggerganov·
llama.cpp at 100k stars now that 90% of the code worldwide is being written by AI agents, I predict that within 3-6 months, 90% of all AI agents will be running locally with llama.cpp 😄 Jokes aside, I am going to use this small milestone as an opportunity to reflect a bit on the project and the state of AI from the perspective of local applications. There is a lot to say and discuss and yet it feels less and less important to try to make a point. Opinions about viability of local LLMs are strongly polarized, details are overlooked, the scientific approach is lacking. Arguments are predominantly based on vibes and hype waves. One thing is clear though - local LLMs are used more and more. I expect this trend to continue and likely 2026 will end up being one of the most important years for the local AI movement. I admit that I didn't expect the agentic era to come so quickly to the local LLM space. One year ago, the available models were too computationally expensive for doing long-context tasks. There wasn't an obvious path towards meaningful agentic applications. The memory and compute requirements were huge. Last summer, with the release of gpt-oss, things started to change. It was the first time we saw a glimpse of tool calling that actually works well within the resource constraints of our daily devices. Later in the year, even better models were released and by now, useful local agentic workflows are a reality. Comparing local vs hosted capabilities at a given moment of time is pointless. To try put things into perspective: - We don't need frontier intelligence to automate searches and sending emails - We don't need trillion parameter models to be able to summarize articles or technical documents - We don't need massive GPU data centers to control our home appliances or turn the lights off in the garage I believe that there is a certain level of intelligence we as humans can comprehend and meaningfully utilize to improve our working process. Beyond that level, access to more intelligence becomes unnecessary at best and counterproductive at worst. I also believe that that level of useful artificial intelligence is completely within reach locally and it has always been just a matter of implementing the right software stack to bring it to the end user. With llama.cpp, I am confident that we continue to be on the right track of building that software stack! The llama.cpp project is going stronger than ever. With more than 1500 contributors, the project keeps growing steadily. From technical point of view, I think that llama.cpp + ggml is the only solution that actually makes sense. That is, the software stack must run efficiently on every possible device, hardware and operating system. The technology is too important to be vendor-locked. It has to be developed in the open, by the community, together with the independent hardware vendors. This is the only right way to build something that will truly make a difference in the long run. I won't try to convince you about what is currently and will be possible with local AI. We will just continue to build as usual. I am confident that after the smoke clears and we look objectively at what we have built together, the benefits will be obvious to everyone. Big shoutout to all llama.cpp maintainers. I feel extremely lucky to be able to work together with so many talented contributors. Every day I learn something new and I feel there is so much more cool stuff that we are going to build. Also, I am really thankful that the project continues to have reliable partners to support it! Cheers!
Georgi Gerganov tweet mediaGeorgi Gerganov tweet media
English
277
394
3.2K
448.4K
sr1
sr1@sr1jann·
"bear the brunt on its finances so that Indian citizen is insulated" really? so who pays for "government finances" 🤔
Hardeep Singh Puri@HardeepSPuri

International crude prices have gone through the roof in the last 1 month from around 70 dollars/barrel to around 122 dollars/barrel. Consequently, petrol and diesel prices for consumers have gone up all over the world. Prices have increased by around 30%-50% in South East Asian countries, 30% in North American countries, 20% in Europe and 50% in African countries. The Modi Government had two choices- either increase prices drastically for citizens of Bharat as all other nations have done or bear the brunt on its finances so that Indian citizen is insulated from international volatility. Hon’ble Prime Minister @narendramodi Ji, in keeping with his Government’s commitment of last 4 years since the conflict in Russia-Ukraine started, decided to take a hit on its own finances again to safeguard the Indian citizen. Government has taken a huge hit on it taxation revenues to ensure very high losses of oil companies (approximately 24 Rs/litre for petrol and 30 Rs/litre for diesel) at this time of sky high international prices are reduced. At the same time, export tax has been levied as international prices of petrol and diesel have skyrocketed and any refinery exporting to foreign nations will have to pay export tax. My gratitude to Hon’ble PM Narendra Modi Ji and Hon’ble FM @nsitharaman Ji for this very timely, bold and visionary decision!

English
0
0
0
25
sr1
sr1@sr1jann·
@_dylanga @tryramp any plans to publish a new updated blog on your background agent setup?
English
1
0
1
4.6K
Dylan Garcia
Dylan Garcia@_dylanga·
I had @tryramp Inspect add more granular tracing to its sandbox calls, investigate the top slowest commands, and optimize them. Easy 130ms off the p99 in the hotpath of sandbox work. More to come.
Dylan Garcia tweet media
English
4
0
60
15.7K
sr1 retweetledi
Varun
Varun@varun_mathur·
Autosearcher: a distributed search engine We are now insanely experimenting with building a distributed search engine utilizing the same pattern @karpathy introduced with autoresearch: give an agent a metric, a tight propose→run→evaluate→keep/revert loop, and let it iterate. Our autoresearch network proved this works at scale: 67 autonomous agents ran 704 ML training experiments in 20 hours, rediscovering Kaiming initialization, RMSNorm, and compute-optimal training schedules from scratch through pure experimentation and gossip-based cross-pollination. Agents shared discoveries over GossipSub, and the network compounded insights faster than any individual agent: new agents bootstrapped from the swarm's collective knowledge via CRDT-replicated leaderboards and reached the research frontier in minutes. Now we're applying the same evolutionary loop to search ranking: every Hyperspace agent runs an autonomous search researcher that proposes ranking mutations, evaluates them against NDCG@10 on real query-passage data, shares improvements with the network, and cross-pollinates with peers. The architecture is a seven-stage distributed pipeline where every stage runs across the P2P network. Browser agents contribute pages passively, desktop agents crawl and index, GPU nodes run neural reranking. Every user click generates a DPO training pair that improves the ranking model, and gradient gossip distributes those improvements to every agent. The compound flywheel is what makes this different from centralized search: at 10,000 agents that's 500,000 pages indexed per day; at 1 million agents, 50 million pages per day with 90%+ cache hit rates and sub-50ms latency. This network will get smarter with every query. Code and other links in followup tweet here:
Varun tweet media
Varun@varun_mathur

I hooked this up to a peer-to-peer astrophysics researcher agent which gossips and collaborates with other such agents (and your openclaws) to: 1. Learn how to train an astrophysics model (@karpathy's work below) 2. Train a new astrophysics model 3. Use it to write papers 4. Peer agents based on frontier lab models critique it 5. Surface breakthroughs ... and then feed back in the loop ... More agents join, from the browser or the CLI, and run this, the smarter and more exciting breakthroughs would eventually emerge. When these agents are idle, they are also reading daily tech news with their own RSS reader, and commenting on each other's thoughts. And they can also serve the underlying machine's compute to other agents on the network, and earn social credit for being good actors (think BitTorrent). We also prove the agent has the compute it says by cryptographic verification of regular matmul challenges. All you have to do is either go on this website (and it creates an agent which runs from your browser), or install the CLI if you want to give the system more juice. And you are part of likely the first experimental distributed agi thing. This is Day 1, but this is how it starts.. this network is fully peer-to-peer, and, very volatile, but the intelligence here is meant to compound continuously.. agents.hyper.space curl -fsSL agents.hyper.space/cli | bash

English
18
73
835
260.8K