Hussein Lezzaik

673 posts

Hussein Lezzaik banner
Hussein Lezzaik

Hussein Lezzaik

@husseinlezzaik

computer use https://t.co/IdfTPm1OeU

Montréal, CA Katılım Aralık 2018
166 Takip Edilen1.9K Takipçiler
Hussein Lezzaik retweetledi
Peter Steinberger 🦞
Peter Steinberger 🦞@steipete·
Built clawsweeper, which runs 50 codex in parallel around the clock, scans issues/prs deep and closes what is already implemented or what makes no sense. Closed around 4000 issues today, a few thousand are in the pipeline. (rate limits are rough) github.com/openclaw/claws…
English
423
574
9.4K
2.1M
Hussein Lezzaik retweetledi
Christos Tzamos
Christos Tzamos@ChristosTzamos·
1/4 LLMs solve research grade math problems but struggle with basic calculations. We bridge this gap by turning them to computers. We built a computer INSIDE a transformer that can run programs for millions of steps in seconds solving even the hardest Sudokus with 100% accuracy
English
249
812
6.1K
1.8M
Hussein Lezzaik retweetledi
Dev
Dev@DevvMandal·
Today, we're launching the world's largest open-source dataset of computer-use recordings. 10,000+ hours across Salesforce, Blender, Photoshop and more, to automate the next level of white-collar work. Link in the comments :) @markov__ai
English
91
198
1.8K
456.3K
Hussein Lezzaik retweetledi
Standard Intelligence
Standard Intelligence@si_pbc·
Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.
GIF
English
189
403
3.9K
1.2M
Hussein Lezzaik retweetledi
JUNDE WU
JUNDE WU@JundeMorsenWu·
Introducing OneContext. I built it for myself but now I can’t work without it, so it felt wrong not to share. OneContext is an Agent Self-Managed Context Layer across different sessions, devices, and coding agents (Codex / Claude Code). How it works: 1. Open Claude Code/Codex inside OneContext as usual, it automatically manages your context and history into a persistent context layer. 2. Start a new agent under the same context, it remembers everything about your project. 3. Share the context via link, anyone can continue building on the exact same shared context. Install with: npm i -g onecontext-ai And open with: onecontext Give it a try!
English
141
243
3.6K
963K
Mehul
Mehul@mehul·
Announcing $60M for @maticrobots. We didn't ask "What's the most impressive robot we can demo?" We asked "What's the most useful robot we can ship? What comes after Roomba?" Customers answered with their wallets: It's Matic.
English
203
149
1.6K
1.7M
Hussein Lezzaik retweetledi
Kimi.ai
Kimi.ai@Kimi_Moonshot·
🥝 Meet Kimi K2.5, Open-Source Visual Agentic Intelligence. 🔹 Global SOTA on Agentic Benchmarks: HLE full set (50.2%), BrowseComp (74.9%) 🔹 Open-source SOTA on Vision and Coding: MMMU Pro (78.5%), VideoMMMU (86.6%), SWE-bench Verified (76.8%) 🔹 Code with Taste: turn chats, images & videos into aesthetic websites with expressive motion. 🔹 Agent Swarm (Beta): self-directed agents working in parallel, at scale. Up to 100 sub-agents, 1,500 tool calls, 4.5× faster compared with single-agent setup. - 🥝 K2.5 is now live on kimi.com in chat mode and agent mode. 🥝 K2.5 Agent Swarm in beta for high-tier users. 🥝 For production-grade coding, you can pair K2.5 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blogs/kimi-k2-… 🔗 Weights & code: huggingface.co/moonshotai/Kim…
Kimi.ai tweet media
English
779
2K
15.9K
7.3M
Hussein Lezzaik retweetledi
Mohammed Alshehri
Mohammed Alshehri@SwishMoe·
My implementation of the Recursive Language Model (RLM) paper by @a1zhang , Kraska, and @lateinteraction . Key insight: "Treat long context as an external environment, not something to stuff into a context window." Applied to video understanding — instead of encoding 38K frames into a prompt, the agent: → Treats video as an environment → Writes code to explore segments → Uses recursive LLM sub-calls for analysis Tested: 20+ min video, 7 steps, $0.002 Paper: arxiv.org/abs/2512.24601 Code: github.com/mohammed840/RL…
English
27
79
854
56.9K
Hussein Lezzaik retweetledi
1X
1X@1x_tech·
NEO’s Starting to Learn on Its Own
English
299
419
3.2K
6.3M
Hussein Lezzaik retweetledi
William Holmberg
William Holmberg@WilliamHolmbe19·
casually driving around on google maps this is also opensource and can be found on my github
English
39
79
871
40.3K
Boris Cherny
Boris Cherny@bcherny·
run this: /mobile
English
195
61
2.1K
701.5K
Hussein Lezzaik
Hussein Lezzaik@husseinlezzaik·
this took around 30mins of training only
GIF
English
0
0
0
58
Hussein Lezzaik
Hussein Lezzaik@husseinlezzaik·
the model produces trajectories in action chunks of detlta_x/delta_y
Hussein Lezzaik tweet media
English
1
0
1
89
Hussein Lezzaik
Hussein Lezzaik@husseinlezzaik·
@trq212 now that CC codes autonomously — what are the odds that a hacker group gets CC to install a key-logger without it knowing it did?
English
0
0
0
1.5K
Thariq
Thariq@trq212·
If you started using Claude Code over the holidays, you might be curious about how AI actually works, the benefits and risks, and where it's headed. Here are some of my favorite papers on alignment, interpretability, and societal impacts 🧵
English
42
123
1.3K
155K
Hussein Lezzaik
Hussein Lezzaik@husseinlezzaik·
@modal serverless infra is a perfect fit for Claude Code. I used to always use Lambda Labs, but given how easy it is to give CC access to a GPU, monitor training runs, benchmarks, run certain inference tests, push code fixes .. all on demand & in the background it's effortless.
Hussein Lezzaik tweet media
English
0
0
2
154
Hussein Lezzaik retweetledi
alex zhang
alex zhang@a1zhang·
Much like the switch in 2025 from language models to reasoning models, we think 2026 will be all about the switch to Recursive Language Models (RLMs). It turns out that models can be far more powerful if you allow them to treat *their own prompts* as an object in an external environment, which they understand and manipulate by writing code that invokes LLMs! Our full paper on RLMs is now available—with much more expansive experiments compared to our initial blogpost from October 2025! arxiv.org/pdf/2512.24601
alex zhang tweet media
English
252
1.1K
7.4K
2M
Hussein Lezzaik
Hussein Lezzaik@husseinlezzaik·
None of the SoTa Computer use models can output a smooth high-frequency cursor path in order to draw a circle. I trained a small DiT action head + Qwen2.5-VL on 10k samples to generate continuous point trajectories using action chunking and flow matching for 30mins on an H100.
GIF
English
0
0
1
105
Hussein Lezzaik
Hussein Lezzaik@husseinlezzaik·
OpenAI defines 5 stages of intelligence w/ level 5 AI running organizations. The best economically human institutions all run on custom UIs with limited API support. Therefore agents wrapping 10k APIs will always be at a disadvantage and limited compared to ones that could.
Hussein Lezzaik tweet media
English
0
0
2
115