Hussein Lezzaik

673 posts

Hussein Lezzaik

@husseinlezzaik

computer use https://t.co/IdfTPm1OeU

Montréal, CA Katılım Aralık 2018

166 Takip Edilen1.9K Takipçiler

Hussein Lezzaik retweetledi

Peter Steinberger 🦞@steipete·25 Nis

Built clawsweeper, which runs 50 codex in parallel around the clock, scans issues/prs deep and closes what is already implemented or what makes no sense. Closed around 4000 issues today, a few thousand are in the pipeline. (rate limits are rough) github.com/openclaw/claws…

English

423

574

9.4K

2.1M

Hussein Lezzaik retweetledi

Christos Tzamos@ChristosTzamos·12 Mar

1/4 LLMs solve research grade math problems but struggle with basic calculations. We bridge this gap by turning them to computers. We built a computer INSIDE a transformer that can run programs for millions of steps in seconds solving even the hardest Sudokus with 100% accuracy

English

249

812

6.1K

1.8M

Hussein Lezzaik retweetledi

Dev@DevvMandal·12 Mar

Today, we're launching the world's largest open-source dataset of computer-use recordings. 10,000+ hours across Salesforce, Blender, Photoshop and more, to automate the next level of white-collar work. Link in the comments :) @markov__ai

English

198

1.8K

456.3K

Hussein Lezzaik retweetledi

Standard Intelligence@si_pbc·23 Şub

Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.

GIF

English

189

403

3.9K

1.2M

Hussein Lezzaik retweetledi

JUNDE WU@JundeMorsenWu·7 Şub

Introducing OneContext. I built it for myself but now I can’t work without it, so it felt wrong not to share. OneContext is an Agent Self-Managed Context Layer across different sessions, devices, and coding agents (Codex / Claude Code). How it works: 1. Open Claude Code/Codex inside OneContext as usual, it automatically manages your context and history into a persistent context layer. 2. Start a new agent under the same context, it remembers everything about your project. 3. Share the context via link, anyone can continue building on the exact same shared context. Install with: npm i -g onecontext-ai And open with: onecontext Give it a try!

English

141

243

3.6K

963K

Hussein Lezzaik@husseinlezzaik·29 Oca

@mehul @maticrobots congrats @mehul well deserved!

English

Mehul@mehul·29 Oca

Announcing $60M for @maticrobots. We didn't ask "What's the most impressive robot we can demo?" We asked "What's the most useful robot we can ship? What comes after Roomba?" Customers answered with their wallets: It's Matic.

English

203

149

1.6K

1.7M

Hussein Lezzaik retweetledi

Kimi.ai@Kimi_Moonshot·27 Oca

🥝 Meet Kimi K2.5, Open-Source Visual Agentic Intelligence. 🔹 Global SOTA on Agentic Benchmarks: HLE full set (50.2%), BrowseComp (74.9%) 🔹 Open-source SOTA on Vision and Coding: MMMU Pro (78.5%), VideoMMMU (86.6%), SWE-bench Verified (76.8%) 🔹 Code with Taste: turn chats, images & videos into aesthetic websites with expressive motion. 🔹 Agent Swarm (Beta): self-directed agents working in parallel, at scale. Up to 100 sub-agents, 1,500 tool calls, 4.5× faster compared with single-agent setup. - 🥝 K2.5 is now live on kimi.com in chat mode and agent mode. 🥝 K2.5 Agent Swarm in beta for high-tier users. 🥝 For production-grade coding, you can pair K2.5 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blogs/kimi-k2-… 🔗 Weights & code: huggingface.co/moonshotai/Kim…

English

779

15.9K

7.3M

Hussein Lezzaik retweetledi

Mohammed Alshehri@SwishMoe·12 Oca

My implementation of the Recursive Language Model (RLM) paper by @a1zhang , Kraska, and @lateinteraction . Key insight: "Treat long context as an external environment, not something to stuff into a context window." Applied to video understanding — instead of encoding 38K frames into a prompt, the agent: → Treats video as an environment → Writes code to explore segments → Uses recursive LLM sub-calls for analysis Tested: 20+ min video, 7 steps, $0.002 Paper: arxiv.org/abs/2512.24601 Code: github.com/mohammed840/RL…

English

854

56.9K

Hussein Lezzaik retweetledi

1X@1x_tech·12 Oca

NEO’s Starting to Learn on Its Own

English

299

419

3.2K

6.3M

Hussein Lezzaik retweetledi

William Holmberg@WilliamHolmbe19·3 Oca

casually driving around on google maps this is also opensource and can be found on my github

English

871

40.3K

Hussein Lezzaik@husseinlezzaik·5 Oca

@bcherny why does this happen and how do i fix it? @bcherny

English

Boris Cherny@bcherny·4 Oca

run this: /mobile

English

195

2.1K

701.5K

Hussein Lezzaik@husseinlezzaik·5 Oca

this took around 30mins of training only

GIF

English

Hussein Lezzaik@husseinlezzaik·5 Oca

for more info check: - blog: husseinlezzaik.com/tess/flow-matc… - model: huggingface.co/TESS-Computer/… - code: github.com/HusseinLezzaik… - data: huggingface.co/datasets/TESS-…

English

Hussein Lezzaik@husseinlezzaik·5 Oca

the model produces trajectories in action chunks of detlta_x/delta_y

English

Hussein Lezzaik@husseinlezzaik·4 Oca

@trq212 now that CC codes autonomously — what are the odds that a hacker group gets CC to install a key-logger without it knowing it did?

English

1.5K

Thariq@trq212·4 Oca

If you started using Claude Code over the holidays, you might be curious about how AI actually works, the benefits and risks, and where it's headed. Here are some of my favorite papers on alignment, interpretability, and societal impacts 🧵

English

123

1.3K

155K

Hussein Lezzaik@husseinlezzaik·4 Oca

@modal serverless infra is a perfect fit for Claude Code. I used to always use Lambda Labs, but given how easy it is to give CC access to a GPU, monitor training runs, benchmarks, run certain inference tests, push code fixes .. all on demand & in the background it's effortless.

English

154

Hussein Lezzaik retweetledi

alex zhang@a1zhang·3 Oca

Much like the switch in 2025 from language models to reasoning models, we think 2026 will be all about the switch to Recursive Language Models (RLMs). It turns out that models can be far more powerful if you allow them to treat *their own prompts* as an object in an external environment, which they understand and manipulate by writing code that invokes LLMs! Our full paper on RLMs is now available—with much more expansive experiments compared to our initial blogpost from October 2025! arxiv.org/pdf/2512.24601

English

252

1.1K

7.4K

Hussein Lezzaik@husseinlezzaik·4 Oca

None of the SoTa Computer use models can output a smooth high-frequency cursor path in order to draw a circle. I trained a small DiT action head + Qwen2.5-VL on 10k samples to generate continuous point trajectories using action chunking and flow matching for 30mins on an H100.

GIF

English

105

Hussein Lezzaik@husseinlezzaik·4 Oca

OpenAI defines 5 stages of intelligence w/ level 5 AI running organizations. The best economically human institutions all run on custom UIs with limited API support. Therefore agents wrapping 10k APIs will always be at a disadvantage and limited compared to ones that could.

English

115

Hussein Lezzaik@husseinlezzaik·4 Oca

Figure's Helix VLA to click screens uses the same VLM backbone of computer use models to control it's hands. General computer use is a robotics problem, we need better digital hands!

Brett Adcock@adcock_brett

We now test our humanoid robots by having its onboard neural network play games that challenge its intelligence and fine motor skills

English

Keşfet

@markov__ai @mehul @maticrobots @a1zhang @lateinteraction @bcherny @trq212 @modal