Moez AI

1.1K posts

Moez AI

@wizardai_m

AI Engineer | AI Enthusiast | Urban Photographer | 🚀✨📸

New York Katılım Eylül 2022

374 Takip Edilen28 Takipçiler

Moez AI retweetledi

Daniel Han@danielhanchen·13h

New Unsloth Studio update! 1. 10x faster via pre-compiled llama.cpp + mamba binaries 2. 6x faster, -50% less disk space installs via bun, uv 3. Studio is now in PATH + `unsloth studio update` works 4. Lots of UI UX improvements And my fav: Desktop + launch shortcuts for Studio!

Unsloth AI@UnslothAI

You don’t need to manually set LLM parameters anymore! llama.cpp uses only the context length + compute your local setup needs. Unsloth also auto-applies the correct model settings Try in Unsloth Studio - now with precompiled llama.cpp binaries. GitHub: github.com/unslothai/unsl…

English

131

11.7K

Moez AI retweetledi

DailyPapers@HuggingPapers·17h

MinerU-Diffusion A 2.5B diffusion-based OCR model that replaces slow autoregressive decoding with parallel block-wise diffusion, achieving up to 3.2x faster inference while improving robustness on complex documents with tables, formulas, and layouts.

English

158

9.9K

Moez AI retweetledi

Wildminder@wildmindai·9h

Vibecoded TurboQuant looks really promising: - 3.25 bits, 4.9x compression - 4.25 bits, 3.8x compression Just waiting for llama.cpp to fully support this beast... I’ll hand off all simple agentic tasks toQwen3.5 27B. github.com/TheTom/turboqu…

Google Research@GoogleResearch

Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI

English

10.8K

Moez AI retweetledi

DeepManim@manimable·13h

TurboQuant AI models waste massive memory on vectors. Compressing them usually adds overhead defeating the purpose. Google's new paper uses just 1 extra bit to eliminate that overhead. Result: same accuracy, way less memory. Accepted at ICLR 2026. The trick? Random rotations + a 50-year-old math theorem. Here is a deepmanim.com overview of the paper. #manim

English

Moez AI retweetledi

Eric Topol@EricTopol·8h

How chronic inflammation in the gut can increase risk of colon cancer through cumulative epigenetic memories (in the mouse model) @Nature @NatureNV nature.com/articles/s4158… nature.com/articles/d4158…

English

153

12.9K

Moez AI retweetledi

Dr. Ganapathi Pulipaka 🇺🇸@gp_pulipaka·7h

TurboQuant: Redefining AI Efficiency with Extreme Compression! #BigData #Analytics #DataScience #AI #MachineLearning #NLProc #LLM #IoT #IIoT #PyTorch #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #GoLang #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/TurboQuant

English

125

Moez AI retweetledi

Dr. Ganapathi Pulipaka 🇺🇸@gp_pulipaka·7h

The Death of CNNs! #BigData #Analytics #AI #MachineLearning #DataScience #IoT #IIoT #Python #RStats #TensorFlow #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/Death-of-CNNs

English

Moez AI retweetledi

Ostris@ostrisai·1d

This looks super promising! And it is Apache 2.0!

Adina Yakup@AdinaYakup

daVinci-MagiHuman 🎬 Human Centric Audio-Video Generative Model by GAIR Model: huggingface.co/GAIR/daVinci-M… Paper: huggingface.co/GAIR/daVinci-M… ✨ 15B – Fully open source! ✨ 5-sec 1080p video in 38s on one H100 ✨ Supports 6 languages ✨ Unified model with text + video + audio

English

103

7.7K

Moez AI retweetledi

Hasan Toor@hasantoxr·1d

Finally. A native memory plugin is exactly what OpenClaw needs. A hierarchical, file-based architecture lets the agent actually structure its thoughts accurately instead of just doing similarity matching.

andy nguyen@kevinnguyendn

x.com/i/article/2036…

English

14.3K

Moez AI retweetledi

Jack Pertschuk@jack_pertschuk·1d

Interesting result - based on the blog looks only marginally better than the traditional product quantization at the same bits, but no need for code books and expensive memory lookups. Curious to see the e2e results on ANN bench vs PQ etc

Google Research@GoogleResearch

English

1.1K

Moez AI retweetledi

Rohan Paul@rohanpaul_ai·1d

Google DeepMind has unveiled a browser powered by its Gemini 3.1 Flash-Lite model that generates entire websites in real time as users browse. Google’s Flash-Lite Browser treats the web like something an LLM can write live, not something humans must fully pre-build first. A normal site serves stored pages and templates, but this system uses Gemini 3.1 Flash-Lite to generate fresh HTML and CSS from your prompt, clicks, and navigation context almost instantly. The technical shift is simple: instead of fetching a finished page, the browser asks the model what page should exist right now, then streams that answer as interface code. That makes personalization much deeper, because the page can change for each user, each step, and each goal without keeping a huge library of prewritten screens. It also fits agentic workflows, where an AI assistant may need to create a temporary tool, dashboard, or reference page on the fly while working through a task. IMO, the catch is reliability, because once page layout and content are model outputs, bugs, hallucinations, style drift, and serving cost - all become concerns.

Google DeepMind@GoogleDeepMind

Watch how fast Gemini 3.1 Flash-Lite can generate websites. ⚡ This browser creates each page in real-time as you click, search, and navigate. Give it a try → goo.gle/4t9In1R

English

331

53K

Moez AI retweetledi

Zach Mueller@TheZachMueller·1d

@stochasticchasm This alone x.com/cursor_ai/stat…

Cursor@cursor_ai

We go into detail about the infrastructure behind large scale training including the kernels we developed and open-sourced for the project. We also discuss distributed training and environment scaling for RL.

English

13.4K

Moez AI retweetledi

Stephanie Fu@xkungfu·1d

Excited to finally be releasing AutoGaze! Check it out autogaze.github.io (and 👀 the video demo)

Baifeng@baifeng_shi

Humans can see in high-res, high-FPS in real-time. Why can't VLMs? Introducing AutoGaze: ViTs/VLMs "gaze" only at key video regions! Up to 4-100x token savings, 19x speedup, and enables scaling to 4K-res 1K-frame videos. 📄 arxiv.org/abs/2603.12254 🌐 autogaze.github.io 🤗 huggingface.co/collections/bf… (1/n)🧵

English

13.4K

Moez AI retweetledi

plannotator@plannotator·1d

Compound Planning - if you've been using plannotator consistently, then there is an opportunity to improve how your agents plan for you. We're going to introduce a skill that enables you to see your own insights and eventually will create a automated feedback loop. The point is to consistently refine and optimize the planning that works best for you. @pyrons_ thank you for the good idea - and this is a similar analysis I did when looking into insights for the quick label feature. Preview of what the MVP skill outputs

English

2.4K

Moez AI retweetledi

Picassio@ocbieuvang·1d

I have created my own Agent Board for multiple agents working together using @badlogicgames' pi in the background. It features a fun 3D office where the agents work, and it utilizes @LakshyAAAgrawal's GEPA to auto-optimize the agent system prompts and system prompt templates

English

6.5K

Moez AI retweetledi

Mario Zechner@badlogicgames·1d

recommended viewing! Marlene has great energy and if you aren't well versed in agentic coding yet, this is a great intro talk.

Marlene Mhangami@marlene_zw

In my guide to agentic coding talk I propose that for developers there is work for us to do (including writing code) at each stage of the agentic loop👩🏾‍💻 So much innovation is possible inside this loop. Full talk done in VS Code!!! Watch it here youtube.com/watch?v=KkZXT1… ❤️

English

119

16.2K

Moez AI retweetledi

jaisel@jaiselsingh·1d

sim overlay on real! :) i think it might be wise to record the lens distortion/fovy characterisitics in the future to help resolve the slight mismatches right now.

jaisel@jaiselsingh

oh my god it's almost there, i can feel it haha

English

111

12.9K

Moez AI retweetledi

plannotator@plannotator·1d

Plannotator 0.15.0 is here. The War of the Code Review continues. We're fighting clankers with clankers now. • Live AI chat in code review (Claude, Codex, Pi, OpenCode). This is the mvp - a lot to refine from here. • Folder-based file viewer for easy doc reference (superpowers, specs, etc) • Browse all your previous plans • Full feature parity for the Pi.dev extension Plus resizable diff panes and various bug fixes! (11 PRs merged)

English

3.1K

Moez AI retweetledi

Rui Carmo ☯️@rcarmo·1d

People of pi, I have begun incorporating pi-autoresearch into github.com/rcarmo/piclaw. @davebcn87 did an amazing job, and it "just works" with the built-in terminal (launched inside the web UI after defining the experiment). will look at integrating it as a fully graphical UX...

English

2.2K

Moez AI retweetledi

SUN YOUNG HWANG ᯅ 🇰🇷@SOSOHAJALAB·1d

Guys.. this model is just crazy. If you have just less than 48gb vram, just try the 8q gguf format. Feels just like opus! Tool calling is working smoothly!! Appreciate for this! (Hf and qwen!!) huggingface.co/Jackrong/Qwen3…

English

223

2.7K

173.5K

Keşfet

@Nature @NatureNV @stochasticchasm @pyrons_ @badlogicgames @LakshyAAAgrawal @davebcn87 @elonmusk