billyG88

2.2K posts

billyG88

@billyG881

-ML engineer- "I don't need a reason to grind, I grind because I need it" - BillygG8

Katılım Aralık 2020

428 Takip Edilen110 Takipçiler

billyG88 retweetledi

Red Hat AI@RedHat_AI·38m

What compression looks like on @vllm_project. Same Gemma 4 31B. Red Hat AI's quantized version runs at nearly 2x tokens/sec, half the memory, 99%+ accuracy retained. Open source. Quantized with LLM Compressor. Links in comments. 🙏 @_soyr_ for the 2-minute demo.

English

1.4K

billyG88 retweetledi

David Hendrickson@TeksEdge·3h

🚀 New TTS Model: MOSS-TTS-Nano, a Real-time TTS that runs fully on CPU! 💡I'm always looking for good TTS models to run on an average PC (w/o a GPU) and this is one of them. OpenMOSS is a new 0.1B multilingual model: • No GPU needed, pure CPU inference • Real-time speech generation + streaming • 20 languages supported • 48 kHz stereo quality • Ultra-lightweight & easy local/web deployment Huge value for edge devices, local demos, web apps, and lightweight products where GPU isn’t an option. Part of the MOSS-TTS family 👏

English

2.2K

billyG88 retweetledi

ModelScope@ModelScope2022·7h

Say hello to MOSS-TTS-Nano 🚀 0.1B multilingual TTS from MOSI.AI and OpenMOSS. Designed for realtime speech generation without a GPU. Runs directly on CPU, keeping the deployment stack simple enough for local demos, web serving, and lightweight product integration. Part of the MOSS-TTS family alongside the 1.7B and 8B flagship models. 🤖 modelscope.cn/models/openmos… 🌍 modelscope.ai/models/openmos… 💻 github.com/OpenMOSS/MOSS-…

English

209

30.2K

billyG88 retweetledi

Abhimanyu Sharma@0xN1nja·2d

started with a raspberry pi, now i run an entire AWS region at home

English

390

670

16K

966.3K

billyG88 retweetledi

Masato Ota@ottamm_190·1d

Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering arxiv.org/abs/2604.08224

English

401

28.2K

billyG88@billyG881·17h

@Hesamation I wonder how many AI avenues could benefit from 1T $$$ in investment…

English

ℏεsam@Hesamation·21h

DHH is in. Karpathy is in. Andrew Ng is in. Terence Tao is in. Linus Torvalds is in. John Carmack is in. Tony with an opinion still believes AI is just a next-token predictor with no real future.

English

1.4K

136.4K

billyG88 retweetledi

dealign.ai@dealignai·1d

MiniMax m2.7 at 83gb. wow.

English

187

10.8K

billyG88 retweetledi

NVIDIA AI Developer@NVIDIAAIDev·1d

🎉Congratulations to the @MiniMax_AI team on the launch of MiniMax M2.7! MiniMax M2.7 is now available with NVIDIA GPU accelerated endpoints ready to try out with claws including NemoClaw and @OpenClaw. 🦞 📝Get started with our technical guide: developer.nvidia.com/blog/minimax-m… and see how you can begin experimenting for free at build.nvidia.com/minimaxai/mini…. What will you build this weekend? Share in comments. 👇

MiniMax (official)@MiniMax_AI

We're delighted to announce that MiniMax M2.7 is now officially open source. With SOTA performance in SWE-Pro (56.22%) and Terminal Bench 2 (57.0%). You can find it on Hugging Face now. Enjoy!🤗 huggingface：huggingface.co/MiniMaxAI/Mini… Blog: minimax.io/news/minimax-m… MiniMax API: platform.minimax.io

English

159

1.8K

352.5K

billyG88 retweetledi

Zixuan Li@ZixuanLi_·1d

Trending #1 on @huggingface. The hardest one yet.

English

1.3K

59K

billyG88 retweetledi

Omar Khattab@lateinteraction·1d

As promised, here's a recording of my 30-min keynote and the subsequent Q&A for the inaugural late interaction retrieval (LIR) workshop, cc @bclavie @antoine_chaffin. The talk is admittedly advanced, as it's directed at an expert IR community. But hopefully still broadly useful!

Amélie Chatelain@AmelieTabatta

Lots of people interested in the late Interaction workshop, listening to @lateinteraction's keynote!

English

663

120.5K

billyG88 retweetledi

Roan@RohOnChain·1d

This 2 hour Stanford lecture shows exactly how Stanford trains it's engineers to build AI systems. It's more practical than every Claude tutorial & prompting threads you've seen. Bookmark & give it 2 hours, no matter what. It'll be the most productive thing you do this weekend.

English

146

1.8K

12.9K

1.5M

billyG88 retweetledi

Asuka Groyper 🚬@asukagrypr·1d

I'm sorry George.

English

164

2.8K

34K

784.7K

billyG88@billyG881·17h

@pfau AXAXAXAXAXAXXAAXXAXAXAXAXAXAXAX

Suomi

David Pfau@pfau·20h

At the current growth rate of 3x every quarter, Anthropic's revenue is on track to surpass Google in Q4 this year, Amazon in Q1 next year and the entire United States federal government somewhere in Q2/Q3.

English

1.4K

171.3K

billyG88 retweetledi

François Fleuret@francoisfleuret·1d

This is really 2019 pure science fiction.

English

230.3K

billyG88 retweetledi

Avid@Av1dlive·2d

This 15-minute talk by the creator of Pydantic on how to correctly use MCPs will teach you more about making your AI tools actually work together than everything you've scrolled past this year. Bookmark this & watch, no matter what. Then read the guide below by @eng_khairallah1

Khairallah AL-Awady@eng_khairallah1

x.com/i/article/2042…

English

260

344.5K

billyG88 retweetledi

@Axiomofmind ⚡@axiomofmind·1d

training in unlsoth - learning which quant size to train - training parameters based on data size - merging LoRA - exporting to gguf / quantized - merging ggufs - running on llama.cpp vs. ollama fun stuff. lots of wasted time but learning by doing is best IMO

English

10.8K

billyG88 retweetledi

Dev@DevvMandal·1d

Tomorrow, we're launching the world's most advanced computer-use dataset. Stay tuned :) @markov__ai

Dev@DevvMandal

Today, we're launching the world's largest open-source dataset of computer-use recordings. 10,000+ hours across Salesforce, Blender, Photoshop and more, to automate the next level of white-collar work. Link in the comments :) @markov__ai

English

408

39.5K

billyG88 retweetledi

Lotto@LottoLabs·21h

I actually think this is the best subscription out now I just used $10 of opus last night or I could have a month of GO $10 a month with generous limits on open models The best of both worlds for qwen 27b + Hermes

English

366

25.1K

billyG88 retweetledi

Design Arena@Designarena·20h

BREAKING: Wan2.7-Video by @Alibaba_Wan is now #1 on Video-to-Video Arena with an Elo of 1337! This establishes a new state of the art with video editing models Huge congrats to the @Alibaba_Wan team!

English

288

22.9K

billyG88 retweetledi

Fahd Mirza@fahdmirza·18h

💥 MiniMax M2.7 is NOW running locally — 229B open source model on CPU + GPU ♠ and everyone can do it with llama.cpp 🔹229B MoE model, 256 experts, 8 active per token 🔹IQ4_XS quant brings it down to 100GB — no need for multiple GPUs 🔹Split across H100 VRAM + system RAM via llama.cpp GPU offloading 🔹OpenAI compatible API endpoint — drop-in replacement for any client 🔹SWE-Bench Pro 56.2 — within 2 points of GPT-5.4 🔹MLE-Bench Lite 66.6% medal rate — #2 only to Opus 4.6 and GPT-5 🔹Modified MIT license — run it, build on it, ship it 🔥 Watch the full video below:👇

English

4.5K

Keşfet

@vllm_project @_soyr_ @Hesamation @MiniMax_AI @OpenClaw @huggingface @bclavie @antoine_chaffin