Marcos V

11.9K posts

Marcos V

@TheRealMarcosV

professional beta tester

Katılım Ağustos 2010

353 Takip Edilen299 Takipçiler

Sabitlenmiş Tweet

Marcos V@TheRealMarcosV·2 Eyl

Couple days with the Solana seeker it's been awesome!! @solanamobile

English

274

26.4K

Marcos V@TheRealMarcosV·1h

Got Gemma 4 working with full context on multi-GPU.

GIF

English

Marcos V@TheRealMarcosV·2h

@bcherny

GIF

QME

Boris Cherny@bcherny·3h

Starting tomorrow at 12pm PT, Claude subscriptions will no longer cover usage on third-party tools like OpenClaw. You can still use these tools with your Claude login via extra usage bundles (now available at a discount), or with a Claude API key.

English

830

336

4.2K

1.2M

Marcos V@TheRealMarcosV·2h

Anthropic throwing me a bone for complaining on X

English

Marcos V@TheRealMarcosV·10h

@lmstudio @GoogleDeepMind having trouble loading bigger context on any of these models -multi gpu rig 32gb vram any ideas? runtime has been updated btw.

English

LM Studio@lmstudio·1d

Say hello to Gemma 4 from @GoogleDeepMind 🚀🔥 💎 Comes in 4 sizes: E2B, E4B, 26B A4B, 31B 💎 Supports vision and reasoning 💎 Apache 2.0 💎 Available now in LM Studio lmstudio.ai/models/gemma-4

English

950

44.3K

Marcos V@TheRealMarcosV·1d

Will be testing dense model tonight and MOE model tomorrow.

Google DeepMind@GoogleDeepMind

Meet Gemma 4: our new family of open models you can run on your own hardware. Built for advanced reasoning and agentic workflows, we’re releasing them under an Apache 2.0 license. Here’s what’s new 🧵

English

Marcos V@TheRealMarcosV·1d

Another day another model

Logan Kilpatrick@OfficialLoganK

Introducing Gemma 4, our series of open weight (Apache 2.0 licensed) models, which are byte for byte the most capable open models in the world! Gemma 4 is build to run on your hardware: phones, laptops, and desktops. Frontier intelligence with a 26B MOE and a 31B Dense model!

English

Marcos V@TheRealMarcosV·1d

Same

Jeddi@antinertia

i’ve been a gamer since i was 12 every year i kind of hibernate for a few weeks and play 24/7 whether it’s world of warcraft or any new mmorpg (i’ve tried them all) i’m really into mmorpgs since january, i’ve been claude code pilled and i get the same kind of joy from it as i do from playing video games i can’t really explain why but it feels exactly like a video game i can be whoever i want to be interact with the world in any way i want and no one is there to tell me: you can’t do that, you can’t be that

English

Marcos V@TheRealMarcosV·1d

Crypto and Ai

GIF

vitalik.eth@VitalikButerin

My self-sovereign / local / private / secure LLM setup, April 2026 vitalik.eth.limo/general/2026/0…

English

Marcos V@TheRealMarcosV·1d

Chat gpt 5.4 is always in plan mode lmao

Peter Steinberger 🦞@steipete

I never use plan mode. The main reason this was added to codex is for claude-pilled people who struggle with changing their habits. just talk with your agent.

English

Marcos V@TheRealMarcosV·1d

Me and my Openclaw

𓏲𓆤@catsedih

me and him after the dumbest fight but we can’t stay mad at each other

English

Marcos V@TheRealMarcosV·2d

@ZagZino @thdxr No bacon

English

199

Zag Zino@ZagZino·2d

@thdxr whats the catch?

English

3.9K

dax@thdxr·2d

what if we gave you unlimited tokens for free and we also paid you

English

707

3.6K

239.6K

Marcos V@TheRealMarcosV·2d

@mudler_it @no_stp_on_snek @LocalAI_API ill test any model that can fit in 32gb vram

English

Ettore Di Giacinto@mudler_it·2d

APEX + TurboQuant from @no_stp_on_snek are landing in @LocalAI_API - Q8_0 quality at half the size and significantly faster inference. To benchmark bigger models we need GPU access. If you can help, DM me 🙏 What model do you want quantized with APEX next? 👇

English

2.8K

Marcos V@TheRealMarcosV·2d

i cannot keep up, everyday its a new model.

Ettore Di Giacinto@mudler_it

I've just released APEX (Adaptive Precision for EXpert Models): a novel MoE quantization technique that outperforms @UnslothAI Dynamic 2.0 on accuracy while being 2x smaller for MoE architectures. Benchmarked on Qwen3.5-35B-A3B, but the method applies to any MoE model. Half the size of Q8. Perplexity comparable to F16. Works with stock @ggml_org's llama.cpp. Open source (of course!), with ❤️ from the @LocalAI_API team. 👇Links to the model, repository and benchmarks below! (+ Bonus TurboQuant benchmarks with @no_stp_on_snek's TQ+! )

English

Marcos V@TheRealMarcosV·2d

@outsource_ I'm about to do the same thing thanks

English

Eric ⚡️ Building...@outsource_·2d

bro stole my tweet and the image I used 😂

Craig Hewitt@TheCraigHewitt

Very bullish on open source and local models Imagine running near-Opus-level model locally on that $600, 16GB Mac Mini you bought last month This 27B Qwen3.5 distill was trained on Claude 4.6 Opus reasoning traces and is putting up real numbers: - beats Claude Sonnet 4.5 on SWE-bench - keeps 96.91% HumanEval - cuts CoT (chain of thought) bloat by 24% - runs in 4-bit quantization Why this matters: local agent loops get a lot cheaper, faster, and more usable. frontier models aren’t going to keep subsidizing cheap tokens on subscriptions forever 300K+ downloads already on HF Link below 👇🏻 We’re early

English

748

Marcos V@TheRealMarcosV·2d

@ashen_one Codex loves to hand hold and make you waste 5-6 prompts before getting what you want done opus just goes yolo

English

ashen@ashen_one·3d

I think Codex is amazing but in my experience, for what I use it for, Claude has always been the most effective I use it for vibe coding and Open Claw more than anything. A lot of the stuff that I require my Open Claw to do is like free range stuff that he uses a lot of API calls for and browser access for During the week that I tested Codex on one of my OpenClaws, I ran into a lot of problems where he would just tell me that he's not allowed to or doesn't know how to do certain things That level of autonomy is something that I've never dealt with with Claude For Claude, he's less of a fed, where if he doesn't know how to do something or even if it's a little bit of a gray area, he'll still do it For Codex on OpenClaw he would always just refuse to do certain things that he was hard coded to not do Switching models became kinda annoying too like no need to waste brain power on these different models, i think it’s best to just find one you love for your specific use cases and focus on using it rather than trying to find the “perfect model” I also think that since Claude Cowork is basically openclaw, it makes sense to just claudemaxx and not deal with switching models

Hagrid@rubieshagrid

@ashen_one as a professional Claude maxer do u literally use Claude for everything and not codex at all or do u think codex is better at implementing Claude’s plans or vice versa? Im lost rn and Claude is being so ass

English

2.1K

Marcos V@TheRealMarcosV·2d

You get no context on 16gb

Craig Hewitt@TheCraigHewitt

English

Marcos V@TheRealMarcosV·2d

@manoj_ahi It's alright way more usage but I think opus really is better than chat gpt

English

476

Manoj Ahirwar@manoj_ahi·3d

Ok enough!! time to cancel my claude subscription is codex any good?

English

218

1.2K

132K

Marcos V@TheRealMarcosV·3d

@mr_r0b0t Moving at super speed

English

mr-r0b0t@mr_r0b0t·3d

@TheRealMarcosV V2 is available in his collections, V3 is dropping now, he’s got 9B done, probably finishing up the others as we speak 😁

English

Marcos V@TheRealMarcosV·3d

Testing on my 32gb multi GPU rig right now.

Eric ⚡️ Building...@outsource_

🚀 Imagine running Claude 4.6 Opus-level reasoning... but entirely on your own GPU with just 16GB VRAM. This 27B Qwen3.5 variant, distilled on Claude 4.6 Opus reasoning traces, delivers frontier coding power locally. It’s beating Claude Sonnet 4.5 on SWE-bench in 4-bit quantization (Q4_K_M) while slashing chain-of-thought bloat by 24%. ✅ Retains 96.91% HumanEval accuracy ✅ Perfect for agentic coding loops (no API costs or latency) 300K+ downloads on HF Link below 👇🏻

English