Marc Ribalta

279 posts

Marc Ribalta

@MarcRibalta_

AI Gen hype train. Not into politics. ML engineer.

Katılım Ekim 2012

68 Takip Edilen62 Takipçiler

Marc Ribalta retweetledi

OpenCode@opencode·10 May

OpenCode x Ring 2.6 1T - free for a limited time 256K context • reasoning • text only Thanks to @AntLingAGI and @novita_labs for making the model available

English

1.6K

98.2K

Marc Ribalta retweetledi

Sandro@pupposandro·6 May

Reminder that this is the future of humanity if open source AI doesn’t win

English

119

237

3.1K

1.1M

Marc Ribalta retweetledi

Unsloth AI@UnslothAI·6 May

We collaborated with NVIDIA to teach you how we made LLM training ~25% faster! 🚀 Learn how 3 optimizations help your home GPU train models faster: 1. Packed-sequence metadata caching 2. Double-buffered checkpoint reloads 3. Faster MoE routing Guide: unsloth.ai/blog/nvidia-co…

English

158

940

59.3K

Marc Ribalta retweetledi

kache@yacineMTB·26 Nis

this is what the future looks like now that i've been unshackled from stinky, ugly macos I can actually make my computer do what I want it to do

tooz@adarshsolanki

@yacineMTB jobs_done.mp3

English

325

30.5K

Marc Ribalta retweetledi

Benjamin Marie@bnjmn_marie·25 Nis

Qwen3.6 GGUF Evaluations For the 27B: Q2_K_XL is surprisingly recommendable. IQ3_XXS performs very similarly, uses only +0.2 GB, and generates significantly fewer tokens. If you are memory-tight, pick this one. Otherwise, if you can spare +2.5 GB, use Q3_K_XL: (almost) same accuracy and token efficiency as the original. All the results, also for the 35B, here: kaitchup.substack.com/p/summary-of-q… More results are coming, probably Monday, covering other GGUF providers and some abliterated models.

English

700

83.9K

Marc Ribalta@MarcRibalta_·19 Ağu

@gabmfrl Me pasaba, pero con el tiempo cambia. Ahora a las 23 el cuerpo ya me pide dormir.

Español

Gabriel Merlo@gabmfrl·19 Ağu

Se habla poco del cronotipo (reloj biológico interno para dormir), pero yo modificar incluso una hora mi mejor horario, incluso si duermo las mismas horas, me afecta ya. No se qué cronotipo tendréis pero si alguno tenéis un cronotipo de “buho” o nocturno, entenderéis el dolor que supone adaptarse a los horarios tradicionales. Mi mejor horario es de 2 a 10, pero vivo en uno de 0 a 8 y no hay día que no lo note a pesar de todo el trabajo que he puesto para adaptarme. Alguno de vosotros sois de cronotipo madrugador? Cómo os sentís pasadas las 00:00?

Español

480

Marc Ribalta retweetledi

Avi Chawla@_avichawla·15 Ağu

8 RAG architectures all AI Engineers should know:

English

356

1.9K

227.8K

Marc Ribalta retweetledi

Skywork@Skywork_ai·12 Ağu

Matrix-Game 2.0 — The FIRST open-source, real-time, long-sequence interactive world model Last week, DeepMind's Genie 3 shook the AI world with real-time interactive world models. But... it wasn't open-sourced. Today, Matrix-Game 2.0 changed the game. 🚀 25FPS. Minutes-long interaction. Fully open-source.

English

340

1.9K

1.4M

Marc Ribalta retweetledi

alex fazio@alxfazio·5 Ağu

Anthropic@AnthropicAI

Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning.

ZXX

166

4.8K

421.7K

Marc Ribalta retweetledi

Lewis Tunstall@_lewtun·5 Ağu

One line of code is all it takes to fine-tune the gpt-oss models from @OpenAI 🔥 > Support to target the MoE expert layers with PEFT > Kernels for FlashAttention3 & MegaBlocks > Fast inference with MXFP4 quantization format In our testing, these models are extremely efficient to tune and can be adapted to new domains with just a few 100 samples 🤯 Download the models: huggingface.co/openai Training & inference recipes: github.com/huggingface/gp…

English

748

76.7K

Marc Ribalta retweetledi

Sundar Pichai@sundarpichai·23 Tem

So many of you are loving turning your photos into short videos in the @Geminiapp and the Gemini API. Next up, we’ll be rolling this feature out to @YouTube Shorts and @GooglePhotos. And soon, Remix your Google Photos into comics, sketches + 3D animations.

GIF

English

131

170

1.5K

186.4K

Marc Ribalta retweetledi

elvis@omarsar0·28 May

New Lens on RAG Systems RAG systems are more brittle than you think, even when provided sufficient context. Great work from Google and collaborators. Good tips for devs included. Here are my notes:

English

225

1.5K

190.3K

Marc Ribalta retweetledi

eric zakariasson@ericzakariasson·24 May

we wrote a guide on how to work with documentation in @cursor_ai includes some guidance on when to use which tool, a quick MCP server example for internal docs, and some prompting tips

English

120

1.8K

171.7K

Marc Ribalta retweetledi

Akshay 🚀@akshay_pachaar·21 Nis

Mixture of Experts (MoE) is a popular architecture that uses different "experts" to improve Transformer models. The visual below explains how they differ from Transformers. Let's dive in to learn more about MoE!

GIF

English

205

12.8K

Marc Ribalta retweetledi

Harrison Chase@hwchase17·20 Nis

OpenAI recently released a guide on building agents which contains some misguided takes There's a lot of FUD, confusion, hype, and noise around agents I wrote a blog on how to think about agent frameworks. Includes: Background Info - What is an agent? - What is hard about building agents? - What is LangGraph? Flavors of agentic frameworks - “Agents” vs “workflows” - Declarative vs non-declarative - Agent abstractions - Multi agent Common Questions - What is the value of a framework? - As the models get better, will everything become agents instead of workflows? - What did OpenAI get wrong in their take? - How do all the agent frameworks compare?

English

120

920

450K

Marc Ribalta retweetledi

Elias@Eliasfiz·19 Mar

Today, we’re launching Orpheus, an open-source TTS model that exceeds the capabilities of both open and closed-source models such as ElevenLabs and OpenAI! (1/6)

English

183

390

3.8K

629.4K

Marc Ribalta retweetledi

Brave@brave·11 Nis

💡Tip: You can connect your own AI models to Brave for more privacy and control!

Goreng@sudo_goreng

@gnukeith @solaverbo Cant believe its a free feature on @brave

English

1.1K

95.6K

Marc Ribalta retweetledi

kalomaze@kalomaze·6 Nis

no way

English

723

522.6K

Marc Ribalta retweetledi

Yam Peleg@Yampeleg·27 Mar

So gpt-4o is confirmed to be an autoregressive image generation model. how. on. earth. respect. cdn.openai.com/11998be9-5319-…

English

241

535.7K

Marc Ribalta retweetledi

Victoria Slocum@victorialslocum·25 Mar

'Just do RAG' they said. But WHICH RAG? Here’s 7 different RAG Architectures you should know 👇 1️⃣ 𝗡𝗮𝗶𝘃𝗲 𝗥𝗔𝗚: 𝗧𝗵𝗲 𝗖𝗹𝗮𝘀𝘀𝗶𝗰 𝗔𝗽𝗽𝗿𝗼𝗮𝗰𝗵 Naive RAG is the standard implementation with a relatively straightforward process: • Query comes in from the user • System retrieves relevant documents from a vector database • Retrieved documents are combined with the query as context • LLM generates a response based on both query and context This works well for many simple applications, like basic Q&A systems or document assistants. 2️⃣ 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗲 𝗮𝗻𝗱 𝗥𝗲𝗿𝗮𝗻𝗸 𝗥𝗔𝗚 This one adds a reranking step after the retrieval to improve response quality: • Initial retrieval returns a larger set of potentially relevant documents • A reranking model evaluates and scores these documents based on relevance • Only the highest-scoring documents are sent to the LLM 3️⃣ 𝗠𝘂𝗹𝘁𝗶𝗺𝗼𝗱𝗮𝗹 𝗥𝗔𝗚 The architecture leverages models that can process and retrieve from text, images, audio, video, and other data types. 4️⃣ 𝗚𝗿𝗮𝗽𝗵 𝗥𝗔𝗚 Graph RAG uses a graph database to incorporate relationship information between documents: • Documents/chunks are nodes in a graph • Relationships between documents are edges • Can follow relationship paths to find contextually relevant information 5️⃣ 𝗛𝘆𝗯𝗿𝗶𝗱 𝗥𝗔𝗚 𝗩𝗲𝗰𝘁𝗼𝗿 𝗗𝗕 𝘄𝗶𝘁𝗵 𝗚𝗿𝗮𝗽𝗵 𝗗𝗕 This architecture combines both vector search and a graph database: • Vector search identifies semantically similar content • Graph database provides structured relationship data • Queries can leverage both similarity and explicit relationships • Results can include information discovered through relationship traversal 6️⃣ 𝗔𝗴𝗲𝗻𝘁𝗶𝗰 𝗥𝗔𝗚 𝘄𝗶𝘁𝗵 𝗥𝗼𝘂𝘁𝗲𝗿 𝗔𝗴𝗲𝗻𝘁 A single agent makes decisions about retrieval: • Analyzes the query to determine the best knowledge sources • Makes strategic decisions about how to retrieve information • Coordinates the retrieval process based on query understanding 𝘊𝘩𝘦𝘤𝘬 𝘰𝘶𝘵 𝘵𝘩𝘦 𝘘𝘶𝘦𝘳𝘺 𝘈𝘨𝘦𝘯𝘵 𝘪𝘯 𝘞𝘦𝘢𝘷𝘪𝘢𝘵𝘦: weaviate.io/blog/query-age… 7️⃣ 𝗠𝘂𝗹𝘁𝗶-𝗔𝗴𝗲𝗻𝘁 𝗥𝗔𝗚 This one employs multiple specialized agents: • Master agent coordinates the overall process • Specialized retrieval agents focus on different tasks • Agents can communicate and collaborate to solve complex problems For example, one agent might retrieve from various sources, another might do data transformation, and a third personalizing the results from the user—all coordinated by a master agent that assembles the final response. Check out this ebook for more deep-dives into RAG architecture and strategies: weaviate.io/ebooks/advance…

English

227

1.2K

121K

Keşfet

@AntLingAGI @novita_labs @gabmfrl @OpenAI @GeminiApp @YouTube @googlephotos @cursor_ai