Arun Mohan TP

64 posts

Arun Mohan TP

@ArunMohanTP

Interested in programming and building AI tools. Snorkeling, diving.

From India , lives in Chicago Katılım Mayıs 2014

400 Takip Edilen20 Takipçiler

Arun Mohan TP@ArunMohanTP·14h

I don't know if the future of AI is entirely local, but I'm definitely sure there is a future for local AI that is free and accessible to everyone.

0xSero@0xSero

As promised! Gemma-4-21B-REAP is out! Results are great it held up really well and actually gained accuracy on reasoning tasks. MLX & GGUF bros do you thing! This should fit on as little as 12GB of vram with some context, or 16GB with full context huggingface.co/0xSero/gemma-4…

English

Arun Mohan TP@ArunMohanTP·14h

Ok , so if you have a problem with Anthropic or Open AI use Owen 3.6 then.

OpenRouter@OpenRouter

Qwen 3.6 Plus from @Alibaba_Qwen is officially the first model on OpenRouter to break 1 Trillion tokens processed in a single day! At ~1,400,000,000,000 tokens, it’s the strongest full day performance of any new model dropped this year. Congrats to the Qwen team!

English

Arun Mohan TP retweetledi

OpenRouter@OpenRouter·2d

English

146

368

4.3K

447.7K

Arun Mohan TP@ArunMohanTP·16h

@aronprins @steipete Intermittent token fasting

GIF

English

532

Aron Prins@aronprins·16h

@steipete @ArunMohanTP Got some of those tokens?

GIF

English

695

Peter Steinberger 🦞@steipete·17h

Anthropic now blocks first-party harness use too 👀 claude -p --append-system-prompt 'A personal assistant running inside OpenClaw.' 'is clawd here?' → 400 Third-party apps now draw from your extra usage, not your plan limits. So yeah: bring your own coin 🪙🦞

English

435

239

4.6K

1.2M

Arun Mohan TP@ArunMohanTP·16h

@steipete I’m not sure , I used sonnet 4.6 and now as anthropic blocked it , will try some cheaper models through openrouter , like qwen coder or xiaomi mimo, but I’m not sure if chineese state monitor us ;)

English

601

Peter Steinberger 🦞@steipete·16h

@ArunMohanTP OpenClaw is Switzerland, we wanna work great for *any* token dealer.

English

100

10.3K

Arun Mohan TP@ArunMohanTP·2d

@anirudhbv_ce @GoogleResearch This is impressive , great work @anirudhbv_ce

English

477

anirudh bv@anirudhbv_ce·3d

I implemented @GoogleResearch's TurboQuant as a CUDA-native compression engine on Blackwell B200. 5x KV cache compression on Qwen 2.5-1.5B, near-loseless attention scores, generating live from compressed memory. 5 custom cuTile CUDA kernels ft: - fused attention (with QJL corrections) - online softmax -on-chip cache decompression - pipelined TMA loads Try it out: devtechjr.github.io/turboquant_cut… s/o @blelbach and the cuTile team at @nvidia for lending me Blackwell GPU access :) cc @sundeep @GavinSherry

English

140

306

3.2K

750.7K

Arun Mohan TP@ArunMohanTP·3d

@steipete Yeah for the same reason most people like Claude code. That's an awesome tool.

English

Peter Steinberger 🦞@steipete·4d

I never use plan mode. The main reason this was added to codex is for claude-pilled people who struggle with changing their habits. just talk with your agent.

Anthony Kroeger@kr0der

slowly starting to use plan mode a LOT less nowadays i realised whenever i use plan mode, it generates a gigantic plan and then i dont read it and hit build out of laziness having a meaningful conversation with the AI agent to discuss implementation feels a lot easier 🤔

English

536

236

4.6K

1.1M

Arun Mohan TP@ArunMohanTP·4d

Whaaat 28T tokens and just 350M , that’s huge for a smaller models. Let intelligence move from cloud to edge.

Liquid AI@liquidai

Trained on 28T tokens with scaled RL, LFM2.5-350M is a step change from LFM2-350M: > instruction following: 18.20 → 40.69 > data extraction: 11.67 → 32.45 > tool use: 22.95 → 44.11 These are the capabilities that matter in production.

English

Arun Mohan TP@ArunMohanTP·4d

Seems like very promising model ! And it’s open source.

Arcee.ai@arcee_ai

Today we're releasing Trinity-Large-Thinking. Available now on the Arcee API, with open weights on Hugging Face under Apache 2.0. We built it for developers and enterprises that want models they can inspect, post-train, host, distill, and own.

English

Arun Mohan TP@ArunMohanTP·5d

github.com/rivet-dev/agen…

ZXX

Arun Mohan TP@ArunMohanTP·5d

Here is an OS built for agents that runs inside your app process, so agents start fast, use less memory, and connect directly to backend functions. It is more efficient than full sandboxes, with strong security, optional sandbox support, and easy deployment as an open-source npm.

English

Arun Mohan TP@ArunMohanTP·5d

@Prince_Canuma @roboflow Wow , this is great!

English

149

Prince Canuma@Prince_Canuma·5d

RF-DETR by @roboflow now on MLX It can do realtime instance segmentation on-device and enable some cool use cases for visual analysis, monitoring and robotics like Reachy Mini. Also augmented VLM and VLA by preprocessing image and video with areas of interest. New release coming soon on mlx-vlm 🚀 For those who can’t wait you can install mlx-vlm from source.

English

417

29.6K

Arun Mohan TP@ArunMohanTP·5d

@liquidai congratulations

Liquid AI@liquidai

Today, we release LFM2.5-350M. Agentic loops at 350M parameters. A 350M model trained for reliable data extraction and tool use, where models at this scale typically struggle. <500MB when quantized, built for environments where compute, memory, and latency are constrained. 🧵

English

Arun Mohan TP@ArunMohanTP·27 Mar

@demishassabis @GeminiApp @GoogleAIStudio Thanks for rolling out to the gemini app as well!

English

103

Demis Hassabis@demishassabis·26 Mar

Gemini 3.1 Flash Live is our highest quality audio & voice model yet - and a big leap towards building next-gen voice-first agents. Lower latency, better precision, more natural interactions... try it now with Gemini Live in the @GeminiApp or build with it in @GoogleAIStudio!

Google DeepMind@GoogleDeepMind

Say hello to Gemini 3.1 Flash Live. 🗣️ Our latest audio model delivers more natural conversations with improved function calling – making it more useful and informed. Here’s what’s new 🧵

English

124

139

1.5K

284.2K

Arun Mohan TP@ArunMohanTP·27 Mar

@demishassabis @LuizaJarovsky @elonmusk @sama @DarioAmodei @sundarpichai @tim_cook @satyanadella @JeffBezos @finkd @IsomorphicLabs Thank you @demishassabis

English

Demis Hassabis@demishassabis·26 Mar

@LuizaJarovsky @elonmusk @sama @DarioAmodei @sundarpichai @tim_cook @satyanadella @JeffBezos @finkd We are already working on this with tools like AlphaFold and the work we are doing at @IsomorphicLabs

English

3.4K

116.5K

Luiza Jarovsky, PhD@LuizaJarovsky·26 Mar

I would like to invite @elonmusk, @sama, @DarioAmodei, @sundarpichai, @demishassabis, @tim_cook, @satyanadella, @JeffBezos, @finkd, and all the other tech CEOs to join the "AI race to cure cancer." The winner gets humanity's forever appreciation.

Luiza Jarovsky, PhD@LuizaJarovsky

Everybody wants AI to help cure cancer. Why isn't every AI company obsessively focused on that?

English

178.7K

Arun Mohan TP@ArunMohanTP·26 Mar

THis is incredible, is a perfect prediction of a human response the same as reconstructing a human?

AI at Meta@AIatMeta

Today we're introducing TRIBE v2 (Trimodal Brain Encoder), a foundation model trained to predict how the human brain responds to almost any sight or sound. Building on our Algonauts 2025 award-winning architecture, TRIBE v2 draws on 500+ hours of fMRI recordings from 700+ people to create a digital twin of neural activity and enable zero-shot predictions for new subjects, languages, and tasks. Try the demo and learn more here: go.meta.me/tribe2

English

Arun Mohan TP@ArunMohanTP·26 Mar

@sawyerhood Wooowww… this is incredible, my go to was playwright CLI but this looks great

English

375

Sawyer Hood@sawyerhood·25 Mar

fittingly we just hit 4k stars on github! Check it out today! github.com/SawyerHood/dev…

English

230

25.5K

Sawyer Hood@sawyerhood·25 Mar

Introducing the new dev-browser cli. The fastest way for an agent to use a browser is to let it write code. Just `npm i -g dev-browser` and tell your agent to "use dev-browser"

English

151

292

845K

Arun Mohan TP@ArunMohanTP·26 Mar

Now we can create 3 minute songs , let me compose a cinematic Bollywood × Hollywood × Sanskrit × Arabic rock DNA. 😂

Google DeepMind@GoogleDeepMind

You can now create longer tracks with Lyria 3 Pro. 🎶 Map out intros, verses, choruses, and bridges to build high-fidelity compositions up to 3 minutes long. 🎹

English

Arun Mohan TP@ArunMohanTP·26 Mar

@demishassabis @GeminiApp @GoogleAIStudio Hahah … let me compose a unique Bollywood Sanskrit Arabic music 🎶

English

106

Demis Hassabis@demishassabis·25 Mar

Perfect background music for flow state at 2am - made with the new Lyria 3 Pro. Google AI subscribers can try it in the @GeminiApp and developers can build with the API in @GoogleAIStudio - have fun!!

Google DeepMind@GoogleDeepMind

You can now create longer tracks with Lyria 3 Pro. 🎶 Map out intros, verses, choruses, and bridges to build high-fidelity compositions up to 3 minutes long. 🎹

English

128

1.5K

153.8K

Keşfet

@Alibaba_Qwen @aronprins @steipete @anirudhbv_ce @GoogleResearch @blelbach @nvidia @sundeep