Arun Mohan TP

64 posts

Arun Mohan TP

Arun Mohan TP

@ArunMohanTP

Interested in programming and building AI tools. Snorkeling, diving.

From India , lives in Chicago Katılım Mayıs 2014
400 Takip Edilen20 Takipçiler
Arun Mohan TP
Arun Mohan TP@ArunMohanTP·
Ok , so if you have a problem with Anthropic or Open AI use Owen 3.6 then.
OpenRouter@OpenRouter

Qwen 3.6 Plus from @Alibaba_Qwen is officially the first model on OpenRouter to break 1 Trillion tokens processed in a single day! At ~1,400,000,000,000 tokens, it’s the strongest full day performance of any new model dropped this year. Congrats to the Qwen team!

English
0
0
0
30
Arun Mohan TP retweetledi
OpenRouter
OpenRouter@OpenRouter·
Qwen 3.6 Plus from @Alibaba_Qwen is officially the first model on OpenRouter to break 1 Trillion tokens processed in a single day! At ~1,400,000,000,000 tokens, it’s the strongest full day performance of any new model dropped this year. Congrats to the Qwen team!
OpenRouter tweet media
English
146
368
4.3K
447.7K
Peter Steinberger 🦞
Anthropic now blocks first-party harness use too 👀 claude -p --append-system-prompt 'A personal assistant running inside OpenClaw.' 'is clawd here?' → 400 Third-party apps now draw from your extra usage, not your plan limits. So yeah: bring your own coin 🪙🦞
English
435
239
4.6K
1.2M
Arun Mohan TP
Arun Mohan TP@ArunMohanTP·
@steipete I’m not sure , I used sonnet 4.6 and now as anthropic blocked it , will try some cheaper models through openrouter , like qwen coder or xiaomi mimo, but I’m not sure if chineese state monitor us ;)
English
0
0
0
601
anirudh bv
anirudh bv@anirudhbv_ce·
I implemented @GoogleResearch's TurboQuant as a CUDA-native compression engine on Blackwell B200. 5x KV cache compression on Qwen 2.5-1.5B, near-loseless attention scores, generating live from compressed memory. 5 custom cuTile CUDA kernels ft: - fused attention (with QJL corrections) - online softmax -on-chip cache decompression - pipelined TMA loads Try it out: devtechjr.github.io/turboquant_cut… s/o @blelbach and the cuTile team at @nvidia for lending me Blackwell GPU access :) cc @sundeep @GavinSherry
English
140
306
3.2K
750.7K
Arun Mohan TP
Arun Mohan TP@ArunMohanTP·
@steipete Yeah for the same reason most people like Claude code. That's an awesome tool.
English
0
0
0
24
Arun Mohan TP
Arun Mohan TP@ArunMohanTP·
Here is an OS built for agents that runs inside your app process, so agents start fast, use less memory, and connect directly to backend functions. It is more efficient than full sandboxes, with strong security, optional sandbox support, and easy deployment as an open-source npm.
English
1
0
0
13
Prince Canuma
Prince Canuma@Prince_Canuma·
RF-DETR by @roboflow now on MLX It can do realtime instance segmentation on-device and enable some cool use cases for visual analysis, monitoring and robotics like Reachy Mini. Also augmented VLM and VLA by preprocessing image and video with areas of interest. New release coming soon on mlx-vlm 🚀 For those who can’t wait you can install mlx-vlm from source.
English
17
41
417
29.6K
Demis Hassabis
Demis Hassabis@demishassabis·
Gemini 3.1 Flash Live is our highest quality audio & voice model yet - and a big leap towards building next-gen voice-first agents. Lower latency, better precision, more natural interactions... try it now with Gemini Live in the @GeminiApp or build with it in @GoogleAIStudio!
Google DeepMind@GoogleDeepMind

Say hello to Gemini 3.1 Flash Live. 🗣️ Our latest audio model delivers more natural conversations with improved function calling – making it more useful and informed. Here’s what’s new 🧵

English
124
139
1.5K
284.2K
Arun Mohan TP
Arun Mohan TP@ArunMohanTP·
@sawyerhood Wooowww… this is incredible, my go to was playwright CLI but this looks great
English
0
0
0
375
Sawyer Hood
Sawyer Hood@sawyerhood·
Introducing the new dev-browser cli. The fastest way for an agent to use a browser is to let it write code. Just `npm i -g dev-browser` and tell your agent to "use dev-browser"
English
151
292
3K
845K