Tanmay Patil

328 posts

Tanmay Patil

@TanmayPatil79

Building ML and AI things | Open for new opportunities Github : https://t.co/VFmAjsLtPA

Katılım Ağustos 2021

387 Takip Edilen97 Takipçiler

Sabitlenmiş Tweet

Tanmay Patil@TanmayPatil79·2h

🚀 Just released: Krea-2 Depth ControlNet-LoRA Keeps near-perfect 3D structure while letting you completely change the image with any prompt. Works great with Krea-2 Turbo too. huggingface.co/Patil/Krea-2-d… Big thanks to @edwixxxx @Shauray7 who helped test and refine it along the way. Thanks @krea_ai for great model 🤗.

English

337

Tanmay Patil retweetledi

subho ghosh@SubhoGhosh02·22h

FA4's SingleTileLPTScheduler exploits that causal attention work grows with block index, so it just visits blocks in reverse (block = num_block - 1 - block). So why not try something similar on grouped gemm! In grouped GEMM the analog is that a tile's mainloop time is proportional to its group's K, and StaticPersistentGroupTileScheduler visits tiles in group-metadata order. So LPT = order groups by descending K. Result is 1.74x speedup in grouped gemm, just by sorting the scheduling path.

English

2.4K

Tanmay Patil@TanmayPatil79·23h

@Kshitijjkapoor Lmao

kshitij@Kshitijjkapoor·1d

not sleeping just in case they revoke fable by the time i wake up

English

1.4K

Tanmay Patil@TanmayPatil79·23h

@thsottiaux I am ready with 2 usage resets .

English

Tibo@thsottiaux·1d

Can't wait to see what people will do with GPT-5.6 Sol Ultra. Stash your hardest prompts somewhere.

English

1.6K

395

9.9K

1.4M

Tanmay Patil@TanmayPatil79·23h

Cooking something .. let's see how it goes 🥴

English

128

Tanmay Patil@TanmayPatil79·1d

@NielsRogge @aiDotEngineer Hey can we have dark mode on papers with code .. 🤗 BTW nice work

English

Niels Rogge@NielsRogge·1d

A bit busy this week for @aiDotEngineer, but here are the top 3 trending papers of this week: - "Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent" - "Introducing LongCat-2.0" - "Scalable GANs with Transformers" Explore the papers and evals on Papers with Code!

English

1.4K

Tanmay Patil retweetledi

Together AI@togethercompute·2d

Multi-GPU kernels are the real test for coding models. Today at @aiDotEngineer, @simran_s_arora shared ParallelKernelBench, an open-source benchmark for evaluating whether LLMs can write fast CUDA kernels for real communication-heavy workloads. Proud to see this work from the Together AI Frontier Performance team.

English

246

16.1K

Tanmay Patil@TanmayPatil79·1d

@Zai_org Nice 🫡

English

Z.ai@Zai_org·1d

Introducing ZCode, the official development environment for GLM-5.2 - GLM Coding Plan subscribers: now 1.5x usage quota in ZCode - BYOK supported: works with your existing subscriptions and APIs - Available on macOS, Windows, and Linux Download now: zcode.z.ai/en

English

354

596

5.6K

1.3M

Tanmay Patil retweetledi

Anthropic@AnthropicAI·2d

We’ve received notice that the Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5. We'll begin restoring access tomorrow, and will share an update soon. We’re grateful to our users for their patience, and to everyone who worked with us on redeploying the models.

English

4.1K

13K

84.7K

14.2M

Tanmay Patil@TanmayPatil79·2d

@fraserpricee @AnthropicAI time to cancel subscription

English

112

Fraser Price@fraserpricee·2d

My car isn't getting washed up but @AnthropicAI certainly is 🫵😂 Top is DeepSeek-V4-Flash-DSpark running @ 250TPS on 2 local GPUs Bottom is Sonnet 5, which tbf should definitely be regulated because it is going to ruin a LOT of codebases

English

3.3K

Tanmay Patil@TanmayPatil79·2d

GIF

Pop Base@PopBase

Google is shutting down GIF provider Tenor’s public API at some point today, June 30th. Apps that rely on the public API to source GIFs will lose direct access to Tenor’s library.

ZXX

Tanmay Patil@TanmayPatil79·2d

@huggingface Buckets are awesome

English

Tanmay Patil retweetledi

Claude@claudeai·2d

Introducing Claude Sonnet 5, our most agentic Sonnet yet. It makes plans, uses tools like browsers and terminals, and runs autonomously at a level that just a few months ago required larger and more expensive models.

English

4.4K

41.7K

9.2M

Tanmay Patil retweetledi

Vivek Galatage@vivekgalatage·4d

It's super interesting to know the system architecture of the TPUs. henryhmko.github.io/posts/tpu/tpu.…

Vivek Galatage@vivekgalatage

Looking forward to Dive into Deep Learning - "Interactive deep learning book with code, math, and discussions". d2l.ai

English

560

30.8K

Tanmay Patil retweetledi

vLLM@vllm_project·3d

👀 vLLM community is working non-stop to get @deepseek_ai's new DSpark spec decode algorithm for vLLM! Faster inference for everyone! github.com/vllm-project/v…

English

825

55.6K

Tanmay Patil retweetledi

Vidit Gujrathi@viditchess·3d

Chess engines tell you the best move. But grandmasters are human, they don’t always play it. So I built "Kibitz": a human move predictor for chess broadcasts. I trained this model on my Nvidia RTX 5080. Then I made it run as a business by itself. A channel buys the overlay, Hermes onboards them, charges via @stripe test mode, runs the broadcast, narrates with @NVIDIAAI Nemotron, tracks inference cost, and books its own P&L. I build. Hermes operates. This is my demo and entry for the @NousResearch × @NVIDIAAI × @stripe Hermes Agent Accelerated Business Hackathon.

English

295

372

6.7K

605.9K

Tanmay Patil@TanmayPatil79·4d

Type of shit I was doing in 2013 instead of learning CUDA

English

275

Keşfet

@Kshitijjkapoor @thsottiaux @NielsRogge @aiDotEngineer @simran_s_arora @Zai_org @fraserpricee @AnthropicAI