Dev Patel

244 posts

Dev Patel banner
Dev Patel

Dev Patel

@devpatelio

making magic machines @ucberkeley | @novaskyai • @amd • @healthenginecal

SF, CA Bergabung Haziran 2022
1.1K Mengikuti554 Pengikut
Dev Patel me-retweet
Neuralink
Neuralink@neuralink·
ALS has gradually taken away Kenneth’s ability to speak. Through Neuralink’s VOICE clinical trial, he’s exploring how a brain-computer interface designed to translate thought to speech could help restore autonomy in his daily life. Watch to learn more:
English
1.3K
4K
21.9K
42M
Hanson Wen
Hanson Wen@_hansonw·
Crazy win man
Hanson Wen tweet mediaHanson Wen tweet media
English
4
1
7
242
krupa
krupa@krupaad·
Had a fun time building SimSafe for the @nvidiaomniverse Cosmos Cookoff!! AV companies generate millions of synthetic training clips, but bad data with broken physics and unrealistic footage, degrades model performance. SimSafe uses Cosmos Reason 2 to automatically detect physically implausible synthetic AV training data from dashcam footage (by looking at shadow consistency, vehicle dynamics, road texture realism).
English
8
12
27
1.6K
Dev Patel me-retweet
Sheel Mohnot
Sheel Mohnot@pitdesi·
Luckin Coffee (Chinese Starbucks) acquired Blue Bottle retail operations for $400M from Nestle, who had bought a majority at a ~$700M valuation back in 2017. Blue Bottle had raised from True, Index and GV. Nestle will keep the FMCG brand. Luckin was delisted from the Nasdaq in 2020 when it came out that they had fabricated $310M of sales, and paid a $180M fine. They seem to have turned it around and might get relisted after 5 years in the penalty box. It’s trading at ~$10B on OTC markets en.sedaily.com/international/…
English
49
75
907
245.1K
Dev Patel me-retweet
NovaSky
NovaSky@NovaSkyAI·
We’ve been consistently surprised lately by how capable frontier models are at handling complex kernel implementation and system optimization. Check out this work as a step toward automating AI infrastructure building!
Shiyi Cao@shiyi_c98

Introducing our new work K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model — a new paradigm for automated GPU kernel generation, achieving SoTA results. 🔍 Big insight: Traditional methods treat LLMs as stochastic code generators inside heuristic loops — but this misses a key point: LLMs are powerful planners with rich domain priors. 🧠 Core idea: K-Search uses the LLM itself as a co-evolving world model — one that plans + updates beliefs + guides search decisions based on experience. 📌 This decouples high-level strategy (intent) from low-level code implementation, allowing the optimizer to pursue multi-step transformations even when intermediate implementations don’t immediately improve performance. 📈 Key results: 🔥 Our discovered kernels are ~2.10× average speedup vs state-of-the-art evolutionary search across 4 FlashInfer kernels on H100/B200. 🔥 Up to 14.3× gain on complex Mixture-of-Experts (MoE) kernels. 🔥 State-of-the-art performance on GPUMode TriMul (H100) task — beating both automated and human solutions. 🙏 Acknowledgements This work is developed in @BerkeleySky, w/ the amazing @ziming_mao, @profjoeyg, and @istoica05. We thank @DachengLi177, @MayankMish98, @randwalk0, @pgasawa, @fangz_zzu, and @tian_xia_ for helpful discussion and feedback. We also thank the generous compute support from @databricks, @awscloud, @anyscalecompute, @nvidia, @Google, @LambdaAPI, and @MayfieldFund. 👨‍💻 GitHub: github.com/caoshiyi/K-Sea… 📄 arXiv: arxiv.org/pdf/2602.19128…

English
0
3
20
2.7K
Dev Patel me-retweet
baby keem
baby keem@babykeem·
how do u fix openclaw internal reasoning leaking
English
659
1.8K
18.7K
3.6M
Dev Patel
Dev Patel@devpatelio·
@willccbb thank u for ur good taste in music will brown
English
0
0
2
103
will brown
will brown@willccbb·
now listening
will brown tweet media
English
18
1
74
3.6K
Dev Patel me-retweet
Hao Kang
Hao Kang@GT_HaoKang·
@NovaSkyAI Personally, SkyRL is the best code repo for agentic RL. We have tested lots of other RL codebase and their examples can not even run correctly. Respect!
English
0
8
49
5.8K
Dev Patel me-retweet
Hao Kang
Hao Kang@GT_HaoKang·
🔥Modifying 2 lines of code and get your agentic serving/rollout up to 3.9x faster losslessly! ⚡️Say hello to ThunderAgent, a fast, simple, and program-aware agentic Inference System. 🥇 We propose a program abstraction to schedule all GPU and CPU resources, the first principled approach for distributed agentic inference and rollout. 🌐 Blog: thunderagent.ai 💻 Code: github.com/ThunderAgent-o… 📜 Paper: arxiv.org/pdf/2602.13692 #AI #ThunderAgent #LLMAgent #Mlsys 1/n
English
3
23
103
27.9K
Dev Patel me-retweet
saksham
saksham@sakshambatraa·
i spent the last few months building microMLC, a machine learning compiler without any prior experience. the result is an interactive educational resource that documents my journey and findings. here's a breakdown of what i learned!
English
29
23
178
25.8K
surya
surya@suryasure05·
if you haven’t tried using codex or claude code to generate model architecture diagrams (with tensor size annotations too), you should seriously give it a shot
English
2
0
22
1.5K
Dev Patel me-retweet
Techmeme
Techmeme@Techmeme·
Source: Benchmark's 2020 fund is now worth 10x+ and 2024 fund is 3x what investors put in, based on cash distributions and the paper value of its investments (@nmasc_ / Bloomberg) bloomberg.com/news/articles/… #a260217p46" target="_blank" rel="nofollow noopener">techmeme.com/260217/p46#a26… 📥 Send tips! techmeme.com/contact
English
5
10
103
149.1K
Dev Patel me-retweet
vLLM
vLLM@vllm_project·
🔥Excited to see SkyRL bringing Tinker to local GPUs. Standardizing training APIs lowers the barrier for research and infrastructure innovation. vLLM is proud to power the inference layer behind high-throughput RL training. 🚀
Tyler Griggs@tyler_griggs_

SkyRL now implements the Tinker API. Now, training scripts written for Tinker can run on your own GPUs with zero code changes using SkyRL's FSDP2, Megatron, and vLLM backends. Blog: novasky-ai.notion.site/skyrl-tinker 🧵

English
7
11
85
8.4K
Fazal
Fazal@fazalmittu_·
recently been interested in Meta's JEPA, an architecture for predictive models to be able to extract/understand higher level details from any modality of data. walkthrough on my website: fazalmittu.com/reports/ijepa going to be doing more of these going forward!
Fazal tweet media
English
3
0
9
615
Dev Patel me-retweet
NovaSky
NovaSky@NovaSkyAI·
We are excited to announce that SkyRL now implements the Tinker API. Run Tinker training scripts on your own hardware with zero code changes. Try it out today: novasky-ai.notion.site/skyrl-tinker
Tyler Griggs@tyler_griggs_

SkyRL now implements the Tinker API. Now, training scripts written for Tinker can run on your own GPUs with zero code changes using SkyRL's FSDP2, Megatron, and vLLM backends. Blog: novasky-ai.notion.site/skyrl-tinker 🧵

English
0
4
28
1.9K