Rahul Atlury

53 posts

Rahul Atlury

@atlury

I am an electronics engineer with interests in preserving the knowledge of yesteryears for future generations

Katılım Ağustos 2009

1.4K Takip Edilen41 Takipçiler

Rahul Atlury@atlury·4d

@Prince_Canuma They say int4 per-group affine quantization is more robust across architectures, while angular methods like PolarQuant can fail badly when attention scaling is non-standard. It reports PolarQuant works on Llama 70B but fails on Gemma 31B...FYI

English

Prince Canuma@Prince_Canuma·4d

✅

Prince Canuma@Prince_Canuma

Congrats to the @ZyphraAI on the launch! 🚀 Zaya1-VL comes with day-0 support on mlx-vlm Install from source for now if you want to try it out github.com/Blaizzy/mlx-vl…

ART

1.2K

Prince Canuma@Prince_Canuma·5d

Coming to MLX 🚀

Zyphra@ZyphraAI

Today we're releasing ZAYA1-VL-8B, our first vision-language model. ZAYA1-VL-8B is a 700M active / 8B total MoE built on our ZAYA1-8B base trained on @AMD. We achieve strong performance for our size resulting in leading intelligence density and inference efficiency.

English

4.8K

Rahul Atlury@atlury·4d

@Prince_Canuma @Prince_Canuma no intention of hijacking this thread but have you looked at Open-TQ-Metal arxiv.org/abs/2604.16957 They apparently compute attention directly from compressed cache.

English

Rahul Atlury@atlury·4 May

@jimkxa @tenstorrent Please do consider releasing them standalone, achieving scale via massive, centralized data centers is just unsustainable for nature. Local AI allows for distributed, sustainable growth, thats the scale moat companies are missing...

English

124

Jim Keller@jimkxa·25 Nis

@atlury @tenstorrent We need it the P300 cards in the QuietBox2, rethinking releasing stand alone

English

765

Jim Keller@jimkxa·25 Nis

Don’t worry, we’ll make this fast. About 3 weeks. Lots in the pipeline. @tenstorrent

NVIDIA AI@NVIDIAAI

✨ DeepSeek-V4 is here — a million-token context, 1.6T parameter powerhouse optimized for agentic workflows. Out of the box, on DeepSeek-V4-Pro, NVIDIA Blackwell Ultra delivers over 150 TPS/user interactivity for agentic workflows. And we’re just getting started. Expect these performance figures to climb higher as we implement Dynamo, NVFP4, and advanced parallelization techniques. Start building today with @lmsysorg and @vllm_project

English

426

48.1K

Rahul Atlury@atlury·4 May

@DeepComputingio @jimkxa @tenstorrent This memory crunch is killing local AI from taking off, what we have been doing is going to back writing specialized simd kernels in assembly, testing in old generation DDR rams. Aah painful but needed at the place I live...waiting for better

English

DeepComputing@DeepComputingio·26 Nis

@jimkxa @atlury @tenstorrent Soon RISCV HOST QB2

English

Rahul Atlury@atlury·4 May

@brk0v I think the baggage of having to maintain compatibility, especially with the weird use cases is the biggest inertia for lightweight Linux systems to take off...more modernization is needed...

English

213

Viacheslav Biriukov@brk0v·4 May

🦀Rust: Rewriting GNU coreutils in Ubuntu. FOSDEM talk: Memory safety matters, but Rust is not magic. The hard part is compatibility, weird Unix edge cases, test suites, distro integration, rollback paths, and users finding real bugs. fosdem.org/2026/schedule/… #rust #rustlang

English

121

6.3K

Rahul Atlury@atlury·24 Nis

@Prince_Canuma In terms of accuracy how close was it? usable?

English

255

Prince Canuma@Prince_Canuma·24 Nis

You can now run DeepSeek4-Flash on 256GB Mac. Next up speed 🚀 PR: github.com/ml-explore/mlx…

Prince Canuma@Prince_Canuma

Ported DeepSeek-V4 to MLX 🔥 There still lots to optimize but it’s work well

English

138

78.2K

Rahul Atlury@atlury·25 Mar

@jimkxa @austinlyons @tenstorrent Love the work Tenstorrent is doing, I wish they existed in M.2 form factor as well for edge devices, and I wish we had access to your hardware, especially in this era of unbelievable level of vibe coding, we could just translate the whole hugging repos to TT. :-)

English

358

Jim Keller@jimkxa·25 Mar

@austinlyons Jensen said Groq runs 25% of agents AI. I doubt specialized hardware is going to win. AI is too diverse and changing too fast. @Tenstorrent runs 100%. Hardcore benchmarks coming.

English

8.9K

Austin Lyons@austinsemis·23 Mar

Is Groq the only inference chip for agentic AI? Or just the first of many? Same for Vera and CPUs.

Chipstrat@chipstrat

The Multi-Silicon Era Is Here. Disagg is out of the bag. What it means for @Nvidia, CPUs, XPUs, startups, and more. chipstrat.com/p/the-multi-si… $NVDA $AMD

English

12.2K

Rahul Atlury@atlury·2 Mar

@steeve @sluongng This is very interesting, i recently bootstrapped a non-systemd, openrc, non gcc, glibc minimal os from scratch using nothing but zig with zig based package manager i wrote, the traditional LFS stages collapsed to just 3, zig is wonderful, libgcc shims n all...

English

Steeve Morin@steeve·1 Mar

Coming from @sluongng, it means a whole lot

English

2.9K

Rahul Atlury@atlury·27 Kas

@anil_nal @AAI_Official @RamMNK Sad!

Anil Nallamotu@anil_nal·26 Kas

An immigration officer slapped a passenger (Eerlam Anjaiah) who was arguing after being sent to the end of the Immigration clearance line at 8 pm on 24 Nov at Hyderabad Airport. Written complaint lodged to check CCTV cameras. Please take action @AAI_Official @RamMNK

English

245

Rahul Atlury@atlury·16 Eki

@Prince_Canuma Indeed, sad to see this behavior.

English

Prince Canuma@Prince_Canuma·16 Eki

I feel sorry for people like him! I lived in Gujarat for 5 half a decade, and I loved it because of how amazing, kind and respectful people are. Don’t know what’s happening in his life for him to speak to others this way.

Aryan Agarwal@AryanA9019

@Prince_Canuma yeah I am talking about the dgx spark and the nvidia 6000 cards they all have the software u idiot

English

4.3K

Rahul Atlury@atlury·16 Ağu

@m7server your support@monibuca.com is bouncing. Are you still working on commercial version?

English

Monibuca@m7server·28 Nis

New Interactive special effects

English

Rahul Atlury@atlury·15 May

@Prince_Canuma aaah nice!

English

Prince Canuma@Prince_Canuma·14 May

@atlury sure I will take you up on it too Btw, I used to live in India till 2022 🇮🇳

English

Prince Canuma@Prince_Canuma·14 May

Work so hard the community wants to feed you😎🙌🏽

Joe Burnett@Joe_A_Burnett

@Prince_Canuma @FastAPI I swear, if you ever travel to Texas where I live, I’m going to take you to eat all-you-can-eat Texas barbecue! Great work!

English

1.2K

Rahul Atlury@atlury·26 Mar

@skalskip92 How did you convert to a vector image?

English

177

SkalskiP@skalskip92·26 Mar

okey... I jumped on the bandwagon

English

104

7.6K

Rahul Atlury@atlury·16 Ara

@sarahcstock this is very painful but correct picture. Its the same within India as well, I feel the same within India itself. Everyone within here just wants to build acres and acres of concrete jungle. They have just destroyed nature. I hope you guys rebuild and survive.

English

Rahul Atlury@atlury·23 Kas

If next generation Intel GPUs (Battlemage) can ship in 24gb and upwards then its game on. Can easily combine 4 or 6 of them and run 70b models. :-D

English

312

Rahul Atlury@atlury·23 Kas

@JustinWolfers excellent talk! amazing....

English

138

Justin Wolfers@JustinWolfers·19 Kas

Here's a overview talk I recently gave on how AI will change the world of work. My goal in this talk was to put together the existing literature, hopefully in a way that makes the whole more interesting than the sum of the parts: youtube.com/watch?v=jIL5pV…

YouTube

English

20.6K

Rahul Atlury@atlury·9 Kas

@Prince_Canuma Congrats! We are getting there….

English

113

Prince Canuma@Prince_Canuma·9 Kas

Florence-2 is next level good! 🔥 On-device OCR will never be the same. Btw, I will be speaking about MLX at Data Science Summit in a couple weeks. Date: 22 of November 2024 Location: PGE Narodowy Stadium, Warsaw Poland 🇵🇱 20% Discount Code: DSS24SP20 See you there!

Prince Canuma@Prince_Canuma

Finally, Florence-2 port to MLX complete ✅🚀 I finally figured out a 3 day old bug that has been bugging me on the vision encoder. (Pun intended) Thanks for the wine @malgamves!

English

5.5K

Rahul Atlury@atlury·7 Kas

@ElecNotes This is excellent vintage stuff, please give references, it will help us explore further. And thank you!!!

English

ElectronicsNotes by Ian Poole@ElecNotes·6 Kas

Valve / Vacuum Tube Reflex Amplifier One of the techniques widely used in early radios was that of what was termed a reflex amplifier. The curious typical of one used for this technique. The IF signal is applied to the control grid. Then the amplified signal is passed to the diode rectifier elements of the valve from the IF transformer T2. On passing through the filter formed from C1, R1, and C2, the AF signal appears across the volume control R2. It is then passed back through the secondary winding of T1 which has no effect at audio and into the control grid again. The audio is amplified by the valve and this time appears across the anode load resistor R5, after which it is passed onto to further audio stages.

ElectronicsNotes by Ian Poole tweet media

English

182

6.7K

Keşfet

@Prince_Canuma @jimkxa @tenstorrent @DeepComputingio @brk0v @austinlyons @Tenstorrent @steeve