
mark
2.5K posts

mark
@thisisnotmark_
AI/ML Research Lead @ NASA | Founder @ Mosaic Voice AAC. Views my own not my employer’s.





pip install turboquant-gpu 5.02x KV cache compression for ANY GPU (RTX, H100, A100, B200) - works over @huggingface transformers - dead-simple API: compress + generate in 3 lines - 3-bit Lloyd-Max fused KV compression (0.98 cosine similarity) - outperforms MXFP4 (3.76x) and NVFP4 (3.56x) on compression Ran Mistral-7B: 1,408 KB → 275 KB KV cache (5.02x) Quickstart: github.com/DevTechJr/turb… Written in cuTile (CUDA 12, 13) with PyTorch fallbacks


We're going farther than ever before 🚀 Today, the Artemis II crew will break the record for how far humans have traveled from Earth as they fly around the far side of the Moon. Coverage begins at 1 p.m. EDT (1700 UTC). Watch Artemis II make history: nasa.gov/ways-to-watch/





One idiot we are happy not to hear from anymore













