Chandan

1.1K posts

Chandan

Chandan

@dchandan22

Technology Expert @ Intel | Wharton MBA | U of Michigan Engineer | Thoughts are my own

Katılım Ocak 2015
575 Takip Edilen368 Takipçiler
Chandan retweetledi
LMSYS Org
LMSYS Org@lmsysorg·
🚀Summer Fest Day 3: Cost-Effective MoE Inference on CPU from Intel PyTorch team Deploying 671B DeepSeek R1 with zero GPUs? SGLang now supports high-performance CPU-only inference on Intel Xeon 6—enabling billion-scale MoE models like DeepSeek to run on commodity CPU servers. Key highlights: 1. Full CPU backend for SGLang with Intel AMX 2. Native BF16 / INT8 / FP8 support for both Dense and Sparse FFNs 3. 6–14× TTFT and 2–4× TPOT speedup vs. llama.cpp 4. 85%+ memory bandwidth efficiency with optimized MoE kernels 5. Flash Attention V2 + MLA + MoE all optimized for CPU 6. Multi-NUMA parallelism mapped from GPU-style Tensor Parallelism This work is now fully upstreamed to SGLang main—read how we achieved it, and how far you can go without a GPU 👇 #LLMInfra #ModelServing #MoE #Xeon6 #SGLang #FP8 #INT8 #CPUInference
LMSYS Org tweet media
English
6
16
39
19K
Chandan retweetledi
Haihao Shen
Haihao Shen@HaihaoShen·
💡Intel Neural Compressor v3.4 is released, supporting more quantization recipes, e.g., W4A8 (FP8). In the past weeks, we've contributed the algorithm AutoRound to HF Transformers and vLLM, and now we are making the contribution to SGLang. Stay tuned.😀 🎯github.com/intel/neural-c…
English
1
22
83
3.6K
Chandan retweetledi
Intel Software
Intel Software@IntelSoftware·
Ever tried searching for “more ladybugs than flowers”? We did. And the AI nailed it. Fine-tuned LLMs can really deliver when trained on the right datasets. This demo shows what happens when #Qwen3 models are optimized and deployed on Intel hardware. Read the full article: intel.ly/3GUmVuI
English
1
4
8
884
Chandan retweetledi
Intel Software
Intel Software@IntelSoftware·
Automated prompt engineering on-device—no fine-tuning, no RAG. This new guide shows how to use #DSPy with Intel #oneAPI and llama.cpp to boost task accuracy from 📉 35% → 📈 78% Run LLMs locally, optimize efficiently. Read the guide → intel.ly/4lBzYRw
Intel Software tweet media
English
1
5
10
780
Chandan retweetledi
PyTorch
PyTorch@PyTorch·
Learn how #PyTorch 2.7 and @Intel GPUs can accelerate your #AI workloads on Windows and Linux.💡 Read the latest blog from the Intel PyTorch team: hubs.la/Q03jWrRf0
PyTorch tweet media
English
1
18
72
10.8K
Chandan retweetledi
the tiny corp
the tiny corp@__tinygrad__·
Turns out Intel does make big GPUs! 832 BF16 TFLOPS, 128GB, 3.28 TB/s. Each.
the tiny corp tweet media
English
63
105
1.7K
134.2K
Chandan retweetledi
the tiny corp
the tiny corp@__tinygrad__·
Its little brother is already up and working. Will take work to use XMX and make really fast, but already everything is correct and usable. PyTorch -> tinygrad -> OpenCL -> i915
the tiny corp tweet media
English
26
56
976
59.2K
Chandan retweetledi
PyTorch
PyTorch@PyTorch·
Curious about the latest optimizations in #PyTorch 2.6 for @intel Corporation platforms? The Intel PyTorch Team has detailed key improvements for Intel x86 CPUs and GPUs, enhancing performance and efficiency. 👉 Explore the latest advancements and how they can improve your PyTorch workloads: hubs.la/Q036jGpH0 #pytorch #machinelearning #cpus #gpus
PyTorch tweet media
English
7
16
87
13.8K
Chandan retweetledi
Intel Software
Intel Software@IntelSoftware·
Dive into language identification with #PyTorch and Intel. Follow along and learn to develop a solution using the Hugging Face SpeechBrain toolkit optimized for Intel hardware to identify up to 133 languages. intel.ly/40QVwQD
Intel Software tweet media
English
0
7
16
772
Chandan retweetledi
Intel News
Intel News@intelnews·
Exciting news from @Argonne — the Aurora supercomputer (powered by @Intel and @HPE!) is live, bringing exascale speed and power to accelerate cancer research, materials discovery, energy technologies and more. intel.ly/4jtU3Zj
English
4
25
126
7.2K
Chandan retweetledi
PyTorch
PyTorch@PyTorch·
Intel + PyTorch: Powering Generative AI on Intel Arc GPUs ⬇️ 📖 Read the case study: hubs.la/Q033-4Zb0 Intel’s new AI Playground showcases advanced GenAI capabilities, including image generation and chatbots, powered by PyTorch. By leveraging Intel® Extension for PyTorch, Intel optimized performance and accelerated development, delivering an open source solution for AI PCs. #pytorch #genai #ai #opensource
PyTorch tweet media
English
1
8
70
11.4K
Chandan retweetledi
Intel Software
Intel Software@IntelSoftware·
Explore #GeneticAlgorithms with this AI code sample! These techniques are based on an analogy of biological evolution and can be applied to various NP-hard optimization platforms. Read the guide to get started today: intel.ly/3VQ1Sy5
Intel Software tweet media
English
0
5
7
714