Red Hat Community

14.4K posts

Red Hat Community banner
Red Hat Community

Red Hat Community

@redhatopen

Helping our open source projects and standards be wildly successful

Katılım Mart 2013
1.1K Takip Edilen39.6K Takipçiler
Red Hat Community
Red Hat Community@redhatopen·
How does a Python tensor op become a lightning-fast C++ kernel? Christopher Leonard traces the complete PyTorch call stack. Get ready for #PyTorch Conference Europe with a deep dive! red.ht/47sp9vR
English
0
0
0
253
Red Hat Community retweetledi
Cloud Native Now
Cloud Native Now@ContainerJrnl·
At KubeCon Europe, the CNCF and Red Hat contributed the llm‑d framework to the consortium, and the Kubernetes AI Conformance Program tightened its requirements. Stable in‑place pod resizing and workload‑aware scheduling are now mandatory, both critical for running inference engines without restarts and avoiding deadlocks in distributed training. A new Verify Conformance Bot will move beyond self‑assessments, and 31 platforms are already KAR certified. See what the stricter standards mean for production AI: zpr.io/qLGMxdzn9LZc #KubeCon #Kubernetes #AIInference
English
0
1
0
228
Red Hat Community
Red Hat Community@redhatopen·
Triton kernels: Hand-tuned vs. TorchInductor vs. Helion vs. LLM-generated. Our reproducible benchmark reveals the trade-offs in performance, portability, and tuning cost. Check it out! #LLM #GPUkernels #Triton red.ht/4aeQKm7
English
0
0
1
353
Red Hat Community
Red Hat Community@redhatopen·
Secure your high-performance Triton GPU kernels! 📷 New cryptographic signature support for the JIT cache prevents corruption and denial of service attacks in production. red.ht/46q6KiH
English
1
0
2
287
Red Hat Community
Red Hat Community@redhatopen·
Skip the JITters!  Cut AI model cold-start latency by 2x. We're exploring Model Cache Vault (MCV) and OCI image support for Triton/vLLM kernel caches, secured with Sigstore Cosign for trusted, portable performance. #AI #MLOps #DevSecOps #Triton #vLLM  red.ht/4tdo3gW
English
0
0
0
418
Red Hat Community
Red Hat Community@redhatopen·
Moving AI from chatbots to autonomous Cloud-Native Ambient Agents requires a new architecture. This article covers the AgentOps lifecycle on OpenShift, w/event-driven logic, observability, and Zero Trust identity. Read more! #CloudNative #AI #AgenticAI red.ht/4pOq0NC
English
0
0
2
352
Red Hat Community
Red Hat Community@redhatopen·
.@leighgriffin and Ray Carroll just put this article on Spec Driven Development together and is now live on infoq.com/articles/spec-… -- this is a result of our investigations into the technology in our Emerging Technologies team
English
0
1
1
440
Red Hat Community
Red Hat Community@redhatopen·
Check out how to simplify Edge AI Builds with Verified GitHub Actions Patterns! Vance Raiti and Nick Cao show us how their Edge AI Image Pipelines project tackles hardware enablement friction for RHEL bootc images on NVIDIA Jetson. red.ht/3MxP5yy
English
0
1
2
506
Red Hat Community
Red Hat Community@redhatopen·
How do you stop tool sprawl from overwhelming your AI agents? A larger context window isn't the answer—a smarter retrieval process is. Check out Eoghan O'Connor and Kevin Cogan's implementation of Tool2Vec for scalable enterprise AI! red.ht/4iO3i6E
English
0
0
5
423
Red Hat Community
Red Hat Community@redhatopen·
Triton kernel profiling with NVIDIA's Nsight tools? Joseph Groenenboom and Craig Magina show us how! red.ht/49mLFIl
English
0
0
2
519
Red Hat Community
Red Hat Community@redhatopen·
Inferencing at scale: Ron Haberman, Huamin Chen, and other Red Hat OCTO team members share their contributions to the vLLM Semantic Router. red.ht/3WLK87c
English
0
0
3
441
Red Hat Community
Red Hat Community@redhatopen·
Ivan Font and Donald Hunter do a deep dive on a confidential computing approach to AI inference security - have a look! red.ht/3JmLxxT
English
0
1
6
552
Red Hat Community
Red Hat Community@redhatopen·
Guest author Steven Pousty has been hacking on PyTorch, containers, and NVIDIA, and he has some advice for other interested developers - check it out! red.ht/45Dlhrm
English
1
0
5
691