Alluxio

2.6K posts

Alluxio banner
Alluxio

Alluxio

@Alluxio

Open Source Data Orchestration for Analytics and Machine Learning in the Cloud @TachyonProject is now @Alluxio! [email protected]

Katılım Ekim 2015
197 Takip Edilen1.3K Takipçiler
Alluxio
Alluxio@Alluxio·
Object storage is durable, but AI workloads need: • fast reads • efficient metadata ops • better access semantics Keep object storage as the source of truth. Add a data layer near compute. 👉na2.hubs.ly/H04PhqF0 #AIInfrastructure #MLOps
Alluxio tweet media
English
0
0
0
47
Alluxio
Alluxio@Alluxio·
Scaling GenAI to 300TB/day requires a fast data path. Security leader @Uptycs moved beyond traditional caching for better scale. Results: ⚡ Sub-second responses 📉 90% CPU reduction See how: na2.hubs.ly/H04N12T0 #GenAI #BigData
Alluxio tweet media
English
0
0
0
40
Alluxio
Alluxio@Alluxio·
GPUs often sit idle when data is locked in a different region. This guide shows how to optimize your data path to maximize throughput and cut cross-region egress costs. 👉: na2.hubs.ly/H04LXL20 #AI #GPU #DataPath
Alluxio tweet media
English
0
0
0
37
Alluxio
Alluxio@Alluxio·
@Coupang solved the "Data Wait" in ML training. By using a distributed cache across hybrid GPU clusters, they hit: ⚡ Instant job starts (no manual copying) 🚀 40% faster I/O than parallel file systems 🌐 Total code portability Full story: na2.hubs.ly/H04KC3T0 #ML #AI #GPU
Alluxio tweet media
English
0
0
0
47
Alluxio
Alluxio@Alluxio·
Parquet on object storage can get expensive for retrieval-heavy workloads. Co-authored with @salesforce engineers, this white paper looks at reducing round trips for RAG, feature retrieval, and similar access patterns. 🔗 na2.hubs.ly/H04GQMH0 #AIInfrastructure #RAG
Alluxio tweet media
English
0
0
1
34
Alluxio
Alluxio@Alluxio·
A lot of AI infra friction comes from the data path: slow dataset access, too much data movement, and GPUs sitting idle waiting on data. This white paper breaks down the problem and the architecture behind it:na2.hubs.ly/H04D1ZH0 #AIInfrastructure #MLOps
Alluxio tweet media
English
0
0
1
38
Alluxio
Alluxio@Alluxio·
PyTorch performance tuning is bigger than the training loop. Data access, GPU efficiency, and distributed execution all affect throughput. This guide covers practical ways to improve training efficiency. 👇 na2.hubs.ly/H04v70l0 #PyTorch #MLOps #AIInfrastructure
Alluxio tweet media
English
0
0
1
24
Alluxio
Alluxio@Alluxio·
ML scale is not just a compute problem. It is also a data access problem. Blackout Power Trading used Alluxio to scale from 5K to 100K+ models in the same 15-minute window. 👇 na2.hubs.ly/H04sHwQ0 #AIInfrastructure #MLOps
Alluxio tweet media
English
0
0
0
23
Alluxio
Alluxio@Alluxio·
AI workloads do not scale on object storage alone. Add a data layer that removes the I/O bottleneck to get: ☑️ Low latency ☑️ High throughput ☑️ Less data movement ☑️ Better data access for GPUs 👉na2.hubs.ly/H04rcrW0 #AIInfrastructure #GPUComputing
Alluxio tweet media
English
0
0
0
20