پن کیا گیا ٹویٹ
Nvidia has a structured data enablement strategy. Nvidia provides libaries, software and hardware to index and search data faster. This is the $120 billion ecosystem Nvidia identified. The Indexing and retrievals are way faster 10-40X faster in most cases.
How cuDF + cuVS Work Together (The GTC 2026 “Ground Truth” Pipeline)
Unstructured data (PDFs, videos, logs, emails, sensor streams) → embedding model (NeMo, etc.).
cuVS indexes the embeddings (CAGRA/IVF-PQ) → semantic retrieval.
Retrieved context + facts → fed into cuDF DataFrames for cleaning, joining, aggregating, and turning into structured tables.
Result hundreds of zettabytes of “dark data” become queryable, trustworthy structured ground truth.
NVIDIA supplies GPU-accelerated software (cuDF + cuVS) and AI agents (NemoClaw) so enterprises, CSPs, and data platforms can securely transform their own hundreds of zettabytes of unstructured data (PDFs, videos, logs, emails, sensor streams, etc.) into structured, queryable ground truth inside their existing ecosystems.
@herbertong @nvidia #GTC

English





















