LanceDB
918 posts

LanceDB
@lancedb
Developer-friendly, open source AI-Native Multimodal Lakehouse https://t.co/wXn4tw5ySn
San Francisco, CA เข้าร่วม Nisan 2023
62 กำลังติดตาม4.1K ผู้ติดตาม

@huggingface @openclaw 6/7 🗓️ Upcoming Events
- Data Engineering Open Forum (SF) — Apr 16
- TokioConf (Portland) — Apr 20–22


English

Bottom line: @lancedb's JSONB gives AI builders the same storage efficiency as Variant, with more flexibility, and no vendor lock-in.
3/3
Read the full blog post for details 👇🏽
lancedb.com/blog/lance-jso…
English

𝘿𝙤 𝙮𝙤𝙪 𝙧𝙚𝙖𝙡𝙡𝙮 𝙣𝙚𝙚𝙙 𝙑𝙖𝙧𝙞𝙖𝙣𝙩 𝙛𝙤𝙧 𝙮𝙤𝙪𝙧 𝘼𝙄 𝙙𝙖𝙩𝙖?
We benchmarked Lance JSONB vs Parquet Variant on real-world JSON workloads.
On text-heavy data with mixed schemas Variant and @lancedb's JSONB are within 𝟬-𝟴% of one another — essentially equal. Variant's 𝟮-𝟰x storage advantage only appears when documents share the same structure with short, repetitive fields
1/3

English

@LlamaIndex @itsclelia 4/4 Check out the repo: github.com/run-llama/llam…
Shoutout to @itsclelia for collaborating on this piece! Great work pushing on parsing + retrieval together 🤝
English

@LlamaIndex @itsclelia 3/4 This avoids splitting modalities across systems and losing context between stages.
The agent can retrieve what it needs, in the form it needs.
On our eval dataset, this setup reaches near-perfect accuracy on complex QA.
Full breakdown:
lancedb.com/blog/smart-par…
English

1/4 If your agent fails on PDFs with tables or charts, it’s usually not the model.
It’s the data pipeline. 🧵
We worked with @LlamaIndex and @itsclelia to build a structure-aware QA pipeline using LiteParse.
lancedb.com/blog/smart-par…
English

3/3 Compression is applied without breaking the access path.
Less data read → faster batch fetch → higher GPU utilization.
Benchmarks: lancedb.com/blog/lance-for…
English

3/3 Because Lance integrates both the file format and table format, fragment metadata tracks which blob objects are referenced by each dataset version.
That enables version-aware garbage collection and compaction without rewriting large blobs.
Full design: lancedb.com/blog/lance-blo…
English


