LanceDB

878 posts

LanceDB banner
LanceDB

LanceDB

@lancedb

Developer-friendly, open source AI-Native Multimodal Lakehouse https://t.co/wXn4tw5ySn

San Francisco, CA Katılım Nisan 2023
62 Takip Edilen3.9K Takipçiler
LanceDB
LanceDB@lancedb·
3/3 Talks from Lei Xu (CTO @ LanceDB), Goutam Venkatramanan (Software Engineer @ @anyscalecompute), and Hubert Yuan (Software Engineer @ @ExaAILabs) If you’re building AI search or large-scale retrieval systems, this is a concrete look at how it runs in production. Register: events.builder.aws.com/d/73q3y1
English
0
0
1
95
LanceDB
LanceDB@lancedb·
@anyscalecompute @ExaAILabs 2/3 How the system is built: - Lance for storing embeddings + raw data with fast random access - Ray for distributed ingestion and embedding pipelines - Continuous updates to search signals without rebuilding the index
English
1
0
1
95
LanceDB
LanceDB@lancedb·
1/3 What does it take to run search over billions of documents *and* keep it continuously updated? Join LanceDB, @anyscalecompute, and @ExaAILabs on March 31 in SF for a deep dive into the infrastructure behind Exa’s web-scale AI search engine.
LanceDB tweet media
English
1
0
4
205
LanceDB
LanceDB@lancedb·
1/3 LanceDB, @openclaw, and Seed 2.0 let you go from a few screenshots to a working frontend without writing specs. The bottleneck isn’t generation—it’s retaining design intent across iterations.
LanceDB tweet media
English
2
1
8
348
LanceDB
LanceDB@lancedb·
2/3 Screenshots capture appearance, but not the reasoning behind decisions. This system treats that workflow as data: - OpenClaw captures and updates reference designs - LanceDB stores screenshots, notes, code, and revisions - Seed 2.0 generates code directly from that context
English
0
1
1
54
LanceDB
LanceDB@lancedb·
In other words: unstructured data → embeddings → searchable knowledge That’s exactly the workflow many teams are building on LanceDB today — storing and querying multimodal data without stitching together multiple systems. Super awesome to see Lance & LanceDB included here, and shoutout to our partners on the @nvidia cuVS team!
English
0
0
1
263
LanceDB
LanceDB@lancedb·
~90% of the world’s data is unstructured — PDFs, videos, images, audio, documents. “Just as AI was able to solve multimodality problems — perception and understanding — you can use that same technology… to read a PDF, understand its meaning, and embed it into a structure we can search and query.”
English
1
0
1
299
LanceDB
LanceDB@lancedb·
Did you spot Lance & LanceDB in Jensen Huang's keynote at @nvidia GTC today? 👀 Jensen highlighted one of the biggest shifts in AI infrastructure: unlocking unstructured data.
LanceDB tweet media
English
2
3
38
1.7K
LanceDB
LanceDB@lancedb·
2/3 Why this pairing works well architecturally: - Runs as an embedded library, not a separate database service - Embeddings, metadata, and indexes stored together in the same table - Vector + full-text + structured filtering over the same memory dataset - Append-friendly storage that matches how agents write memories
English
1
0
0
271
LanceDB
LanceDB@lancedb·
1/3 @OpenClaw agents persist memory across sessions. Preferences, facts, and prior decisions can be written during one interaction and retrieved the next time the agent runs. In the #OpenClaw ecosystem, @lancedb is quickly becoming the default memory layer.
English
2
2
7
2.5K
LanceDB
LanceDB@lancedb·
Taken from last year's team offsite! So it only makes sense that LanceDB is the default memory layer for @openclaw 🦞
LanceDB tweet media
English
0
1
8
373
LanceDB
LanceDB@lancedb·
Cool to see people building persistent memory for AI agents with LanceDB! This @OpenClaw tutorial by @tonbistudio shows how the community plugin memory-lancedb-pro adds hybrid retrieval, reranking, and session distillation so agents can actually remember across sessions.
English
1
0
6
472
LanceDB
LanceDB@lancedb·
@dlthub and @huggingface 🤗 just shipped a clean way to ingest Hugging Face datasets into LanceDB 🚀 Query datasets over hf:// with DuckDB, stream them in batches, and load them into LanceDB with embeddings generated during ingest. The result is a simple Python path from Hub dataset to a searchable, explorable table.
LanceDB tweet media
English
2
6
23
854
LanceDB
LanceDB@lancedb·
6/6 All four layouts share the same on-disk descriptor: (kind, position, size, blob_id, blob_uri) kind determines the storage semantic while the logical column type stays the same. Read more about the mixed blob access problem and the format design behind Blob V2: lancedb.com/blog/lance-blo…
English
0
0
0
182
LanceDB
LanceDB@lancedb·
5/6 External (URI): Stores only a URI (optionally with offset/size). Allows existing object storage assets to be referenced directly without copying.
English
1
0
0
188
LanceDB
LanceDB@lancedb·
4/6 Dedicated (> 4 MB): Each blob stored in its own file. Isolated from compaction and supports direct object store range reads. Typical for large videos, PDFs, and raw assets.
English
1
0
0
196