📄 Lance Blob V2, 🤗 Upload Lance Datasets to HF Hub, 🦞 LanceDB as OpenClaw Default Memory
March Newsletter • April 8, 2026
Highlights
📄 Lance Blob V2: Making Multimodal Data a First-Class Citizen in the Lakehouse
Lance Blob V2 introduces four storage semantics (inline, packed, dedicated, external) that automatically optimize layout by data size and access pattern, improving small-read efficiency while avoiding costly rewrites for large blobs.
🤗 A Guide to Uploading Lance Datasets on the Hugging Face Hub
You can now upload a Lance dataset—including data, indexes, and versions—to Hugging Face and query it via hf:// without downloading, with support for vector search, full-text search, SQL, and nested filtering.
🦞 Why LanceDB Is the Most Natural Memory Layer for OpenClaw
OpenClaw agents persist memory across sessions, and LanceDB is emerging as the default storage layer for that memory. It runs embedded (no service required) and stores embeddings, metadata, and indexes together in a single table.
Jack Ye and Pablo Delgado, ML Engineer at Netflix, will present on multimodal feature engineering at scale with Netflix, covering how LanceDB supports large-scale storage, retrieval, and dataset workflows.
Weston Pace & Lu Qiu will share a deep dive into optimizing a Rust-native search database, focusing on I/O scheduling, async profiling, and achieving storage-level performance.
⚡ Faster indexing and search: ~50% faster FTS with WAND, SIMD-optimized vector search, and reduced indexing time and memory ⚡ Up to 3× faster scans in format v2.2 with lower memory usage during index training 📄 Blob V2 introduces improved storage layout with external blobs and better access patterns
🔗 Vector-first ANN integration in Cypher enables hybrid graph + vector reranking 🔍 Expanded query capabilities with parameterized queries and node-return support
⚡ Improved query execution with index-aware planning, deferred materialization, and filter pushdown 📦 Full SQL surface including MERGE INTO and dataset versioning support
A huge thank you to contributors from Uber, ByteDance, Netflix, Twitter, and Huawei, and more for their contributions!
Read the full newsletter for more updates around lance-spark.
March's Lance Community Syncs focused on Lance 3.0 and upcoming 4.0 releases, including file format adoption, indexing improvements, and growing ecosystem integrations like DuckDB and PrestoDB, along with early work on faster distributed vector indexing.
The next Lance Community Sync will take place on April 9, 2026.