https://github.com/lancedb/lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

https://github.com/lancedb/lance

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    2 of 91 committers (2.2%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.7%) to scientific vocabulary

Keywords

apache-arrow computer-vision data-analysis data-analytics data-centric data-format data-science dataops deep-learning duckdb embeddings llms machine-learning mlops python rust

Keywords from Contributors

arrow parquet serving llm-serving llm-inference distributed deployment hyperparameter-optimization hyperparameter-search large-language-models
Last synced: 6 months ago · JSON representation

Repository

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

Basic Info
Statistics
  • Stars: 5,305
  • Watchers: 51
  • Forks: 450
  • Open Issues: 808
  • Releases: 330
Topics
apache-arrow computer-vision data-analysis data-analytics data-centric data-format data-science dataops deep-learning duckdb embeddings llms machine-learning mlops python rust
Created over 3 years ago · Last pushed 6 months ago
Metadata Files
Readme License

README.md

Lance Logo **Modern columnar data format for ML. Convert from Parquet in 2-lines of code for 100x faster random access, zero-cost schema evolution, rich secondary indices, versioning, and more.
** **Compatible with Pandas, DuckDB, Polars, Pyarrow, and Ray with more integrations on the way.** DocumentationBlogDiscordX [![CI Badge]][CI] [![Docs Badge]][Docs] [![crates.io badge]][crates.io] [![Python versions badge]][Python versions]


Lance is a modern columnar data format that is optimized for ML workflows and datasets. Lance is perfect for:

  1. Building search engines and feature stores.
  2. Large-scale ML training requiring high performance IO and shuffles.
  3. Storing, querying, and inspecting deeply nested data for robotics or large blobs like images, point clouds, and more.

The key features of Lance include:

  • High-performance random access: 100x faster than Parquet without sacrificing scan performance.

  • Vector search: find nearest neighbors in milliseconds and combine OLAP-queries with vector search.

  • Zero-copy, automatic versioning: manage versions of your data without needing extra infrastructure.

  • Ecosystem integrations: Apache Arrow, Pandas, Polars, DuckDB, Ray, Spark and more on the way.

[!TIP] Lance is in active development and we welcome contributions. Please see our contributing guide for more information.

Quick Start

Installation

shell pip install pylance

To install a preview release:

shell pip install --pre --extra-index-url https://pypi.fury.io/lancedb/ pylance

[!TIP] Preview releases are released more often than full releases and contain the latest features and bug fixes. They receive the same level of testing as full releases. We guarantee they will remain published and available for download for at least 6 months. When you want to pin to a specific version, prefer a stable release.

Converting to Lance

```python import lance

import pandas as pd import pyarrow as pa import pyarrow.dataset

df = pd.DataFrame({"a": [5], "b": [10]}) uri = "/tmp/test.parquet" tbl = pa.Table.frompandas(df) pa.dataset.writedataset(tbl, uri, format='parquet')

parquet = pa.dataset.dataset(uri, format='parquet') lance.write_dataset(parquet, "/tmp/test.lance") ```

Reading Lance data python dataset = lance.dataset("/tmp/test.lance") assert isinstance(dataset, pa.dataset.Dataset)

Pandas python df = dataset.to_table().to_pandas() df

DuckDB ```python import duckdb

If this segfaults, make sure you have duckdb v0.7+ installed

duckdb.query("SELECT * FROM dataset LIMIT 10").to_df() ```

Vector search

Download the sift1m subset

shell wget ftp://ftp.irisa.fr/local/texmex/corpus/sift.tar.gz tar -xzf sift.tar.gz

Convert it to Lance

```python import lance from lance.vector import vectotable import numpy as np import struct

nvecs = 1000000 ndims = 128 with open("sift/sift_base.fvecs", mode="rb") as fobj: buf = fobj.read() data = np.array(struct.unpack("<128000000f", buf[4 : 4 + 4 * nvecs * ndims])).reshape((nvecs, ndims)) dd = dict(zip(range(nvecs), data))

table = vectotable(dd) uri = "vecdata.lance" sift1m = lance.writedataset(table, uri, maxrowspergroup=8192, maxrowsperfile=1024*1024) ```

Build the index

python sift1m.create_index("vector", index_type="IVF_PQ", num_partitions=256, # IVF num_sub_vectors=16) # PQ

Search the dataset

```python

Get top 10 similar vectors

import duckdb

dataset = lance.dataset(uri)

Sample 100 query vectors. If this segfaults, make sure you have duckdb v0.7+ installed

sample = duckdb.query("SELECT vector FROM dataset USING SAMPLE 100").todf() queryvectors = np.array([np.array(x) for x in sample.vector])

Get nearest neighbors for all of them

rs = [dataset.totable(nearest={"column": "vector", "k": 10, "q": q}) for q in queryvectors] ```

Directory structure

| Directory | Description | |--------------------|--------------------------| | rust | Core Rust implementation | | python | Python bindings (PyO3) | | java | Java bindings (JNI) | | docs | Documentation source |

What makes Lance different

Here we will highlight a few aspects of Lance’s design. For more details, see the full Lance design document.

Vector index: Vector index for similarity search over embedding space. Support both CPUs (x86_64 and arm) and GPU (Nvidia (cuda) and Apple Silicon (mps)).

Encodings: To achieve both fast columnar scan and sub-linear point queries, Lance uses custom encodings and layouts.

Nested fields: Lance stores each subfield as a separate column to support efficient filters like “find images where detected objects include cats”.

Versioning: A Manifest can be used to record snapshots. Currently we support creating new versions automatically via appends, overwrites, and index creation.

Fast updates (ROADMAP): Updates will be supported via write-ahead logs.

Rich secondary indices: Support BTree, Bitmap, Full text search, Label list, NGrams, and more.

Benchmarks

Vector search

We used the SIFT dataset to benchmark our results with 1M vectors of 128D

  1. For 100 randomly sampled query vectors, we get <1ms average response time (on a 2023 m2 MacBook Air)

avg_latency.png

  1. ANNs are always a trade-off between recall and performance

avg_latency.png

Vs. parquet

We create a Lance dataset using the Oxford Pet dataset to do some preliminary performance testing of Lance as compared to Parquet and raw image/XMLs. For analytics queries, Lance is 50-100x better than reading the raw metadata. For batched random access, Lance is 100x better than both parquet and raw files.

Why are you building yet another data format?!

The machine learning development cycle involves the steps:

mermaid graph LR A[Collection] --> B[Exploration]; B --> C[Analytics]; C --> D[Feature Engineer]; D --> E[Training]; E --> F[Evaluation]; F --> C; E --> G[Deployment]; G --> H[Monitoring]; H --> A;

People use different data representations to varying stages for the performance or limited by the tooling available. Academia mainly uses XML / JSON for annotations and zipped images/sensors data for deep learning, which is difficult to integrate into data infrastructure and slow to train over cloud storage. While industry uses data lakes (Parquet-based techniques, i.e., Delta Lake, Iceberg) or data warehouses (AWS Redshift or Google BigQuery) to collect and analyze data, they have to convert the data into training-friendly formats, such as Rikai/Petastorm or TFRecord. Multiple single-purpose data transforms, as well as syncing copies between cloud storage to local training instances have become a common practice.

While each of the existing data formats excels at the workload it was originally designed for, we need a new data format tailored for multistage ML development cycles to reduce and data silos.

A comparison of different data formats in each stage of ML development cycle.

| | Lance | Parquet & ORC | JSON & XML | TFRecord | Database | Warehouse | |---------------------|-------|---------------|------------|----------|----------|-----------| | Analytics | Fast | Fast | Slow | Slow | Decent | Fast | | Feature Engineering | Fast | Fast | Decent | Slow | Decent | Good | | Training | Fast | Decent | Slow | Fast | N/A | N/A | | Exploration | Fast | Slow | Fast | Slow | Fast | Decent | | Infra Support | Rich | Rich | Decent | Limited | Rich | Rich |

Community Highlights

Lance is currently used in production by: * LanceDB, a serverless, low-latency vector database for ML applications * LanceDB Enterprise, hyperscale LanceDB with enterprise SLA. * Leading multimodal Gen AI companies for training over petabyte-scale multimodal data. * Self-driving car company for large-scale storage, retrieval and processing of multi-modal data. * E-commerce company for billion-scale+ vector personalized search. * and more.

Presentations, Blogs and Talks

Owner

  • Name: Lance DB
  • Login: lancedb
  • Kind: organization
  • Location: United States of America

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 2,364
  • Total Committers: 91
  • Avg Commits per committer: 25.978
  • Development Distribution Score (DDS): 0.711
Past Year
  • Commits: 829
  • Committers: 51
  • Avg Commits per committer: 16.255
  • Development Distribution Score (DDS): 0.73
Top Committers
Name Email Commits
Lei Xu l****i@l****m 683
Weston Pace w****e@g****m 363
Will Jones w****7@g****m 273
Chang She 7****n 212
BubbleCal b****l@o****m 191
Rob Meng r****g@g****m 127
Lance Release l****v@l****m 103
Lance Release l****v@e****i 37
LuQQiu l****b@g****m 32
broccoliSpicy 9****y 30
gsilvestrin g****n 29
Bert a****t@l****m 23
vinoyang y****7@g****m 21
huangzhaowei c****x@g****m 21
Chongchen Chen c****y@q****m 16
Rok Mihevc r****k@m****g 15
Jai Chopra j****a@g****m 14
Raunak Shah r****0@g****m 14
jay j****n@b****m 12
Jiacheng Yang 9****b 9
jacketsj j****j 9
dsgibbons g****t@g****o 7
Tanay Mehta h****y@g****m 6
Yue n****m@g****m 6
Ishan Anand a****n@o****m 5
Jack Ye y****n@g****m 5
Wyatt Alt w****t@g****m 5
Xin Hao h****t@g****m 5
Utkarsh Gautam u****7@g****m 4
universalmind303 c****d@g****m 4
and 61 more...

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 1,186
  • Total pull requests: 3,473
  • Average time to close issues: about 2 months
  • Average time to close pull requests: 8 days
  • Total issue authors: 151
  • Total pull request authors: 131
  • Average comments per issue: 0.79
  • Average comments per pull request: 1.17
  • Merged pull requests: 2,631
  • Bot issues: 0
  • Bot pull requests: 2
Past Year
  • Issues: 607
  • Pull requests: 1,674
  • Average time to close issues: 18 days
  • Average time to close pull requests: 5 days
  • Issue authors: 92
  • Pull request authors: 88
  • Average comments per issue: 0.63
  • Average comments per pull request: 1.34
  • Merged pull requests: 1,194
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • wjones127 (294)
  • westonpace (220)
  • eddyxu (94)
  • jackye1995 (66)
  • chebbyChefNEQ (47)
  • yanghua (46)
  • broccoliSpicy (36)
  • changhiskhan (29)
  • BubbleCal (19)
  • Jay-ju (17)
  • majin1102 (16)
  • Xuanwo (16)
  • jacketsj (13)
  • SaintBacchus (12)
  • tonyf (12)
Pull Request Authors
  • westonpace (732)
  • eddyxu (497)
  • wjones127 (475)
  • BubbleCal (464)
  • chebbyChefNEQ (198)
  • jackye1995 (96)
  • yanghua (89)
  • LuQQiu (84)
  • Jay-ju (73)
  • broccoliSpicy (69)
  • SaintBacchus (54)
  • Xuanwo (48)
  • albertlockett (43)
  • majin1102 (37)
  • chenkovsky (36)
Top Labels
Issue Labels
enhancement (145) rust (114) bug (103) good first issue (56) python (55) performance (41) help wanted (38) vector (20) documentation (20) arrow (17) benchmark (16) question (15) ci (14) java (14) priority: high (10) epic (10) PyTorch (10) c++ (3) duckdb (3) flaky-test (3) windows (2) macOS (2) feature (2) format (2) chore (1) Tensorflow (1) x86 (1)
Pull Request Labels
python (795) enhancement (786) bug (536) chore (340) java (233) performance (140) ci (133) documentation (95) breaking-change (78) rust (24) vector (19) experimental (11) benchmark (11) donotreview (11) donotmerge (11) WIP (5) PyTorch (5) arrow (4) feature (3) dependencies (2) storage (1) good first issue (1) wasm (1) Tensorflow (1)

Packages

  • Total packages: 23
  • Total downloads:
    • cargo 2,795,723 total
  • Total dependent packages: 52
    (may contain duplicates)
  • Total dependent repositories: 1
    (may contain duplicates)
  • Total versions: 1,699
  • Total maintainers: 6
proxy.golang.org: github.com/lancedb/lance
  • Versions: 332
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 5.6%
Average: 5.8%
Dependent repos count: 6.0%
Last synced: 6 months ago
crates.io: lance

A columnar data format that is 100x faster than Parquet for random access.

  • Versions: 156
  • Dependent Packages: 4
  • Dependent Repositories: 1
  • Downloads: 302,221 Total
Rankings
Stargazers count: 2.8%
Forks count: 6.7%
Average: 12.0%
Dependent packages count: 15.7%
Dependent repos count: 17.1%
Downloads: 17.8%
Last synced: 6 months ago
repo1.maven.org: com.lancedb:lance-core

Lance Format Java API

  • Versions: 23
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Stargazers count: 4.5%
Forks count: 9.3%
Average: 23.0%
Dependent repos count: 32.2%
Dependent packages count: 46.1%
Last synced: 6 months ago
repo1.maven.org: com.lancedb:lance-parent

Lance Format Java API

  • Versions: 23
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Stargazers count: 4.5%
Forks count: 9.3%
Average: 23.0%
Dependent repos count: 32.2%
Dependent packages count: 46.1%
Last synced: 6 months ago
crates.io: lance-tools

Tools for interacting with Lance files and tables

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 0 Total
Rankings
Stargazers count: 1.8%
Forks count: 2.4%
Dependent repos count: 20.1%
Dependent packages count: 26.7%
Average: 29.1%
Downloads: 94.5%
Maintainers (1)
Last synced: 6 months ago
crates.io: lance-arrow

Arrow Extension for Lance

  • Versions: 104
  • Dependent Packages: 10
  • Dependent Repositories: 0
  • Downloads: 239,912 Total
Rankings
Stargazers count: 2.8%
Forks count: 6.7%
Dependent repos count: 30.2%
Dependent packages count: 31.2%
Average: 32.8%
Downloads: 93.2%
Maintainers (1)
Last synced: 6 months ago
crates.io: lance-linalg

A columnar data format that is 100x faster than Parquet for random access.

  • Versions: 103
  • Dependent Packages: 4
  • Dependent Repositories: 0
  • Downloads: 236,093 Total
Rankings
Stargazers count: 2.8%
Forks count: 6.7%
Dependent repos count: 30.2%
Dependent packages count: 31.2%
Average: 32.9%
Downloads: 93.5%
Last synced: 6 months ago
crates.io: lance-testing

A columnar data format that is 100x faster than Parquet for random access.

  • Versions: 104
  • Dependent Packages: 5
  • Dependent Repositories: 0
  • Downloads: 220,392 Total
Rankings
Stargazers count: 2.8%
Forks count: 6.7%
Dependent repos count: 30.2%
Dependent packages count: 31.2%
Average: 33.7%
Downloads: 97.5%
Last synced: 6 months ago
crates.io: lance-index

Lance indices implementation

  • Versions: 97
  • Dependent Packages: 3
  • Dependent Repositories: 0
  • Downloads: 224,466 Total
Rankings
Stargazers count: 2.6%
Forks count: 6.7%
Dependent repos count: 30.5%
Dependent packages count: 30.7%
Average: 33.8%
Downloads: 98.3%
Maintainers (1)
Last synced: 6 months ago
crates.io: lance-datagen

A columnar data format that is 100x faster than Parquet for random access.

  • Versions: 93
  • Dependent Packages: 5
  • Dependent Repositories: 0
  • Downloads: 139,819 Total
Rankings
Stargazers count: 2.6%
Forks count: 6.6%
Dependent repos count: 30.7%
Average: 34.9%
Dependent packages count: 36.2%
Downloads: 98.4%
Maintainers (1)
Last synced: 6 months ago
crates.io: vercel_blob

A rust client for the Vercel Blob Storage API

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 1,755 Total
Rankings
Stargazers count: 2.6%
Forks count: 6.7%
Dependent repos count: 30.6%
Average: 34.9%
Dependent packages count: 36.3%
Downloads: 98.4%
Maintainers (1)
Last synced: 6 months ago
crates.io: fsst

FSST string compression for Lance

  • Versions: 34
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 119,252 Total
Rankings
Dependent repos count: 26.5%
Dependent packages count: 35.1%
Average: 44.3%
Downloads: 71.2%
Last synced: 6 months ago
crates.io: lance-bitpacking

Vendored copy of https://github.com/spiraldb/fastlanes for use in Lance

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 188 Total
Rankings
Dependent repos count: 20.3%
Dependent packages count: 26.8%
Average: 47.2%
Downloads: 94.6%
Maintainers (1)
Last synced: 6 months ago
crates.io: lance-examples

Lance examples in Rust

  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 1,507 Total
Rankings
Dependent repos count: 21.2%
Dependent packages count: 28.1%
Average: 48.0%
Downloads: 94.8%
Maintainers (2)
Last synced: 6 months ago
crates.io: lance-encoding

Encoders and decoders for the Lance file format

  • Versions: 56
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Downloads: 157,462 Total
Rankings
Dependent repos count: 28.3%
Dependent packages count: 33.4%
Average: 52.7%
Downloads: 96.6%
Last synced: 6 months ago
crates.io: lance-jni

JNI bindings for Lance Columnar format

  • Versions: 27
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 16,978 Total
Rankings
Dependent repos count: 27.0%
Dependent packages count: 35.8%
Average: 53.1%
Downloads: 96.5%
Last synced: 6 months ago
crates.io: lance-core

Lance Columnar Format -- Core Library

  • Versions: 96
  • Dependent Packages: 8
  • Dependent Repositories: 0
  • Downloads: 232,237 Total
Rankings
Dependent repos count: 30.5%
Dependent packages count: 30.7%
Average: 53.2%
Downloads: 98.4%
Last synced: 6 months ago
crates.io: lance-test-macros

A columnar data format that is 100x faster than Parquet for random access.

  • Versions: 100
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Downloads: 94,476 Total
Rankings
Dependent repos count: 30.3%
Dependent packages count: 31.0%
Average: 53.2%
Downloads: 98.3%
Maintainers (1)
Last synced: 6 months ago
crates.io: lance-encoding-datafusion

Encoders and decoders for the Lance file format that rely on datafusion

  • Versions: 36
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 25,192 Total
Rankings
Dependent repos count: 27.3%
Dependent packages count: 36.2%
Average: 53.3%
Downloads: 96.4%
Last synced: 6 months ago
crates.io: lance-io

I/O utilities for Lance

  • Versions: 73
  • Dependent Packages: 4
  • Dependent Repositories: 0
  • Downloads: 191,048 Total
Rankings
Dependent repos count: 29.8%
Dependent packages count: 35.1%
Average: 53.8%
Downloads: 96.5%
Last synced: 6 months ago
crates.io: lance-table

Utilities for the Lance table format

  • Versions: 73
  • Dependent Packages: 2
  • Dependent Repositories: 0
  • Downloads: 189,972 Total
Rankings
Dependent repos count: 29.8%
Dependent packages count: 35.1%
Average: 53.8%
Downloads: 96.5%
Last synced: 6 months ago
crates.io: lance-file

Utilities for the Lance file format

  • Versions: 72
  • Dependent Packages: 3
  • Dependent Repositories: 0
  • Downloads: 189,505 Total
Rankings
Dependent repos count: 29.8%
Dependent packages count: 35.1%
Average: 53.8%
Downloads: 96.5%
Last synced: 6 months ago
crates.io: lance-datafusion

Internal utilities used by other lance modules to simplify working with datafusion

  • Versions: 87
  • Dependent Packages: 2
  • Dependent Repositories: 0
  • Downloads: 213,248 Total
Rankings
Dependent repos count: 30.8%
Dependent packages count: 36.1%
Average: 55.1%
Downloads: 98.4%
Last synced: 6 months ago

Dependencies

.github/workflows/benchmarks.yml actions
  • ./.github/workflows/build_linux_wheel * composite
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • actions/upload-artifact v3 composite
.github/workflows/build_linux_wheel/action.yml actions
  • PyO3/maturin-action v1 composite
.github/workflows/build_mac_wheel/action.yml actions
  • PyO3/maturin-action v1 composite
.github/workflows/build_windows_wheel/action.yml actions
  • PyO3/maturin-action v1 composite
  • actions/upload-artifact v3 composite
.github/workflows/bump-version/action.yml actions
.github/workflows/cargo-publish.yml actions
  • Swatinem/rust-cache v2 composite
  • actions/checkout v3 composite
  • katyo/publish-crates v2 composite
.github/workflows/docs.yml actions
  • ./.github/workflows/build_linux_wheel * composite
  • actions/checkout v3 composite
  • actions/configure-pages v2 composite
  • actions/deploy-pages v1 composite
  • actions/setup-python v4 composite
  • actions/upload-pages-artifact v1 composite
.github/workflows/duckdb.yml actions
  • actions/checkout v3 composite
.github/workflows/install_windows_dependencies/action.yml actions
.github/workflows/make-release-commit.yml actions
  • ./.github/workflows/bump-version * composite
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • ad-m/github-push-action master composite
.github/workflows/notebook.yml actions
  • ./.github/workflows/build_linux_wheel * composite
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
.github/workflows/pr-title.yml actions
  • actions/github-script v6 composite
  • actions/setup-node v3 composite
.github/workflows/pypi-publish.yml actions
  • ./.github/workflows/build_linux_wheel * composite
  • ./.github/workflows/build_mac_wheel * composite
  • ./.github/workflows/build_windows_wheel * composite
  • ./.github/workflows/upload_wheel * composite
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
.github/workflows/python.yml actions
  • ./.github/workflows/build_linux_wheel * composite
  • ./.github/workflows/build_mac_wheel * composite
  • ./.github/workflows/build_windows_wheel * composite
  • ./.github/workflows/run_integtests * composite
  • ./.github/workflows/run_tests * composite
  • Swatinem/rust-cache v2 composite
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • amazon/dynamodb-local * docker
  • lazybit/minio * docker
.github/workflows/run_integtests/action.yml actions
.github/workflows/run_tests/action.yml actions
.github/workflows/rust.yml actions
  • Swatinem/rust-cache v2 composite
  • actions/checkout v3 composite
.github/workflows/upload_wheel/action.yml actions
integration/duckdb_lance/Cargo.toml cargo
  • libduckdb-sys 0.8.1 development
  • arrow-array 43.0.0
  • arrow-schema 43.0.0
  • futures 0.3
  • lazy_static 1.4.0
  • num-traits 0.2
  • tokio 1.23
integration/duckdb_lance/duckdb-ext/Cargo.toml cargo
python/Cargo.toml cargo
rust/Cargo.toml cargo
rust/lance/Cargo.toml cargo
  • all_asserts 2.3.1 development
  • approx 0.5.1 development
  • clap 4.1.1 development
  • dirs 5.0.0 development
  • mock_instant 0.3.1 development
  • arrow 43.0.0
  • arrow-ipc 43.0
  • async-recursion 1.0
  • async-trait 0.1.60
  • aws-config 0.56
  • aws-credential-types 0.56
  • aws-sdk-dynamodb 0.30.0
  • byteorder 1.4.3
  • bytes 1.3
  • cblas 0.4.0
  • chrono 0.4.23
  • clap 4.1.1
  • dashmap 5
  • datafusion 28.0.0
  • half 2.2.1
  • http 0.2.9
  • lapack 0.19.0
  • lru_time_cache 0.11
  • moka 0.11.3
  • num-traits 0.2
  • num_cpus 1.0
  • ordered-float 3.6.0
  • path-absolutize 3.0.14
  • pin-project 1.0
  • prost 0.10
  • prost-types 0.10
  • rand 0.8.3
  • roaring 0.10.1
  • shellexpand 3.0.0
  • tfrecord 0.14.0
  • url 2.3
  • uuid 1.2
rust/lance-arrow/Cargo.toml cargo
rust/lance-linalg/Cargo.toml cargo
rust/lance-testing/Cargo.toml cargo
.devcontainer/Dockerfile docker
  • ubuntu 22.04 build
docs/requirements.txt pypi
  • breathe *
  • cython *
  • duckdb >=0.8
  • fastai *
  • jupyterlab *
  • pandas *
  • piccolo-theme *
  • pyarrow *
  • sphinx ==7.1.2
  • tensorflow *
  • xmltodict *
python/pyproject.toml pypi
  • numpy >=1.22
  • pyarrow >=10