Rust implementation of @Qdrant/fastembed.

Anush

Last update: Oct 30, 2023

Overview

FastEmbed-rs 🦀

Rust implementation of @Qdrant/fastembed

🍕 Features

Supports synchronous usage. No dependency on Tokio.
Uses @huggingface/tokenizers for blazing-fast encodings.
Supports batch embedddings with parallelism using Rayon.

The default embedding supports "query" and "passage" prefixes for the input text. The default model is Flag Embedding, which is top of the MTEB leaderboard.

🔍 Not looking for Rust?

Python 🐍: fastembed
Go 🐳: fastembed-go
JavaScript 🌐: fastembed-js

🤖 Models

🚀 Installation

Run the following Cargo command in your project directory:

cargo add fastembed

Or add the following line to your Cargo.toml:

fastembed = "1"

📖 Usage

use fastembed::{FlagEmbedding, InitOptions, EmbeddingModel, EmbeddingBase};

// With default InitOptions
let model: FlagEmbedding = FlagEmbedding::try_new(Default::default())?;

// With custom InitOptions
let model: FlagEmbedding = FlagEmbedding::try_new(InitOptions {
    model_name: EmbeddingModel::BGEBaseEN,
    show_download_message: true,
    ..Default::default()
})?;

let documents = vec![
    "passage: Hello, World!",
    "query: Hello, World!",
    "passage: This is an example passage.",
    // You can leave out the prefix but it's recommended
    "fastembed-rs is licensed under MIT"
    ];

 // Generate embeddings with the default batch size, 256
 let embeddings = model.embed(documents, None)?;

 println!("Embeddings length: {}", embeddings.len()); // -> Embeddings length: 4
 println!("Embedding dimension: {}", embeddings[0].len()); // -> Embedding dimension: 768

Supports passage and query embeddings for more accurate results

 // Generate embeddings for the passages
 // The texts are prefixed with "passage" for better results
 // The batch size is set to 1 for demonstration purposes
 let passages = vec![
     "This is the first passage. It contains provides more context for retrieval.",
     "Here's the second passage, which is longer than the first one. It includes additional information.",
     "And this is the third passage, the longest of all. It contains several sentences and is meant for more extensive testing."
    ];

 let embeddings = model.passage_embed(passages, Some(1))?;

 println!("Passage embeddings length: {}", embeddings.len()); // -> Embeddings length: 3
 println!("Passage embedding dimension: {}", embeddings[0].len()); // -> Passage embedding dimension: 768

 // Generate embeddings for the query
 // The text is prefixed with "query" for better retrieval
 let query = "What is the answer to this generic question?";

 let query_embedding = model.query_embed(query)?;

 println!("Query embedding dimension: {}", query_embedding.len()); // -> Query embedding dimension: 768

🚒 Under the hood

Why fast?

It's important we justify the "fast" in FastEmbed. FastEmbed is fast because:

Quantized model weights
ONNX Runtime which allows for inference on CPU, GPU, and other dedicated runtimes

Why light?

No hidden dependencies via Huggingface Transformers

Why accurate?

Better than OpenAI Ada-002
Top of the Embedding leaderboards e.g. MTEB

📄 LICENSE

A naive DBSCAN implementation in Rust

DBSCAN Density-Based Spatial Clustering of Applications with Noise Wikipedia link DBSCAN is a density-based clustering algorithm: given a set of point

2 Dec 23, 2022

A random forest implementation in Rust

randomforest A random forest implementation in Rust. Examples use randomforest::criterion::Mse; use randomforest::RandomForestRegressorOptions; use ra

3 Nov 19, 2022

Rust implementation of multi-index hashing for neighbor searches on binary codes in the Hamming space

mih-rs Rust implementation of multi-index hashing (MIH) for neighbor searches on binary codes in the Hamming space, described in the paper Norouzi, Pu

8 Sep 23, 2022

kdtree implementation for rust.

kdtree-rust kdtree implementation for rust. Implementation uses sliding midpoint variation of the tree. More Info here Implementation uses single Vec

11 May 18, 2022

Rust implementation of user-based collaborative filtering

Rucommender Recommendation system written in Rust Overview An implementation in Rust of a collaborative filtering recommendations algorithm with a use

0 Sep 15, 2018

This repository features a simple Kalman filter and RTS smoother (KFS) implementation in Rust by using the ndarray library.

Kalman filter and RTS smoother in Rust (ndarray) This repository features a simple Kalman filter and RTS smoother (KFS) implementation in Rust by usin

3 Dec 1, 2022

Python+Rust implementation of the Probabilistic Principal Component Analysis model

Probabilistic Principal Component Analysis (PPCA) model This project implements a PPCA model implemented in Rust for Python using pyO3 and maturin. In

11 Dec 16, 2022

TopK algorithm implementation in Rust (Filtered Space-Saving)

TopK TopK algorithm implementation in Rust. This crate currently provides the Filtered Space-Saving algorithm. Version numbers follow the semver conve

6 Feb 24, 2023

Rust+OpenCL+AVX2 implementation of LLaMA inference code

RLLaMA RLLaMA is a pure Rust implementation of LLaMA large language model inference.. Supported features Uses either f16 and f32 weights. LLaMA-7B, LL

344 Apr 16, 2023

Comments

fix: MLE5 large url transform

The MLE5Large model URL doesn't follow the same naming convention as the other models So, we tranform "fast-multilingual-e5-large" -> "intfloat-multilingual-e5-large" in the download URL The model directory name in the GCS storage is "fast-multilingual-e5-large", like the others
released

opened by Anush008 1
Wrong onnxruntime.dll Picked Up When Running Integration Tests
Error

ort 1.16 is not compatible with the ONNX Runtime binary found at `onnxruntime.dll`; expected GetVersionString to return '1.16.x', but got '1.10.0'

How To Replicate

For windows, fastembed does not include onnxruntime.dll in target/debug/deps (which is used by integration tests). Therefore if onnxruntime.dll exists in C:/Windows/System32/ then that version is used. Causing the issue above.

To replicate, ensure a version of onnxruntime.dll exists in C:/Windows/System32/ that is not the same version as fastembed (currently 1.16.x). Adding a fastembed model to a library, like

lazy_static! { pub static ref EMBEDDING_MODEL: FlagEmbedding = FlagEmbedding::try_new(InitOptions { model_name: EmbeddingModel::BGEBaseEN, show_download_message: true, ..Default::default() }).unwrap(); }

Then importing that module into an integration and attempting to use it that error will cause the error.

How To fix by hand

Add onnxruntime.dll to target/debug/deps

Expected Behavior

If fast fastembed runs correctly in a binary or library crate, it should also run correctly in their integration tests. (Note: I have not tested release mode or regular unit tests with fastembed yet.)

Possible solution

ort has a feature that could be used, but I have not tested. https://github.com/pykeio/ort/blob/4ab57859caa9490473bac3dfcd043dbb1b89d9a5/Cargo.toml#L44
opened by mcmah309 2

Releases(v1.9.0)

v1.9.0(Oct 28, 2023)
1.9.0 (2023-10-28)

1.9.0 (2023-10-28)

📝 Documentation

Update example README.md (a40f7f7)

Source code(tar.gz)
Source code(zip)
v1.8.0(Oct 26, 2023)
1.8.0 (2023-10-26)

1.8.0 (2023-10-26)

📝 Documentation

example update (8693061)

Source code(tar.gz)
Source code(zip)
v1.7.0(Oct 18, 2023)
1.7.0 (2023-10-18)

Features

New FlagEmbedding models (#5) (335937b)

1.7.0 (2023-10-18)

🍕 Features

New FlagEmbedding models (#5) (335937b)

Source code(tar.gz)
Source code(zip)
v1.6.0(Oct 10, 2023)
1.6.0 (2023-10-10)

1.6.0 (2023-10-10)

📝 Documentation

README.md typo (46478d6)

🔁 Continuous Integration

test before release (#4) (983dff2)

Source code(tar.gz)
Source code(zip)
v1.5.0(Oct 10, 2023)
1.5.0 (2023-10-10)

Features

Reading config from model files (#2) (cf0a512)

1.5.0 (2023-10-10)

🍕 Features

Reading config from model files (#2) (cf0a512)

Source code(tar.gz)
Source code(zip)
v1.4.3(Oct 10, 2023)
1.4.3 (2023-10-10)

1.4.3 (2023-10-10)

✅ Tests

canonical value tests (#3) (78c7f0b)

Source code(tar.gz)
Source code(zip)
v1.4.2(Oct 10, 2023)
1.4.2 (2023-10-10)

Bug Fixes

MLE5 large url transform (#1) (cea5fb4)

1.4.2 (2023-10-10)

🐛 Bug Fixes

MLE5 large url transform (#1) (cea5fb4)

Source code(tar.gz)
Source code(zip)
v1.4.1(Oct 8, 2023)
1.4.1 (2023-10-08)

Bug Fixes

typo README.md (852db71)

1.4.1 (2023-10-08)

🐛 Bug Fixes

typo README.md (852db71)

Source code(tar.gz)
Source code(zip)
v1.4.0(Oct 8, 2023)
1.4.0 (2023-10-08)

1.4.0 (2023-10-08)

📝 Documentation

README.md update (df1efb0)

Source code(tar.gz)
Source code(zip)
v1.3.0(Oct 3, 2023)
1.3.0 (2023-10-03)

1.3.0 (2023-10-03)

📝 Documentation

Installation, license README.md (85d0af5)

🧑‍💻 Code Refactoring

Use tokenizer batch_encode (4ebef86)

Source code(tar.gz)
Source code(zip)
v1.2.0(Oct 2, 2023)
1.2.0 (2023-10-02)

1.2.0 (2023-10-02)

📝 Documentation

Hyperlink @Qdrant/fastembed README.md (3c0a30b)

🔁 Continuous Integration

@semantic-release/git dep install (a62e54c)

lockfile for sem-release (30c31d3)

Replace lockfile inline installs (600d614)

Source code(tar.gz)
Source code(zip)
v1.1.3(Oct 2, 2023)
1.1.3 (2023-10-02)

1.1.3 (2023-10-02)

🔁 Continuous Integration

output git creds (25c3a79)

restore semantic-release (30c517d)

Source code(tar.gz)
Source code(zip)
v1.1.2(Oct 2, 2023)
1.1.2 (2023-10-02)

1.1.2 (2023-10-02)

🔁 Continuous Integration

move git author creds separate job (050990a)

Source code(tar.gz)
Source code(zip)
v1.1.1(Oct 2, 2023)
1.1.1 (2023-10-02)

1.1.1 (2023-10-02)

🔁 Continuous Integration

git author creds, compliance (f85fe2b)

Source code(tar.gz)
Source code(zip)
v1.1.0(Oct 2, 2023)
1.1.0 (2023-10-02)

1.1.0 (2023-10-02)

📝 Documentation

semantic release badge README.md (d799518)

Source code(tar.gz)
Source code(zip)
v1.0.0(Oct 2, 2023)
1.0.0 (2023-10-02)

Features

FlagEmbedding v1 (3d8f105)

list_supported_models (9ddf708)

Multilingual e5-large (d5fb2ff)

semantic release ci/cd (bbc9a6e)

1.0.0 (2023-10-02)

🍕 Features

FlagEmbedding v1 (3d8f105)

list_supported_models (9ddf708)

Multilingual e5-large (d5fb2ff)

semantic release ci/cd (bbc9a6e)

📝 Documentation

Create README.md (869addd)

docs.rs content (31d60a9)

🔁 Continuous Integration

added git committer, author creds (51a8d4c)

Source code(tar.gz)
Source code(zip)

Owner

Anush

Nocturnal back-end engineer.

GitHub https://docs.rs/fastembed

Rust implementation of @Qdrant/fastembed.

Related tags

Overview

You might also like...

A naive DBSCAN implementation in Rust

A random forest implementation in Rust

Rust implementation of multi-index hashing for neighbor searches on binary codes in the Hamming space

kdtree implementation for rust.

Rust implementation of user-based collaborative filtering

This repository features a simple Kalman filter and RTS smoother (KFS) implementation in Rust by using the ndarray library.

Python+Rust implementation of the Probabilistic Principal Component Analysis model

TopK algorithm implementation in Rust (Filtered Space-Saving)

Rust+OpenCL+AVX2 implementation of LLaMA inference code

Comments

Error

How To Replicate

How To fix by hand

Expected Behavior

Possible solution

Releases(v1.9.0)

v1.9.0(Oct 28, 2023)

1.9.0 (2023-10-28)

1.9.0 (2023-10-28)

📝 Documentation

v1.8.0(Oct 26, 2023)

1.8.0 (2023-10-26)

1.8.0 (2023-10-26)

📝 Documentation

v1.7.0(Oct 18, 2023)

1.7.0 (2023-10-18)

Features

1.7.0 (2023-10-18)

🍕 Features

v1.6.0(Oct 10, 2023)

1.6.0 (2023-10-10)

1.6.0 (2023-10-10)

📝 Documentation

🔁 Continuous Integration

v1.5.0(Oct 10, 2023)

1.5.0 (2023-10-10)

Features

1.5.0 (2023-10-10)

🍕 Features

v1.4.3(Oct 10, 2023)

1.4.3 (2023-10-10)

1.4.3 (2023-10-10)

✅ Tests

v1.4.2(Oct 10, 2023)

1.4.2 (2023-10-10)

Bug Fixes

1.4.2 (2023-10-10)

🐛 Bug Fixes

v1.4.1(Oct 8, 2023)

1.4.1 (2023-10-08)

Bug Fixes

1.4.1 (2023-10-08)

🐛 Bug Fixes

v1.4.0(Oct 8, 2023)

1.4.0 (2023-10-08)

1.4.0 (2023-10-08)

📝 Documentation

v1.3.0(Oct 3, 2023)

1.3.0 (2023-10-03)

1.3.0 (2023-10-03)

📝 Documentation

🧑‍💻 Code Refactoring

v1.2.0(Oct 2, 2023)

1.2.0 (2023-10-02)

1.2.0 (2023-10-02)

📝 Documentation

🔁 Continuous Integration

v1.1.3(Oct 2, 2023)

1.1.3 (2023-10-02)

1.1.3 (2023-10-02)

🔁 Continuous Integration

v1.1.2(Oct 2, 2023)

1.1.2 (2023-10-02)

1.1.2 (2023-10-02)

🔁 Continuous Integration

v1.1.1(Oct 2, 2023)