Implementation of sentence embeddings with BERT in Rust, using the Burn library.

Tyler Vergho

Last update: Sep 4, 2023

Related tags

Text processing sentence-transformers-burn

Overview

Sentence Transformers in Burn

This library provides an implementation of the Sentence Transformers framework for computing text representations as vector embeddings in Rust. Specifically, it uses the Burn deep learning library to implement the BERT model. Using Burn, this can be combined with any supported backend for fast, efficient, cross-platform inference on CPUs and GPUs. ST-Burn supports any state-of-the-art model that implements the BERT architecture.

Currently inference-only for now.

Features

Import models via safetensors (using Candle). 📦
Code structure replicates the official Huggingface BertModel implementation. 🚀
Flexible inference backend using Burn. 🔧

Installation

sentence-transformers-burn can be installed from source.

cargo add --git https://github.com/tvergho/sentence-transformers-burn.git sentence_transformers

Run cargo build to make sure everything can be correctly built.

cargo build

Note that building the burn-tch dependency may require manually linking Libtorch. After installing via pip:

export LIBTORCH=$(python3 -c 'import torch; from pathlib import Path; print(Path(torch.__file__).parent)')
# /path/to/torch

export DYLD_LIBRARY_PATH=/path/to/torch/lib

Python dependencies (for running the scripts in scripts/) should also be installed.

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Usage

A BertModel can be loaded and initialized from a file, as in the example below:

use sentence_transformers::bert_loader::{load_model_from_safetensors, load_config_from_json};
use sentence_transformers::model::{
  bert_embeddings::BertEmbeddingsInferenceBatch,
  bert_model::BertModel,
};
use burn_tch::{TchBackend, TchDevice};
use burn::tensor::Tensor;

const BATCH_SIZE: u64 = 64;

let device = TchDevice::Cpu;
let config = load_config_from_json("model/bert_config.json");
let model: BertModel<_> = load_model_from_safetensors::<TchBackend<f32>>("model/bert_model.safetensors", &device, config);

let batch = BertEmbeddingsInferenceBatch {
  tokens: Tensor::zeros(vec![BATCH_SIZE, 256]).to_device(&device.clone()),
  mask_attn: Some(Tensor::ones(vec![BATCH_SIZE, 256]).to_device(&device.clone()))
};

model.forward(batch); // [batch_size, seq_len, n_dims]

sentence-transformers-burn also comes with a built-in inference server. To start, simply run:

cargo run --release --bin server -- path/to/model/dir

The model directory should contain a bert_model.safetensors and bert_config.json file. Once the server is running, inference can be initiated via POST request:

POST http://localhost:3030/embed

{
  "input_ids": [[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]],
  "attention_mask": [[1, 1, 1, 1, 1, 1, 1, 1, 1, 1]]
}

This will return a 3D array of floats of size [batch_size, seq_len, n_dims].

Testing

Tests can be run to verify that the Rust model output matches a comparable Huggingface model. To save a model to use during testing, run python scripts/prepare_test.py. Then, simply:

cargo run test

To Do

Cleaner model import (directly from safetensors/config.json)
Proper documentation and more testing
More model usage options (e.g. classification, NER, question answering heads)
GGML backend/quantization

Multilingual implementation of RAKE algorithm for Rust

RAKE.rs The library provides a multilingual implementation of Rapid Automatic Keyword Extraction (RAKE) algorithm for Rust. How to Use Append rake to

26 Dec 16, 2022

Snips NLU rust implementation

Snips NLU Rust Installation Add it to your Cargo.toml: [dependencies] snips-nlu-lib = { git = "https://github.com/snipsco/snips-nlu-rs", branch = "mas

327 Dec 26, 2022

A fast implementation of Aho-Corasick in Rust.

aho-corasick A library for finding occurrences of many patterns at once with SIMD acceleration in some cases. This library provides multiple pattern s

662 Dec 31, 2022

🦀 A Rust implementation of a RoBERTa classification model for the SNLI dataset

RustBERTa-SNLI A Rust implementation of a RoBERTa classification model for the SNLI dataset, with support for fine-tuning, predicting, and serving. Th

11 Oct 17, 2022

A rust implementation of some popular snowball stemming algorithms

Rust Stemmers This crate implements some stemmer algorithms found in the snowball project which are compiled to rust using the rust-backend of the sno

84 Dec 15, 2022

Gomez - A pure Rust framework and implementation of (derivative-free) methods for solving nonlinear (bound-constrained) systems of equations

Gomez A pure Rust framework and implementation of (derivative-free) methods for solving nonlinear (bound-constrained) systems of equations. Warning: T

19 Dec 24, 2022

Which words can you spell using only element abbreviations from the periodic table?

Comments

Add "--release" to cargo examples

Great project!

Briefly glanced over the examples to run, I think "--release" switch should be added. By default, cargo builds in debug it's many many times slower.

opened by antimora 1

Implementation of sentence embeddings with BERT in Rust, using the Burn library.

Related tags

Overview

Sentence Transformers in Burn

Features

Installation

Usage

Testing

To Do

You might also like...

Multilingual implementation of RAKE algorithm for Rust

Snips NLU rust implementation

A fast implementation of Aho-Corasick in Rust.

🦀 A Rust implementation of a RoBERTa classification model for the SNLI dataset

A rust implementation of some popular snowball stemming algorithms

Gomez - A pure Rust framework and implementation of (derivative-free) methods for solving nonlinear (bound-constrained) systems of equations

Which words can you spell using only element abbreviations from the periodic table?

A crate using DeepSpeech bindings to convert mic audio from speech to text

Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.

Comments

Add "--release" to cargo examples

Owner

Tyler Vergho

SHA256 sentence: discover a SHA256 checksum that matches a sentence's description of hex digit words.

A rule based sentence segmentation library.

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

finalfusion embeddings in Rust

Context-sensitive word embeddings with subwords. In Rust.

Semantic text segmentation. For sentence boundary detection, compound splitting and more.

An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.

A lightweight platform-accelerated library for biological motif scanning using position weight matrices.

Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.

A Markdown to HTML compiler and Syntax Highlighter, built using Rust's pulldown-cmark and tree-sitter-highlight crates.