13 Repositories
Rust sentence-bert Libraries
SHA256 sentence: discover a SHA256 checksum that matches a sentence's description of hex digit words.
SHA256 sentence "The SHA256 for this sentence begins with: one, eight, two, a, seven, c and nine." Inspired by @lauriewired post Inspired by @humbleha
Implementation of sentence embeddings with BERT in Rust, using the Burn library.
Sentence Transformers in Burn This library provides an implementation of the Sentence Transformers framework for computing text representations as vec
A rule based sentence segmentation library.
cutters A rule based sentence segmentation library. π§ This library is experimental. π§ Features Full UTF-8 support. Robust parsing. Language specific
Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.
SyntaxDot Introduction SyntaxDot is a sequence labeler and dependency parser using Transformer networks. SyntaxDot models can be trained from scratch
A Demo server serving Bert through ONNX with GPU written in Rust with 3
Demo BERT ONNX server written in rust This demo showcase the use of onnxruntime-rs on BERT with a GPU on CUDA 11 served by actix-web and tokenized wit
High-level non-blocking Deno bindings to the rust-bert machine learning crate.
bertml High-level non-blocking Deno bindings to the rust-bert machine learning crate. Guide Introduction The ModelManager class manages the FFI bindin
Parse BNF grammar definitions
bnf A library for parsing BackusβNaur form context-free grammars. What does a parsable BNF grammar look like? The following grammar from the Wikipedia
π¦ A Rust implementation of a RoBERTa classification model for the SNLI dataset
RustBERTa-SNLI A Rust implementation of a RoBERTa classification model for the SNLI dataset, with support for fine-tuning, predicting, and serving. Th
Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)
Rust SBert Rust port of sentence-transformers using rust-bert and tch-rs. Supports both rust-tokenizers and Hugging Face's tokenizers. Supported model
Semantic text segmentation. For sentence boundary detection, compound splitting and more.
NNSplit A tool to split text using a neural network. The main application is sentence boundary detection, but e. g. compound splitting for German is a
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
rust-bert Rust native Transformer-based models implementation. Port of Hugging Face's Transformers library, using the tch-rs crate and pre-processing
π₯ Fast State-of-the-Art Tokenizers optimized for Research and Production
Provides an implementation of today's most used tokenizers, with a focus on performance and versatility. Main features: Train new vocabularies and tok
π₯ Fast State-of-the-Art Tokenizers optimized for Research and Production
Provides an implementation of today's most used tokenizers, with a focus on performance and versatility. Main features: Train new vocabularies and tok