218 Repositories
Rust text-generation Libraries
Nextest is a next-generation test runner for Rust.
nextest Nextest is a next-generation test runner for Rust. For more, check out the website. This repository contains the source code for: cargo-nextes
A rule based sentence segmentation library.
cutters A rule based sentence segmentation library. 🚧 This library is experimental. 🚧 Features Full UTF-8 support. Robust parsing. Language specific
Tantivy is a full text search engine library written in Rust.
Tantivy is a full text search engine library written in Rust. It is closer to Apache Lucene than to Elasticsearch or Apache Solr in the sense it is no
hj is a command line tool to convert HTTP/1-style text into JSON
hj hj is a command line tool to convert HTTP/1-style text into JSON. This command is inspired by yusukebe/rj, which is a standalone HTTP client that s
Terminal text styling via ANSI escape sequences.
Iridescent Features iridescent is a library for styling terminal text easily. It supports basic ANSI sequences, Xterm-256 colors, and RGB. You can ope
Ultra-fast, spookily accurate text summarizer that works on any language
pithy 0.1.0 - an absurdly fast, strangely accurate, summariser Quick example: pithy -f your_file_here.txt --sentences 4 --help: Print this help messa
A Rust wrapper for the Text synthesization service TextSynth API
A Rust wrapper for the Text synthesization service TextSynth API
Turns lines of text into SVG files.
Sentences 2 svg Does what it says on the tin. This takes in a file with some sentences and outputs numbered svgs. There are 3 arguments to keep in min
Deduplicating Training Data Makes Language Models Better
Deduplicating Training Data Makes Language Models Better This repository contains code to deduplicate language model datasets as descrbed in the paper
Rust library for of graph ensembles
Rust library for random graph ensembles Minimal Rust version: 1.55.0 Implements simple sampling and monte carlo (or rather markov-) steps, that can be
A lightweight full-text search library that provides full control over the scoring calculations
probly-search · A full-text search library, optimized for insertion speed, that provides full control over the scoring calculations. This start initia
Making Postgres and Elasticsearch work together like it's 2021
Making Postgres and Elasticsearch work together like it's 2021 Readme ZomboDB brings powerful text-search and analytics features to Postgres by using
A Text User Interface library for the Rust programming language
Cursive Cursive is a TUI (Text User Interface) library for rust. It uses ncurses by default, but other backends are available. It allows you to build
UNIC: Unicode and Internationalization Crates for Rust
UNIC: Unicode and Internationalization Crates for Rust https://github.com/open-i18n/rust-unic UNIC is a project to develop components for the Rust pro
Ginkgo is a text editor built entirely in Rust
Ginkgo is a text editor built entirely in Rust. It supports cursor movements, CTRL commands, select vim commands, insert vs. normal modes, and more. Ginkgo is based on my text editor JED, which itself was based on the popular online editor Kilo.
IntelliJ version of the Afterglow Sublime Text theme
Afterglow IntelliJ This theme for IntelliJ is based on the the Afterglow Sublime Text theme, and replaces the default sidebar icons and colour of Inte
A tool that generates a Sublime Text project file that helps you get started using Scoggle.
README A tool that generates a Sublime Text project file that helps you get started using Scoggle. While Scoggle-Gen may not find every single source
🎨 Procedurally generate 2D sprites
sprite (Executable) Run On Linux you need the gtk-rs dependencies to compile: cargo install sprite sprite This should produce the following window: s
Cryptographic Primitive Code Generation by Fiat
Fiat-Crypto: Synthesizing Correct-by-Construction Code for Cryptographic Primitives Building This repository requires Coq 8.11 or later. Note that if
dye is a tool to easily color text in shell.
Dye dye is a tool to easily color text in shell. Usage See the gif below to see these commands in action. echo $(dye --red WARN) This tool will knock
A Rust library for random number generation.
A Rust library for random number generation.
Cloud backup service for the Rite text editor
rite-cloud A cloud backup solution for the rite text editor. Written in Rust using Tide. Usage You can try out rite-cloud using the instance hosted he
My own personal code editor built with Rust + OpenGL
Glyph This is my personal code editor that I am building for fun and to get more familiar with OpenGL. Glyph currently supports Vim keybinds, syntax h
Makdown-like text parser.
Makdown-like text parser.
WriteForAll is a text file style checker, that compares text documents with editorial tips to make text better.
WriteForAll: tips to make text better WriteForAll is a text file style checker, that compares text documents with editorial tips to make text better.
Subtext is a text-based, block-oriented hypertext format.
Subtext: markup for note-taking Subtext is a text-based, block-oriented hypertext format. It is designed with note-taking in mind. It has a simple, pe
New generation decentralized data warehouse and streaming data pipeline
World's first decentralized real-time data warehouse, on your laptop Docs | Demo | Tutorials | Examples | FAQ | Chat Get Started Watch this introducto
Searching for plain-text files for lines that match a given string. Built with Rust.
Getting Started This is a minimal grep command-line utility built on Rust. It provides searching for plain-text files for lines that match a given str
Text-based to-do management CLI & language server
☑️ Todome (日本語版はこちら) Todome is a notation developed for making and editing to-do lists. It is inspired by Todo.txt, and in fact some of the todome not
Generate QR code easily for free - QR Code Generation as a Service.
QRcode.show Generate QR code easily for free - QR Code Generation as a Service. INPUT: curl qrcode.show/INPUT curl qrcode.show -d INPUT curl qrcode.sh
Next-GEN Confguration Template Generation Language
Sap lang yet another configuration oriented language name comes from Sapphire which is the birthstone of september Language Feature the last expr of t
lingua-rs Python binding. An accurate natural language detection library, suitable for long and short text alike.
lingua-py lingua-rs Python binding. An accurate natural language detection library, suitable for long and short text alike. Installation pip install l
Procedural-generation in Rust
Procedural Generation This is a crate for for procedurally generating maps written in Rust. It's very elegant to use and creates nice results, see the
Font independent text analysis support for shaping and layout.
lipi Lipi (Sanskrit for 'writing, letters, alphabet') is a pure Rust crate that provides font independent text analysis support for shaping and layout
Using the powers of Rust, Go and Dragonfly to make a vanilla-like world generation.
df-rs-gen — Dragonfly Rust Generator Using the powers of Rust, Go and Dragonfly to make a vanilla-like world generation. How to use Clone the repo. gi
Easy to use Rust i18n library based on code generation
rosetta-i18n rosetta-i18n is an easy-to-use and opinionated Rust internationalization (i18n) library powered by code generation. rosetta_i18n::include
Simplified glue code generation for Deno FFI libraries written in Rust.
deno_bindgen This tool aims to simplify glue code generation for Deno FFI libraries written in Rust. Quickstart # install CLI deno install -Afq -n den
🐎 Daac Horse: Double-Array Aho-Corasick in Rust
🐎 daachorse Daac Horse: Double-Array Aho-Corasick Overview A fast implementation of the Aho-Corasick algorithm using Double-Array Trie. Examples use
🔎 A simple in-memory search for collections and key-value stores.
Indicium Search 🔎 A simple in-memory search for collections (Vec, HashMap, BTreeMap, etc) and key-value stores. Features autocompletion. There are ma
Next-GEN Confguration Template Generation Language
Sap lang yet another configuration oriented language name comes from Sapphire which is the birthstone of september Language Feature the last expr of t
Blazingly fast data generation & seeding for MongoDB
Planter Blazingly fast and simple data generation & seeding for MongoDB Installation Use the package manager cargo to install planter. Add the followi
Generator of Firestore rules and type safe client code.
Generator of Firestore rules and type safe client code. Usage [WIP] Install from npm or curl. $ npm install -g firegen Setting your yml. # firegen.yml
A next-generation music player and manager
ouverture A next-generation music player and manager Very much at Work-In-Planning stage now Planned features GUI (localizable) Backend (something lik
CLI toolkit for GTD framework.
GTDF_Crabby CLI toolkit for GTD framework. How to use crabby 0. Parameters Crabby is a CLI toolkit and gets parameters as input. All the main options
colorStyle is a library of styles for command-line text write in Rust.
Colorstyle colorStyle is a library of styles for command-line text. Inspired by flylog/colorstyle (golang) Example let text = colorstyle::green("gre
better tools for text parsing
nom-text Goal: a library that extends nom to provide better tools for text formats (programming languages, configuration files). current needs Recogni
Swash is a pure Rust, cross-platform crate that provides font introspection, complex text shaping and glyph rendering.
Swash is a pure Rust, cross-platform crate that provides font introspection, complex text shaping and glyph rendering. Goals This crate aims to
full text search manpages
buke full text search manpages cargo run --release -- --build builds an sqlite3 database out of all manpages in your $MANPATH cargo run --release -- "
Source text parsing, lexing, and AST related functionality for Deno
Source text parsing, lexing, and AST related functionality for Deno.
The fastest way to identify any mysterious text or analyze strings from a file, just ask `lemmeknow` !
The fastest way to identify anything lemmeknow ⚡ Identify any mysterious text or analyze strings from a file, just ask lemmeknow. lemmeknow can be use
mdzk is a plain text Zettelkasten system that is based on the mdBook API.
mdzk A lovingly designed system and static publishing tool for your plain text Zettelkasten mdzk is a plain text Zettelkasten system that is based on
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
Untanglr Untanglr takes in a some mangled words and makes sense out of them so you dont have to. It goes through the input and splits it probabilistic
Vim-fork focused on extensibility and usability
Documentation | Chat | Twitter Neovim is a project that seeks to aggressively refactor Vim in order to: Simplify maintenance and encourage contributio
Arkworks bindings to Circom's R1CS, for Groth16 Proof and Witness generation in Rust.
ark-circom Arkworks bindings to Circom's R1CS, for Groth16 Proof and Witness generation in Rust.
A library that creates a terminal-like window with feature-packed drawing of text and easy input handling. MIRROR.
BearLibTerminal provides a pseudoterminal window with a grid of character cells and a simple yet powerful API for flexible textual output and uncompli
Termbox is a library that provides minimalistic API which allows the programmer to write text-based user interfaces.
Termbox is a library that provides minimalistic API which allows the programmer to write text-based user interfaces.
Sorta Text Format in UTF-8
STFU-8: Sorta Text Format in UTF-8 STFU-8 is a hacky text encoding/decoding protocol for data that might be not quite UTF-8 but is still mostly UTF-8.
Fast and easy random number generation.
alea A zero-dependency crate for fast number generation, with a focus on ease of use (no more passing &mut rng everywhere!). The implementation is bas
Fegeya Gretea (aka green tea), new generation programming language.
Fegeya Gretea Gretea (aka green tea), new generation programming language. A taste of Gretea's syntax: import tea.green.fmt module hello { fn hel
hck is a shortening of hack, a rougher form of cut.
🪓 hck A sharp cut(1) clone. hck is a shortening of hack, a rougher form of cut. A close to drop in replacement for cut that can use a regex delimiter
Experimental playground for wiktionary data
wikt Experimental playground for wiktionary data. This document might not update as often as the code does. Set up You'll want a minimum of 10 GB free
🔍TinySearch is a lightweight, fast, full-text search engine. It is designed for static websites.
tinysearch TinySearch is a lightweight, fast, full-text search engine. It is designed for static websites. TinySearch is written in Rust, and then com
A full-text search engine in rust
Toshi A Full-Text Search Engine in Rust Please note that this is far from production ready, also Toshi is still under active development, I'm just slo
A full-text search and indexing server written in Rust.
Bayard Bayard is a full-text search and indexing server written in Rust built on top of Tantivy that implements Raft Consensus Algorithm and gRPC. Ach
ReVi is a cross-platform terminal based Vim inspired text editor.
ReVi Table Of Contents: About Usage Install Clone && Installing Development Q&A KeyBindings Roadmap Changelog About ReVi is a cross-platform terminal
Kaolinite - A crate to assist in the creation of TUI text editors.
Kaolinite - A crate to assist in the creation of TUI text editors.
A crate using DeepSpeech bindings to convert mic audio from speech to text
DS-TRANSCRIBER Need an Offline Speech To Text converter? Records your mic, and returns a String containing what was said. Features Begins transcriptio
MASQ combines the benefits of VPN and Tor technology to create a superior next-generation privacy software, where users are rewarded for supporting an uncensored global web. Users gain privacy and anonymity online, while helping promote Internet Freedom.
MASQ Node Source The MASQ project was forked from Substratum's Node project in order to carry on development after Substratum ceased operations in Oct
Amp: A text editor for your terminal.
Amp: A text editor for your terminal. Heavily inspired by Vi/Vim. Amp aims to take the core interaction model of Vim, simplify it, and bundle in the e
Intuitive find & replace CLI (sed alternative)
sd - s[earch] & d[isplace] sd is an intuitive find & replace CLI. The Pitch Why use it over any existing tools? Painless regular expressions sd uses r
Pitch-perfect copy of map generation algorithm from Slay the Spire
sts_map_oracle Pitch-perfect copy of map generation algorithm from Slay the Spire Usage Prints out map layouts in console for given seed: sts_map_orac
syntect is a syntax highlighting library for Rust that uses Sublime Text syntax definitions.
syntect is a syntax highlighting library for Rust that uses Sublime Text syntax definitions. It aims to be a good solution for any Rust project that needs syntax highlighting, including deep integration with text editors written in Rust.
Text Renderer written in Rust using HarfBuzz for shaping, FreeType for rasterization and OpenGL for rendering.
Provok Text Renderer written in Rust using HarfBuzz for shaping, FreeType for rasterization and OpenGL for rendering. Input Provok is fed with a JSON
Rust read/write support for well-known text (WKT)
wkt Rust read/write support for well-known text (WKT). License Licensed under either of Apache License, Version 2.0 (LICENSE-APACHE or http://www.apac
Rust read/write support for well-known text (WKT)
wkt Rust read/write support for well-known text (WKT). License Licensed under either of Apache License, Version 2.0 (LICENSE-APACHE or http://www.apac
discord bot for uwuifying text
uwubot This crate defines a discord bot using serenity for uwuifying text via uwuify. Installation You can install uwubot from source or from crates.i
A tray icon to uwuify your text
uwu-tray fastest uwuifier in the west... now in your system tray for even faster uwufication! FAQ Which platforms are supported? Windows Linux MacOS H
fastest text uwuifier in the west
uwuify fastest text uwuifier in the west transforms Hey... I think I really love you. Do you want a headpat? into hey... i think i w-weawwy wuv you.
A fast implementation of Aho-Corasick in Rust.
aho-corasick A library for finding occurrences of many patterns at once with SIMD acceleration in some cases. This library provides multiple pattern s
A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
nlprule A fast, low-resource Natural Language Processing and Error Correction library written in Rust. nlprule implements a rule- and lookup-based app
Semantic text segmentation. For sentence boundary detection, compound splitting and more.
NNSplit A tool to split text using a neural network. The main application is sentence boundary detection, but e. g. compound splitting for German is a
👄 The most accurate natural language detection library in the Rust ecosystem, suitable for long and short text alike
Table of Contents What does this library do? Why does this library exist? Which languages are supported? How good is it? Why is it better than other l
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
rust-bert Rust native Transformer-based models implementation. Port of Hugging Face's Transformers library, using the tch-rs crate and pre-processing
🦀 Example of serving deep learning models in Rust with batched prediction
rust-dl-webserver This project provides an example of serving a deep learning model with batched prediction using Rust. In particular it runs a GPT2 m
High-performance runtime for data analytics applications
Weld Documentation Weld is a language and runtime for improving the performance of data-intensive applications. It optimizes across libraries and func
Rust version of the Haskell ERD tool. Translates a plain text description of a relational database schema to dot files representing an entity relation diagram.
erd-rs Rust CLI tool for creating entity-relationship diagrams from plain text markup. Based on erd (uses the same input format and output rendering).
Text Expression Runner – Readable and easy to use text expressions
ter - Text Expression Runner ter is a cli to run text expressions and perform basic text operations such as filtering, ignoring and replacing on the c
A (self hosted) pastebin for easily sharing text right from the terminal
termpad termpad allows you to easily host a pastebin server for saving and viewing text right from the terminal, or the browser. Client Usage Assuming
bottom encodes UTF-8 text into a sequence comprised of bottom emoji
bottom encodes UTF-8 text into a sequence comprised of bottom emoji (with , sprinkled in for good measure) followed by 👉👈. It can encode any valid UTF-8 - being a bottom transcends language, after all - and decode back into UTF-8.
Web base text editor written in rust
Ultron Ultron is a web based monospace text-editor with syntax highlighting, completely written in rust. I wrote this code editor for my very specific
A small discord bot to archive the messages in a discord text channel.
discord-channel-archiver A small discord bot to archive the messages in a discord text channel. This is still WIP. The HTML and JSON modes are vaguely
Aggregatable Distributed Key Generation
Aggregatable DKG and VUF WARNING: this code should not be used in production! Implementation of Aggregatable Distributed Key Generation, a distributed
Rust library for hardware accelerated drawing of 2D shapes, images, and text, with an easy to use API.
Speedy2D Hardware-accelerated drawing of shapes, images, and text, with an easy to use API. Speedy2D aims to be: The simplest Rust API for creating a
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
Tantivy is a full text search engine library written in Rust. It is closer to Apache Lucene than to Elasticsearch or Apache Solr in the sense it is no
Multilingual implementation of RAKE algorithm for Rust
RAKE.rs The library provides a multilingual implementation of Rapid Automatic Keyword Extraction (RAKE) algorithm for Rust. How to Use Append rake to
Natural language detection library for Rust. Try demo online: https://www.greyblake.com/whatlang/
Whatlang Natural language detection for Rust with focus on simplicity and performance. Content Features Get started Documentation Supported languages
An efficient and powerful Rust library for word wrapping text.
Textwrap Textwrap is a library for wrapping and indenting text. It is most often used by command-line programs to format dynamic output nicely so it l
Generate PDF files with JavaScript and WASM (WebAssembly)
WASM-PDF Generates PDF files directly in the browser with JavaScript and WASM (WebAssembly). Idea here is to push all the work involved in creating a
An easy-to-use library for writing PDF in Rust
printpdf printpdf is a library designed for creating printable PDF documents. Crates.io | Documentation [dependencies] printpdf = "0.3.2" Features Cur
A complete harfbuzz's shaping algorithm port to Rust
rustybuzz rustybuzz is a complete harfbuzz's shaping algorithm port to Rust. Matches harfbuzz v2.7.0 Why? Because you can add rustybuzz = "*" to your