370 Repositories
Rust synthetic-data Libraries
Dynamically get the suggested clusters in the data for unsupervised learning.
Python implementation of the Gap Statistic Purpose Dynamically identify the suggested number of clusters in a data-set using the gap statistic. Full e
Barnes-Hut t-SNE implementation written in Rust.
bhtsne Barnes-Hut implementation of t-SNE written in Rust. The algorithm is described with fine detail in this paper by Laurens van der Maaten. Instal
Orkhon: ML Inference Framework and Server Runtime
Orkhon: ML Inference Framework and Server Runtime Latest Release License Build Status Downloads Gitter What is it? Orkhon is Rust framework for Machin
A Modern Real-Time Data Processing & Analytics DBMS with Cloud-Native Architecture, written in Rust
Datafuse Modern Real-Time Data Processing & Analytics DBMS with Cloud-Native Architecture Datafuse is a Real-Time Data Processing & Analytics DBMS wit
Simple but powerful graph library for Rust
Graphlib Graphlib is a simple and powerful Rust graph library. This library attempts to provide a generic api for building, mutating and iterating ove
Graph data structure library for Rust.
petgraph Graph data structure library. Supports Rust 1.41 and later. Please read the API documentation here Crate feature flags: graphmap (default) en
GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba
A One-Stop Large-Scale Graph Computing System from Alibaba GraphScope is a unified distributed graph computing platform that provides a one-stop envir
DataFrame / Series data processing in Rust
black-jack While PRs are welcome, the approach taken only allows for concrete types (String, f64, i64, ...) I'm not sure this is the way to go. I want
SegVec data structure for rust. Similar to Vec, but allocates memory in chunks of increasing size.
segvec This crate provides the SegVec data structure. It is similar to Vec, but allocates memory in chunks of increasing size, referred to as "segment
the file filesystem: mount semi-structured data (like JSON) as a Unix filesystem
ffs: the file filesystem ffs, the file filessytem, let's you mount semi-structured data as a fileystem---a tree structure you already know how to work
🐸Slippi DB ingests Slippi replays and puts the data into a SQLite database for easier parsing.
The primary goal of this project is to make it easier to analyze large amounts of Slippi data. Its end goal is to create something similar to Ballchasing.com but for Melee.
Lossless compressor and decompressor for numerical data using quantiles
This rust library compresses and decompresses sequences of numerical data very well. It currently supports the following data types: i32, i64, u32, u64, f32, f64. Smaller data types like i16 can be efficiently compressed by casting to i32. Timestamp support may come soon in the future.
Data structures and algorithms for 3D geometric modeling.
geom3d Data structures and algorithms for 3D geometric modeling. Features: Bezier curve and surface B-Spline curve and surface Spin surface Sweep surf
Rust library for string parsing of basic data structures.
afmt Simple rust library for parsing basic data structures from strings. Usage You can specify string formats to any strucute, via the use of the fmt
😎 A CLI tool for fetching data and displaying it nicely
😎 A CLI tool for fetching data and displaying it nicely. (The neofetch for remote data fetching)
Uindex is a data store, for data that can be parsed as sentences in some context-free language.
Uindex - Universal index Uindex is a data store, for data that can be parsed as sentences in some context-free language.
Simple Data Stealer
helfsteal Simple Data Stealer Hi All, I published basic data stealer malware with Rust. FOR EDUCATIONAL PURPOSES. You can use it for Red Team operatio
Quickwit is a big data search engine.
Quickwit This repository will host Quickwit, the big data search engine developed by Quickwit Inc. We will progressively polish and opensource our cod
A proof of concept implementation of cyclic data structures in stable, safe, Rust.
A proof of concept implementation of cyclic data structures in stable, safe, Rust. This demonstrates the combined power of the static-rc crate and the
Data-oriented and data-driven game engine written in Rust
What is Amethyst? Amethyst is a data-driven and data-oriented game engine aiming to be fast and as configurable as possible. Principles These principl
A refreshingly simple data-driven game engine built in Rust
What is Bevy? Bevy is a refreshingly simple data-driven game engine built in Rust. It is free and open-source forever! WARNING Bevy is still in the ve
OpenStreetMap flatdata format and compiler
osmflat Flat OpenStreetMap (OSM) data format providing an efficient random data access through memory mapped files. The data format is described and i
A performant binary encoding for geographic data based on flatbuffers
FlatGeobuf A performant binary encoding for geographic data based on flatbuffers that can hold a collection of Simple Features including circular inte
Rust read/write support for GPS Exchange Format (GPX)
gpx gpx is a library for reading and writing GPX (GPS Exchange Format) files. It uses the primitives provided by geo-types to allow for storage of GPS
Zero-Copy reading and writing of geospatial data.
GeoZero Zero-Copy reading and writing of geospatial data. GeoZero defines an API for reading geospatial data formats without an intermediate represent
Spatial Data Structures for Rust
spade Documentation Using spade Examples Project state Performance License Spade (SPAtial DatastructurEs, obviously!) implements a few nifty data stru
An advanced geospatial data analysis platform
Bringing the power of Whitebox GAT to the world at large This page is related to the stand-alone command-line program and Python scripting API for geo
Rust read/write support for GPS Exchange Format (GPX)
gpx gpx is a library for reading and writing GPX (GPS Exchange Format) files. It uses the primitives provided by geo-types to allow for storage of GPS
An implementation of the FP-Growth algorithm in pure Rust.
fp-growth-rs An implementation of the FP-Growth algorithm in pure Rust, which is inspired by enaeseth/python-fp-growth. Usage Add this to your Cargo.t
Proof-of-concept for a memory-efficient data structure for zooming billion-event traces
Proof-of-concept for a gigabyte-scale trace viewer This repo includes: A memory-efficient representation for event traces An unusually simple and memo
A Google-like web search engine that provides the user with the most relevant websites in accordance to his/her query, using crawled and indexed textual data and PageRank.
Mini Google Course project for the Architecture of Computer Systems course. Overview: Architecture: We are working on multiple components of the web c
Rayon: A data parallelism library for Rust
Rayon Rayon is a data-parallelism library for Rust. It is extremely lightweight and makes it easy to convert a sequential computation into a parallel
A high-performance, high-reliability observability data pipeline.
Quickstart • Docs • Guides • Integrations • Chat • Download What is Vector? Vector is a high-performance, end-to-end (agent & aggregator) observabilit
Orkhon: ML Inference Framework and Server Runtime
Orkhon: ML Inference Framework and Server Runtime Latest Release License Build Status Downloads Gitter What is it? Orkhon is Rust framework for Machin
Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.
Cleora Cleora is a genus of moths in the family Geometridae. Their scientific name derives from the Ancient Greek geo γῆ or γαῖα "the earth", and metr
High-performance runtime for data analytics applications
Weld Documentation Weld is a language and runtime for improving the performance of data-intensive applications. It optimizes across libraries and func
A highly efficient daemon for streaming data from Kafka into Delta Lake
kafka-delta-ingest The kafka-delta-ingest project aims to build a highly efficient daemon for streaming data through Apache Kafka into Delta Lake. Thi
RiteLinked - LinkedHashMap & LinkedHashSet in Rust
RiteLinked -- HashMap-like containers that hold their key-value pairs in a user controllable order RiteLinked provides more up to date versions of Lin
Easily share data between terminal windows!
abra A tool that makes data sharing between terminal windows easy. abra can be used for displaying info about the current working directory, for split
A Rust application which funnels external webhook event data to an Urbit chat.
Urbit Webhook Funnel This is a simple Rust application which funnels external webhook event data to an Urbit chat. This application is intended to be
A Rust library to extract useful data from HTML documents, suitable for web scraping.
select.rs A library to extract useful data from HTML documents, suitable for web scraping. NOTE: The following example only works in the upcoming rele
High-performance log search engine.
NOTE: This project is under development, please do not depend on it yet as things may break. MinSQL MinSQL is a log search engine designed with simpli
A Rust library for zero-allocation parsing of binary data.
Zero A Rust library for zero-allocation parsing of binary data. Requires Rust version 1.6 or later (requires stable libcore for no_std). See docs for
A data-first Rust-native UI design toolkit.
Druid A data-first Rust-native UI toolkit. Druid is an experimental Rust-native UI toolkit. Its main goal is to offer a polished user experience. Ther
A refreshingly simple data-driven game engine built in Rust
What is Bevy? Bevy is a refreshingly simple data-driven game engine built in Rust. It is free and open-source forever! WARNING Bevy is still in the ve
Rusty Object Notation
Rusty Object Notation RON is a simple readable data serialization format that looks similar to Rust syntax. It's designed to support all of Serde's da
pem-rs pem PEM jcreekmore/pem-rs [pem] — A Rust based way to parse and encode PEM-encoded data
pem A Rust library for parsing and encoding PEM-encoded data. Documentation Module documentation with examples Usage Add this to your Cargo.toml: [dep
rust-jsonnet - The Google Jsonnet( operation data template language) for rust
rust-jsonnet ==== Crate rust-jsonnet - The Google Jsonnet( operation data template language) for rust Google jsonnet documet: (http://google.github.io
I/O and binary data encoding for Rust
nue A collection of tools for working with binary data and POD structs in Rust. pod is an approach at building a safe interface for transmuting POD st
A rust drawing library for high quality data plotting for both WASM and native, statically and realtimely 🦀 📈🚀
Plotters - A Rust drawing library focus on data plotting for both WASM and native applications 🦀 📈 🚀 Plotters is drawing library designed for rende
Plotly for Rust
Plotly.rs Plotly for Rust Getting Started | Recipes | API Docs | Changelog | | | A plotting library for Rust powered by Plotly.js. Usage Add this to y
Data plotting library for Rust
plotlib plotlib is a generic data visualisation and plotting library for Rust. It is currently in the very early stages of development. It can current
Parameterized routing for generic resources in Rust
Usher Usher provides an easy way to construct parameterized routing trees in Rust. The nodes of these trees is naturally generic, allowing Usher to le
Rust Persistent Data Structures
Rust Persistent Data Structures Rust Persistent Data Structures provides fully persistent data structures with structural sharing. Setup To use rpds a
A priority queue for Rust with efficient change function.
PriorityQueue This crate implements a Priority Queue with a function to change the priority of an object. Priority and items are stored in an IndexMap
High-performance runtime for data analytics applications
Weld Documentation Weld is a language and runtime for improving the performance of data-intensive applications. It optimizes across libraries and func
Easy to use cryptographic framework for data protection: secure messaging with forward secrecy and secure data storage. Has unified APIs across 14 platforms.
Themis provides strong, usable cryptography for busy people General purpose cryptographic library for storage and messaging for iOS (Swift, Obj-C), An
Rayon: A data parallelism library for Rust
Rayon Rayon is a data-parallelism library for Rust. It is extremely lightweight and makes it easy to convert a sequential computation into a parallel
Tools for concurrent programming in Rust
Crossbeam This crate provides a set of tools for concurrent programming: Atomics AtomicCell, a thread-safe mutable memory location.(no_std) AtomicCons
A library for generating fake data in Rust.
Fake A Rust library for generating fake data. Installation Default (rand is required): [dependencies] fake = "2.4" rand = "0.8" If you want to use #[d
A setuptools/wheel/cffi extension to embed a binary data in wheels
Milksnake Milksnake is an extension for setuptools that allows you to distribute dynamic linked libraries in Python wheels in the most portable way im
Powerful database anonymizer with flexible rules. Written in Rust.
[Data]nymizer Powerful database anonymizer with flexible rules. Written in Rust. Datanymizer is created & supported by Evrone. What else we develop wi
Check Have I Been Pwned and see if it's time for you to change passwords.
checkpwn Check Have I Been Pwned and see if it's time for you to change passwords. Getting started Install: cargo install checkpwn Update: cargo inst
🔮 Futuristic take on hexdump, made in Rust.
hex (hx) Futuristic take on hexdump. hx accepts a file path as input and outputs a hexadecimal colorized view to stdout. $ hx tests/files/alphanumeric
A high-performance observability data pipeline.
Get Started • Docs • Guides • Integrations • Chat • Download What is Vector? Vector is a high-performance, end-to-end (agent & aggregator) observabili
svgcleaner could help you to clean up your SVG files from the unnecessary data.
svgcleaner svgcleaner helps you clean up your SVG files, keeping them free from unnecessary data. Table of Contents Purpose Goals Alternatives Charts
a tokio-enabled data store for triple data
terminusdb-store, a tokio-enabled data store for triple data Overview This library implements a way to store triple data - data that consists of a sub
Materialize simplifies application development with streaming data. Incrementally-updated materialized views - in PostgreSQL and in real time. Materialize is powered by Timely Dataflow.
Materialize is a streaming database for real-time applications. Get started Check out our getting started guide. About Materialize lets you ask questi
A fast data collector in Rust
Flowgger is a fast, simple and lightweight data collector written in Rust. It reads log entries over a given protocol, extracts them, decodes them usi
A system to programmatically run data pipelines
Factotum A dag running tool designed for efficiently running complex jobs with non-trivial dependency trees. The zen of Factotum A Turing-complete job