395 Repositories
Rust data-engineering Libraries
Stdto provides a set of functional traits for conversion between various data representations.
Stdto stdto provides a set of functional traits for conversion between various data representations. | Examples | Docs | Latest Note | stdto = "0.13.0
simple and powerful IP architecture in Rust.
ipzone Ipzone provides a simple and powerful IP architecture to Rust. | Examples | Docs | Latest Note | ipzone = "0.4.0" Examples use ipzone::prelude:
A reverse-engineering of Speedball 2 for the Sega Megadrive
Reverse engineering of Speedball 2 for Sega Megadrive (Genesis) This repo is a reverse engineering of Speedball 2, using Ghidra to reverse the assembl
Python+Rust implementation of the Probabilistic Principal Component Analysis model
Probabilistic Principal Component Analysis (PPCA) model This project implements a PPCA model implemented in Rust for Python using pyO3 and maturin. In
Pure rust library for reading / writing DNG files providing access to the raw data in a zero-copy friendly way.
DNG-rs β A pure rust library for reading / writing DNG files providing access to the raw data in a zero-copy friendly way. Also containing code for re
π My solutions to Advent of Code 2022.
π advent-of-code-2022 Overview This repository contains my solutions to Advent of Code 2022. I decided to take this year as an opportunity to learn R
Displaying data for the Soroban Futurenet.
Soroban Fiddle https://leighmcculloch.github.io/soroban-fiddle Web frontend-only application that displays data on the Soroban Futurenet network. Feat
A simple key-value store with a log-structured, append-only storage architecture where data is encrypted with AES GCM.
akvdb A simple key-value store with a log-structured, append-only storage architecture where data is encrypted with AES GCM. Modified from the actionk
A fast and robust MLOps tool for managing data and pipelines
xvc A Fast and Robust MLOps Swiss-Army Knife in Rust β When to use xvc? Machine Learning Engineers: When you manage large quantities of unstructured d
π Rusty flow graph processing library
flowing flowing is a flow graph processing library written in Rust. It shall serve as a general-purpose building block for all kinds of dataflow progr
An apocalypse-resistant data storage format for the truly paranoid.
Carbonado An apocalypse-resistant data storage format for the truly paranoid. Designed to keep encrypted, durable, compressed, provably replicated con
rust_aads - Rust Algorithms And Data Structures
rust_aads - Rust Algorithms And Data Structures rust_aads is an open repository with algorithms and data structures, used in computer science and comp
A general-purpose, transactional, relational database that uses Datalog and focuses on graph data and algorithms
cozo A general-purpose, transactional, relational database that uses Datalog for query and focuses on graph data and algorithms. Features Relational d
A library for spatial partitioning of 3D data.
Eightfold A library for spatial partitioning of 3D data. Built with nalgebra. Not yet fit for actual use; wait until 1.0.0. Feature Flags spatial :: [
A dead-simple tool for working with data in Kafka
ktool - a tool for Kafka ktool is a dead-simple tool for working with data in Kafka: Copy partitions / topics to disk Replay messages Inspect message
UniSBOM is a tool to build a software bill of materials on any platform with a unified data format.
UniSBOM is a tool to build a software bill of materials on any platform with a unified data format. Work in progress Support MacOS Uses system_profile
Haylou Smart Watch 2 (LS02) reverse-engineering project
Haywatch Haywatch Hello Haylou Watch features Device communication General command structure Pairing Unpairing Battery Firmware Date and time Pulses U
Simple utility to backup/sync data between devices to the cloud
RSink Simple utility to backup/sync data between devices to the cloud Features π Blazingly Fast β½ Lightweight π Cross-platform, runs everywhere π S
Convert rust coverage data to HTML reports, LCOV files or terminal tables
Coverage Prepare Convert coverage data to HTML reports, LCOV files or terminal tables. coverage-prepare --help: Convert "profraw" coverage data to: *
Databento Binary Encoding (DBZ) - Fast message encoding and storage format for market data
dbz A library (dbz-lib) and CLI tool (dbz-cli) for working with Databento Binary Encoding (DBZ) files. Python bindings for dbz-lib are provided in the
Interesting protection concept i came up with.
Note If this reprository is useful to you in in any shape or form please give it a star. How it works This protection works by taking all the assembly
Library provides a simple API for Google Firestore for create/update/query/streaming/listening data
Firestore for Rust Library provides a simple API for Google Firestore: Create or update documents using Rust structures and Serde; Support for queryin
Parse RISC-V opcodes to provide more detailed structured data
riscv-opcodes-parser Parse RISC-V opcodes to provide more detailed structured data. License Licensed under either of Apache License, Version 2.0 (LICE
Concurrent and multi-stage data ingestion and data processing with Rust+Tokio
TokioSky Build concurrent and multi-stage data ingestion and data processing pipelines with Rust+Tokio. TokioSky allows developers to consume data eff
A fast and simple in-memory database with a key-value data model written in Rust
Segment Segment is a simple & fast in-memory database with a key-value data model written in Rust. Features Dynamic keyspaces Keyspace level control o
The powerful analysis platform to explore and visualize data from blockchain.
Mars: The powerful analysis platform to explore and visualize data from Web3 Features Blazing Fast Create from scratch with Rust. Pipeline Processor E
A library and application for lossless, format-preserving, two-pass optimization and repair of Vorbis data, reducing its size without altering any audio information.
OptiVorbis A library and application for lossless, format-preserving, two-pass optimization and repair of Vorbis data, reducing its size without alter
Plow - The ontology package manager
Plow - Ontology package manager Plow is package management solution for OWL ontologies, with support for specifying dependencies between packages via
qsv: Ultra-fast CSV data-wrangling toolkit
qsv is a command line program for indexing, slicing, analyzing, splitting, enriching, validating & joining CSV files. Commands are simple, fast & composable
A data compatibility project for Minecraft's assets
Data Compat MC Data tool for the Falcon MC project. Description DataCompatMC is a cli tool designed to help deal with the many different versions of M
A data compatibility project for Minecraft's assets
Data Compat MC Data tool for the Falcon MC project. Description DataCompatMC is a cli tool designed to help deal with the many different versions of M
ReadySet is a lightweight SQL caching engine written in Rust that helps developers enhance the performance and scalability of existing applications.
ReadySet is a SQL caching engine designed to help developers enhance the performance and scalability of their existing database-backed applications. W
OBKV Table Client is Rust Library that can be used to access table data from OceanBase storage layer.
OBKV Table Client is Rust Library that can be used to access table data from OceanBase storage layer. Its access method is different from JDBC, it skips the SQL parsing layer, so it has significant performance advantage.
Rust libraries for working with GPT (GUID Partition Table) disk data
gpt-disk-rs no_std libraries related to GPT (GUID Partition Table) disk data. There are three Rust packages in this repository: uguid The uguid packag
π A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure. (Python wrapper for daachorse)
python-daachorse daachorse is a fast implementation of the Aho-Corasick algorithm using the compact double-array data structure. This is a Python wrap
PRQL is a modern language for transforming data β a simpler and more powerful SQL
PRQL Pipelined Relational Query Language, pronounced "Prequel". PRQL is a modern language for transforming data β a simpler and more powerful SQL. Lik
Seed your development database with real data β‘οΈ
Seed Your Development Database With Real Data β‘οΈ Replibyte is a blazingly fast tool to seed your databases with your production data while keeping sen
Jsonptr - Data structures and logic for resolving, assigning, and deleting by JSON Pointers
jsonptr - JSON Pointers for Rust Data structures and logic for resolving, assigning, and deleting by JSON Pointers (RFC 6901). Usage Resolve JSON Poin
Sort (key, value) data sets that don't fit in memory
kv-par-merge-sort Key-Value Parallel Merge Sort Sort Pod (key, value) data sets that don't fit in memory. This crate provides the kv_par_merge_sort li
Steggy CLI Tool - hides data within the least significant bit of an image
Written in Rust, features a simple cli and a client-side webapp. This tool hides data within the least significant bit of an image. Obfuscation techniques are utilized to make the
Library with support for de/serialization, parsing and executing on data-structures and network messages related to Bitcoin
Rust Bitcoin Library with support for de/serialization, parsing and executing on data-structures and network messages related to Bitcoin. Heads up for
Noria: data-flow for high-performance web applications
Noria: data-flow for high-performance web applications Noria is a new streaming data-flow system designed to act as a fast storage backend for read-he
Vector is a high-performance, end-to-end (agent & aggregator) observability data pipeline that puts you in control of your observability data
Quickstart β’ Docs β’ Guides β’ Integrations β’ Chat β’ Download What is Vector? Vector is a high-performance, end-to-end (agent & aggregator) observabilit
The Declarative Data Generator
The Declarative Data Generator Synth is a tool for generating realistic data using a declarative data model. Synth is database agnostic and can scale
User-friendly secure computation engine based on secure multi-party computation
CipherCore If you have any questions, or, more generally, would like to discuss CipherCore, please join the Slack community. See a vastly extended ver
Generic and extensible egui widgets to create analog synthesizer-like UI with data-oriented API
egui_cable A generic and extensible data-oriented widget for connecting ports by cables. I create this for the visual programming editor of Hihaheho/D
NSE is a rust cli binary and library for extracting real-time data from National Stock Exchange (India)
NSE Check out the sister projects NsePython and SaveKiteEnctoken which are Python & Javascript libraries to use the NSE and Zerodha APIs respectively
A HashMap/Vector hybrid: efficient, ordered key-value data storage in Rust.
hashvec A HashVec is a hash map / dictionary whose key-value pairs are stored (and can be iterated over) in a fixed order, by default the order in whi
An app which reads data from a serial port and serves it on a TCP port.
serial-to-tcp An app which reads data from a serial port and serves it on a TCP port. How to use Clone this repo and build the app as outlined below (
Turns running Rust code into a serializable data structure.
WasmBox WasmBox turns running Rust code into a serializable data structure. It does this by compiling it to WebAssembly and running it in a sandbox. T
Map the Teenage Engineering OP-1 MIDI output to keyboard commands
OP1NPUT Maps the Teenage Engineering OP-1's MIDI output to keyboard keypresses so it may be used as a game controller. This exists because many of the
Convert perf.data files to the Firefox Profiler format
fxprof-perf-convert A converter from the Linux perf perf.data format into the Firefox Profiler format, specifically into the processed profile format.
Modular FFXIV data toolkit written in rust.
ironworks Modular FFXIV data toolkit written in rust. ironworks is pre-1.0, and as such its API should be considered unstable. Breaking API changes wi
Plugin for macro-, mini-quad (quads) to save data in simple local storage using Web Storage API in WASM and local file on a native platforms.
quad-storage This is the crate to save data in persistent local storage in miniquad/macroquad environment. In WASM the data persists even if tab or br
Generates a unique hash/identifier for a system given a set of parameters.
uniqueid π Generates a unique hash/identifier for a system given a set of parameters. Example usage use uniqueid; pub fn main() { let data = vec
Parse and encoding of data using the SCTE-35 standard.
SCTE-35 lib and parser for Rust Work in progress! This library provide access to parse and encoding of data using the SCTE-35 standard. This standard
π Collect cloud usage data, so that it can be combined with impact data of Boavizta API.
cloud-scanner Collect aws cloud usage data, so that it can be combined with impact data of Boavizta API. β Very early Work in progress ! At the moment
A SOAP client for Brazilian Central Bank's Time Series Management System
A SOAP client for Brazilian Central Bank's Time Series Management System
Export MacOS iMessage data + run iMessage Diagnostics
imessage-export This crate provides both a library to interact with iMessage data as well as a binary that can perform some useful read-only operation
An intel PT trace converter from `perf.data` to Fuchsia trace format.
Introduction Recent Intel processors feature the "Intel Processor Trace" feature, which can be used to capture the full execution trace of a program.
Make data-driven table rendering easy with Dioxus
Dioxus Table Make data-driven table rendering easy with Dioxus Installation Until the next release of Dioxus this requires Dioxus nightly from git. Th
Unicorn Emulator Debug Server - Written in Rust, with bindings of C, Go, Java and Python
udbserver - Unicorn Emulator Debug Server When you do emulation with Unicorn Engine, do you want to inspect the inner state during every step? udbserv
A parser for the perf.data format
linux-perf-data This repo contains a parser for the perf.data format which is output by the Linux perf tool. It also contains a main.rs which acts sim
A prefix tree (trie) is a data structure that allows you to store an associative array whose keys are strings
RadixTrie A prefix tree (trie) is a data structure that allows you to store an associative array whose keys are strings. It is a root tree, each edge
NixEl is a Rust library that turns Nix code into a variety of correct, typed, memory-safe data-structures
π NixEL Lexer, Parser, Abstract Syntax Tree and Concrete Syntax Tree for the Nix Expressions Language. NixEl is a Rust library that turns Nix code in
A rust library for creating and managing logs of arbitrary binary data
A rust library for creating and managing logs of arbitrary binary data. Presently it's used to collect sensor data. But it should generally be helpful in cases where you need to store timeseries data, in a nearly (but not strictly) append-only fashion.
computed data's hash by webAssembly
wasm-hasher computed data's hash by webAssembly support md5,sha1,sha2-224,sha2-356,sha2-384,sha2-512,sha3-224,sha3-256,sha3-384,sha3-512,china-sm3 typ
BTM is an incremental data backup mechanism that does not require downtime.
BTM Blockchain Time Machine. BTM is an incremental data backup mechanism that does not require downtime.
Extracting react native app source code from apk file.
extract-myreact Extracting React Native app source code from apk file.
Provides a mechanism to lay out data into GPU buffers according to WGSL's memory layout rules
Provides a mechanism to lay out data into GPU buffers ensuring WGSL's memory layout requirements are met. Features supports all WGSL host-shareable ty
Rest API to check if a password is in a data breach
easypwned (haveibeenpwned / HIBP) Rest API to check if a password is in a data breach. Works offline - everything stays on your machine! Database is i
Fast, efficient, and robust memory reclamation for concurrent data structures
Seize Fast, efficient, and robust memory reclamation for concurrent data structures. Introduction Concurrent data structures are faced with the proble
A lightweight Rust library for BitVector Rank&Select operations, coupled with a generic Sparse Array implementation
A lightweight Rust library for BitVector Rank&Select operations, coupled with a generic Sparse Array implementation
Focus Annotator - a tool for annotation the focal plane of a part of an image
Focus Annotator Focus Annotator is a tool for annotation the focal plane of a part of an image. It is a tool I built in rust as part of my master's th
πA WGPU graphics pipeline, along with simple types used to marshal data to the GPU
renderling π This library is a collection of WGPU render pipelines. Shaders are written in GLSL. shaderc is used to compile shaders to SPIR-V. Defini
sblade or switchblade it's a multitool in one capable of doing simple analysis with any type of data, attempting to speed up ethical hacking activities
sblade or switchblade it's a multitool in one capable of doing simple analysis with any type of data, attempting to speed up ethical hacking activities
Dade is data definition for Rust structures.
dade dade is data definition for Rust structures. For the easy handle of data, the following will support it. Data validation. Data schema conforms Js
postgres-ical - a PostgreSQL extension that adds features related to parsing RFC-5545 Β« iCalendar Β» data from within a PostgreSQL database
postgres-ical - a PostgreSQL extension that adds features related to parsing RFC-5545 Β« iCalendar Β» data from within a PostgreSQL database
Recursive & Iterative Binary Search Tree Implementations within Rust
bst-rs Recursive & Iterative Binary Search Tree Implementations within Rust Table of Contents Personal Goals About Quick Start License Contributing In
Code for connecting an RP2040 to a Bosch BNO055 IMU and having the realtime orientation data be sent to the host machine via serial USB
Code for connecting an RP2040 (via Raspberry Pi Pico) to a Bosch BNO055 IMU (via an Adafruit breakout board) and having the realtime orientation data be sent to the host machine via serial USB.
Open Data Access Layer that connect the whole world together
OpenDAL Open Data Access Layer that connect the whole world together. Status OpenDAL is in alpha stage and has been early adopted by databend. Welcome
Databend aimed to be an open source elastic and reliable serverless data warehouse,
An elastic and reliable Serverless Data Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy
An (unofficial) Rust library for querying db-ip.com data
db_ip An (unofficial) library for querying db-ip.com CSV databases in safe Rust. This library is not affiliated with or endorsed by db-ip.com. Be advi
Rust data structures and client for the PubChem REST API
pubchem.rs Rust data structures and client for the PubChem REST API. π Usage π Compound Create a Compound to query the PubChem API for a single comp
Library containing implementations of various sequential data-structures.
Library containing implementations of various sequential data-structures.
Bevy is a refreshingly simple data-driven game engine built in Rust
What is Bevy? Bevy is a refreshingly simple data-driven game engine built in Rust. It is free and open-source forever! WARNING Bevy is still in the ve
Gyroflow is an application that can stabilize your video by using motion data from a gyroscope and optionally an accelerometer
Gyroflow is an application that can stabilize your video by using motion data from a gyroscope and optionally an accelerometer. Modern cameras record that data internally (GoPro, Sony, Insta360 etc), and this application stabilizes the captured footage precisely by using them. It can also use gyro data from an external source (eg. from Betaflight blackbox).
The Data Highway Substrate-based blockchain node.
DataHighway-Parachain, a parachain on the Polkadot network. Planned features include a decentralized LPWAN roaming hub for LoRaWAN IoT devices and network operator roaming agreements, participative mining, an inter-chain data market, and DAO governance.
SQLite compiled to WASM with pluggable data storage
wasm-sqlite SQLite compiled to WASM with pluggable data storage. Useful to save SQLite in e.g. Cloudflare Durable Objects (example: https://github.com
Turnstile encrypts data so that it can only be decrypted on another computer
Turnstile - One Way Encryption Turnstile encrypts data so that it can only be decrypted on another computer (and can't be decrypted on the encrypting
Rust crate for embedding, manipulating and retrieving data embedded in binaries using linker sections
linkstore is a library that allows you to define global variables in your final compiled binary that can be modified post-compilation.
frawk is a small programming language for writing short programs processing textual data
frawk frawk is a small programming language for writing short programs processing textual data. To a first approximation, it is an implementation of t
hexyl is a simple hex viewer for the terminal. It uses a colored output to distinguish different categories of bytes
hexyl is a simple hex viewer for the terminal. It uses a colored output to distinguish different categories of bytes (NULL bytes, printable ASCII characters, ASCII whitespace characters, other ASCII characters and non-ASCII).
stringsext - search for multi-byte encoded strings in binary data
title stringsext - search for multi-byte encoded strings in binary data stringsext is a Unicode enhancement of the GNU strings tool with additional fu
Kepler is a vulnerability database and lookup store and API currently utilising National Vulnerability Database and NPM Advisories as data sources
Kepler β Kepler is a vulnerability database and lookup store and API currently utilising National Vulnerability Database and NPM Advisories as data so
Dataflow is a data processing library, primarily for machine learning
Dataflow Dataflow is a data processing library, primarily for machine learning. It provides efficient pipeline primitives to build a directed acyclic
Rust port of Ghidra's SLEIGH decompiler
Rust port of Ghidra's SLEIGH decompiler. This library allows you to decompile or translate machine code for multiple architectures.
A Rust application that inserts Discogs data dumps into Postgres
Discogs-load A Rust application that inserts Discogs data dumps into Postgres. Discogs-load uses a simple state machine with the quick-xml Rust librar
A tool using binrw to read FFXIV data files
last-legend-dob A tool using binrw to read FFXIV data files. Mainly made to harvest the music from the game files for personal consumption, since it t
An implementation of a predicative polymorphic language with bidirectional type inference and algebraic data types
Vinilla Lang Vanilla is a pure functional programming language based on System F, a classic but powerful type system. Merits Simple as it is, Vanilla