32 Repositories
Rust arrow-datafusion Libraries
Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.
DataFusion Federation The goal of this repo is to allow DataFusion to resolve queries across remote query engines while pushing down as much compute a
🦖 Evolve your fixed length data files into Apache Arrow tables, fully parallelized!
🦖 Evolve your fixed length data files into Apache Arrow tables, fully parallelized! 🔎 Overview ... 📦 Installation The easiest way to install evolut
Arrow User-Defined Functions Framework on WebAssembly.
Arrow User-Defined Functions Framework on WebAssembly Example Build the WebAssembly module: cargo build --release -p arrow-udf-wasm-example --target w
Integration between arrow-rs and extendr
arrow_extendr arrow-extendr is a crate that facilitates the transfer of Apache Arrow memory between R and Rust. It utilizes extendr, the {nanoarrow} R
Go to Rust calls with Apache Arrow datatypes.
🔎 Overview alloy is a standalone Go module that enables calls to Rust code with Apache Arrow datatypes through its defined C data interface. The over
Work with bioinformatic files using Arrow, Polars, and/or DuckDB
biobear is a Python library designed for reading and searching bioinformatic file formats, using Rust as its backend and producing Arrow Batch Readers
Read specialized NGS formats as data frames in R, Python, and more.
oxbow Read specialized bioinformatic file formats as data frames in R, Python, and more. File formats create a lot of friction for computational biolo
Fast regex in Rust for Apache Arrow, compiled to WASM
Rust regex in wasm I have been looking for a fast regular expression library in Javascript that runs on Apache Arrow for a few years. Arrow uses UTF-8
Cloud native log storage and management for Kubernetes, containerised workloads
Live Demo | Website | API Workspace on Postman Parseable is an open source, cloud native, log storage and management platform. Parseable helps you ing
A user-friendly TUI for secure file transfers, with arrow-key and VIM-style navigation
gsftp SFTP with an interactive text-based user interface (TUI). Transfer files through an encrypted connection with a visual interface, so you can see
Polars is a blazingly fast DataFrames library implemented in Rust using Apache Arrow Columnar Format as memory model.
Polars Python Documentation | Rust Documentation | User Guide | Discord | StackOverflow Blazingly fast DataFrames in Rust, Python & Node.js Polars is
Control a now blue circle with wasd/arrow keys (intense gameplay)
bevy-moving-circle A yellow circle player controlled with arrow keys/WASD Now you can click to spawn orange cicles lmao (intense action packed gamepla
Arrowdantic is a small Python library backed by a mature Rust implementation of Apache Arrow
Welcome to arrowdantic Arrowdantic is a small Python library backed by a mature Rust implementation of Apache Arrow that can interoperate with Parquet
Terminal based, feature rich, interactive SQL tool
datafusion-tui (dft) DataFusion-tui provides a feature rich terminal application, built with tui-rs, for using DataFusion (and eventually Ballista). I
Rust-based WebAssembly bindings to read and write Apache Parquet files
parquet-wasm WebAssembly bindings to read and write the Parquet format to Apache Arrow. This is designed to be used alongside a JavaScript Arrow imple
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
Apache Arrow Powering In-Memory Analytics Apache Arrow is a development platform for in-memory analytics. It contains a set of technologies that enabl
An experimental implementation of Arc against Apache Datafusion
box This is an experimental repository to perform a proof of concept replacement of the Apache Spark executor for Arc with Apache DataFusion. This is
Create full-fledged APIs for static datasets without writing a single line of code.
ROAPI ROAPI automatically spins up read-only APIs for static datasets without requiring you to write a single line of code. It builds on top of Apache
Benchmarks to read parquet to arrow
Parquet benchmarks This repository contains a set of benchmarks of different implementations of Parquet (storage format) - Arrow (in-memory format).
Generated Ryst of Apache Arrow spec
Arrow generated IPC format The generated flatbuffers code for Rust. Note that these files suffered modifications because flatbuffers is unable to comp
This crate allows writing a struct in Rust and have it derive a struct of arrays layed out in memory according to the arrow format.
Arrow2-derive - derive for Arrow2 This crate allows writing a struct in Rust and have it derive a struct of arrays layed out in memory according to th
Fill Apache Arrow record batches from an ODBC data source in Rust.
arrow-odbc Fill Apache Arrow arrays from ODBC data sources. This crate is build on top of the arrow and odbc-api crate and enables you to read the dat
Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow
Parquet2 This is a re-write of the official parquet crate with performance, parallelism and safety in mind. The five main differentiators in compariso
Apache Arrow DataFusion and Ballista query engines
DataFusion is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
Official Rust implementation of Apache Arrow
Native Rust implementation of Apache Arrow Welcome to the implementation of Arrow, the popular in-memory columnar format, in Rust. This part of the Ar
A Rust crate for writing servers that speak PostgreSQL's wire protocol
Convergence A Rust crate for writing servers that speak PostgreSQL's wire protocol. Additionally, the experimental convergence-arrow crate enables con
A Rust DataFrame implementation, built on Apache Arrow
Rust DataFrame A dataframe implementation in Rust, powered by Apache Arrow. What is a dataframe? A dataframe is a 2-dimensional tabular data structure
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Ballista: Distributed Compute Platform Overview Ballista is a distributed compute platform primarily implemented in Rust, powered by Apache Arrow. It
transmute-free Rust library to work with the Arrow format
Arrow2: Transmute-free Arrow This repository contains a Rust library to work with the Arrow format. It is a re-write of the official Arrow crate using
Apache Arrow in WebAssembly
WASM Arrow This package compiles the Rust library of Apache Arrow to WebAssembly. This might be a viable alternative to the pure JavaScript library. R
Command line tool for inspecting Parquet files
pqrs pqrs is a command line tool for inspecting Parquet files This is a replacement for the parquet-tools utility written in Rust Built using the Rust
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Ballista: Distributed Compute Platform Overview Ballista is a distributed compute platform primarily implemented in Rust, powered by Apache Arrow. It