Asynchronous CUDA, NPP and TensorRT for Rust.

Oddity.ai

Last update: Jun 19, 2023

Related tags

Machine learning async-cuda

Overview

Asynchronous CUDA, NPP and TensorRT

ℹ️ Introduction

The async-cuda family of libraries is an experimental set of libraries for interacting with the GPU asynchronously. Since the GPU is just another I/O device (from the point of view of your program), the async model actually fits surprisingly well.

The way it is implemented in async-cuda is that all operations are scheduled on a single runtime thread that drives the GPU. The interface of this library enforces that synchronization happens when it is necessary (and synchronization itself is also asynchronous).

The async-cuda project consists of:

async-cuda-core: CUDA core primitives such as streams and buffers.
async-cuda-npp: Common NPP operations such as resizing and cropping.
async-tensorrt: Minimal wrapper for TensorRT.

🛠 S️️tatus

This project is still a work-in-progress, and will contain bugs. Some parts of the API have not been flushed out yet. Use with caution.

⚠️ Safety warning

The async-cuda crates are intentionally unsafe. Due to the limitations of how async Rust currently works, usage of the async interface of this crate can cause undefined behavior in some rare cases. It is up to the user of this crate to prevent this from happening by following these rules:

No futures produced by functions in this crate may be leaked (either by std::mem::forget or otherwise).
Use a well-behaved runtime (one that will not forget your future) like Tokio or async-std.

Internally, the Future type in this crate schedules a CUDA call on a separate runtime thread. To make the API as ergonomic as possible, the lifetime bounds of the closure (that is sent to the runtime) are tied to the future object. To enforce this bound, the future will block and wait if it is dropped. This mechanism relies on the future being driven to completion, and not forgotten. This is not necessarily guaranteed. Unsafety may arise if either the runtime gives up on or forgets the future, or the caller manually polls the future, then forgets it.

License

Licensed under either of

Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Label Propagation Algorithm by Rust. Label propagation (LP) is graph-based semi-supervised learning (SSL). LGC and CAMLP have been implemented.

label-propagation-rs Label Propagation Algorithm by Rust. Label propagation (LP) is graph-based semi-supervised learning (SSL). A simple LGC and a mor

4 Sep 15, 2021

Asynchronous CUDA, NPP and TensorRT for Rust.

Related tags

Overview

Asynchronous CUDA, NPP and TensorRT

ℹ️ Introduction

🛠 S️️tatus

⚠️ Safety warning

License

Contribution

You might also like...

Tensors and differentiable operations (like TensorFlow) in Rust

A fast, safe and easy to use reinforcement learning framework in Rust.

Rust implementation of real-coded GA for solving optimization problems and training of neural networks

A real-time implementation of "Ray Tracing in One Weekend" using nannou and rust-gpu.

Tensors and dynamic neural networks in pure Rust.

A neural network, and tensor dynamic automatic differentiation implementation for Rust.

K-dimensional tree in Rust for fast geospatial indexing and lookup

Kalman filtering and smoothing in Rust

Label Propagation Algorithm by Rust. Label propagation (LP) is graph-based semi-supervised learning (SSL). LGC and CAMLP have been implemented.

Owner

Oddity.ai

A Rusty CUDA wrapper

Ecosystem of libraries and tools for writing and executing extremely fast GPU code fully in Rust.

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Robust and Fast tokenizations alignment library for Rust and Python

Narwhal and Tusk A DAG-based Mempool and Efficient BFT Consensus.

MesaTEE GBDT-RS : a fast and secure GBDT library, supporting TEEs such as Intel SGX and ARM TrustZone

[WIP] An experimental Java-like language and it's virtual machine, for learning Java and JVM.

Some hacks and failed experiments surrounding nvidia's gamestream protocol and sunshine/moonlight implementations

Msgpack serialization/deserialization library for Python, written in Rust using PyO3, and rust-msgpack. Reboot of orjson. msgpack.org[Python]

Distributed compute platform implemented in Rust, and powered by Apache Arrow.