A repo for learning how to parallelize computations in the GPU using Apple's Metal, in Rust.

Lambdaclass

Last update: Feb 20, 2023

Related tags

Machine learning metal_playground

Overview

Metal playground in rust

Made for learning how to parallelize computations in the GPU using Apple's Metal, in Rust, via the metal crate.

Overview

The source code will contain documented examples of use, growing in complexity, aimed at a final objective of parallelize a Fast Fourier Transform algorithm.

dotprod: learn the basics of Metal and metal-rs, implement a simple dot product between two vectors represented as uint arrays.
matrixprod: a more complex example to learn about grid size and thread groups, implement a product between square matrices.
memory: example to show how to shared memory between CPU and GPU. This example simply creates a vector to be modified from the GPU.

Pre-requisites

XCode
Rust

To run the examples, use the following command:

cargo run --example {example}

To run the memory example some nightly features are needed so +nightly has to be added to the command above.

To re-build all the necessary .metallib files, you can use

make build_metal EXAMPLE={example}

where {example} is the name of the example to run in both commands.

References

Apple's Metal documentation: we recommend to start with "Performing Calculations on a GPU". Note that these docs are in Swift/Obj-C.
miniSTARK: A minimal STARK library built in Rust and gpu-accelerated with Metal.
metal-rs examples

Wonnx - a GPU-accelerated ONNX inference run-time written 100% in Rust, ready for the web

Wonnx is a GPU-accelerated ONNX inference run-time written 100% in Rust, ready for the web. Supported Platforms (enabled by wgpu) API Windows Linux &

354 Jan 6, 2023

A gpu accelerated (optional) neural network Rust crate.

Intricate A GPU accelerated library that creates/trains/runs neural networks in pure safe Rust code. Architechture overview Intricate has a layout ver

11 Dec 26, 2022

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

LLaMa 7b in rust This repo contains the popular LLaMa 7b language model, fully implemented in the rust programming language! Uses dfdx tensors and CUD

16 May 8, 2023

A fun, hackable, GPU-accelerated, neural network library in Rust, written by an idiot

Tensorken: A Fun, Hackable, GPU-Accelerated, Neural Network library in Rust, Written by an Idiot (work in progress) Understanding deep learning from t

44 May 6, 2023

Signed distance functions + Rust (CPU & GPU) = ❤️❤️

sdf-playground Signed distance functions + Rust (CPU & GPU) = ❤️❤️ Platforms: Windows, Mac & Linux. About sdf-playground is a demo showcasing how you

5 Nov 16, 2023

rust-gpu CLI driver

rust-gpu-driver Experiment to make rust-gpu more accessible as a GPU shading language in various projects. DISCLAIMER: This is an unstable experiment

9 Feb 16, 2024

A demo repo that shows how to use the latest component model feature in wasmtime to implement a key-value capability defined in a WIT file.

Key-Value Component Demo This repo serves as an example of how to use the latest wasm runtime wasmtime and its component-model feature to build and ex

3 Dec 20, 2022

Open Machine Intelligence Framework for Hackers. (GPU/CPU)

Leaf • Introduction Leaf is a open Machine Learning Framework for hackers to build classical, deep or hybrid machine learning applications. It was ins

5.5k Jan 1, 2023

Damavand is a quantum circuit simulator. It can run on laptops or High Performance Computing architectures, such CPU distributed architectures or multi GPU distributed architectures.

Damavand is a code that simulates quantum circuits. In order to learn more about damavand, refer to the documentation. Development status Core feature

6 Mar 29, 2022

Comments

Make FP class generic enough to accept any `P` value
This issue can be divided into two tasks:

Remove M and R2 constants and update operations so that results are still valid. This will be less efficent than the original implementation.

Convert FP class to a template that takes P as an argument: template <uint32_t P> class Fp. If you want examples of a template class, see: https://github.com/andrewmilson/ministark/blob/main/gpu-poly/src/metal/felt_u128.h.metal#L50-L54
opened by mpaulucci 0

A repo for learning how to parallelize computations in the GPU using Apple's Metal, in Rust.

Related tags

Overview

Metal playground in rust

Overview

Contents

Pre-requisites

References

You might also like...

Wonnx - a GPU-accelerated ONNX inference run-time written 100% in Rust, ready for the web

A gpu accelerated (optional) neural network Rust crate.

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

A fun, hackable, GPU-accelerated, neural network library in Rust, written by an idiot

Signed distance functions + Rust (CPU & GPU) = ❤️❤️

rust-gpu CLI driver

A demo repo that shows how to use the latest component model feature in wasmtime to implement a key-value capability defined in a WIT file.

Open Machine Intelligence Framework for Hackers. (GPU/CPU)

Damavand is a quantum circuit simulator. It can run on laptops or High Performance Computing architectures, such CPU distributed architectures or multi GPU distributed architectures.

Comments

Make FP class generic enough to accept any `P` value

Owner

Lambdaclass

zenoh-flow aims at providing a zenoh-based data-flow programming framework for computations that span from the cloud to the device.

Rust based Cross-GPU Machine Learning

Open deep learning compiler stack for cpu, gpu and specialized accelerators

A real-time implementation of "Ray Tracing in One Weekend" using nannou and rust-gpu.

Flexible, reusable reinforcement learning (Q learning) implementation in Rust

Ecosystem of libraries and tools for writing and executing extremely fast GPU code fully in Rust.

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

How to: Run Rust code on your NVIDIA GPU

🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

A Demo server serving Bert through ONNX with GPU written in Rust with <3