20 Rust Inference Libraries

Infer a JSON schema from example data, produce nonsense synthetic data (drivel) according to the schema

drivel drivel is a command-line tool written in Rust for inferring a schema from an example JSON (or JSON lines) file, and generating synthetic data (

36 Jul 5, 2024

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

candle-vllm Efficient platform for inference and serving local LLMs including an OpenAI compatible API server. Features OpenAI compatible API server p

21 Nov 15, 2023

LLaMA2 port for Rust inspired by llama2.c

llama2-rs LLaMA2 port for Rust inspired by llama2.c. TODOs: Implement loading of the model Implement forward pass Implement generation Implement token

4 Aug 27, 2023

`dfx new --type=rust` + burn-rs MNIST web inference example

ic-mnist The frontend provides a canvas where users can draw a digit. The drawn digit is then sent to the backend canister running burn-rs for inferen

4 Jun 25, 2023

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

LLaMa 7b in rust This repo contains the popular LLaMa 7b language model, fully implemented in the rust programming language! Uses dfdx tensors and CUD

16 May 8, 2023

Run LLaMA inference on CPU, with Rust 🦀🚀🦙

LLaMA-rs Do the LLaMA thing, but now in Rust 🦀 🚀 🦙 Image by @darthdeus, using Stable Diffusion LLaMA-rs is a Rust port of the llama.cpp project. Th

2.7k Apr 17, 2023

Rust+OpenCL+AVX2 implementation of LLaMA inference code

RLLaMA RLLaMA is a pure Rust implementation of LLaMA large language model inference.. Supported features Uses either f16 and f32 weights. LLaMA-7B, LL

344 Apr 16, 2023

Run LLaMA inference on CPU, with Rust 🦀🚀🦙

LLaMA-rs Do the LLaMA thing, but now in Rust 🦀 🚀 🦙 Image by @darthdeus, using Stable Diffusion LLaMA-rs is a Rust port of the llama.cpp project. Th

2.7k Apr 17, 2023

pyke Diffusers is a modular Rust library for optimized Stable Diffusion inference 🔮

pyke Diffusers is a modular Rust library for pretrained diffusion model inference to generate images, videos, or audio, using ONNX Runtime as a backen

12 Jan 5, 2023

Using OpenAI Codex's "davinci-edit" Model for Gradual Type Inference

OpenTau: Using OpenAI Codex for Gradual Type Inference Current implementation is focused on TypeScript Python implementation comes next Requirements r

11 Dec 18, 2022

A statically-typed, interpreted programming language, with generics and type inference

Glide A programming language. Currently, this includes: Static typing Generics, with monomorphization Type inference on function calls func identityT

1 Apr 10, 2022

Wonnx - a GPU-accelerated ONNX inference run-time written 100% in Rust, ready for the web

Wonnx is a GPU-accelerated ONNX inference run-time written 100% in Rust, ready for the web. Supported Platforms (enabled by wgpu) API Windows Linux &

354 Jan 6, 2023

An implementation of a predicative polymorphic language with bidirectional type inference and algebraic data types

Vinilla Lang Vanilla is a pure functional programming language based on System F, a classic but powerful type system. Merits Simple as it is, Vanilla

73 Aug 4, 2022

A fusion of OTP lib/dialyzer + lib/compiler for regular Erlang with type inference

Typed ERLC The Problem I have a dream, that one day there will be an Erlang compiler, which will generate high quality type-correct code from deduced

35 Sep 5, 2022

Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference

Sonos' Neural Network inference engine. This project used to be called tfdeploy, or Tensorflow-deploy-rust. What ? tract is a Neural Network inference

1.5k Jan 2, 2023

Orkhon: ML Inference Framework and Server Runtime

Orkhon: ML Inference Framework and Server Runtime Latest Release License Build Status Downloads Gitter What is it? Orkhon is Rust framework for Machin

129 Dec 21, 2022

Snips NLU rust implementation

Snips NLU Rust Installation Add it to your Cargo.toml: [dependencies] snips-nlu-lib = { git = "https://github.com/snipsco/snips-nlu-rs", branch = "mas

327 Dec 26, 2022

Orkhon: ML Inference Framework and Server Runtime

Orkhon: ML Inference Framework and Server Runtime Latest Release License Build Status Downloads Gitter What is it? Orkhon is Rust framework for Machin

129 Dec 21, 2022

Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference

Sonos' Neural Network inference engine. This project used to be called tfdeploy, or Tensorflow-deploy-rust. What ? tract is a Neural Network inference

1.5k Jan 8, 2023

A static, type inferred and embeddable language written in Rust.

gluon Gluon is a small, statically-typed, functional programming language designed for application embedding. Features Statically-typed - Static typin

2.7k Dec 29, 2022

Rust Inference Resources

Rust inference Libraries

Infer a JSON schema from example data, produce nonsense synthetic data (drivel) according to the schema

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

LLaMA2 port for Rust inspired by llama2.c

`dfx new --type=rust` + burn-rs MNIST web inference example

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

Run LLaMA inference on CPU, with Rust 🦀🚀🦙

Rust+OpenCL+AVX2 implementation of LLaMA inference code

Run LLaMA inference on CPU, with Rust 🦀🚀🦙

pyke Diffusers is a modular Rust library for optimized Stable Diffusion inference 🔮

Using OpenAI Codex's "davinci-edit" Model for Gradual Type Inference

A statically-typed, interpreted programming language, with generics and type inference

Wonnx - a GPU-accelerated ONNX inference run-time written 100% in Rust, ready for the web

An implementation of a predicative polymorphic language with bidirectional type inference and algebraic data types

A fusion of OTP lib/dialyzer + lib/compiler for regular Erlang with type inference

Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference

Orkhon: ML Inference Framework and Server Runtime

Snips NLU rust implementation

Orkhon: ML Inference Framework and Server Runtime

Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference

A static, type inferred and embeddable language written in Rust.

Rust Inference Resources

Related tags

Rust inference Libraries

Infer a JSON schema from example data, produce nonsense synthetic data (drivel) according to the schema

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

LLaMA2 port for Rust inspired by llama2.c

`dfx new --type=rust` + burn-rs MNIST web inference example

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

Run LLaMA inference on CPU, with Rust 🦀🚀🦙

Rust+OpenCL+AVX2 implementation of LLaMA inference code

Run LLaMA inference on CPU, with Rust 🦀🚀🦙

pyke Diffusers is a modular Rust library for optimized Stable Diffusion inference 🔮

Using OpenAI Codex's "davinci-edit" Model for Gradual Type Inference

A statically-typed, interpreted programming language, with generics and type inference

Wonnx - a GPU-accelerated ONNX inference run-time written 100% in Rust, ready for the web

An implementation of a predicative polymorphic language with bidirectional type inference and algebraic data types

A fusion of OTP lib/dialyzer + lib/compiler for regular Erlang with type inference

Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference

Orkhon: ML Inference Framework and Server Runtime

Snips NLU rust implementation

Orkhon: ML Inference Framework and Server Runtime

Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference

A static, type inferred and embeddable language written in Rust.