OpenAI compatible API for serving LLAMA-2 model

AmineDiro

Last update: Aug 8, 2023

High-level, optionally asynchronous Rust bindings to llama.cpp

llama_cpp-rs Safe, high-level Rust bindings to the C++ project of the same name, meant to be as user-friendly as possible. Run GGUF-based large langua

4 Nov 21, 2023

🚀 Fast and 100% API compatible postcss replacer, built in Rust

472 Jan 7, 2023

Meet Rustacean GPT, an experimental project transforming OpenAi's GPT into a helpful, autonomous software engineer to support senior developers and simplify coding life! 🚀🤖🧠

Rustacean GPT Welcome, fellow coding enthusiasts! 🚀 🤖 I am excited to introduce you to Rustacean GPT, my humble yet ambitious project that aims to t

3 May 10, 2023

Pbot - pan93412's extensible userbot, which is full-documented, enginnered and based on Actor model.

pbot pan93412's extensible user bot, which is full-documented, engineered and based on Actor model. Usage Run cargo run --release [--features modules

4 Feb 28, 2022

A rust and SageMath implementation of (2,2)-isogenies in the theta model

An Algorithmic Approach to (2, 2)-isogenies in the Theta Model Code accompanying the research paper: An Algorithmic Approach to (2, 2)-isogenies in th

7 Dec 4, 2023

Emerald, the EVM compatible paratime

The Emerald ParaTime This is the Emerald ParaTime, an official EVM-compatible Oasis Protocol Foundation's ParaTime for the Oasis Network built using t

5 Mar 31, 2022

Unofficial Bitwarden compatible server written in Rust, formerly known as bitwarden_rs

Alternative implementation of the Bitwarden server API written in Rust and compatible with upstream Bitwarden clients*, perfect for self-hosted deploy

21.5k Jan 8, 2023

A Litecord compatible/inspired OSS implementation of Discord's backend for fun and profit.

3 May 9, 2022

A rust library for interacting with multiple Discord.com-compatible APIs and Gateways at the same time.

Chorus A rust library for interacting with (multiple) Spacebar-compatible APIs and Gateways (at the same time). Explore the docs » Report Bug · Reques

4 Apr 30, 2023

Comments

"git submodule update --init --recursive" requires ssh

I am trying to get this running in Docker

Here is my current build

from ubuntu:latest

RUN apt-get update && apt-get install -y \
    sudo 

RUN apt-get update && \
    apt-get install -y \
    curl \
    build-essential \
    libssl-dev \
    pkg-config


# Install Rust and Cargo
RUN curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh -s -- -y
ENV PATH="/root/.cargo/bin:${PATH}"


RUN yes | sudo apt install git 

RUN git clone https://github.com/AmineDiro/cria.git
WORKDIR /cria
#RUN yes | git submodule update --init --recursive

# Build the project using Cargo in release mode
#RUN cargo build --release

#COPY ggml-model-q4_0.bin .

Unfortunately, it seems like the git submodule update --init --recursive command is somehow interfacing with the repo via ssh

Have you considered maybe building this in a way that the dependencies can be installed without ssh?

opened by snakewizardd 5

Missing LICENSE

I see you have no LICENSE file for this project. The default is copyright.

I would suggest releasing the code under the GPL-3.0-or-later or AGPL-3.0-or-later license so that others are encouraged to contribute changes back to your project.

opened by TechnologyClassroom 2

llama-2 is not a supported architecture

I'm getting this error when trying to run on MacOS:

error: invalid value 'llama-2' for '<MODEL_ARCHITECTURE>': llama-2 is not one of supported model architectures: [Bloom, Gpt2, GptJ, GptNeoX, Llama, Mpt]

If I use LLama instead, it crashes (as it probably should)

GGML_ASSERT: llama-cpp/ggml.c:6192: ggml_nelements(a) == ne0*ne1*ne2
fish: Job 1, 'target/release/cria Llama ../ll…' terminated by signal SIGABRT (Abort)

bug

opened by undefdev 6

Releases(v0.1.0)

v0.1.0(Aug 6, 2023)

Download pre-build binary

target | kind | download -- | -- | -- aarch64-apple-darwin-metal | tarball | cria-aarch64-apple-darwin.tgz
Source code(tar.gz)
Source code(zip)
cria-aarch64-apple-darwin.tgz(4.41 MB)

Owner

AmineDiro

Telecom - ENSAE Data scientist and Data engineer. Computer vision expert. Interested in everything computer science. Fitroulette co-founder

GitHub

OpenAI compatible API for serving LLAMA-2 model

Related tags

Overview

Cria - Local llama OpenAI-compatible API

Quickstart:

Completion Example

Building with GPU issues

TODO/ Roadmap:

Routes

You might also like...

High-level, optionally asynchronous Rust bindings to llama.cpp

🚀 Fast and 100% API compatible postcss replacer, built in Rust

Meet Rustacean GPT, an experimental project transforming OpenAi's GPT into a helpful, autonomous software engineer to support senior developers and simplify coding life! 🚀🤖🧠

Pbot - pan93412's extensible userbot, which is full-documented, enginnered and based on Actor model.

A rust and SageMath implementation of (2,2)-isogenies in the theta model

Emerald, the EVM compatible paratime

Unofficial Bitwarden compatible server written in Rust, formerly known as bitwarden_rs

A Litecord compatible/inspired OSS implementation of Discord's backend for fun and profit.

A rust library for interacting with multiple Discord.com-compatible APIs and Gateways at the same time.

Comments

"git submodule update --init --recursive" requires ssh

Missing LICENSE

llama-2 is not a supported architecture

Releases(v0.1.0)

v0.1.0(Aug 6, 2023)

Download pre-build binary

Owner

AmineDiro

Crates Registry is a tool for serving and publishing crates and serving rustup installation in offline networks.

A Discord bot, written in Rust, that generates responses using the LLaMA language model.

A Discord bot, written in Rust, that generates responses using the LLaMA language model.

Tiny Discord ticket support bot that utilizes the OpenAI GPT-3.5-turbo model.

This crate converts Rust compatible regex-syntax to Vim's NFA engine compatible regex.

Run LLaMA inference on CPU, with Rust 🦀🚀🦙

Run LLaMA inference on CPU, with Rust 🦀🚀🦙

A rusty interface to llama.cpp for rust

A mimimal Rust implementation of Llama.c

Rust bindings to llama.cpp, using metal on macOS