format whisper transcripts to .srt

Mike Dallas

Last update: Jul 21, 2023

Related tags

Command-line subtitles openai srt transcription whisper whisper-cpp

Overview

whispersub

A dead simple utility to format the output of OpenAI's whisper model (or whisper.cpp) into an .srt file.

Usage

whispersub input.txt -o output.srt

you can also pipe the output of whisper.cpp into whispersub

whisper-cpp --file audio.wav --language en --model ggml-medium.en.bin | whispersub

or use a little hellper function to extract the audio from a video, pipe it to whisper.cpp and then to whispersub

makesub () {
    filename=$(basename -- "$1")
    filename="${filename%.*}"
    model=${HOME}/.local/share/whisper/ggml-medium.en.bin
    ffmpeg -i "$1" -vn -acodec pcm_s16le -ar 16000 -ac 2 -f wav - | 
    nice -n 20 whisper-cpp --threads "$(nproc)" --file - --language en --model "$model" |
    whispersub -o "${filename}.en.srt"
}

makesub video.mp4

UniSBOM is a tool to build a software bill of materials on any platform with a unified data format.

UniSBOM is a tool to build a software bill of materials on any platform with a unified data format. Work in progress Support MacOS Uses system_profile

32 Nov 2, 2022

An apocalypse-resistant data storage format for the truly paranoid.

Carbonado An apocalypse-resistant data storage format for the truly paranoid. Designed to keep encrypted, durable, compressed, provably replicated con

30 Dec 29, 2022

Grid-based drum sequencer plugin as MIDI FX in CLAP/VST3 format

dr-seq Grid-based drum sequencer plugin as MIDI FX in CLAP/VST3 format. WARNING: This project is in a very early state. So there is no guarantee for a

6 Jan 29, 2023

jf "jf: %q" "JSON Format"

jf jf "jf: %q" "JSON Format" jf is a jo alternative to help safely format and print JSON objects in the commandline. However, unlike jo, where you bui

15 Apr 1, 2023

A tool to filter sites in a FASTA-format whole-genome pseudo-alignment

Core-SNP-filter This is a tool to filter sites (i.e. columns) in a FASTA-format whole-genome pseudo-alignment based on: Whether the site contains vari

15 Apr 2, 2023

Polyexen demo of Plonkish Arithmetiation Format (Plaf) on the zkevm-circuits

Plaf demo This is a demo of Plaf: Plonkish Arithmetiation Format on the zkevm-circuits Steps to run this: Clone these three repositories in the same f

17 Apr 6, 2023

a command-line tool that transforms a Git repository into a minimal format for ChatGPT queries

gprepo /dʒiːpiːˈɹi:pi:oʊ/ a command-line tool that transforms a Git repository into a minimal format for ChatGPT queries. Features Excludes LICENSE an

6 Apr 20, 2023

Coffee is a loader for ELF (Executable and Linkable Format) object files written in Rust

Coffee is a loader for ELF (Executable and Linkable Format) object files written in Rust. It provides a mechanism to load and parse ELF files similar to COFFLoader, but specifically designed for ELF files used in Unix-like systems.

13 Jun 22, 2023

A tool to format codeblocks inside markdown and org documents.

cbfmt (codeblock format) A tool to format codeblocks inside markdown, org, and restructuredtext documents. It iterates over all codeblocks, and format

126 May 26, 2023

Releases(v0.1.0)

v0.1.0(Jul 15, 2023)

Source code(tar.gz)
Source code(zip)
whispersub_v0.1.0_x86_64-apple-darwin.tar.gz(990.88 KB)
whispersub_v0.1.0_x86_64-apple-darwin.tar.gz.sha256sum(109 bytes)
whispersub_v0.1.0_x86_64-pc-windows-gnu.zip(2.43 MB)
whispersub_v0.1.0_x86_64-pc-windows-gnu.zip.sha256sum(108 bytes)
whispersub_v0.1.0_x86_64-unknown-linux-musl.tar.gz(1.83 MB)
whispersub_v0.1.0_x86_64-unknown-linux-musl.tar.gz.sha256sum(115 bytes)

Owner

Mike Dallas

GitHub

Given a set of kmers (fasta format) and a set of sequences (fasta format), this tool will extract the sequences containing the kmers.

Kmer2sequences Description Given a set of kmers (fasta / fastq [.gz] format) and a set of sequences (fasta / fastq [.gz] format), this tool will extra

22 Sep 16, 2023

⚗️ Superfast CLI interface for the conventional commits commit format

resin ⚗️ Superfast CLI interface for the conventional commits commit format ❓ What is resin? resin is a CLI (command-line interface) tool that makes i

23 Oct 12, 2022

⚗️ Superfast CLI interface for the conventional commits commit format

resin ⚗️ Superfast CLI interface for the conventional commits commit format ❓ What is resin? resin is a CLI (command-line interface) tool that makes i

23 Oct 12, 2022

Crate to generate files in ROFF format (Rust)

roffman A crate to generate roff man pages. Usage Add the following to the Cargo.toml: [dependencies] roffman = "0.3" Example use roffman::{Roff, Roff

23 Jul 13, 2022

CLI tool that make it easier to perform multiple lighthouse runs towards a single target and output the result in a plotable format.

Lighthouse Aggregator CLI tool that make it easier to perform multiple lighthouse runs towards a single target and output the result in a "plotable" f

1 Jan 12, 2022

format whisper transcripts to .srt

Related tags

Overview

whispersub

Usage

You might also like...

UniSBOM is a tool to build a software bill of materials on any platform with a unified data format.

An apocalypse-resistant data storage format for the truly paranoid.

Grid-based drum sequencer plugin as MIDI FX in CLAP/VST3 format

jf "jf: %q" "JSON Format"

A tool to filter sites in a FASTA-format whole-genome pseudo-alignment

Polyexen demo of Plonkish Arithmetiation Format (Plaf) on the zkevm-circuits

a command-line tool that transforms a Git repository into a minimal format for ChatGPT queries

Coffee is a loader for ELF (Executable and Linkable Format) object files written in Rust

A tool to format codeblocks inside markdown and org documents.

Releases(v0.1.0)

v0.1.0(Jul 15, 2023)

Owner

Mike Dallas

Given a set of kmers (fasta format) and a set of sequences (fasta format), this tool will extract the sequences containing the kmers.

⚗️ Superfast CLI interface for the conventional commits commit format

⚗️ Superfast CLI interface for the conventional commits commit format

Crate to generate files in ROFF format (Rust)

CLI tool that make it easier to perform multiple lighthouse runs towards a single target and output the result in a plotable format.

Single File Assets is a file storage format for images

CLI application to run clang-format on a set of files specified using globs in a JSON configuration file.

Format codebase in documentation 🦤

A low-level MVCC file format for storing blobs.

Databento Binary Encoding (DBZ) - Fast message encoding and storage format for market data