`grep` but with PEG patterns. Define grammars (e.g. `digit`), functions for matching. No more regex syntax!

IchHabeKeineNamen

Last update: Apr 18, 2023

Related tags

Overview

PEG

`peggrep`

Example file demo_file:

THIS LINE IS THE 1ST UPPER CASE LINE IN THIS FILE.
this line is the 1st lower case line in this file.
This Line Has All Its First Character Of The Word With Upper Case.

Two lines above this line is empty.
And this is the last line.

Match literals

$ peggrep "'this'" demo_file
this line is the 1st lower case line in this file.
Two lines above this line is empty.
And this is the last line.

'this': match the literal string "this"

Match `this.*empty`

$ peggrep "'this' (!'empty' .)* 'empty'" demo_file
Two lines above this line is empty.

!'empty':
- def: true if the current position is not followed by "empty"
- ! does not consume any characters
.: consume one character unconditionally
(!'empty' .)*:
- def: match any number of characters that are not followed by "empty"
- when the position is at the e of "empty", the (!'empty' .)* exits
- why not just .* 'empty':
  - * is greedy in PEG
  - .* will consume "empty" without exiting, so 'empty' has no chance to match
The final 'empty': match and consume the "empty"

To make life easier, we can make (!'empty' .)* 'empty' into a function:

Write a grammar file:

// grammar.peg
until[str] <- (!str .)* str
            ;

Run with the grammar file:

$ peggrep -g grammar.peg "'this' until['empty']" demo_file
Two lines above this line is empty.

Match digits

Steps:

Write a grammar file:

// grammar.peg
digit <- '0' / '1' / '2' / '3' / '4' / '5' / '6' / '7' / '8' / '9'
       ;

Run with the grammar file:

$ peggrep -g grammar.peg "digit+" demo_file
THIS LINE IS THE 1ST UPPER CASE LINE IN THIS FILE.
this line is the 1st lower case line in this file.

Grammar file from environment variable

We don't want to write a grammar file every time we want to use it. We can use the environment variable PEGGREP_GRAMMAR to specify the grammar file:

$ export PEGGREP_GRAMMAR="/absolute/path/to/grammar.peg"
$ peggrep "'this' until['empty']" demo_file
Two lines above this line is empty.

Match all private functions

Steps:

Write a grammar file:

// grammar.peg
space <- ' ' / '\t' / '\r' / '\n'
       ;
_     <- space*
       ;

Run:

$ export PEGGREP_GRAMMAR="/absolute/path/to/grammar.peg"
$ peggrep -^ "_ !'pub' 'fn'" src/*.rs

-^: match at the start of a line
_:
- predefined in the grammar file
- to match any number of spaces

You might also like...

Kalker (or "kalk") is a calculator program/website that supports user-defined variables, functions, derivation, and integration

Kalker (or "kalk") is a calculator program/website that supports user-defined variables, functions, derivation, and integration. It runs on Windows, macOS, Linux, Android, and in web browsers (with WebAssembly).

1.2k Dec 27, 2022

The simplest way to de-Google your life and business: Inbox, Calendar, Files, Contacts & much more

Bloom The all-in-one private workspace Try it for free! You no longer trust tech monopolies with your data? You are done with your privacy invaded by

1.6k Dec 26, 2022

Milho (corn in portuguese) is a toy dialect of Lisp written as a way to learn more about compilers

Milho (corn in portuguese) is a toy dialect of Lisp written as a way to learn more about compilers. There are implementations in rust and go

27 May 4, 2022

A Discord bot focused on addressing the inherent problems with Discord, to allow a more socialist/anarchist organization of servers.

ACABot A Discord bot focused on addressing the inherent problems with Discord, to allow a more socialist/anarchist organization of servers (or "guilds

4 May 3, 2022

Rust bindings for libjuice. Look at datachannel-rs if you need more batteries.

3 Sep 25, 2022

Utility to quickly setup Starcraft Broodwar matches between 2 or more bots

BWAIShotgun Utility to quickly setup Starcraft Broodwar matches between 2 or more bots Be aware that all bots will be executed directly, without any l

5 Nov 25, 2022

An abstraction build on top of discord-rich-presence that makes possible to use it in a more declarative way

Declarative Discord Rich Presence This library is an abstraction build on top of discord-rich-presence crate that allows you to use it in a more decla

2 Sep 7, 2022

CFD is a tool that allows you to check one or more domains to see if they are protected by CloudFlare or not.

CFD is a tool that allows you to check one or more domains to see if they are protected by CloudFlare or not. The check is carried out based on five criteria: 3 headers in the HTTP response, IP, and SSL certificate issuer. The check result can be displayed on the screen or saved to a file.

13 Apr 7, 2023

A Simple, But amazing telegram bot, Made using the Rust language!

Telegram bot in Rust A fun Telegram bot made using Rust language.

2 Dec 21, 2021

`grep` but with PEG patterns. Define grammars (e.g. `digit`), functions for matching. No more regex syntax!

Related tags

Overview

PEG

`peggrep`

Match literals

Match `this.*empty`

Match digits

Grammar file from environment variable

Match all private functions

You might also like...

Kalker (or "kalk") is a calculator program/website that supports user-defined variables, functions, derivation, and integration

The simplest way to de-Google your life and business: Inbox, Calendar, Files, Contacts & much more

Milho (corn in portuguese) is a toy dialect of Lisp written as a way to learn more about compilers

A Discord bot focused on addressing the inherent problems with Discord, to allow a more socialist/anarchist organization of servers.

Rust bindings for libjuice. Look at datachannel-rs if you need more batteries.

Utility to quickly setup Starcraft Broodwar matches between 2 or more bots

An abstraction build on top of discord-rich-presence that makes possible to use it in a more declarative way

CFD is a tool that allows you to check one or more domains to see if they are protected by CloudFlare or not.

A Simple, But amazing telegram bot, Made using the Rust language!

Releases(v0.1.0)

v0.1.0(Apr 17, 2023)

Owner

IchHabeKeineNamen

Define safe interfaces to MMIO and CPU registers with ease

Const equivalents of many [`bytemuck`] functions, and a few additional const functions.

Rust Keeper bots that run various functions, from liquidations, to orderbook cranks, and more.

`Debug` in rust, but only supports valid rust syntax and outputs nicely formatted using pretty-please

a simple compiled language i made in rust. it uses intermediate representation (IR) instead of an abstract syntax tree (AST).

Analogous, indented syntax for the Rust programming language.

tr-lang is a language that aims to bring programming language syntax closer to Turkish.

A syntax exploration of eventually stable Rust Iterator items

Rust macro to use a match-like syntax as a elegant alternative to nesting if-else statement

A Rust proc-macro crate which derives functions to compile and parse back enums and structs to and from a bytecode representation

`grep` but with PEG patterns. Define grammars (e.g. `digit`), functions for matching. No more regex syntax!

Related tags

Overview

PEG

peggrep

Match literals

Match this.*empty

Match digits

Grammar file from environment variable

Match all private functions

You might also like...

Kalker (or "kalk") is a calculator program/website that supports user-defined variables, functions, derivation, and integration

The simplest way to de-Google your life and business: Inbox, Calendar, Files, Contacts & much more

Milho (corn in portuguese) is a toy dialect of Lisp written as a way to learn more about compilers

A Discord bot focused on addressing the inherent problems with Discord, to allow a more socialist/anarchist organization of servers.

Rust bindings for libjuice. Look at datachannel-rs if you need more batteries.

Utility to quickly setup Starcraft Broodwar matches between 2 or more bots

An abstraction build on top of discord-rich-presence that makes possible to use it in a more declarative way

CFD is a tool that allows you to check one or more domains to see if they are protected by CloudFlare or not.

A Simple, But amazing telegram bot, Made using the Rust language!

Releases(v0.1.0)

v0.1.0(Apr 17, 2023)

Owner

IchHabeKeineNamen

Define safe interfaces to MMIO and CPU registers with ease

Const equivalents of many [`bytemuck`] functions, and a few additional const functions.

Rust Keeper bots that run various functions, from liquidations, to orderbook cranks, and more.

`Debug` in rust, but only supports valid rust syntax and outputs nicely formatted using pretty-please

a simple compiled language i made in rust. it uses intermediate representation (IR) instead of an abstract syntax tree (AST).

Analogous, indented syntax for the Rust programming language.

tr-lang is a language that aims to bring programming language syntax closer to Turkish.

A syntax exploration of eventually stable Rust Iterator items

Rust macro to use a match-like syntax as a elegant alternative to nesting if-else statement

A Rust proc-macro crate which derives functions to compile and parse back enums and structs to and from a bytecode representation

`peggrep`

Match `this.*empty`