Motoko concrete syntax parser in Rust.

DFINITY

Last update: Dec 15, 2022

Related tags

Utilities motoko.rs

Overview

motoko.rs

Motoko concrete syntax parser and dynamic evaluator (VM) in Rust.

Motoko VM

The Motoko VM explores a more dynamic way for Motoko to execute.

The VM runs programs that parse without type-checking them, and it issues dynamic type errors when execution fails to progress.

The VM executes source syntax trees directly, for greater simplicity and greater transparency over more advanced, compilation-based techniques.

Priorities (WIP)

Relaxed semantics for programs with type errors, permitting limited execution.
Faithfully cover full Motoko language. Support base library.
Permit versioning, snapshots, transactions and rollback.

Out of scope

No complex, high-performance VM techniques, like just-in-time compilation.
Not a compiler from Motoko to Wasm.
No static type system here. See compiler for static typing..

Comments

Active trait.
Preparing for actors, and multiple "active" evaluation contexts, this PR introduces the Active trait, which provides an abstraction over what we had been calling Core, permitting us to abstract and reuse the current evaluation logic for two distinct situations in the new, redefined Core.

Because we also sometimes borrowed Core as read-only, we also introduce ActiveBorrow to capture that use case.

To do a demo with a single actor, Core needs to "enlarge" its scope to encompass:

an agent's evaluation context (or several, but only one to start); it activates the actor, and scripts its tests/demos.

an evaluation context for an actor that is activated by this agent; it provides a public interface, and has an internal store that persists across program edits.
opened by matthewhammer 8
Actors.
Continues #126, implementing Actors.

This test demonstrates how to reproduce the following steps, all of which now work:

agent creates a local actor, with a name Counter.

agent sends a local actor Counter a message (calling a method)

Counter's methods execute with its internal state, and variables.

When each method completes, it responds to the agent, which resumes control.

agent upgrades actor Counter by re-using the same name later in the script (proof of concept form of upgrade, for unit testing here)

the upgrade "works correctly," in the sense that:

the new method definitions run now, and the old ones are replaced.

the store retains its variable's values from before, after the upgrade.

actor Counter = { var x = 0; public func get() /*: async Nat*/ { x }; public func inc() { x := x + 1 }; }; assert (Counter.get() == 0); Counter.inc(); assert (Counter.get() == 1); actor Counter { var x = 0; public func get() /*: async Nat*/ { x }; public func inc() { x := x + 2 }; }; assert (Counter.get() == 1); Counter.inc(); assert (Counter.get() == 3); #ok
opened by matthewhammer 5
VM Opt: for loop fast path for opaque iterators.

A for loop whose iterator is an opaque object (see #120) can avoid the two-phase stepping logic of a general for loop, and even avoid popping the for loop's (new, special) stack frame.

opened by matthewhammer 4
AST source locations
Resolves #7.

Progress:

[x] Find a good way to convert usize ranges to line/column numbers

[x] Refactor Exp_, Type_, etc. to include source locations

[x] Refactor the lexer to use the same source location pattern

[x] Update usages throughout the codebase to account for these changes

[x] Update the parser to preserve source locations from tokens
opened by rvanasa 3
Parse the Motoko base library

Adds a unit test which displays the status of each base library module. Once we can parse more of the modules, I'll simplify the output to show an aggregated view of the successes and errors.

opened by rvanasa 2
Implement a simple token-tree-based formatter
A very basic, slightly usable, permissive lexer-based Motoko code formatter.

Example provisional CLI usage (using a line width of 30):

cargo run -- format 'actor Main {public query func main():async Nat{let(a,b)=(1,2)/*{a;a;a/*))))*/;a{{{*/;ignore "abc";"Hello"}}; await Main.hello()' 30

The current formatter "configuration" lives at the bottom of format.rs in a function called get_space(), which determines what type of whitespace should be inserted between two given TokenTree values. This seems to be powerful enough to cover all the syntax in Motoko as long as the token trees are constructed correctly. Depending on how many rules we end up needing overall, I might try refactoring these definitions into a TOML or JSON configuration file.
opened by rvanasa 2
Whitespace/Comments

For our project, we want comments to be retained in the AST, to retain them in the formatted output.

For lalrpop, need to preprocess input to make whitespace in comments parseable. Maybe other steps are needed too.

opened by matthewhammer 2
Optimize VM: Opaque pointers (vs existing var-pointer).
Consider this program

let rands = func(count){ var c = 0; { next = func() { if (c == count) { null } else { c := c + 1; let (n, i) = prim "fastRandIterNext" rand_; rand_ := i; n } } } }; let j = rands(5); for (x in j) { // Do something iterative with x. }

(also same as here https://github.com/dfinity/canister-profiling/pull/16/files#diff-ffc8c4b16dc3edfe909f12892610b23d05804b5fe2195d9f544b1933e347d3e0R13 )

When I run this program (no body in the for loop), I get 54 reductions, and 310 total steps before termination.

Most of the steps are evaluating the next function in the VM’s eval logic — What if I were to internalize all of these steps into the Rust side? This PR explores such an idea.
opened by matthewhammer 1
Rust -> Motoko value conversion

Adds a way to map Rust values directly to their corresponding Motoko values, preserving details such as variant names.

To do: implement a similar Motoko -> Rust conversion to replace the current usage of serde_json::Value.

opened by rvanasa 1
Add pattern matching shorthand functions to `Value`

Includes high-performance Value.to_nat(), Value.to_array(), etc. functions for unwrapping a Value as an alternative to using the (slightly slower) Value.convert() function.

opened by rvanasa 1
VM: Modules.
This PR implements modules for the VM. (WIP)

For each module, combine names defined in the module to create a valid environment. Reusing the same name is an Interruption

combine names in nested modules, and also permit outer names to flow into these internal modules. Unlike inRust, in Motoko, all names in super scopes flow into subscopes.

Unlike other constructs, modules require doing a lot of steps that do not correspond to ordinary execution, but rather, correspond to phases just before or during type checking.

These steps are necessary to resolve identifiers and module projections in real Motoko programs, and involve resolving nested recursive definitions.

Each named module definition introduces a recursive scope that we construct incrementally, in a step-wise manner. When they nest, additional steps propagate the recursion between the nested levels.

Why?

We could eschew a step-based approach, and do the module-level resolution of paths in a batch style, with batch-style errors each time an identifier is used twice. However, that has two drawbacks:

Goes against the rest of the project, where we show internal progress at every possible step and interrupt it immediately when there is an issue. A batch algorithm is not like this, and would introduce another "mode" of interacting with programs that happens before they "run step by step". That would complicate things, including the UX design, potentially.

If we ever want to improvise with other pre-execution steps, this would introduce a place to do them. Such steps may include experimental intensional features, like let box; as with modules, they would step before the program evaluates.

Scoping using nested modules

Consider this nested module declaration:

module { module X { /* module Y { }; */ // will shadow Y sibling of X public module M { func f () : Nat { g();h N.g(); M.g(); /* X.N.g() */ x }; public func g () { Y.y(); Y.Z.z() }; }; module N { public func g () { Y.y(); /* Z.z() /* error. */ */ }; }; public let x = 5; }; module Y { public module Z { public func z() { Y.y(); y() } }; public func y() { X.M.g() }; }; }

Notice:

function X.M.g can "see" Y.y and Y.Z.z.

function Y.Z.z can "see" function "y" and also see the same function as Y.y.

function X.N.g can "see" Y.y but not Z.z (Y.Z.z() works though)

This example could be smaller and simpler. It's something I have been using as I poke at the compiler to test my understanding of the language, and having it be about this large as been helpful.
opened by matthewhammer 1
Implement all Motoko primitives
[ ] prim "abs"

[ ] prim "lsh_Nat"

[ ] prim "rsh_Nat"

[ ] prim "idlHash"

[x] prim "print"

[ ] prim "trap"

[ ] prim "rts_version"

[ ] prim "rts_memory_size"

[ ] prim "rts_heap_size"

[ ] prim "rts_total_allocation"

[ ] prim "rts_reclaimed"

[ ] prim "rts_max_live_size"

[ ] prim "rts_callback_table_count"

[ ] prim "rts_callback_table_size"

[ ] prim "crc32Hash"

[ ] prim "num_conv_Int_Int64"

[ ] prim "num_conv_Int_Int32"

[ ] prim "num_conv_Int_Int16"

[ ] prim "num_conv_Int_Int8"

[ ] prim "num_conv_Nat_Nat64"

[ ] prim "num_conv_Nat_Nat32"

[ ] prim "num_conv_Nat_Nat16"

[ ] prim "num_conv_Nat_Nat8"

[ ] prim "num_wrap_Int_Int64"

[ ] prim "num_wrap_Int_Int32"

[ ] prim "num_wrap_Int_Int16"

[ ] prim "num_wrap_Int_Int8"

[ ] prim "num_wrap_Int_Nat64"

[ ] prim "num_wrap_Int_Nat32"

[ ] prim "num_wrap_Int_Nat16"

[ ] prim "num_wrap_Int_Nat8"

[ ] prim "num_wrap_Int64_Nat64"

[ ] prim "num_wrap_Nat64_Int64"

[ ] prim "num_wrap_Int32_Nat32"

[ ] prim "num_wrap_Nat32_Int32"

[ ] prim "num_wrap_Int16_Nat16"

[ ] prim "num_wrap_Nat16_Int16"

[ ] prim "num_wrap_Int8_Nat8"

[ ] prim "num_wrap_Nat8_Int8"

[ ] prim "num_wrap_Char_Nat32"

[ ] prim "num_conv_Nat32_Char"

[ ] prim "conv_Char_Text"

[ ] prim "char_to_upper"

[ ] prim "char_to_lower"

[ ] prim "char_is_whitespace"

[ ] prim "char_is_lowercase"

[ ] prim "char_is_uppercase"

[ ] prim "char_is_alphabetic"

[ ] prim "decodeUtf8"

[ ] prim "encodeUtf8"

[ ] prim "text_compare"

[ ] prim "popcnt8"

[ ] prim "clz8"

[ ] prim "ctz8"

[ ] prim "btst8"

[ ] prim "popcnt16"

[ ] prim "clz16"

[ ] prim "ctz16"

[ ] prim "btst16"

[ ] prim "popcnt32"

[ ] prim "clz32"

[ ] prim "ctz32"

[ ] prim "btst32"

[ ] prim "popcnt64"

[ ] prim "clz64"

[ ] prim "ctz64"

[ ] prim "btst64"

[ ] prim "popcnt8"

[ ] prim "clz8"

[ ] prim "ctz8"

[ ] prim "btst8"

[ ] prim "popcnt16"

[ ] prim "clz16"

[ ] prim "ctz16"

[ ] prim "btst16"

[ ] prim "popcnt32"

[ ] prim "clz32"

[ ] prim "ctz32"

[ ] prim "btst32"

[ ] prim "popcnt64"

[ ] prim "clz64"

[ ] prim "ctz64"

[ ] prim "btst64"

[ ] prim "fabs"

[ ] prim "fsqrt"

[ ] prim "fceil"

[ ] prim "ffloor"

[ ] prim "ftrunc"

[ ] prim "fnearest"

[ ] prim "fmin"

[ ] prim "fmax"

[ ] prim "fcopysign"

[ ] prim "num_conv_Float_Int"

[ ] prim "num_conv_Int_Float"

[ ] prim "num_conv_Float_Int64"

[ ] prim "num_conv_Int64_Float"

[ ] prim "fmtFloat->Text"

[ ] prim "fsin"

[ ] prim "fcos"

[ ] prim "ftan"

[ ] prim "fasin"

[ ] prim "facos"

[ ] prim "fatan"

[ ] prim "fatan2"

[ ] prim "fexp"

[ ] prim "flog"

[ ] prim "Array.init"

[ ] prim "Array.tabulate"

[ ] prim "blobToArray"

[ ] prim "blobToArrayMut"

[ ] prim "arrayToBlob"

[ ] prim "arrayMutToBlob"

[ ] prim "cast"

[ ] prim "cast"

[ ] prim "cast"

[ ] prim "time"

[ ] prim "cast"

[ ] prim "cast"

[ ] prim "cast"

[ ] prim "cyclesBalance"

[ ] prim "cyclesAvailable"

[ ] prim "cyclesAccept"

[ ] prim "setCertifiedData"

[ ] prim "getCertificate"

[ ] prim "stableMemorySize"

[ ] prim "stableMemoryGrow"

[ ] prim "stableMemoryLoadNat32"

[ ] prim "stableMemoryStoreNat32"

[ ] prim "stableMemoryLoadNat8"

[ ] prim "stableMemoryStoreNat8"

[ ] prim "stableMemoryLoadNat16"

[ ] prim "stableMemoryStoreNat16"

[ ] prim "stableMemoryLoadNat64"

[ ] prim "stableMemoryStoreNat64"

[ ] prim "stableMemoryLoadInt32"

[ ] prim "stableMemoryStoreInt32"

[ ] prim "stableMemoryLoadInt8"

[ ] prim "stableMemoryStoreInt8"

[ ] prim "stableMemoryLoadInt16"

[ ] prim "stableMemoryStoreInt16"

[ ] prim "stableMemoryLoadInt64"

[ ] prim "stableMemoryStoreInt64"

[ ] prim "stableMemoryLoadFloat"

[ ] prim "stableMemoryStoreFloat"

[ ] prim "stableMemoryLoadBlob"

[ ] prim "stableMemoryStoreBlob"

[ ] prim "stableVarQuery"

[ ] prim "performanceCounter"
opened by matthewhammer 0
VM implementation of equality should respect a record's narrowed typing.
Suppose we introduce two fields, but use an annotation to "hide" the second one:

let r1 : { x : Nat } = { x = 3; y = 5 }; r1 == { x = 3 }

The equality check should evaluate to true, but currently we do not respect these type annotations, and the field y will not be hidden.
opened by matthewhammer 0
VM implementation of value equality needs to do more.
In Motoko, values of the same (non higher order) type can be compared for equality.

For records with mutable fields, the pointer identity of the field is dereferenced by the official Motoko comparison logic, not used to distinguish records whose values are the same, but whose pointers are not. In those cases, the records are identified by the implementation of ==.

However, in our Rust-based implementation, we currently do not dereference pointers when doing equality checks, but should.

For instance, this check should evaluate to true, but it will not currently:

{ var x = 0 } == { var x = 0 }

The two variables x are allocated in different pointers, but hold the same value.
opened by matthewhammer 0

Owner

DFINITY

The Internet Computer aims to reinvent the internet as a computer to host secure software and a new breed of open internet services.

GitHub

Experimental syntax for Rust

Osy.rs Experimental syntax for Rust Hey everyone, this readme needs work! The spec has been roughed out in Osy.rs_spec.alpha, but the file could be be

3 Dec 17, 2021

SWC plugin for transform Vue3-jsx syntax

swc-plugin-transform-vue3-jsx ?? SWC plugin for faster conversion vue3-jsx. Installation npm npm install swc-plugin-transform-vue3-jsx -D yarn yarn ad

4 Oct 19, 2022

Lapce vue plugin, support vue (SFC) syntax highlight, autocomplate,types check

Lapce Plugin for Vue (based volar) Preview Usage Required: Lapce version must be greater than 2.0, and you can use Lapce nightly version. click here t

32 Dec 26, 2022

Rustymind is a driver and parser for NeuroSky MindWave EEG headset written in pure Rust.

Rustymind is a driver and parser for NeuroSky MindWave EEG headset written in pure Rust. You can use it to connect, interact, and plot real time data from the headset.

34 Sep 13, 2022

λ-calculus parser made by rust

Lambda Calculus Parser This is a parser for λ-calculus expressions. It takes a λ-terms as input, parses it and returns a JSON representation of the te

4 Apr 17, 2023

Lightweight compile-time UUID parser.

compiled-uuid Anywhere you're building Uuids from a string literal, you should use uuid. Motivation If you want to use a fixed Uuid throughout your pr

10 Dec 8, 2022

Parser for UltraStar Deluxe song files

This is a rust parser for USDX song files. Files are written as a plaintext files that contain data about the song and notes/lyrics.

1 Apr 3, 2022

k-mer counter in Rust using the rust-bio and rayon crates

krust is a k-mer counter written in Rust and run from the command line that will output canonical k-mers and their frequency across the records in a f

14 Jan 7, 2023

Experimental Rust tool for generating FFI definitions allowing many other languages to call Rust code

Diplomat is an experimental Rust tool for generating FFI definitions allowing many other languages to call Rust code. With Diplomat, you can simply define Rust APIs to be exposed over FFI and get high-level C, C++, and JavaScript bindings automatically!

255 Dec 30, 2022

Aws-sdk-rust - AWS SDK for the Rust Programming Language

The AWS SDK for Rust This repo contains the new AWS SDK for Rust (the SDK) and its public roadmap. Please Note: The SDK is currently released as a dev

2k Jan 3, 2023

Rust + Yew + Axum + Tauri, full-stack Rust development for Desktop apps.

rust-yew-axum-tauri-desktop template Rust + Yew + Axum + Tauri, full-stack Rust development for Desktop apps. Crates frontend: Yew frontend app for de

54 Dec 23, 2022

A lightning fast version of tmux-fingers written in Rust, copy/pasting tmux like vimium/vimperator

tmux-thumbs A lightning fast version of tmux-fingers written in Rust for copy pasting with vimium/vimperator like hints. Usage Press ( prefix + Space

598 Jan 2, 2023

A command-line tool collection to assist development written in RUST

dtool dtool is a command-line tool collection to assist development Table of Contents Description Usage Tips Installation Description Now dtool suppor

314 Dec 18, 2022

Rust mid-level IR Abstract Interpreter

MIRAI MIRAI is an abstract interpreter for the Rust compiler's mid-level intermediate representation (MIR). It is intended to become a widely used sta

793 Jan 2, 2023

Migrate C code to Rust

C2Rust helps you migrate C99-compliant code to Rust. The translator (or transpiler) produces unsafe Rust code that closely mirrors the input C code. T

3k Jan 8, 2023

C to Rust translator

Corrode: Automatic semantics-preserving translation from C to Rust This program reads a C source file and prints an equivalent module in Rust syntax.

2.1k Dec 14, 2022

Astronomical algorithms in Rust

astro-rust Contents API Docs About Usage Contributing References About astro-rust is a library of advanced astronomical algorithms for the Rust progra

213 Jan 7, 2023

A Rust library for calculating sun positions

sun A rust port of the JS library suncalc. Install Add the following to your Cargo.toml [dependencies] sun = "0.2" Usage pub fn main() { let unixti

36 Dec 28, 2022

Macro for Python-esque comprehensions in Rust

Cute Macro for Python-esque list comprehensions in Rust. The c! macro implements list and hashmap comprehensions similar to those found in Python, all

306 Jan 6, 2023

Motoko concrete syntax parser in Rust.

Related tags

Overview

motoko.rs

Motoko VM

Priorities (WIP)

Out of scope

Comments

Why?

Scoping using nested modules

Owner

DFINITY

Experimental syntax for Rust

SWC plugin for transform Vue3-jsx syntax

Lapce vue plugin, support vue (SFC) syntax highlight, autocomplate,types check

Rustymind is a driver and parser for NeuroSky MindWave EEG headset written in pure Rust.

λ-calculus parser made by rust

Lightweight compile-time UUID parser.

Parser for UltraStar Deluxe song files

k-mer counter in Rust using the rust-bio and rayon crates

Experimental Rust tool for generating FFI definitions allowing many other languages to call Rust code

Aws-sdk-rust - AWS SDK for the Rust Programming Language

Rust + Yew + Axum + Tauri, full-stack Rust development for Desktop apps.

A lightning fast version of tmux-fingers written in Rust, copy/pasting tmux like vimium/vimperator

A command-line tool collection to assist development written in RUST

Rust mid-level IR Abstract Interpreter

Migrate C code to Rust

C to Rust translator

Astronomical algorithms in Rust

A Rust library for calculating sun positions

Macro for Python-esque comprehensions in Rust

Scoping using nested `module`s