Data analysis infrastructure for the Neo N3 blockchain.

edge

Last update: Jan 18, 2023

Related tags

Database api rust gui sqlite blockchain neo indexer n3 neo-go shrike

Overview

Shrike

Shrike is a set of tools built for the purpose of Neo blockchain data analysis. The infrastructure comprises three components:

Indexer - Synchronizes a NeoGo node, retrieves blockchain data, and processes it into a relational DB.

API - Serves a set of useful queries for indexed data over a REST API. Used to power the GUI and hopefully other third-party applications in the future.

GUI - A simple web interface for interacting with the data made available by Shrike. A hosted version of this application may be found here.

You can find instructions on how to operate each of the components independently in the respective sections below.

Pull requests and suggestions are welcomed in any of the components. There are innumerable ways to improve the code and broaden the featureset.

Indexer

A 🔥 blazingly fast 🔥 chain indexer for Neo N3.

Indexer is the first and primary component. It was built with personal projects in mind, and as a learning experience, so is very much a WIP. However, it should be just about safe for human consumption.

The Indexer oversees three functions:

Synchronize a NeoGo instance.
Fetch block, transaction, and application log data.
Process and store the chain data into SQLite tables.

Requirements

The latest stable Rust version. I recommend using Rustup.
The NeoGo v0.101.0 binary for your platform. Get that here. Indexer has not been tested on any platform except Windows 10.
(Optional) An SQLite-compatible DB browser/query editor. For simplicity I enjoy DB Browser, going more advanced you might prefer DBeaver.

Quickstart

Clone or otherwise download the Indexer folder.
Drop your NeoGo binary in the root directory (where Cargo.toml lives). On Windows, rename the binary to neogo.exe. On other platforms, you'll likely need to edit main.rs to use the correct path in spawn::NeoGo::new().
Open the root directory in a terminal and enter cargo run --release to build and run.
Do something else for a while.

Notes

^{All figures below are accurate on my machine as of block height ~2.7M.}

Database structure

The database has two tables: blocks and transactions. They are modelled to closely match their typical NeoRPC forms, with some allowances made for SQL and the cramming of the relevant parts of their respective application logs into each.

I'm not against the idea of changing the tables, depending on feedback, if there's good reason for it. I also plan to add contracts and perhaps balances or transfers, depending on if I have a use case for them. Feel free to make a PR if you want to expedite that process.

NeoGo sync time

Indexer will wait for its NeoGo instance to sync before it will start fetching data. Syncing NeoGo currently takes a little over an hour. You can speed it up by adjusting the config to SkipBlockVerification, but this is not advised. Once you have caught up to the chain head once, sync time is generally negligible.

Indexing time

Indexer works quickly and quietly, you can use your machine as you usually would while it runs. Once syncing is complete, fully populating the block and transaction tables from scratch takes me less than 15 minutes.

Storage requirements

You'll need a healthy amount of storage space to use the Indexer, slightly more than is required to sync a node on its own. My chain folder is currently 26.6GB and the Shrike DB is 7.18GB. Extrapolate from there to determine how much headroom you need to account for future blockchain growth, depending on your use case.

Alternative networks

You can point Indexer at any Neo N3 network that is compatible with the current NeoGo version used by the program. This can be done by adjusting the protocol config file. References can be found here. You may have to adjust the NODE_PATH in rpc.rs if you alter the RPC port.

Acknowledgements

Thanks to the NeoGo team for their excellent software and documentation. Also thanks to @liaojinghui, whose work on neo-rs saved me a lot of headache with the cumbersome task of converting script hashes to public addresses.

API

An Actix Web-based service that performs various queries against indexed data and serves the responses. Only relatively basic queries are implemented so far. There is currently no caching for queries that only need to be performed once per block, it will scale very poorly to multiple users until then.

Quickstart

Clone or otherwise download the API folder.
Get a copy of the Shrike DB from the download page (TODO) or by running the Indexer. Adjust the file path in main.rs via the DB_PATH constant.
Use cargo run or cargo run --release to serve the API.
Make your requests! The default path for the API when run locally is as follows: http://127.0.0.1:8080/v1/module/method/parameter.

A hosted version will be provided in the future.

API Reference

TODO

GUI

A simple web application built using SolidJS (SolidStart) and PicoCSS. It was created to give a way for regular users to leverage Shrike, but power users will be better served by running custom queries against their own copy of the Shrike DB.

Quickstart

Clone or otherwise download the GUI folder.
Run the API following the above instructions, or update the path in /constants/index.js to use the hosted version (coming soon).
Serve the GUI locally with npm run dev and open it in your browser at http://127.0.0.1:5173/.

It's not a novel data sturcture just AVL and Btree for rust

This crate named as ABtree but this not means it is a novel data sturcture. It’s just AVL tree and Btree. For the Btree, what makes it different from

3 Jun 20, 2022

RedisJSON - a JSON data type for Redis

RedisJSON RedisJSON is a Redis module that implements ECMA-404 The JSON Data Interchange Standard as a native data type. It allows storing, updating a

3.4k Jan 1, 2023

A Rust application that inserts Discogs data dumps into Postgres

Discogs-load A Rust application that inserts Discogs data dumps into Postgres. Discogs-load uses a simple state machine with the quick-xml Rust librar

7 Dec 9, 2022

SQLite compiled to WASM with pluggable data storage

wasm-sqlite SQLite compiled to WASM with pluggable data storage. Useful to save SQLite in e.g. Cloudflare Durable Objects (example: https://github.com

36 Dec 7, 2022

Databend aimed to be an open source elastic and reliable serverless data warehouse,

An elastic and reliable Serverless Data Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy

5k Jan 3, 2023

Open Data Access Layer that connect the whole world together

OpenDAL Open Data Access Layer that connect the whole world together. Status OpenDAL is in alpha stage and has been early adopted by databend. Welcome

302 Jan 4, 2023

postgres-ical - a PostgreSQL extension that adds features related to parsing RFC-5545 « iCalendar » data from within a PostgreSQL database

1 Feb 23, 2022

Plugin for macro-, mini-quad (quads) to save data in simple local storage using Web Storage API in WASM and local file on a native platforms.

quad-storage This is the crate to save data in persistent local storage in miniquad/macroquad environment. In WASM the data persists even if tab or br