Machine learning framework for building object trackers and similarity search engines

In-Sight

Last update: Dec 28, 2022

Related tags

Machine learning rust machine-learning artificial-intelligence object-tracking feature-matching similarity-search

Overview

Similari

Similari is a framework that helps build sophisticated tracking systems. The most frequently met operations that can be efficiently implemented with Similari - collecting of observable object features, looking for similar objects, and merging them into tracks based on features and attributes.

With Similari one can develop highly efficient parallelized SORT, DeepSORT, and other sophisticated single observer (e.g. Cam) or multi-observer tracking engines.

Introduction

The primary purpose of Similari is to provide means to build sophisticated in-memory object tracking engines.

The framework helps to build various kinds of tracking or similarity search engines - the simplest one that holds vector features and allows comparing new vectors against the ones kept in the database. More sophisticated engines operate over tracks - a series of observations for the same feature collected during the lifecycle. Such systems are often used in video processing or other systems where the observer receives fuzzy or changing observation results.

Applicability Notes

Although Similari allows building various similarity engines, there are competitive tools that sometime (or often) may be more desirable. The section will explain where it is applicable and what alternatives exist.

Similari fits best for the tasks where objects are described by multiple observations for a certain feature class, not a single feature vector. Also, their behavior is dynamic - you remove them from the index or modify them as often as add new ones. This is a very important point - it is less efficient than tools that work with growing or static object spaces.

Fit: track the person across the room: person ReID, age/gender, and face features are collected multiple times during the tracking and used to merge tracks or provide aggregated results at the end of the track;
Not fit: plagiarism database, when a single document is described by a number (or just one) constant ReID vectors, documents are added but not removed. The task is to find the top X most similar documents to a checked.

If your task looks like Not fit, can use Similari, but you're probably looking for HNSW or NMS implementations:

HNSW Rust - Link
HNSW C/Python - link
NMS - link

Objects in Similari index support following features:

Track lifecycle - the object is represented by its lifecycle (track) - it appears, evolves, and disappears. During its lifetime object evolves according to its behavioral properties (attributes, and feature observations).
Feature Observation - Similari assumes that an object is observed by an observer entity that collects its features multiple times. Those features are presented by vectors of float numbers and observation attributes. When the observation happened, the track is updated with gathered features. Future observations are used to find similar tracks in the index and merge them.
Track Attributes - Arbitrary attributes describe additional track properties aside from feature observations. Attributes is crucial part when you are comparing objects in the wild, because there may be attributes disposition when objects are incompatible, like animal_type that prohibits you from comparing dogs and cats between each other. Another popular use of attributes is a spatial or temporal characteristic of an object, e.g. objects that are situated at distant locations at the same time cannot be compared. Attributes in Similari are dynamic and evolve upon every feature observation addition and when objects are merged. They are used in both distance calculations and compatibility guessing (which decreases compute space by skipping incompatible objects).

If you are planning to use Similari to search in a huge index, consider object attributes to decrease the lookup space. If the attributes of the two tracks are not compatible, their distance calculations are skipped.

Performance

To keep the calculations performant the framework uses:

ultraviolet - fast SIMD computations.

Parallel computations are implemented with index sharding and parallel computations based on a dedicated thread workers pool.

The vector operations performance depends a lot on the optimization level defined for the build. On low or default optimization levels Rust may not use f32 vectorization, so when running benchmarks take care of proper optimization levels configured.

Rust optimizations

Use RUSTFLAGS="-C target-cpu=native" to enable all cpu features like AVX, AVX2, etc. It is beneficial to ultraviolet.

Alternatively you can add build instructions to .cargo/config:

[build]
rustflags = "-C target-cpu=native"

Take a look at benchmarks for numbers.

Numbers

IoU tracking benchmark for N simultaneously observed objects run on 4 cores of Intel(R) Core(TM) i5-7440HQ CPU @ 2.80GHz. The benchmark doesn't use heuristics that separate the observed objects based on object distances.

The benchmark is located at benches/iou_tracker.rs.

10 objects   :      261,184 ns/iter (+/- 170,940)      [3800 FPS]
100 objects  :    1,440,733 ns/iter (+/- 361,937)      [ 694 FPS]
500 objects  :  17,705,508 ns/iter (+/- 5,622,983)     [  57 FPS]
1000 objects :  58,834,824 ns/iter (+/- 12,626,173)    [  17 FPS]

Feature (256 @ f32) tracking benchmark for N simultaneously observed objects run on 4 cores of Intel(R) Core(TM) i5-7440HQ CPU @ 2.80GHz. The benchmark doesn't use heuristics that separate the observed objects based on object distances.

The benchmark located at benches/feature_tracker.rs.

10 objects:       101,465 ns/iter (+/- 10,056)         [9900 FPS]
100 objects:    4,020,673 ns/iter (+/- 877,444)        [ 250 FPS]
500 objects:   61,716,729 ns/iter (+/- 11,215,929)     [  16 FPS]
1000 objects: 235,187,877 ns/iter (+/- 89,734,978)     [   4 FPS]

Manuals and Articles

Collected articles about Similari:

IoU object tracker example at Medium
Re-ID object tracker example at Medium

Usage Examples

Take a look at samples in the repo:

examples/simple.rs for an idea of simple usage.
examples/track_merging.rs for an idea of intra-cam track merging.
examples/incremental_track_build.rs very simple feature-based tracker.
examples/iou_tracker.rs very simple IoU tracker (without Kalman filter).

The Hacker's Machine Learning Engine

Juice This is the workspace project for juice - machine learning frameworks for hackers coaster - underlying math abstraction coaster-nn coaster-blas

982 Dec 31, 2022

Machine learning in Rust.

Rustml Rustml is a library for doing machine learning in Rust. The documentation of the project with a descprition of the modules can be found here. F

60 Dec 15, 2022

Rust based Cross-GPU Machine Learning

HAL : Hyper Adaptive Learning Rust based Cross-GPU Machine Learning. Why Rust? This project is for those that miss strongly typed compiled languages.

83 Dec 20, 2022

Machine Learning Library for Rust

autograph Machine Learning Library for Rust undergoing maintenance Features Portable accelerated compute Run SPIR-V shaders on GPU's that support Vulk

223 Jan 1, 2023

Fwumious Wabbit, fast on-line machine learning toolkit written in Rust

Fwumious Wabbit is a very fast machine learning tool built with Rust inspired by and partially compatible with Vowpal Wabbit (much love! read more abo

115 Dec 9, 2022

Example of Rust API for Machine Learning

rust-machine-learning-api-example Example of Rust API for Machine Learning API example that uses resnet224 to infer images received in base64 and retu

16 Oct 3, 2022

convolutions-rs is a crate that provides a fast, well-tested convolutions library for machine learning

convolutions-rs convolutions-rs is a crate that provides a fast, well-tested convolutions library for machine learning written entirely in Rust with m

10 Jun 28, 2022

High-level non-blocking Deno bindings to the rust-bert machine learning crate.

bertml High-level non-blocking Deno bindings to the rust-bert machine learning crate. Guide Introduction The ModelManager class manages the FFI bindin

14 Dec 15, 2022

Machine learning Neural Network in Rust

vinyana vinyana - stands for mind in pali language. Goal To implement a simple Neural Network Library in order to understand the maths behind it. This

3 Dec 26, 2022

Comments

Build without Python support?

I have the same issue as https://github.com/insight-platform/Similari/issues/67 with roughly the same configuration. Because my project is made for multiple hosts configurations, adding a dependency on Python through PyO3 without a very good reason is almost a deal breaker.

Is there any plans to make the Python bindings an optional feature?

opened by higmo 5
Medium example won't build. Pyo3 issue?

Following your example from https://medium.com/@kudryavtsev_ia/high-performance-sort-tracker-in-rust-9a1dd18c259c

Under MacOS 12.5 with rust 1.62.1, the example won't build with targets x86_64 and arm64 as there seems to be an issue with Pyo3.

See the attached build log. build.log

opened by skahl 3
Fix build on macOS

Add a build.rs to set the appropriate build flags for macOS, following the recommendations here: https://pyo3.rs/v0.16.4/building_and_distribution.html#macos

Relates to https://github.com/insight-platform/Similari/issues/67.

opened by higmo 2
Avoid Visual Feature Using and Collecting for Overlapped Boxes

When the boxes are heavily overlapped the features tend to be of a poor quality. This request introduces the requirement to avoid using such features in comparisons or collecting them to the track for future use.

opened by bwsw 1

Releases(v0.22.9)

v0.22.9(Oct 31, 2022)

Source code(tar.gz)
Source code(zip)
v0.22.7(Aug 28, 2022)
2D Point Kalman filter;

2D Point Vector Kalman filter;

Some code improvements;

Source code(tar.gz)
Source code(zip)
v0.22.4(Aug 19, 2022)

Updates and improvements to the initial v0.22.0
Source code(tar.gz)
Source code(zip)
v0.22.0(Aug 15, 2022)
Stabilized Framework API

Ready-to-use SORT tracker for axis-aligned and rotated bounding boxes

Experimental VisualSORT tracker for building DeepSORT-like flavors;

Python interface for SORT and VisualSORT.

Source code(tar.gz)
Source code(zip)
v0.21.1(Aug 11, 2022)

Release includes improved python bindings for SORT (Maha, IoU), DeepSORT flavour, NMS, Kalman filter and polygone clipping.
Source code(tar.gz)
Source code(zip)
v0.20.7(Aug 5, 2022)
Python interface for:

Kalman filter for oriented, non-oriented boxes, Mahalanobis distance for oriented boxes;

NMS, parallel NMS for oriented, non-oriented boxes;

Clipping for oriented and non-oriented boxes;

SORT with IoU metric;

SORT with Mahalanobis metric;

Source code(tar.gz)
Source code(zip)
v0.19.1(Jul 30, 2022)
Oriented bounding boxes support;

SORT with oriented bounding boxes;

SORT optimizations for oriented bounding boxes.

Source code(tar.gz)
Source code(zip)
v0.18.1(Jul 25, 2022)
New Iterator API that decreases processing latency.

Kalman filter for BBoxes to ease SORT, and DeepSORT implementations.

Source code(tar.gz)
Source code(zip)
v0.17.5(Jul 20, 2022)

Minor release that includes a new API for querying the store.
Source code(tar.gz)
Source code(zip)
v0.17.3(Jul 20, 2022)
Improved benchmarks

Improved samples

Source code(tar.gz)
Source code(zip)
v0.16.0(Jul 18, 2022)
Improved several benchmarks

added non-blocking merge operation

Source code(tar.gz)
Source code(zip)
v0.8.4(Jul 14, 2022)
Added new methods for Track object

Implemented test sample for a feature-based incremental tracker with bounding box collecting.

Source code(tar.gz)
Source code(zip)
v0.8.1(Jul 14, 2022)
Parametrized observation attributes;

Improved code structure and readability;

No merge history flag supported

Source code(tar.gz)
Source code(zip)
v0.6.2-packed-simd-nightly(Jul 12, 2022)

Non production release that requires nightly compiler.
Source code(tar.gz)
Source code(zip)
v0.6.2(Jul 12, 2022)

0.6.2 introduces a new option to filter distances earlier. It allows cutting of excessive elements within workers before the aggregration.
Source code(tar.gz)
Source code(zip)
v0.4.3(Jul 10, 2022)
New SIMD backend - ultraviolet

Performance increased almost twice

Source code(tar.gz)
Source code(zip)
v0.3.1(Jul 7, 2022)

Release 0.3.1 introduces a better parallel computation model that relies on pooled executors rather than the Rayon library. Rayon was removed from the dependencies. The improvement allowed to perform the operations 2-3 times faster.
Source code(tar.gz)
Source code(zip)
v0.2.6(Jul 1, 2022)

Release notes

Version 0.2.6 introduces safe update, merge, optimize operations. Prior to 0.2.6, when the operation returned Err(e), the internal state of failed object was undefined, starting from 0.2.6, the failed operation rolls back the state of the object to an initial one.
Source code(tar.gz)
Source code(zip)
v0.2.4(Jul 1, 2022)

Documentation update, not functionality change versus 0.2.3.
Source code(tar.gz)
Source code(zip)
v0.2.3(Jun 30, 2022)

Implemented performance benchmarks
Source code(tar.gz)
Source code(zip)
v0.2.2(Jun 30, 2022)
Additional vector store methods implemented:

add a vector to the store;

merge store-owned vectors;

merge external vector with vector in store.

Source code(tar.gz)
Source code(zip)
v0.2.1(Jun 30, 2022)

Source code(tar.gz)
Source code(zip)

Owner

In-Sight

In-Sight Platform OSS components

GitHub

Machine learning framework for building object trackers and similarity search engines

Related tags

Overview

Similari

Introduction

Applicability Notes

Performance

Rust optimizations

Numbers

Manuals and Articles

Usage Examples

You might also like...

The Hacker's Machine Learning Engine

Machine learning in Rust.

Rust based Cross-GPU Machine Learning

Machine Learning Library for Rust

Fwumious Wabbit, fast on-line machine learning toolkit written in Rust

Example of Rust API for Machine Learning

convolutions-rs is a crate that provides a fast, well-tested convolutions library for machine learning

High-level non-blocking Deno bindings to the rust-bert machine learning crate.

Machine learning Neural Network in Rust

Comments

Build without Python support?

Medium example won't build. Pyo3 issue?

Fix build on macOS

Avoid Visual Feature Using and Collecting for Overlapped Boxes

Releases(v0.22.9)

v0.22.9(Oct 31, 2022)

v0.22.7(Aug 28, 2022)

v0.22.4(Aug 19, 2022)

v0.22.0(Aug 15, 2022)

v0.21.1(Aug 11, 2022)

v0.20.7(Aug 5, 2022)

v0.19.1(Jul 30, 2022)

v0.18.1(Jul 25, 2022)

v0.17.5(Jul 20, 2022)

v0.17.3(Jul 20, 2022)

v0.16.0(Jul 18, 2022)

v0.8.4(Jul 14, 2022)

v0.8.1(Jul 14, 2022)

v0.6.2-packed-simd-nightly(Jul 12, 2022)

v0.6.2(Jul 12, 2022)

v0.4.3(Jul 10, 2022)

v0.3.1(Jul 7, 2022)

v0.2.6(Jul 1, 2022)

Release notes

v0.2.4(Jul 1, 2022)

v0.2.3(Jun 30, 2022)

v0.2.2(Jun 30, 2022)

v0.2.1(Jun 30, 2022)

Owner

In-Sight

Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.

A Rust machine learning framework.

Xaynet represents an agnostic Federated Machine Learning framework to build privacy-preserving AI applications.

Tangram is an automated machine learning framework designed for programmers.

A Machine Learning Framework for High Performance written in Rust

A Framework for Production-Ready Continuous Machine Learning

[WIP] An experimental Java-like language and it's virtual machine, for learning Java and JVM.

Tangram - makes it easy for programmers to train, deploy, and monitor machine learning models.

Machine Learning library for Rust

Machine learning crate for Rust