December 15, 2020

365 words 2 mins read

signals-dev/Orion

A machine learning library for detecting anomalies in signals.


repo name	signals-dev/Orion
repo link	https://github.com/signals-dev/Orion
homepage	https://signals-dev.github.io/Orion
language	Jupyter Notebook
size (curr.)	13340 kB
stars (curr.)	47
created	2018-07-24
license	MIT License

Orion

License: MIT
Development Status: Pre-Alpha
Homepage: https://github.com/signals-dev/Orion
Documentation: https://signals-dev.github.io/Orion

Overview

Orion is a machine learning library built for unsupervised time series anomaly detection. With a given time series data, we provide a number of “verified” ML pipelines (a.k.a Orion pipelines) that identify rare patterns and flag them for expert review.

The library makes use of a number of automated machine learning tools developed under Data to AI Lab at MIT.

Recent news: Read about using an Orion pipeline on NYC taxi dataset in a blog series part 1, part 2 and part 3.

Quickstart

Install with pip

The easiest and recommended way to install Orion is using pip:

pip install orion-ml

This will pull and install the latest stable release from PyPi.

In the following example we show how to use one of the Orion Pipelines.

Fit an Orion pipeline

We will load a demo data for this example:

from orion.data import load_signal

train_data = load_signal('S-1-train')
train_data.head()

which should show a signal with timestamp and value.

    timestamp     value
0  1222819200 -0.366359
1  1222840800 -0.394108
2  1222862400  0.403625
3  1222884000 -0.362759
4  1222905600 -0.370746

In this example we use lstm_dynamic_threshold pipeline and set some hyperparameters (in this case training epochs as 5).

from orion import Orion

hyperparameters = {
    'keras.Sequential.LSTMTimeSeriesRegressor#1': {
        'epochs': 5,
        'verbose': True
    }
}

orion = Orion(
    pipeline='lstm_dynamic_threshold',
    hyperparameters=hyperparameters
)

orion.fit(train_data)

Detect anomalies using the fitted pipeline

Once it is fitted, we are ready to use it to detect anomalies in our incoming time series:

new_data = load_signal('S-1-new')
anomalies = orion.detect(new_data)

:warning: Depending on your system and the exact versions that you might have installed some WARNINGS may be printed. These can be safely ignored as they do not interfere with the proper behavior of the pipeline.

The output of the previous command will be a pandas.DataFrame containing a table of detected anomalies:

        start         end     score
0  1394323200  1399701600  0.673494

Resources

Additional resources that might be of interest:

Learn about benchmarking pipelines.
Read about pipeline evaluation.
More about database design.

signals-dev/Orion

Orion

Overview

Quickstart

Install with pip

Fit an Orion pipeline

Detect anomalies using the fitted pipeline

Resources

mfrdixon/ML_Finance_Codes

aws-samples/aws-machine-learning-university-accelerated-nlp

mims-harvard/TDC

aws-samples/aws-machine-learning-university-accelerated-cv

SeldonIO/seldon-core

LiYangHart/Hyperparameter-Optimization-of-Machine-Learning-Algorithms

MicrosoftDocs/ml-basics

dafriedman97/mlbook

jonkrohn/ML-foundations