February 4, 2021

734 words 4 mins read

MAIF/shapash

MAIF/shapash

Shapash makes Machine Learning models transparent and understandable by everyone

repo name MAIF/shapash
repo link https://github.com/MAIF/shapash
homepage https://maif.github.io/shapash/
language Jupyter Notebook
size (curr.) 25026 kB
stars (curr.) 467
created 2020-04-29
license Apache License 2.0

Shapash

Actions Status PyPI Python Version PyPI - License documentation badge

Shapash is a Python library which aims to make machine learning interpretable and understandable by everyone. It provides several types of visualization that display explicit labels that everyone can understand.

Data Scientists can more easily understand their models and share their results. End users can understand the decision proposed by a model using a summary of the most influential criteria.

  • Readthedocs: documentation badge

The objectives of shapash:

  • To display clear and understandable results: Plots and outputs use explicit labels for each feature and its values:
  • To allow Data Scientists to quickly understand their models by using a webapp to easily navigate between global and local explainability, and understand how the different features contribute: Live Demo Shapash-Monitor
  • To Summarize and export the local explanation:

Shapash proposes a short and clear local explanation. It allows each user, whatever their Data backround, to understand a local prediction of a supervised model thanks to an summarized and explicit explanation

  • To discuss results: Shapash allows Data Scientists to easily share and discuss their results with non-Data users

  • Use Shapash to deploy interpretability part of your project: From model training to deployment (API or Batch Mode)

How does shapash work?

Shapash is an overlay package for libraries dedicated to the interpretability of models. It uses Shap or Lime backend to compute contributions. Shapash builds on the different steps necessary to build a machine learning model to make the results understandable

User Manual

Shapash works for Regression, Binary Classification or Multiclass problem. It is compatible with many models: Catboost, Xgboost, LightGBM, Sklearn Ensemble, Linear models, SVM. Shapash can use category-encoders object, sklearn ColumnTransformer or simply features dictionary.

  • Category_encoder: OneHotEncoder, OrdinalEncoder, BaseNEncoder, BinaryEncoder, TargetEncoder
  • Sklearn ColumnTransformer: OneHotEncoder, OrdinalEncoder, StandardScaler, QuantileTransformer, PowerTransformer

You can use Shap or Lime, compute your own local contributions and use Shapash to display plot or summarize it.

Using Shapash is simple and requires only a few lines of code. Most parameters are optional, you can displays plots effortlessly.

But you can also tune plots and outputs, specifying labels dict, features dict, encoders, predictions, … : The more you specify parameters, options, dictionaries and more the outputs will be understandable

Installation

pip install shapash

Getting Started: 3 minutes to Shapash

The 4 steps to display results:

  • Step 1: Declare SmartExplainer Object

    You can declare features dict here to specify the labels to display

from shapash.explainer.smart_explainer import SmartExplainer
xpl = SmartExplainer(features_dict=house_dict) # optional parameter
  • Step 2: Compile Model, Dataset, Encoders, …

    There are 2 mandatory parameters in compile method: Model and Dataset

xpl.compile(
    x=Xtest,
    model=regressor,
    preprocessing=encoder, # Optional: compile step can use inverse_transform method
    y_pred=y_pred, # Optional
    postprocessing=postprocess # Optional: see tutorial postprocessing
)
  • Step 3: Display output

    There are several outputs and plots available. for example, you can launch the web app:

app = xpl.run_app()

Live Demo Shapash-Monitor

  • Step 4: From training to deployment : SmartPredictor Object

    Shapash provides a SmartPredictor object to deploy the summary of local explanation for the operational needs. It is an object dedicated to deployment, lighter than SmartExplainer with additional consistency checks. SmartPredictor can be used with an API or in batch mode. It provides predictions, detailed or summarized local explainability using appropriate wording.

predictor = xpl.to_smartpredictor()

See the tutorial part to know how to use the SmartPredictor object

Tutorials

This github repository offers a lot of tutorials to allow you to start more concretely in the use of Shapash.

More Precise Overview

More details about charts and plots

The different ways to use Encoders and Dictionaries

Better displaying data with postprocessing

How to use shapash with Shap or Lime compute

Deploy local explainability in production

comments powered by Disqus