December 10, 2018

638 words 3 mins read

mil-tokyo/webdnn

The Fastest DNN Running Framework on Web Browser


repo name	mil-tokyo/webdnn
repo link	https://github.com/mil-tokyo/webdnn
homepage	https://mil-tokyo.github.io/webdnn
language	Python
size (curr.)	25455 kB
stars (curr.)	1702
created	2017-04-06
license	Other

WebDNN: Fastest DNN Execution Framework on Web Browser

WebDNN is an open source software framework for executing deep neural network (DNN) pre-trained model on web browser.

WebDNN can execute DNN models trained by follow deep learning frameworks on web browser.

TensorFlow (v1.2.0 - v1.4.0)
Keras (v2.1.3 - )
PyTorch (v0.3.0 - v0.4.1)
Chainer (v1.23.0 - v4.0.0)
Caffe

Why is WebDNN needed?

Deep neural network (DNN) is getting much attention to use in many applications. However, it requires a lot of computational resources, and there are many tremendous processes to setup execution environment based hardware acceleration such as GPGPU. Therefore providing DNN applications to end-users is very hard.

WebDNN solves this problem by using web browser as installation-free DNN execution framework. This framework optimizes trained DNN model to compress the model data and accelerate the execution, and executes it with novel JavaScript API such as WebAssembly and WebMetal to achieve zero-overhead execution. Empirical evaluations showed that it achieved more than 200x acceleration.

Note: WebGPU introduced by Apple was renamed to WebMetal in 2019. In WebDNN 1.2.8, both WebMetal and old name WebGPU are supported for compatiblity. For string constant, currently webgpu is used, but will be changed to webmetal in the future version.

Performance

Compared processing time with Keras.js
Test environment:
- Mac Book Pro early 2015
- macOS 10.12.4 Sierra
- Intel Core i5 2.7 GHz CPU
- 16 GB Memory
- Intel Iris Graphics 6100 GPU
- Safari Technology Preview 30
Model: VGG16[1], Inception-v3[4], and ResNet50[2].
Input Shape: (1, 299, 299, 3) for Inception-v3, (1, 224, 224, 3) for others.

Benchmark result with Keras.js

Elapsed time per image are shown in vertical axis as logarithmic scale.

WebDNN with WebMetal backend was significantly faster than Keras.js. WebDNN with WebAssembly backend was comparable with GPU backend of Keras.js. In each DNN model and backend, WebDNN obtained better results in terms of speed. More speed improvement is observed when the optimizations are applied in the graph transpiler.

Getting started in 30 seconds

Let’s convert and execute ResNet50 pre-trained Keras model[3] on your web browser.

First, save ResNet50 pre-trained model provided by Keras.

from keras.applications import resnet50
model = resnet50.ResNet50(include_top=True, weights='imagenet')
model.save("resnet50.h5")

Next, convert the model by CLI. In this phase, model is optimized.

python ./bin/convert_keras.py resnet50.h5 --input_shape '(1,224,224,3)' --out output

Then, generated files (called as Descriptor) can be loaded and executed by JavaScript as follows,

let runner, image, probabilities;

async function init() {
    // Initialize descriptor runner
    runner = await WebDNN.load('./output');
    image = runner.inputs[0]; 
    probabilities = runner.outputs[0];
}

async function run() {
    // Set the value into input variable.
    image.set(await WebDNN.Image.getImageArray('./input_image.png'));
    
    // Run
    await runner.run(); 

    // Show the result
    console.log('Output', WebDNN.Math.argmax(probabilities));
}

WebDNN also supports Caffemodel and Chainer model.

For more information, please see documents.

Setup

Please see documents.

Also, Docker image is provided. See docker.

Applications / demos using WebDNN

http://make.girls.moe/#/ - MakeGirls.moe - Create Anime Characters with A.I.!
https://new3rs.github.io/AZ.js/index.html (Japanese ver) - Go AI (JavaScript version of Pyaq)
https://milhidaka.github.io/chainer-image-caption/ - Generating image caption demo
https://github.com/milhidaka/webdnn-exercise - Exercise of basic usage of WebDNN
https://emotionaltrackingsdk.morphcast.com - Emotion recognition from camera by Cynny, see [https://www.morphcast.com] for more info
https://github.com/zaghaghi/nsfw-webdnn - Yahoo Open NSFW inside your browser

[1] Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. In Proceedings of the International Conference on Learning Representations (ICLR).
[2] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Deep Residual Learning for Image Recognition. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR). https://github.com/KaimingHe/deep-residual-networks
[3] Applications - Keras Documentation
[4] Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the Inception Architecture for Computer Vision. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR).

mil-tokyo/webdnn

WebDNN: Fastest DNN Execution Framework on Web Browser

Why is WebDNN needed?

Performance

Getting started in 30 seconds

Setup

Applications / demos using WebDNN

gaojiuli/gain

facebookresearch/ParlAI

airbnb/streamalert

pallets/flask

vinta/awesome-python

choojs/choo

ianstormtaylor/slate

TIGERB/easy-php

totaljs/framework