April 6, 2020

363 words 2 mins read

XiaohangZhan/deocclusion

Code for our CVPR 2020 work.


repo name	XiaohangZhan/deocclusion
repo link	https://github.com/XiaohangZhan/deocclusion
homepage
language	Python
size (curr.)	30205 kB
stars (curr.)	334
created	2020-03-10
license	Apache License 2.0

Paper

Xiaohang Zhan, Xingang Pan, Bo Dai, Ziwei Liu, Dahua Lin, Chen Change Loy, “Self-Supervised Scene De-occlusion”, accepted to CVPR 2020 as an Oral Paper. [Project page].

For further information, please contact Xiaohang Zhan.

Demo Video

Watch the full demo video in YouTube or bilibili. The demo video contains vivid explanations of the idea, and interesting applications.
Below is an application of scene de-occlusion: image manipulation.

Requirements

pytorch>=0.4.1
```
pip install -r requirements.txt
```

Data Preparation

COCOA dataset proposed in Semantic Amodal Segmentation.

Download COCO2014 train and val images from here and unzip.
Download COCOA annotations from here and untar.

Ensure the COCOA folder looks like:

COCOA/
  |-- train2014/
  |-- val2014/
  |-- annotations/
    |-- COCO_amodal_train2014.json
    |-- COCO_amodal_val2014.json
    |-- COCO_amodal_test2014.json
    |-- ...

Create symbolic link:

cd deocclusion
mkdir data
cd data
ln -s /path/to/COCOA

KINS dataset proposed in Amodal Instance Segmentation with KINS Dataset.

Download left color images of object data in KITTI dataset from here and unzip.
Download KINS annotations from here corresponding to this commit.

Ensure the KINS folder looks like:

KINS/
  |-- training/image_2/
  |-- testing/image_2/
  |-- instances_train.json
  |-- instances_val.json

Create symbolic link:

cd deocclusion/data
ln -s /path/to/KINS

LVIS dataset

Download training and validation sets from here

Run Demos

Download released models here and put the folder released under deocclusion.
Run demos/demo_cocoa.ipynb or demos/demo_kins.ipynb.

Train

train PCNet-M

Train (taking COCOA for example).
```
sh experiments/COCOA/pcnet_m/train.sh
```
Monitoring status and visual results using tensorboard.
```
sh tensorboard.sh $PORT
```

train PCNet-C

Download the pre-trained image inpainting model using partial convolution here to pretrains/partialconv.pth
Convert the model to accept 4 channel inputs.
```
python tools/convert_pcnetc_pretrain.py
```
Train (taking COCOA for example).
```
sh experiments/COCOA/pcnet_c/train.sh
```
Monitoring status and visual results using tensorboard.

Evaluate

Execute:
```
sh tools/test_cocoa.sh
```

Bibtex

@inproceedings{zhan2020self,
 author = {Zhan, Xiaohang and Pan, Xingang and Dai, Bo and Liu, Ziwei and Lin, Dahua and Loy, Chen Change},
 title = {Self-Supervised Scene De-occlusion},
 booktitle = {Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)},
 month = {June},
 year = {2020}
}

Acknowledgement

We used the code and models of GCA-Matting in our demo.

XiaohangZhan/deocclusion

Paper

Demo Video

Requirements

Data Preparation

COCOA dataset proposed in Semantic Amodal Segmentation.

KINS dataset proposed in Amodal Instance Segmentation with KINS Dataset.

LVIS dataset

Run Demos

Train

train PCNet-M

train PCNet-C

Evaluate

Bibtex

Acknowledgement

mit-han-lab/gan-compression

magicleap/SuperGluePretrainedNetwork

kam1107/RealnessGAN

mkocabas/VIBE

huawei-noah/ghostnet

iamhankai/ghostnet.pytorch

youngwanLEE/centermask2

HKUST-Aerial-Robotics/Stereo-RCNN

SoftwareGift/FeatherNets_Face-Anti-spoofing-Attack-Detection-Challenge-CVPR2019