August 19, 2019

2102 words 10 mins read

huseinzol05/NLP-Models-Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0


repo name	huseinzol05/NLP-Models-Tensorflow
repo link	https://github.com/huseinzol05/NLP-Models-Tensorflow
homepage
language	Jupyter Notebook
size (curr.)	48095 kB
stars (curr.)	1167
created	2018-04-23
license	MIT License

NLP-Models-Tensorflow, Gathers machine learning and tensorflow deep learning models for NLP problems, code simplify inside Jupyter Notebooks 100%.

Abstractive Summarization
Chatbot
Dependency Parser
Entity Tagging
Extractive Summarization
Generator
Language Detection
Neural Machine Translation
OCR
POS Tagging
Question-Answers
Sentence pairs
Speech-to-Text
Spelling correction
SQUAD Question-Answers
Stemming
Text Augmentation
Text Classification
Text Similarity
Text-to-Speech
Topic Generator
Topic Modeling
Unsupervised Extractive Summarization
Vectorizer
Old-to-Young Vocoder
Visualization
Attention

Objective

Original implementations are quite complex and not really beginner friendly. So I tried to simplify most of it. Also, there are tons of not-yet release papers implementation. So feel free to use it for your own research!

I will attached github repositories for models that I not implemented from scratch, basically I copy, paste and fix those code for deprecated issues.

Tensorflow version

Tensorflow version 1.13 and above only, not included 2.X version. 1.13 < Tensorflow < 2.0

pip install -r requirements.txt

Abstractive Summarization

Trained on India news.

Accuracy based on 10 epochs only, calculated using word positions.

LSTM Seq2Seq using topic modelling, test accuracy 13.22%
LSTM Seq2Seq + Luong Attention using topic modelling, test accuracy 12.39%
LSTM Seq2Seq + Beam Decoder using topic modelling, test accuracy 10.67%
LSTM Bidirectional + Luong Attention + Beam Decoder using topic modelling, test accuracy 8.29%
Pointer-Generator + Bahdanau, https://github.com/xueyouluo/my_seq2seq, test accuracy 15.51%
Copynet, test accuracy 11.15%
Pointer-Generator + Luong, https://github.com/xueyouluo/my_seq2seq, test accuracy 16.51%
Dilated Seq2Seq, test accuracy 10.88%
Dilated Seq2Seq + Self Attention, test accuracy 11.54%
BERT + Dilated CNN Seq2seq, test accuracy 13.5%
self-attention + Pointer-Generator, test accuracy 4.34%
Dilated-CNN Seq2seq + Pointer-Generator, test accuracy 5.57%

Chatbot

Trained on Cornell Movie Dialog corpus, accuracy table in chatbot.

Seq2Seq-manual
Seq2Seq-API Greedy
Bidirectional Seq2Seq-manual
Bidirectional Seq2Seq-API Greedy
Bidirectional Seq2Seq-manual + backward Bahdanau + forward Luong
Bidirectional Seq2Seq-API + backward Bahdanau + forward Luong + Stack Bahdanau Luong Attention + Beam Decoder
Bytenet
Capsule layers + LSTM Seq2Seq-API + Luong Attention + Beam Decoder
End-to-End Memory Network
Attention is All you need
Transformer-XL + LSTM
GPT-2 + LSTM
Tacotron + Beam decoder

Basic cell Seq2Seq-manual
LSTM Seq2Seq-manual
GRU Seq2Seq-manual
Basic cell Seq2Seq-API Greedy
LSTM Seq2Seq-API Greedy
GRU Seq2Seq-API Greedy
Basic cell Bidirectional Seq2Seq-manual
LSTM Bidirectional Seq2Seq-manual
GRU Bidirectional Seq2Seq-manual
Basic cell Bidirectional Seq2Seq-API Greedy
LSTM Bidirectional Seq2Seq-API Greedy
GRU Bidirectional Seq2Seq-API Greedy
Basic cell Seq2Seq-manual + Luong Attention
LSTM Seq2Seq-manual + Luong Attention
GRU Seq2Seq-manual + Luong Attention
Basic cell Seq2Seq-manual + Bahdanau Attention
LSTM Seq2Seq-manual + Bahdanau Attention
GRU Seq2Seq-manual + Bahdanau Attention
LSTM Bidirectional Seq2Seq-manual + Luong Attention
GRU Bidirectional Seq2Seq-manual + Luong Attention
LSTM Bidirectional Seq2Seq-manual + Bahdanau Attention
GRU Bidirectional Seq2Seq-manual + Bahdanau Attention
LSTM Bidirectional Seq2Seq-manual + backward Bahdanau + forward Luong
GRU Bidirectional Seq2Seq-manual + backward Bahdanau + forward Luong
LSTM Seq2Seq-API Greedy + Luong Attention
GRU Seq2Seq-API Greedy + Luong Attention
LSTM Seq2Seq-API Greedy + Bahdanau Attention
GRU Seq2Seq-API Greedy + Bahdanau Attention
LSTM Seq2Seq-API Beam Decoder
GRU Seq2Seq-API Beam Decoder
LSTM Bidirectional Seq2Seq-API + Luong Attention + Beam Decoder
GRU Bidirectional Seq2Seq-API + Luong Attention + Beam Decoder
LSTM Bidirectional Seq2Seq-API + backward Bahdanau + forward Luong + Stack Bahdanau Luong Attention + Beam Decoder
GRU Bidirectional Seq2Seq-API + backward Bahdanau + forward Luong + Stack Bahdanau Luong Attention + Beam Decoder
Bytenet
LSTM Seq2Seq + tf.estimator
Capsule layers + LSTM Seq2Seq-API Greedy
Capsule layers + LSTM Seq2Seq-API + Luong Attention + Beam Decoder
LSTM Bidirectional Seq2Seq-API + backward Bahdanau + forward Luong + Stack Bahdanau Luong Attention + Beam Decoder + Dropout + L2
DNC Seq2Seq
LSTM Bidirectional Seq2Seq-API + Luong Monotic Attention + Beam Decoder
LSTM Bidirectional Seq2Seq-API + Bahdanau Monotic Attention + Beam Decoder
End-to-End Memory Network + Basic cell
End-to-End Memory Network + LSTM cell
Attention is all you need
Transformer-XL
Attention is all you need + Beam Search
Transformer-XL + LSTM
GPT-2 + LSTM
CNN Seq2seq
Conv-Encoder + LSTM
Tacotron + Greedy decoder
Tacotron + Beam decoder
Google NMT

Dependency-Parser

Trained on CONLL English Dependency. Train set to train, dev and test sets to test.

Stackpointer and Biaffine-attention originally from https://github.com/XuezheMax/NeuroNLP2 written in Pytorch.

Accuracy based on arc, types and root accuracies after 15 epochs only.

Bidirectional RNN + CRF + Biaffine, arc accuracy 70.48%, types accuracy 65.18%, root accuracy 66.4%
Bidirectional RNN + Bahdanau + CRF + Biaffine, arc accuracy 70.82%, types accuracy 65.33%, root accuracy 66.77%
Bidirectional RNN + Luong + CRF + Biaffine, arc accuracy 71.22%, types accuracy 65.73%, root accuracy 67.23%
BERT Base + CRF + Biaffine, arc accuracy 64.30%, types accuracy 62.89%, root accuracy 74.19%
Bidirectional RNN + Biaffine Attention + Cross Entropy, arc accuracy 72.42%, types accuracy 63.53%, root accuracy 68.51%
BERT Base + Biaffine Attention + Cross Entropy, arc accuracy 72.85%, types accuracy 67.11%, root accuracy 73.93%
Bidirectional RNN + Stackpointer, arc accuracy 61.88%, types accuracy 48.20%, root accuracy 89.39%
XLNET Base + Biaffine Attention + Cross Entropy, arc accuracy 74.41%, types accuracy 71.37%, root accuracy 73.17%

Entity-Tagging

Trained on CONLL NER.

Bidirectional RNN + CRF, test accuracy 96%
Bidirectional RNN + Luong Attention + CRF, test accuracy 93%
Bidirectional RNN + Bahdanau Attention + CRF, test accuracy 95%
Char Ngrams + Bidirectional RNN + Bahdanau Attention + CRF, test accuracy 96%
Char Ngrams + Bidirectional RNN + Bahdanau Attention + CRF, test accuracy 96%
Char Ngrams + Residual Network + Bahdanau Attention + CRF, test accuracy 69%
Char Ngrams + Attention is you all Need + CRF, test accuracy 90%
BERT, test accuracy 99%
XLNET-Base, test accuracy 99%

Extractive Summarization

Trained on CNN News dataset.

Accuracy based on ROUGE-2.

LSTM RNN, test accuracy 16.13%
Dilated-CNN, test accuracy 15.54%
Multihead Attention, test accuracy 26.33%
BERT-Base

Generator

Trained on Shakespeare dataset.

Character-wise RNN + LSTM
Character-wise RNN + Beam search
Character-wise RNN + LSTM + Embedding
Word-wise RNN + LSTM
Word-wise RNN + LSTM + Embedding
Character-wise + Seq2Seq + GRU
Word-wise + Seq2Seq + GRU
Character-wise RNN + LSTM + Bahdanau Attention
Character-wise RNN + LSTM + Luong Attention
Word-wise + Seq2Seq + GRU + Beam
Character-wise + Seq2Seq + GRU + Bahdanau Attention
Word-wise + Seq2Seq + GRU + Bahdanau Attention
Character-wise Dilated CNN + Beam search
Transformer + Beam search
Transformer XL + Beam search

Language-detection

Trained on Tatoeba dataset.

Fast-text Char N-Grams

Neural Machine Translation

Trained on English-Vietnam, accuracy table in neural-machine-translation.

Bytenet
Capsule layers + LSTM Seq2Seq-API + Luong Attention + Beam Decoder
End-to-End Memory Network
Attention is All you need
Conv Seq2Seq
BERT + Transformer Decoder
XLNET + Transformer Decoder

basic-seq2seq-manual
lstm-seq2seq-manual
gru-seq2seq-manual
basic-seq2seq-api-greedy
lstm-seq2seq-api-greedy
gru-seq2seq-greedy
basic-birnn-seq2seq-manual
lstm-birnn-seq2seq-manual
gru-birnn-seq2seq-manual
basic-birnn-seq2seq-greedy
lstm-birnn-seq2seq-greedy
gru-birnn-seq2seq-greedy
basic-seq2seq-luong
lstm-seq2seq-luong
gru-seq2seq-luong
basic-seq2seq-bahdanau
lstm-seq2seq-bahdanau
gru-seq2seq-bahdanau
basic-birnn-seq2seq-bahdanau
lstm-birnn-seq2seq-bahdanau
gru-birnn-seq2seq-bahdanau
basic-birnn-seq2seq-luong
lstm-birnn-seq2seq-luong
gru-birnn-seq2seq-luong
lstm-seq2seq-contrib-greedy-luong
gru-seq2seq-contrib-greedy-luong
lstm-seq2seq-contrib-greedy-bahdanau
gru-seq2seq-contrib-greedy-bahdanau
lstm-seq2seq-contrib-beam-bahdanau
gru-seq2seq-contrib-beam-bahdanau
lstm-birnn-seq2seq-contrib-beam-luong
gru-birnn-seq2seq-contrib-beam-luong
lstm-birnn-seq2seq-contrib-luong-bahdanau-beam
gru-birnn-seq2seq-contrib-luong-bahdanau-beam
bytenet-greedy
capsule-lstm-seq2seq-contrib-greedy
capsule-gru-seq2seq-contrib-greedy
dnc-seq2seq-bahdanau-greedy
dnc-seq2seq-luong-greedy
lstm-birnn-seq2seq-beam-luongmonotic
lstm-birnn-seq2seq-beam-bahdanaumonotic
memory-network-lstm-seq2seq-contrib
attention-is-all-you-need-beam
conv-seq2seq
conv-encoder-lstm-decoder
dilated-conv-seq2seq
gru-birnn-seq2seq-greedy-residual
google-nmt
bert-transformer-decoder-beam
xlnet-base-transformer-decoder-beam

OCR (optical character recognition)

CNN + LSTM RNN, test accuracy 100%
Im2Latex, test accuracy 100%

POS-Tagging

Trained on CONLL POS.

Bidirectional RNN + CRF, test accuracy 92%
Bidirectional RNN + Luong Attention + CRF, test accuracy 91%
Bidirectional RNN + Bahdanau Attention + CRF, test accuracy 91%
Char Ngrams + Bidirectional RNN + Bahdanau Attention + CRF, test accuracy 91%
Char Ngrams + Bidirectional RNN + Bahdanau Attention + CRF, test accuracy 91%
Char Ngrams + Residual Network + Bahdanau Attention + CRF, test accuracy 3%
Char Ngrams + Attention is you all Need + CRF, test accuracy 89%
BERT, test accuracy 99%

Question-Answers

Trained on bAbI Dataset.

End-to-End Memory Network + Basic cell
End-to-End Memory Network + GRU cell
End-to-End Memory Network + LSTM cell
Dynamic Memory

Sentence-pair

Trained on Cornell Movie–Dialogs Corpus

BERT

Speech to Text

Trained on Toronto speech dataset.

Tacotron, https://github.com/Kyubyong/tacotron_asr, test accuracy 77.09%
BiRNN LSTM, test accuracy 84.66%
BiRNN Seq2Seq + Luong Attention + Cross Entropy, test accuracy 87.86%
BiRNN Seq2Seq + Bahdanau Attention + Cross Entropy, test accuracy 89.28%
BiRNN Seq2Seq + Bahdanau Attention + CTC, test accuracy 86.35%
BiRNN Seq2Seq + Luong Attention + CTC, test accuracy 80.30%
CNN RNN + Bahdanau Attention, test accuracy 80.23%
Dilated CNN RNN, test accuracy 31.60%
Wavenet, test accuracy 75.11%
Deep Speech 2, test accuracy 81.40%
Wav2Vec Transfer learning BiRNN LSTM, test accuracy 83.24%

Spelling correction

BERT-Base
XLNET-Base
BERT-Base Fast
BERT-Base accurate

SQUAD Question-Answers

Trained on SQUAD Dataset.

BERT,

{"exact_match": 77.57805108798486, "f1": 86.18327335287402}

Stemming

Trained on English Lemmatization.

LSTM + Seq2Seq + Beam
GRU + Seq2Seq + Beam
LSTM + BiRNN + Seq2Seq + Beam
GRU + BiRNN + Seq2Seq + Beam
DNC + Seq2Seq + Greedy
BiRNN + Bahdanau + Copynet

Text Augmentation

Pretrained Glove
GRU VAE-seq2seq-beam TF-probability
LSTM VAE-seq2seq-beam TF-probability
GRU VAE-seq2seq-beam + Bahdanau Attention TF-probability
VAE + Deterministic Bahdanau Attention, https://github.com/HareeshBahuleyan/tf-var-attention
VAE + VAE Bahdanau Attention, https://github.com/HareeshBahuleyan/tf-var-attention
BERT-Base + Nucleus Sampling
XLNET-Base + Nucleus Sampling