December 27, 2018

1835 words 9 mins read

zhedongzheng/tensorflow-nlp

Code, for Natural Language Processing, and Text Generation, in TensorFlow 2.x / 1.x


repo name	zhedongzheng/tensorflow-nlp
repo link	https://github.com/zhedongzheng/tensorflow-nlp
homepage
language	Jupyter Notebook
size (curr.)	3656 kB
stars (curr.)	1404
created	2017-03-12
license	MIT License

These scripts have been run on Google Colab which provides free GPU memory

Natural Language Processing（自然语言处理）
- Text Classification（文本分类）
  - IMDB
    - TF-IDF + Logistic Regression
    - FastText
    - Attention
    - Sliced LSTM
- Text Matching（文本匹配）
  - SNLI
    - DAM
    - MatchPyramid
    - ESIM
    - RE2
- Topic Modelling（主题模型）
- Chatbot（对话机器人）
  - Single-turn (单轮对话)
    - Spoken Language Understanding（对话理解）
      - ATIS
    - Generative Dialog（生成式对话）
      - RNN Seq2Seq + Attention
      - Transformer
  - Multi-turn (多轮对话)
    - Multi-turn Dialogue Rewriting（多轮对话改写）
      - RNN Seq2Seq + Attention + Dynamic Memory
- Semantic Parsing（语义解析）
  - Semantic Parsing for Task Oriented Dialog
    - RNN Seq2Seq + Attention
    - Transformer
- Question Answering（问题回答）
  - bAbI
    - Dynamic Memory Network
- Text Transformation（文本变换）
  - Word Extraction
  - Text Vectorization
  - Word Segmentation
Knowledge Graph（知识图谱）
- Knowledge Graph Construction（知识图谱构建）
- Knowledge Graph Inference（知识图谱推理）
  - WN18
    - DistMult
    - TuckER
    - ComplEx
Recommender System（推荐系统）
- Movielens 1M
  - Fusion
    - Classification
    - Regression

Text Classification

└── finch/tensorflow2/text_classification/imdb
	│
	├── data
	│   └── glove.840B.300d.txt          # pretrained embedding, download and put here
	│   └── make_data.ipynb              # step 1. make data and vocab: train.txt, test.txt, word.txt
	│   └── train.txt  		     # incomplete sample, format <label, text> separated by \t 
	│   └── test.txt   		     # incomplete sample, format <label, text> separated by \t
	│   └── train_bt_part1.txt  	     # (back-translated) incomplete sample, format <label, text> separated by \t
	│
	├── vocab
	│   └── word.txt                     # incomplete sample, list of words in vocabulary
	│	
	└── main              
		└── attention_linear.ipynb   # step 2: train and evaluate model
		└── attention_conv.ipynb     # step 2: train and evaluate model
		└── fasttext_unigram.ipynb   # step 2: train and evaluate model
		└── fasttext_bigram.ipynb    # step 2: train and evaluate model
		└── sliced_rnn.ipynb         # step 2: train and evaluate model
		└── sliced_rnn_bt.ipynb      # step 2: train and evaluate model

Task: IMDB
- <Notebook>: Make Data & Vocabulary
- Model: TF-IDF + Logistic Regression
  - PySpark
    - <Notebook> TF + IDF + Logistic Regression -> 88.2% Testing Accuracy
  - Sklearn
    - <Notebook> TF + IDF + Logistic Regression -> 88.3% Testing Accuracy
    - <Notebook> TF (binary) + IDF + Logistic Regression -> 88.8% Testing Accuracy
- Model: FastText
  - Facebook Official Release
    - <Notebook> Unigram FastText -> 87.3% Testing Accuracy
    - <Notebook> Bigram FastText -> 89.8% Testing Accuracy
  - TensorFlow 2
    - <Notebook> Unigram FastText -> 89.1 % Testing Accuracy
    - <Notebook> Bigram FastText -> 90.2 % Testing Accuracy
- Model: Feedforward Attention
  - TensorFlow 2
    - <Notebook> Unigram Attention -> 89.5 % Testing Accuracy
    - <Notebook> 5-gram Attention -> 90.7 % Testing Accuracy
- Model: Sliced RNN
  - TensorFlow 2
    - <Notebook> Sliced LSTM -> 91.4 % Testing Accuracy
    - <Notebook> Sliced LSTM + Back-Translation -> 91.7 % Testing Accuracy
    - <Notebook> Sliced LSTM + Back-Translation + Char Embedding -> 92.3 % Testing Accuracy
    - <Notebook> Sliced LSTM + Back-Translation + Char Embedding + Label Smoothing
      
      -> 92.5 % Testing Accuracy
      
      This result (without transfer learning) is higher than CoVe (with transfer learning)

Text Matching

└── finch/tensorflow2/text_matching/snli
	│
	├── data
	│   └── glove.840B.300d.txt       # pretrained embedding, download and put here
	│   └── download_data.ipynb       # step 1. run this to download snli dataset
	│   └── make_data.ipynb           # step 2. run this to generate train.txt, test.txt, word.txt 
	│   └── train.txt  		  # incomplete sample, format <label, text1, text2> separated by \t 
	│   └── test.txt   		  # incomplete sample, format <label, text1, text2> separated by \t
	│
	├── vocab
	│   └── word.txt                  # incomplete sample, list of words in vocabulary
	│	
	└── main              
		└── dam.ipynb      	  # step 3. train and evaluate model
		└── esim.ipynb      	  # step 3. train and evaluate model

Task: SNLI
- <Notebook>: Download Data
- <Notebook>: Make Data & Vocabulary
  - <Text File>: Data Example
  - <Text File>: Vocabulary Example
- Model: DAM
  - TensorFlow 2
    - <Notebook> DAM -> 85.3% Testing Accuracy
      
      The accuracy of this implementation is higher than UCL MR Group (84.6%)
- Model: Match Pyramid
  - TensorFlow 2
    - <Notebook> Match Pyramid -> 85.9% Testing Accuracy
    - <Notebook> Match Pyramid + Multiway Attention -> 87.1% Testing Accuracy
      
      The accuracy of this model is 0.3% below ESIM, however the speed is 1x faster than ESIM
- Model: ESIM
  - TensorFlow 2
    - <Notebook> ESIM -> 87.4% Testing Accuracy
      
      The accuracy of this implementation is sligntly higher than UCL MR Group (87.2%)
- Model: RE2
  - TensorFlow 2

Topic Modelling

Data: Some Book Titles
- Model: TF-IDF + LDA
  - PySpark
    - <Notebook> TF + IDF + LDA
  - Sklearn + pyLDAvis
    - <Notebook> TF + IDF + LDA
    - <Notebook> Visualization

Spoken Language Understanding

└── finch/tensorflow2/spoken_language_understanding/atis
	│
	├── data
	│   └── glove.840B.300d.txt           # pretrained embedding, download and put here
	│   └── make_data.ipynb               # step 1. run this to generate vocab: word.txt, intent.txt, slot.txt 
	│   └── atis.train.w-intent.iob       # incomplete sample, format <text, slot, intent>
	│   └── atis.test.w-intent.iob        # incomplete sample, format <text, slot, intent>
	│
	├── vocab
	│   └── word.txt                      # list of words in vocabulary
	│   └── intent.txt                    # list of intents in vocabulary
	│   └── slot.txt                      # list of slots in vocabulary
	│	
	└── main              
		└── bigru.ipynb               # step 2. train and evaluate model
		└── bigru_self_attn.ipynb     # step 2. train and evaluate model
		└── transformer.ipynb         # step 2. train and evaluate model
		└── transformer_elu.ipynb     # step 2. train and evaluate model

Task: ATIS
- <Text File>: Data Example
- <Notebook>: Make Vocabulary
  - <Text File>: Vocabulary Example
- Model: Bi-directional RNN
  - TensorFlow 2
    - <Notebook> Bi-GRU
      
      97.8% Intent Micro-F1, 95.5% Slot Micro-F1 on Testing Data
  - TensorFlow 1
    - <Notebook> Bi-GRU + CRF
      
      97.2% Intent Micro-F1, 95.7% Slot Micro-F1 on Testing Data
- Model: Transformer
  - TensorFlow 2
    - <Notebook> Transformer
      
      97.5% Intent Micro-F1, 94.9% Slot Micro-F1 on Testing Data
    - <Notebook> Transformer + ELU activation
      
      97.2% Intent Micro-F1, 95.5% Slot Micro-F1 on Testing Data
    - <Notebook> Bi-GRU + Transformer
      
      97.7% Intent Micro-F1, 95.8% Slot Micro-F1 on Testing Data
- Model: ELMO Embedding
  - TensorFlow 1
    - <Notebook> ELMO (the first LSTM hidden state) + Bi-GRU
      
      97.6% Intent Micro-F1, 96.2% Slot Micro-F1 on Testing Data
    - <Notebook> ELMO (weighted sum of 3 layers) + Bi-GRU
      
      97.6% Intent Micro-F1, 96.1% Slot Micro-F1 on Testing Data

Generative Dialog

└── finch/tensorflow1/free_chat/chinese_gaoq1
	│
	├── data
	│   └── make_data.ipynb           	# step 1. run this to generate vocab {char.txt} and data {reduce.txt & core.txt} 
	│
	├── vocab
	│   └── char.txt                	# list of chars in vocabulary for chinese
	│   └── cc.zh.300.vec			# fastText pretrained embedding downloaded from external
	│   └── char.npy			# chinese characters and their embedding values (300 dim)	
	│	
	└── main
		└── lstm_seq2seq_train.ipynb    # step 2. train and evaluate model
		└── lstm_seq2seq_export.ipynb   # step 3. export trained tf model
		└── lstm_seq2seq_predict.ipynb  # step 4. end-to-end inference

Task: Chinese Free Chat
- Data
  - Data provided by GaoQ1
  - <Notebook>: Make Data & Vocabulary
- Model: RNN Seq2Seq + Attention
  - TensorFlow 1
    - <Notebook> Training
      
      LSTM + Attention + Beam Search -> 28.6 Perplexity & 10.5 BLEU-2
    - <Notebook> Export
    - <Notebook> Inference
- Model: Transformer
  - TensorFlow 1
    - <Notebook> Training
      
      Transformer (6 Layers, 8 Heads) -> 29.4 Perplexity & 12.1 BLEU-2
    - <Notebook> Export
    - <Notebook> Inference
- <Notebook> TF Serving Example

Semantic Parsing

└── finch/tensorflow1/semantic_parsing/tree_slu
	│
	├── data
	│   └── glove.840B.300d.txt     	# pretrained embedding, download and put here
	│   └── make_data.ipynb           	# step 1. run this to generate vocab: word.txt, intent.txt, slot.txt 
	│   └── train.tsv   		  	# incomplete sample, format <text, tokenized_text, tree>
	│   └── test.tsv    		  	# incomplete sample, format <text, tokenized_text, tree>
	│
	├── vocab
	│   └── source.txt                	# list of words in vocabulary for source (of seq2seq)
	│   └── target.txt                	# list of words in vocabulary for target (of seq2seq)
	│	
	└── main
		└── lstm_transformer.ipynb           # step 2. train and evaluate model
		└── lstm_seq2seq_multi_attn.ipynb    # step 2. train and evaluate model

Task: Semantic Parsing for Task Oriented Dialog
- <Text File>: Data Example
- <Notebook>: Make Vocabulary
  - <Text File>: Vocabulary Example
- Model: RNN Seq2Seq + Attention
  - TensorFlow 2
    - <Notebook> LSTM + Attention + Beam Search ->
      
      72.4% Exact Match Accuracy on Testing Data
  - TensorFlow 1
    - <Notebook> ELMO + LSTM + Attention + Beam Search + Label Smoothing ->
      
      74.8% Exact Match Accuracy on Testing Data
- Model: Transformer
  - TensorFlow 1 + Texar
    - <Notebook> ELMO + Transformer + Beam Search + Label Smoothing ->
      
      73.3% Exact Match Accuracy on Testing Data

Knowledge Graph Inference

└── finch/tensorflow2/knowledge_graph_completion/wn18
	│
	├── data
	│   └── download_data.ipynb       	# step 1. run this to download wn18 dataset
	│   └── make_data.ipynb           	# step 2. run this to generate vocabulary: entity.txt, relation.txt
	│   └── wn18  		          	# wn18 folder (will be auto created by download_data.ipynb)
	│   	└── train.txt  		  	# incomplete sample, format <entity1, relation, entity2> separated by \t
	│   	└── valid.txt  		  	# incomplete sample, format <entity1, relation, entity2> separated by \t 
	│   	└── test.txt   		  	# incomplete sample, format <entity1, relation, entity2> separated by \t
	│
	├── vocab
	│   └── entity.txt                  	# incomplete sample, list of entities in vocabulary
	│   └── relation.txt                	# incomplete sample, list of relations in vocabulary
	│	
	└── main              
		└── distmult_1-N.ipynb    	# step 3. train and evaluate model

Task: WN18
- <Notebook>: Download Data
  - <Text File>: Data Example
- <Notebook>: Make Vocabulary
  - <Text File>: Vocabulary Example
- We use 1-N Fast Evaluation to largely accelerate evaluation process
  
  MRR: Mean Reciprocal Rank
- Model: DistMult
  - TensorFlow 2
    - <Notebook> DistMult -> 81.0% MRR on Testing Data
  - TensorFlow 1
    - <Notebook> DistMult -> 79.2% MRR on Testing Data
    - Inference
- Model: TuckER
  - TensorFlow 2
    - <Notebook> TuckER -> 87.6% MRR on Testing Data
- Model: ComplEx
  - TensorFlow 2
    - <Notebook> ComplEx -> 94.2% MRR on Testing Data

Knowledge Graph Construction

Data Scraping
- Use Scrapy
- Downloaded
SPARQL
- WN18 Example
Neo4j + Cypher
- Getting Started

Question Answering

└── finch/tensorflow1/question_answering/babi
	│
	├── data
	│   └── make_data.ipynb           		# step 1. run this to generate vocabulary: word.txt 
	│   └── qa5_three-arg-relations_train.txt       # one complete example of babi dataset
	│   └── qa5_three-arg-relations_test.txt	# one complete example of babi dataset
	│
	├── vocab
	│   └── word.txt                  		# complete list of words in vocabulary
	│	
	└── main              
		└── dmn_train.ipynb
		└── dmn_serve.ipynb
		└── attn_gru_cell.py

Task: bAbI
- <Text File>: Data Example
- <Notebook>: Make Vocabulary
- Model: Dynamic Memory Network
  - TensorFlow 1
    - <Notebook> DMN -> 99.4% Testing Accuracy
    - Inference

Text Transformation

Word Extraction
- Chinese
  - <Notebook>: Regex Rule
Text Vectorization
- Chinese
  - <Notebook>: BERT
Word Segmentation
- Chinese
  - Custom TensorFlow Op added by applenob
    - <Notebook>

Recommender System

└── finch/tensorflow1/recommender/movielens
	│
	├── data
	│   └── make_data.ipynb           		# run this to generate vocabulary
	│
	├── vocab
	│   └── user_job.txt
	│   └── user_id.txt
	│   └── user_gender.txt
	│   └── user_age.txt
	│   └── movie_types.txt
	│   └── movie_title.txt
	│   └── movie_id.txt
	│	
	└── main              
		└── dnn_softmax.ipynb
		└── dnn_mse.ipynb

Task: Movielens 1M
- <Notebook>: Make Vocabulary
  - <Text File>: Data Example
- Model: Fusion
  - TensorFlow 1
    
    MAE: Mean Absolute Error
    - <Notebook> Fusion + Regression Loss -> 0.6618 Testing MAE
    - <Notebook> Fusion + Classification Loss -> 0.6320 Testing MAE

Multi-turn Dialogue Rewriting

└── finch/tensorflow1/multi_turn_rewrite/chinese/
	│
	├── data
	│   └── make_data.ipynb         # run this to generate vocab, split train & test data, make pretrained embedding
	│
	├── vocab
	│   └── cc.zh.300.vec		# fastText pretrained embedding downloaded from external
	│   └── char.npy		# chinese characters and their embedding values (300 dim)	
	│   └── char.txt		# list of chinese characters used in this project 
	│	
	└── main              
		└── baseline_lstm_train.ipynb
		└── baseline_lstm_export.ipynb
		└── baseline_lstm_predict.ipynb

Task: Chinese Multi-turn Dialogue Rewriting
- <Text File>: Data
- <Notebook>: Make Data & Vocabulary & Pretrained Embedding
- Model: RNN Seq2Seq + Attention + Dynamic Memory
  - TensorFlow 1
    - <Notebook> LSTM + Attention + Memory + Beam Search
      
      -> BLEU-1: 95.0, BLEU-2: 89.4, BELU-4: 79.0, EM: 56.7%
    - <Notebook> Export
    - <Notebook> Inference

zhedongzheng/tensorflow-nlp

Contents

Text Classification

Text Matching

Topic Modelling

Spoken Language Understanding

Generative Dialog

Semantic Parsing

Knowledge Graph Inference

Knowledge Graph Construction

Question Answering

Text Transformation

Recommender System

Multi-turn Dialogue Rewriting

nfmcclure/tensorflow_cookbook

llSourcell/How_to_use_Tensorflow_for_classification-LIVE

tkipf/gae

leandromoreira/digital_video_introduction

adeshpande3/Tensorflow-Programs-and-Tutorials

skcript/tensorflow-resources

fivethirtyeight/data

rguthrie3/DeepLearningForNLPInPytorch

chiphuyen/stanford-tensorflow-tutorials