Tree-Structured Long Short-Term Memory Networks

An implementation of the Tree-LSTM architectures described in the paper Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks by Kai Sheng Tai, Richard Socher, and Christopher Manning.

Requirements

Torch7
penlight
nn
nngraph
optim
Java >= 8 (for Stanford CoreNLP utilities)
Python >= 2.7

The Torch/Lua dependencies can be installed using luarocks. For example:

luarocks install nngraph

Usage

First run the following script:

./fetch_and_preprocess.sh

This downloads the following data:

SICK dataset (semantic relatedness task)
Stanford Sentiment Treebank (sentiment classification task)
Glove word vectors (Common Crawl 840B) -- Warning: this is a 2GB download!

and the following libraries:

The preprocessing script generates dependency parses of the SICK dataset using the Stanford Neural Network Dependency Parser.

Alternatively, the download and preprocessing scripts can be called individually.

Semantic Relatedness

The goal of this task is to predict similarity ratings for pairs of sentences. We train and evaluate our models on the Sentences Involving Compositional Knowledge (SICK) dataset.

To train models for the semantic relatedness prediction task on the SICK dataset, run:

th relatedness/main.lua --model <dependency|constituency|lstm|bilstm> --layers <num_layers> --dim <mem_dim> --epochs <num_epochs>

where:

model: the LSTM variant to train (default: dependency, i.e. the Dependency Tree-LSTM)
layers: the number of layers (default: 1, ignored for Tree-LSTMs)
dim: the LSTM memory dimension (default: 150)
epochs: the number of training epochs (default: 10)

Sentiment Classification

The goal of this task is to predict sentiment labels for sentences. For this task, we use the Stanford Sentiment Treebank dataset. Here, there are two sub-tasks: binary and fine-grained. In the binary sub-task, the sentences are labeled positive or negative. In the fine-grained sub-task, the sentences are labeled very positive, positive, neutral, negative or very negative.

To train models for the sentiment classification task on the Stanford Sentiment Treebank, run:

th sentiment/main.lua --model <constituency|dependency|lstm|bilstm> --layers <num_layers> --dim <mem_dim> --epochs <num_epochs>

This trains a Constituency Tree-LSTM model for the "fine-grained" 5-class classification sub-task.

For the binary classification sub-task, run with the -b or --binary flag, for example:

th sentiment/main.lua -m constituency -b

Predictions are written to the predictions directory and trained model parameters are saved to the trained_models directory.

See the paper for more details on these experiments.

Third-party Implementations

A Tensorflow Fold re-implementation of the Tree-LSTM for sentiment classification

Name	Name	Last commit message	Last commit date
Latest commit kaishengtai Add pointer to third-party reimplementations Jul 30, 2017 6ab39ea · Jul 30, 2017 History 21 Commits
layers	layers	Initial commit	Apr 2, 2015
lib	lib	Produce dependency parses of SST	May 29, 2015
models	models	Added Dependency Tree-LSTM for sentiment.	May 30, 2015
relatedness	relatedness	Added Dependency Tree-LSTM for sentiment.	May 30, 2015
scripts	scripts	Merge remote-tracking branch 'upstream/master'	Dec 24, 2015
sentiment	sentiment	Added Dependency Tree-LSTM for sentiment.	May 30, 2015
util	util	Added Dependency Tree-LSTM for sentiment.	May 30, 2015
.gitignore	.gitignore	Updated link to GloVe vectors	Dec 17, 2015
LICENSE.txt	LICENSE.txt	Initial commit	Apr 2, 2015
README.md	README.md	Add pointer to third-party reimplementations	Jul 30, 2017
fetch_and_preprocess.sh	fetch_and_preprocess.sh	Updated link to GloVe vectors	Dec 17, 2015
init.lua	init.lua	Added Dependency Tree-LSTM for sentiment.	May 30, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tree-Structured Long Short-Term Memory Networks

Requirements

Usage

Semantic Relatedness

Sentiment Classification

Third-party Implementations

About

Releases

Packages

Contributors 2

Languages

License

stanfordnlp/treelstm

Folders and files

Latest commit

History

Repository files navigation

Tree-Structured Long Short-Term Memory Networks

Requirements

Usage

Semantic Relatedness

Sentiment Classification

Third-party Implementations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages