JST (Joint Sentiment Topic Model)

This is a java implementation of Joint Sentiment Topic Model. JST can be used for sentiment analysis and emotion detection.

Usage

Method 1: Compile a jar using the jst.core.JST class as Main and execute.

Method 2: Run jst.core.Run.

Extracted Topics

Following is the extracted topics using a chinese news dataset. Check the .stwords file in model directory to see all the topics.

Lexicon

The format of sentiment or emotion lexicon file is as follows:

S senti_name_1 senti_name_2 ... senti_name_S

token_1 token_sentiment_distribution_1

token_2 token_sentiment_distribution_2

.

token_m sentiment_distribution_m

where S is the number of sentiment and token sentiment distribution is a S-dimensional vector separated by a blank.

Refer to lexicon.txt in the data directory.

Data Format

N

doc_sentiment_distribution_1#word_1 word_2 ... word_d1

doc_sentiment_distribution_2#word_1 word_2 ... word_d2

.

doc_sentiment_distribution_N#word_1 word_2 ... word_dN

where N is the number of documents, document sentiment distribution is a S-dimensional vector separated by a blank.

Demo

A demo chinese dataset has been provided in the data directory. Segmentation of Chinese text or tokenization of English text should be done for preprocessing. Run jst.core.Run to train a new model.

Author

zhikai.zhang
Email [email protected]
Blog http://zhikaizhang.cn

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
META-INF		META-INF
data		data
model		model
src/main/java		src/main/java
target/classes/jst		target/classes/jst
JST.iml		JST.iml
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JST (Joint Sentiment Topic Model)

Usage

Extracted Topics

Lexicon

Data Format

Demo

Author

About

Releases

Packages

Contributors 2

Languages

laserwave/jst

Folders and files

Latest commit

History

Repository files navigation

JST (Joint Sentiment Topic Model)

Usage

Extracted Topics

Lexicon

Data Format

Demo

Author

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages