GitHub

BioPre is a supervised model using neural network with bag-of-words embedding to predict the entity mention in body of scientific article given the correspondint metadata including abstract and author information.

You can use:

train.py

to train a model with abstract-body pairs (author information is optional);

type "train.py -h" to check the usage.

You can use:

predict.py

to predict entity mentions in body with the model trained using train.py;

type "predict.py -h" to check the usage.

All the articles should be annotated to entity lists in *.csv format, and all the vocabularies should be in *.json format.

Failed attemps are under failed_attempts folder consisting of a binary classifier, a LSTM model and some data making-up scripts.

Have problems? Look into the code by yourself!

More problems or want the training data/optimized models? Contact me through zhengyl940425@gmail.com!

Name	Name	Last commit message	Last commit date
Latest commit NiMaZi 1 May 25, 2018 84873fb · May 25, 2018 History 888 Commits
data_sample	data_sample	new json	May 25, 2018
failed_attempts	failed_attempts	failed	May 25, 2018
models	models	remove	May 25, 2018
utils	utils	load with util	May 25, 2018
.gitignore	.gitignore	with data sample	May 25, 2018
README.md	README.md	1	May 25, 2018
predict.py	predict.py	author net predict	May 25, 2018
train.py	train.py	load with util	May 25, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

NiMaZi/BioPre

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages