video caption

Introduction

This is a python project for video captioning, using hLSTMat model on the msvd or msr-vtt dataset.

How to use the code?

Data

download msvd dataset
download msr-vtt dataset
extract video feature using https://github.com/Cppowboy/video_feature_extractor

Requirements

python 2.7
tensorflow
tensorboard
numpy
pandas
pickle

Run

First, you need to change the data paths in data_engine.py to your own paths.
Use python train.py to run the train script. use tensorboard --logdir your_log_dir to visualize the train procedure and show the scores.

Reference

https://github.com/zhaoluffy/hLSTMat
https://github.com/yunjey/show-attend-and-tell
Song, Jingkuan, et al. "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning." arXiv preprint arXiv:1706.01231 (2017).

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
model_hLSTMat		model_hLSTMat
.gitignore		.gitignore
LICENSE		LICENSE
data_engine.py		data_engine.py
readme.md		readme.md
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

video caption

Introduction

How to use the code?

Data

Requirements

Run

Reference

About

Releases

Packages

Languages

License

Cppowboy/video_caption

Folders and files

Latest commit

History

Repository files navigation

video caption

Introduction

How to use the code?

Data

Requirements

Run

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages