YJ Captions 26k Dataset

We have developed a Japanese version of the MS COCO caption dataset, which we call YJ Captions 26k Dataset. It is created to facilitate the development of image captioning in Japanese language. Each Japanese caption describes the specified image provided in MS COCO dataset and each image has 5 captions.

Annotation Format

The annotations are stored using the JSON file format. The annotation scheme is the same as that of MS COCO. Please see the section on Image Caption Annotations.

License

Creative Commons Attribution 4.0 License

Citation

@InProceedings{P16-1168,
  author = "Miyazaki, Takashi and Shimizu, Nobuyuki",
  title = "Cross-Lingual Image Caption Generation",
  booktitle = "Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
  year = "2016",
  publisher = "Association for Computational Linguistics",
  pages = "1780--1790",
  location = "Berlin, Germany",
  doi = "10.18653/v1/P16-1168",
  url = "http://aclweb.org/anthology/P16-1168"
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
yjcaptions26k.zip		yjcaptions26k.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YJ Captions 26k Dataset

Annotation Format

License

Citation

About

Releases

Packages

yahoojapan/YJCaptions

Folders and files

Latest commit

History

Repository files navigation

YJ Captions 26k Dataset

Annotation Format

License

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages