Skip to content

Latest commit

 

History

History
6 lines (3 loc) · 450 Bytes

README.md

File metadata and controls

6 lines (3 loc) · 450 Bytes

JADES

JADES is a dataset for text simplification in Japanese, described in "JADES: New Text Simplification Dataset in Japanese Targeted at Non-Native Speakers" (the paper will be available soon).

jades.json includes complex-simple sentence pairs, as well as their metadata such as original or translated English sentences. The tokenization in paper are done with Sudachi (ver. 0.6.5, mode A).