Note: This version is NOT compatible with v0.x due to the changes in some target answers.
Fixed various normalization and encoding issues in the dataset files. This effectively changes some target answers (e.g., 19901991
is now corrected as 1990-1991
-- the non-ascii dash was dropped during dataset construction).
The compact version does not include raw HTML pages.