diff --git a/data/10k-v2.parquet b/data/10k-v2.parquet new file mode 100644 index 0000000..231686a Binary files /dev/null and b/data/10k-v2.parquet differ diff --git a/data/README.md b/data/README.md index b4bfe07..9486942 100644 --- a/data/README.md +++ b/data/README.md @@ -18,7 +18,32 @@ --> # Test data files for Parquet compatibility and regression testing -TODO: Document what each file is + +## 10k-v2.parquet +This file consists of 10k rows written in v2 page format with this type: + +``` +message test { + required binary binary_field, + required int32 int32_field, + required int64 int64_field, + required boolean boolean_field, + required float float_field, + required double double_field, + required fixed_len_byte_array(1024) flba_field, + required int96 int96_field +} +``` + +Filled with random values. It is used for performance benchmarks. + +Originally from https://github.com/sunchao/parquet-rs/blob/master/data/10k-v2.parquet + +## stock_simulated.parquet + +Simulated stock market data, used for performance benchmarks. + +Originally from https://github.com/sunchao/parquet-rs/blob/master/data/stock_simulated.parquet ## Encrypted Files diff --git a/data/stock_simulated.parquet b/data/stock_simulated.parquet new file mode 100644 index 0000000..9d2c5fb Binary files /dev/null and b/data/stock_simulated.parquet differ