Add sorting of chunks to evaluation #1588

bartekkuncer · 2022-04-09T00:03:49Z

Description

This change introduces sorting of chunks before executing evaluation to reduce padding to minimum and in this way improve performance.

As every input feature has unique qas_id it can be used for sorting. With the sorting evaluation function goes like this:

Step number 1 is performed so that chunks and their inference results can be easily put in proper order in step number 5 for evaluation in step 6.

Results for max_seq_length=128, doc_stride=32:
no sort:

sorted:

Performance did not improve much due to most of the chunks being of same 128 length due to relatively small values of max_seq_length and doc_stride.

Results for max_seq_length=512, doc_stride=128 (default values in run_squad.py script):
no sort:

sorted:

As you can see the performance improved significantly (~20%) without any loss of accuracy.

cc @dmlc/gluon-nlp-team

Add sorting of chunks to evaluation

3d27013

bartekkuncer requested a review from a team as a code owner April 9, 2022 00:03

bartekkuncer added 3 commits April 20, 2022 12:49

Fix sorting chunks with quantization

fa89f9e

Add sort flag

d1c078d

Fix bad commit

cf93fef

bgawrych approved these changes Jul 12, 2022

View reviewed changes