Skip to content

Latest commit

 

History

History

dataset

Test Data Set

Sample files for testing the fuzzy search algorithms.

Folder Contents

The following .txt files contain sample data set, one entry for line (LF EOL):

file data set name entries
english_wordlist_2k.txt English words list 2k 1,510
english_wordlist_355k.txt English words list 355k 354,986
english_wordlist_58k.txt English words list 58k 58,110
hearthstone_cardlist.txt Hearthstone cards list 743
magicthegathering_cardlist.txt Magic: The Gathering cards list 15,893
ue4_filenames.txt Unreal Engine 4 filenames 13,164

For JavaScript testing, the following file defines all data sets (except the 355k Wordlist) as arrays in the dataSets object:

All files were taken from Forrest Smith's lib_fts project:

The original sources of the files are unknown.