Sample files for testing the fuzzy search algorithms.
The following .txt
files contain sample data set, one entry for line (LF
EOL):
file | data set name | entries |
---|---|---|
english_wordlist_2k.txt |
English words list 2k | 1,510 |
english_wordlist_355k.txt |
English words list 355k | 354,986 |
english_wordlist_58k.txt |
English words list 58k | 58,110 |
hearthstone_cardlist.txt |
Hearthstone cards list | 743 |
magicthegathering_cardlist.txt |
Magic: The Gathering cards list | 15,893 |
ue4_filenames.txt |
Unreal Engine 4 filenames | 13,164 |
For JavaScript testing, the following file defines all data sets (except the 355k Wordlist) as arrays in the dataSets
object:
All files were taken from Forrest Smith's lib_fts project:
- https://github.com/forrestthewoods/lib_fts/tree/master/tests/fuzzy_match/data/
The original sources of the files are unknown.