Skip to content
Axel Ngonga edited this page Feb 20, 2014 · 2 revisions

This project implements the indexing of TSV files using ElasticSearch. It assumes that an ElasticSearch server is installed somewhere (see parameters) and implements both uploading data from a TSV file and searching through that data.

java -jar fileName IP Port shardName

is the most basic call. One can also create a white list of URIs (one per line) and then call

java -jar fileName IP Port shardName whiteList.

You can also choose to use the tool directly from java. Then indexing is initialised by called

IndexingModule im = new IndexingModule();

The parameters for indexing are global variables (see code).

Map<String, Float> searchIndex(String searchString)

allows a fuzzy search for the string in searchString.

Clone this wiki locally