Tech Search is a search engine that allows searching Wikipedia pages of the biggest tech companies in 4 different languages. Pages of companies like Apple, Google, Facebook, Twitter and Uber were first crawled with Nutch and then indexed with Solr to allow fast searching. After a search is performed and the results are presented, in case the user query is considered ambiguous by the DuckDuckGo API, some disambiguation options are presented to the user. This allows to find relevant Wikipedia pages as fast as possible.
You will be need to install Solr and provide your own implementation of the search platform. Feel free to use this project a starting point for your own search engine.
This project was the final project assigned to me in my Informational Retrieval course.
I'm Patrick Balestra
Email: [email protected]
Twitter: @BalestraPatrick
TechSearch
is available under the MIT license. See the LICENSE file for more info.