Investigation of Distributed Search Engine Based on Hadoop

Ning Chen, Chai Xiangyang

Abstract


his paper begins with a review on the research status of search engine, followed by discussion on goals of search engine, and then the principle of distributed computing is explained. Consequently the MapReduce distributed computing model and the Hadoop distributed file system (HDFS) are analyzed in detail. Finally the distributed search engine architecture is presented. On the basis of the architecture, future challenges and opportunities of the distributed search engine are highlighted.


Keywords


Search engine, Hadoop, MapReduce, Distributed file system,Architecture

Full Text:

PDF


DOI: http://doi.org/10.11591/tijee.v12i9.3833

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License