Built using:
  

Project
Downloads
Sponsors
Privacy policy

Documentation
0.90

Developers
Issue tracking
Forums
Version control

Welcome to HSearch!


Useful Links

News

January 19th, 2011: HSearch 0.90 released; Download Windows Installer! Or, Tar File!

December 7th, 2010: HSearch 0.89 released; Download!

Overview

HSearch is an open source, NoSQL Search Engine. Use it when you need real-time search on your Big Data. This project's goal is to index over 100 billion records atop your commodity hardware cluster. HSearch is an open source, distributed, multi-format, structured and unstructured content search engine built on HBase platform. As the complete index is stored in HBase table, it inherits the HBase storable capabilities.

HSearch features include:
  • Multi-XML formats
  • Record and document level search access control
  • Continuous index updation
  • Parallel indexing using multi-machines
  • Embeddable inside application
  • A REST-ful Web service gateway that supports XML
  • Auto sharding
  • Auto replication
HSearch Performance Readings for 0.90. The test bed is:
  • Indexed Wikipedia Dump ( Wiki Pages with detail information such as History, Version)
  • Machine used is an commodity class machine with total 4GB of RAM.
    Out of this HBase standalone instance has 2GB of RAM only.
Term Repository Size Found 1st Run Response2nd Run Response
usa 1.1M 64829 1161ms 600ms
india 1.1M 58377 340ms 326ms
hill 1.1M 48810 658ms 430ms
usa 5M 172883 2380ms 1258ms
india 5M 102712 557ms 490ms
hill 5M 108104 1147ms 744ms
usa 10M 283829 2810ms 1756ms
india 10M 172787 781ms 629ms
hill 10M 159029 1410ms 1076ms

Getting Started

Start by running HSearch examples, importing HSearch project from CVS server at bizosyshsearch.cvs.sourceforge.net
  1. Learn about HSearch by reading the documentation.
  2. Download HSearch from the release page.

Quick Start (MSI Installation)

This will index 50000 location information using sample data file and search for a keyword named somerset in it.
  1. Install the MSI.
  2. From the Windows Start Menu, Go to HSearch and then click on "Step-1 Start". This will create an tray icon at right bottom part of your windows with letter H.
  3. Right click on the icon and click "Start HBase". HBase will be started and a message on console will come (HBase Server started)
  4. From the Windows Start Menu, Go to HSearch and then click on "Step - 2 Index Sample Data"
  5. From the Windows Start Menu, Go to HSearch and then click on "Step - 3 Search Sample Data"

Quick Start (TAR File)

This will index 50000 location information using sample data file and search for a keyword named somerset in it.
  1. Create a folder "c:\bizosys" and copy "hsearch-0.90.tar" file into this folder.
  2. Extract the tar file here. It will create c:\bizosys\hsearch-0.90 folder.
  3. Open a command line prompt. Type
    cd C:\bizosys\hsearch-0.90\bin
  4. Start hsearch tray icon typing command. This will create an icon at right bottom part of your windows desktop shortcut tray.
    hsearch.bat
  5. Right click on the icon and click "Start HBase". HBase will be started and a message on console will come (HBase Server started)
  6. Open another window and Goto benchmark folder
    cd C:\bizosys\hsearch-0.90\benchmark
  7. Start indexing freebase location databases typing command,
    freebaseW.bat
  8. Once indexed, Look for keyword "somerset" typing command,
    query.bat

Getting Involved

HBase is an open source volunteer project. We encourage you to learn about the project and contribute your expertise. Write to me at abinash@bizosys.com

License - Apache

HSearch is available under the Apache licence. Read details at http://www.apache.org/licenses/LICENSE-2.0

Copyright © 2010 Bizosys Technologies Pvt Ltd.