|
Welcome to HSearch!
|
Useful Links
|
January 19th, 2011: HSearch 0.90 released; Download
Windows Installer! Or,
Tar File!
December 7th, 2010: HSearch 0.89 released; Download!
HSearch is an open source, NoSQL Search Engine.
Use it when you need real-time search on your Big Data.
This project's goal is to index over 100 billion records atop your commodity hardware cluster.
HSearch is an open source, distributed, multi-format, structured and unstructured content search engine
built on HBase platform.
As the complete index is stored in HBase table, it inherits the HBase storable capabilities.
HSearch features include:
- Multi-XML formats
- Record and document level search access control
- Continuous index updation
- Parallel indexing using multi-machines
- Embeddable inside application
- A REST-ful Web service gateway that supports XML
- Auto sharding
- Auto replication
HSearch Performance Readings for 0.90. The test bed is:
- Indexed Wikipedia Dump ( Wiki Pages with detail information such as History, Version)
- Machine used is an commodity class machine with total 4GB of RAM.
Out of this HBase standalone instance has 2GB of RAM only.
Term | Repository Size | Found | 1st Run Response | 2nd Run Response |
usa | 1.1M | 64829 | 1161ms | 600ms |
india | 1.1M | 58377 | 340ms | 326ms |
hill | 1.1M | 48810 | 658ms | 430ms |
usa | 5M | 172883 | 2380ms | 1258ms |
india | 5M | 102712 | 557ms | 490ms |
hill | 5M | 108104 | 1147ms | 744ms |
usa | 10M | 283829 | 2810ms | 1756ms |
india | 10M | 172787 | 781ms | 629ms |
hill | 10M | 159029 | 1410ms | 1076ms |
Start by running HSearch examples, importing HSearch project from CVS server at bizosyshsearch.cvs.sourceforge.net
- Learn about HSearch by reading the documentation.
- Download HSearch from the release page.
This will index 50000 location information using sample data file and search for a keyword named somerset in it.
- Install the MSI.
- From the Windows Start Menu, Go to HSearch and then click on "Step-1 Start". This will create an tray icon at right bottom part of your windows with letter H.
- Right click on the icon and click "Start HBase". HBase will be started and a message on console will come (HBase Server started)
- From the Windows Start Menu, Go to HSearch and then click on "Step - 2 Index Sample Data"
- From the Windows Start Menu, Go to HSearch and then click on "Step - 3 Search Sample Data"
This will index 50000 location information using sample data file and search for a keyword named somerset in it.
- Create a folder "c:\bizosys" and copy "hsearch-0.90.tar" file into this folder.
- Extract the tar file here. It will create c:\bizosys\hsearch-0.90 folder.
- Open a command line prompt. Type
cd C:\bizosys\hsearch-0.90\bin
- Start hsearch tray icon typing command. This will create an icon at right bottom part of your windows desktop shortcut tray.
hsearch.bat
- Right click on the icon and click "Start HBase". HBase will be started and a message on console will come (HBase Server started)
- Open another window and Goto benchmark folder
cd C:\bizosys\hsearch-0.90\benchmark
- Start indexing freebase location databases typing command,
freebaseW.bat
- Once indexed, Look for keyword "somerset"
typing command,
query.bat
HBase is an open source volunteer project.
We encourage you to learn about the project and contribute your
expertise. Write to me at
abinash@bizosys.com
HSearch is available under the Apache licence. Read details at http://www.apache.org/licenses/LICENSE-2.0
|