It's an open-source, distributed, column-oriented store modeled after Google' Bigtable.
Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop.
HBase was written to support very large tables, storing data in millions of rows and columns.
Here are some key features of "Apache Hadoop HBase":
· Convenient base classes for backing Hadoop MapReduce jobs with HBase tables
· Query predicate push down via server side scan and get filters
· Optimizations for real time queries
· A high performance Thrift gateway
· A REST-ful Web service gateway that supports XML, Protobuf, and binary data encoding options
· Cascading source and sink modules
· Extensible jruby-based (JIRB) shell
· Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
· No HBase single point of failure
· Rolling restart for configuration changes and minor upgrades
· Random access performance on par with open source relational databases such as MySQL
What's New in This Release: [ read full changelog ]
· Bug fix release.