Apache Hadoop Databases: HBase and Hive
This session will cover the basics of Apache Hadoop and in particular will focus on HBase and Hive, two polyglot persistence-style databases fulfilling different needs designed to be horizontally scaled across clusters of commodity hardwire.
Apache Hadoop is all about building software to support “reliable, scalable, distributed computing.” Hadoop is comprised of an impressive collection of subprojects with the common goal of providing infrastructure components for large-scale distributed computing such as a distributed file system, a map/reduce framework, two different styles of distributed database, and more.
Hadoop is comprised of a distributed file system on which HBase and Hive run; these databases are designed to be run in a distributed environment across dozens, hundreds, or thousands of servers. HBase is modeled after Google's BigTable database and is a distributed, column-oriented database designed to store billions of rows and millions of columns and accessed in real time. Hive is built as a batch-oriented, data warehousing system atop the Hadoop distributed file system (HDFS).
About Scott Leberknight
Scott is Chief Architect at Near Infinity Corporation, an enterprise software development and consulting services company based in Reston, Virginia. He has been developing enterprise and web applications for 14 years professionally, and has developed applications using Java, Ruby, Groovy, and even an iPhone application with Objective-C. His main areas of interest include alternative persistence technologies, object-oriented design, system architecture, testing, and frameworks like Spring, Hibernate, and Ruby on Rails. In addition, Scott enjoys learning new languages to make himself a better and more well-rounded developer a la The Pragmatic Programmers' advice to “learn one language per year.”
Scott holds a B.S. in Engineering Science and Mechanics from Virginia Tech, and an M. Eng. in Systems Engineering from the University of Maryland. Scott speaks at the No Fluff Just Stuff Symposiums and various other conferences. In his (sparse) spare time, Scott enjoys spending time with his wife, three children, and cat. He also tries to find time to play soccer, go snowboarding, and mountain bike whenever he can.
More About Scott »