Wait a minute! I thought Hive and Hadoop were for "big data".
Yes and no. The term "big data" is a silly term, in my opinion. In this post, I want to walk through leveraging the benefits of a Hadoop/Hive environment to work with regular, everyday datasets. Read More
Scalding-JDBC utilizes Cascading-JDBC which comes pre-built to support several relational databases. The bad news is that Oracle is not one of the default supported databases because the Oracle JDBC driver is not available on any public Maven repos. Read More
This is definitely easier than my first attempts and running Scalding on a cluster. Using Kiji and the steps from the book allows you to focus on learning and writing Scalding code rather than environment setup. Read More
Scalding is an extension to the Cascading framework that allows development in the Scala language rather than Java. Cascading and it's plumbing analogies for Hadoop development make a lot of sense. Read More
While learning a bit about Node.js and D3, I wondered if there was a way to use those technologies to visualize HBase data. Turns out, it's possible and it doesn't take much code. Read More