Previously we wrote how to write user defined functions that can be called from Hive. You can write...
Author - Walker Rowe
How to write a Hive User Defined Function (UDF) in Java
Here we show how to write user defined functions (UDF) in Java and call that from Hive. You can...
What is Apache HCatalog? HCatalog Explained
Here we explain what HCatalog is and why it is useful to Hadoop programmers. Basically, HCatalog...
Apache Hive Beeline Client, Import CSV File into Hive
Beeline has replaced the Hive CLI in what Hive was formally called HiveServer1. Now Hive is called...
Graphing Spark Data with HighCharts
Here we look at how to use HighCharts with Spark. HighCharts is a charting framework written in...
K-means Clustering with Apache Spark
Here we show a simple example of how to use k-means clustering. We will look at crime statistics...
Apache Spark: Working with Streams
In the last two posts we wrote, we explained how to read data streaming from Twitter into Apache...
Using Zeppelin with Big Data
Zeppelin is an interactive notebook. It lets you write code into a web page, execute it, and...
Spark Decision Tree Classifier
Here we explain how to use the Decision Tree Classifier with Apache Spark ML (machine learning). We...
Using Logistic Regression, Scala, and Spark
Here we explain how to do logistic regression with Apache Spark. Logistic regression (LR) is...