Big Data Analytics

Play with Spark: Building Apache Spark with Play Framework

Reading Time: < 1 minute Nowadays, Play framework is being used a lot, for building Scala Applications. It is easy to use & it is Typesafe. So, in this post, we will see how to build a Spark Application in Play 2.2.x. Although Play also uses sbt to build an application but building a Spark Application in Play is totally different. Before start building this application follow the instructions of building Continue Reading

Tutorial: How to build a Tokenizer in Spark and Scala

Reading Time: 2 minutes In our earlier blog A Simple Application in Spark and Scala, we explained how to build Spark and make a simple application using it. In this blog, we will see how to build a fast Tokenizer in Spark & Scala using sbt. Tokenization is the process of breaking a stream of text up into words, phrases, symbols, or other meaningful elements called tokens. The list of tokens Continue Reading

A Simple Application in Spark and Scala

Reading Time: < 1 minute In this blog, we will see how to build a Simple Application in Spark and Scala using sbt. Spark is a Map-Reduce like cluster computing framework, designed to make data analytics fast. In this application we will count the number of lines containing “the”. To build this application we are going to use Spark 0.9.1, Scala 2.10.3 & sbt 0.13.0. Before start building this application follow these Continue Reading