Category Archives: Batch

Twitter’s tweets analysis using Lambda Architecture


Hello Folks, In this blog i will explain  twitter’s tweets analysis with lambda architecture. So first we need to understand  what is lambda architecture,about its component and usage. According to Wikipedia, Lambda architecture is a data processing architecture designed to handle … Continue reading

Posted in Akka, akka-http, Apache Kafka, apache spark, Architecture, Batch, big data, Cassandra, Scala, Spark, Streaming | 7 Comments

Building Analytics Engine Using Akka, Kafka & ElasticSearch


In this blog , I will share my experience on building scalable, distributed and fault-tolerant  Analytics engine using Scala, Akka, Play, Kafka and ElasticSearch. I would like to take you through the journey of  building an analytics engine which was primarily … Continue reading

Posted in Akka, akka-http, Amazon, Amazon EC2, Apache Kafka, Architecture, AWS, AWS Services, Batch, Best Practices, big data, Cassandra, database, Elasticsearch, Java, Non-Blocking, NoSql, Reactive, S3, Scala, Streaming, Web | 10 Comments

Another Apache Flink tutorial, following Hortonworks’ Big Data series


Background A couple of weeks back, I was discussing with a friend of mine, on the topic of training materials on Apache Spark, available online. Of the couple of sites that I mentioned, the hadoop tutorial from Hortonworks, came up. … Continue reading

Posted in Apache Flink, Batch, Flink, http://schemas.google.com/blogger/2008/kind#post, IOT, Scala | 2 Comments