1 comment on “Structured Streaming: What is it?”

Structured Streaming: What is it?


With the advent of streaming frameworks like Spark Streaming, Flink, Storm etc. developers stopped worrying about issues related to a streaming application, like - Fault Tolerance, i.e., zero data loss, Real-time processing of data, etc. and started focussing only on solving business…

0 comments on “Streaming in Spark, Flink and Kafka”

Streaming in Spark, Flink and Kafka


There is a lot of buzz going on between when to use use spark, when to use flink, and when to use Kafka. Both spark streaming and flink provides exactly once guarantee that every record will be processed exactly once…

2 comments on “Introduction To HADOOP !”

Introduction To HADOOP !


Here I am to going to  write a blog on Hadoop! "Bigdata is not about data! The value in Bigdata [is in] the analytics. " -Harvard Prof. Gary King So the Hadoop came into Introduction! Hadoop is an open source,…

2 comments on “Another Apache Flink tutorial, following Hortonworks’ Big Data series”

Another Apache Flink tutorial, following Hortonworks’ Big Data series


BackgroundA couple of weeks back, I was discussing with a friend of mine, on the topic of training materials on Apache Spark, available online. Of the couple of sites that I mentioned, the hadoop tutorial from Hortonworks, came up. This…

0 comments on “Getting close to Apache Flink, albeit in a Träge manner – 3”

Getting close to Apache Flink, albeit in a Träge manner – 3


In the last two blogs on Flink, I hope to have been able to underline the primacy of Windows in the scheme of things of Apache Flink's streaming. I have shared my understanding of two types of Windows that can…