2 comments on “Structured Streaming: Philosophy behind it”

Structured Streaming: Philosophy behind it


In our previous blogs: Structured Streaming: What is it? & Structured Streaming: How it works? We got to know 2 major points about Structured Streaming - It is a fast, scalable, fault-tolerant, end-to-end, exactly-once stream processing API that helps users in…

1 comment on “Getting started with MongoDB”

Getting started with MongoDB


Hi everyone! in this blog, I will try to explain what is MongoDB and why to use MongoDB What is MongoDB? MongoDB is one of the powerful NoSql DataBase. It does not use the usual rows and columns that we…

2 comments on “How Spark Internally Executes A Program”

How Spark Internally Executes A Program


Hello everyone! In my previous blog, I explained the difference between RDD, DF, and DS you can find this blog Here In this blog, I will try to explain How spark internally works and what are the Components of Execution: Jobs,…

2 comments on “The curious case of Cassandra Reads”

The curious case of Cassandra Reads


In our previous blog, we discovered how Cassandra handles its write queries. Now it's time to understand how it ensures all the read requests are fulfilled. Let's first have an overall view of Cassandra. Apache Cassandra is a free and…

4 comments on “Cassandra Writes: A Mystery?”

Cassandra Writes: A Mystery?


Apache Cassandra is a free and open-source distributed NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. It is a peer to peer database where…

1 comment on “Knolx: Introduction to Algebird: Abstract Algebra for Analytics”

Knolx: Introduction to Algebird: Abstract Algebra for Analytics


Hello everyone, Knoldus organized a session on 6th October 2017. The topic was “Introduction To Algebird”. Many people attended and enjoyed the session. In this blog post, I am going to share the slides & video of the session. Slides: https://www.slideshare.net/knoldus/introduction-to-algebird-abstract-algebra-for-analytics https://www.slideshare.net/knoldus/introduction-to-algebird-abstract-algebra-for-analytics…

3 comments on “Apache Hadoop vs Apache Spark”

Apache Hadoop vs Apache Spark


The term Big Data has created a lot of hype already in the business world. Hadoop and Spark are both Big Data frameworks – they provide some of the most popular tools used to carry out common Big Data-related tasks.…

1 comment on “One-way & two-way streaming in a Lagom application”

One-way & two-way streaming in a Lagom application


Now a days streaming word is a buzz word and you should have heard many types of streaming till now i.e. kafka streaming, spark streaming etc etc. But in this blog we will see a new type of streaming i.e…

2 comments on “Zeppelin with Spark”

Zeppelin with Spark


Let us first start with the very first question, What is Zeppelin? It is a web-based notebook that enables interactive data analytics. Based on the concept of an interpreter that can be bound to any language or data processing backend,…

2 comments on “Apache Storm: Architecture”

Apache Storm: Architecture


Apache Storm is a distributed realtime computation system. Similar to how Hadoop provides a set of general primitives for doing batch processing, Storm provides a set of general primitives for doing the realtime computation. Storm is simple, can be used…