Category Archives: big data

The curious case of Cassandra Reads


In our previous blog, we discovered how Cassandra handles its write queries. Now it’s time to understand how it ensures all the read requests are fulfilled. Let’s first have an overall view of Cassandra. Apache Cassandra is a free and … Continue reading

Posted in big data, Cassandra, database, NoSql, Scala | Tagged , , , , , , | 1 Comment

Cassandra Writes: A Mystery?


Apache Cassandra is a free and open-source distributed NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. It is a peer to peer database where … Continue reading

Posted in big data, Cassandra, database, Scala | Tagged , , , , , | 3 Comments

Knolx: Introduction to Algebird: Abstract Algebra for Analytics


Hello everyone, Knoldus organized a session on 6th October 2017. The topic was “Introduction To Algebird”. Many people attended and enjoyed the session. In this blog post, I am going to share the slides & video of the session.

Posted in Best Practices, big data, Functional Programming, knoldus, Scala | 1 Comment

Apache Hadoop vs Apache Spark


The term Big Data has created a lot of hype already in the business world. Hadoop and Spark are both Big Data frameworks – they provide some of the most popular tools used to carry out common Big Data-related tasks. … Continue reading

Posted in apache spark, big data, Scala | Tagged , , , , , | 3 Comments

One-way & two-way streaming in a Lagom application


Now a days streaming word is a buzz word and you should have heard many types of streaming till now i.e. kafka streaming, spark streaming etc etc. But in this blog we will see a new type of streaming i.e … Continue reading

Posted in Akka, Best Practices, big data, Functional Programming, github, Java, knoldus, Messages, Reactive, Scala, Streaming, Web Services | 1 Comment

Zeppelin with Spark


Let us first start with the very first question, What is Zeppelin? It is a web-based notebook that enables interactive data analytics. Based on the concept of an interpreter that can be bound to any language or data processing backend, … Continue reading

Posted in big data, Scala, Spark, Tutorial | 2 Comments

Apache Storm: Architecture


Apache Storm is a distributed realtime computation system. Similar to how Hadoop provides a set of general primitives for doing batch processing, Storm provides a set of general primitives for doing the realtime computation. Storm is simple, can be used … Continue reading

Posted in big data, Clojure, Scala, Streaming | 2 Comments

Case Study to understand Kafka Consumer and its offsets


In this blog post, we will discuss mainly Kafka Consumer and its Offsets. We will understand this using a case study implemented in Scala. This blog post assumes that you are aware of basic Kafka terminology. CASE STUDY: The Producer … Continue reading

Posted in Apache Kafka, big data, Functional Programming, knoldus, Scala, Streaming | 4 Comments

Simple Things You Can Learn From Cassandra Nodetool (Monitor/Manage) For DC/OS


Cassandra native tool called nodetool is used for monitoring and managing cassandra cluster for dcos Continue reading

Posted in Best Practices, big data, Cassandra, cluster, NoSql | Tagged , , , , , , , , , , , , , | 4 Comments

Knolx: Getting started with Presto


Hi all, Knoldus has organized a 1-hour session on 8th September 2017. The topic was “Getting started with Presto”. Many people have joined and enjoyed the session. I am going to share the slides here. Please let me know if you … Continue reading

Posted in big data, Scala, sql | 1 Comment