Cassandra

PostgreSQL OR APACHE CASSANDRA: WHICH ONE IS THE BETTER OPTION

Reading Time: 3 minutes We are living in the 20th century the century of technologies. Because of this, we come across a lot of data in our daily life. So it is important for us to have a database that can help in maintaining a huge amount of data. Now we have many popular databases in the market like PostgreSQL, Cassandra, MySQL MongoDB, and many more. But the question Continue Reading

Apache Cassandra: CQL Commands

Reading Time: 4 minutes In previous two blogs of Apache Cassandra series, we have already explained the Basics of Apache Cassandra and How Cassandra Reads and Writes. Now here in this blog we will cover another important topic in Apache Cassandra i.e., CQL commands. So let us name this blog as “Apache Cassandra: CQL commands“. We recommend to go through the other two blogs of this series before diving Continue Reading

Indexes in Cassandra

Reading Time: 2 minutes Cassandra is a distributed database from Apache which is highly scalable and effective in managing large amounts of structured data. It provides high availability with no single point of failure. Cassandra is column oriented DB. Often used for time series data. Primary keys in Cassandra It is a primary key database which means data is persisted and organised around a cluster based on hash values(partition Continue Reading

A Quick Demo: Kafka to Flink to Cassandra

Reading Time: 3 minutes Hi Folks!! In this blog, we are going to learn how we can integrate Flink with Kafka and Cassandra to build a simple streaming data pipeline. Apache Flink is a framework and distributed processing engine. it is used for stateful computations over unbounded and bounded data streams.Kafka is a scalable, high performance, low latency platform. It allows reading and writing streams of data like a messaging system.Cassandra: A distributed and wide-column Continue Reading

Creating Data Pipeline with Spark streaming, Kafka and Cassandra

Reading Time: 3 minutes Hi Folks!! In this blog, we are going to learn how we can integrate Spark Structured Streaming with Kafka and Cassandra to build a simple data pipeline. Spark Structured Streaming is a component of Apache Spark framework that enables scalable, high throughput, fault tolerant processing of data streams.Apache Kafka is a scalable, high performance, low latency platform that allows reading and writing streams of data Continue Reading

Understanding data persistence in Lagom

Reading Time: 4 minutes When we create any microservice, or in general any service, one of the biggest task is to manage data persistence. Lagom supports various databases for doing this task. By default, Lagom uses Cassandra to persist data.

A Simple walk-through to set up a local Cassandra multi-node cluster

Reading Time: 5 minutes In our earlier blogs we have already gone through The basic Introduction to Cassandra and also tried to explore the Cassandra Reads and Writes. Today we will be discussing something apart from the in-depth theoretical knowledge of Cassandra. In one of our projects , we came through a basic requirement in which we needed to required a local Cassandra cluster for some kind of testing.  Continue Reading

Setting Up Cassandra Cluster Through Ansible

Reading Time: 3 minutes In this post, we will use Ansible to and set-up an Apache Cassandra database cluster. We will use AWS EC2 instances as the nodes for the cluster. Creating a cluster manually is a tedious task. We have to manually configure each node and each node must be correctly configured before starting the cluster.With Ansible, we can automate the task and let Ansible handle the configuration Continue Reading

The curious case of Cassandra Reads

Reading Time: 5 minutes In our previous blog, we discovered how Cassandra handles its write queries. Now it’s time to understand how it ensures all the read requests are fulfilled. Let’s first have an overall view of Cassandra. Apache Cassandra is a free and open-source distributed NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of Continue Reading

Cassandra Writes: A Mystery?

Reading Time: 5 minutes Apache Cassandra is a free and open-source distributed NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. It is a peer to peer database where each node in the cluster constantly communicates with each other to share and receive information (node status, data ranges and so on). There is no Continue Reading

Store Semantic Web Triples into Cassandra

Reading Time: 2 minutes The semantic web is the next level of  Web Searching where data is more important and it should be well defined. The semantic web is needed for making the web search more intelligent and intuitive to get the user’s requirement. You all can find some interesting point on the Semantic Web here. Triples is an atomic entity in RDF. It is composed of subject-predicate-object. It Continue Reading

DATA PERSISTENCE IN LAGOM

Reading Time: 5 minutes Are you finding it difficult to understand lagom persistence? Don’t worry because help is right here. In this blog, we will learn about lagom persistence with the help of a simple application and also discuss its theoretical aspects. Before we begin, make sure you know about Event Sourcing and CQRS. You can read about it in details from this link . Choosing a database When Continue Reading

Simple Things You Can Learn From Cassandra Nodetool (Monitor/Manage) For DC/OS

Reading Time: 4 minutes Cassandra native tool called nodetool is used for monitoring and managing cassandra cluster for dcos