nosql

Making SQL Queries with MarkLogic

Reading Time: 3 minutes MarkLogic is the only Enterprise NoSQL database, that brings all the features you need into one unified system. SQL is one of the many languages that has been supported by MarkLogic. So that data stored in a MarkLogic database can be easily consumed by technologies that require a SQL data source. SQL queries can be sent via server-side applications code or through ODBC. The view is required so Continue Reading

background

Overview of Couchbase: The NoSQL Database

Reading Time: 3 minutes NoSQL Database Couchbase works as a NoSQL database. So, we will first know about NoSQL Databases. NoSQL Databases use a variety of data models for accessing and managing data, optimized for large data volume, low latency and flexible data models. Features of NoSQL Database Flexibility: Enables faster and more iterative development. Scalability: NoSQL databases are designed to scale out by using distributed clusters. High-performance: Enable Continue Reading

Couchbase Disaster recovery

Couchbase High Availability and Disaster Recovery: Part 2

Reading Time: 3 minutes In our previous blog we learned about how Couchbase achieves high availability. This post will focus on understanding Couchbase Disaster Recovery mechanisms to prevent data loss. Disaster Recovery Couchbase uses the following mechanisms to prevent potential data loss due to unplanned incidents or disasters. XDCR As discussed in the previous blog, Cross-Data Center Replication (XDCR) is a technology that lets us keep entire Data Centers Continue Reading

DynamoDB Core Components

Amazon DynamoDB: Core Components

Reading Time: 4 minutes   DynamoDB is a part of Amazon Web Services. It is a NoSQL database, which supports key-value and document data structures. In this blog, we will be discussing Core components of DynamoDb. Features of DynamoDb: It is a fully managed NoSQL database. It can store & retrieve any amount of data, and can serve any amount of traffic. To maintain fast performance, it distributes data Continue Reading

The Rise Of Scanamo: Async Access For DynamoDB In Scala

Reading Time: 2 minutes Scanamo is a library to use DynamoDB with Scala in a simpler manner with less error-prone code. Now the question is  “Why should anyone use it?” The answer is very simple. As DynamoDB clients provided by AWS are not available in Scala DSL. So there are a number of libraries available for DynamoDB to write your queries in Scala. But what makes Scanamo different from other Continue Reading

Amazon ES – Secure your cluster from anonymous users! #2

Reading Time: 5 minutes In the previous blog, we have learned how to create a domain on Amazon ES and also how to create an index using Curl on the cluster. Now, let’s just look how we can control access to Amazon ES Domain. One of the key benefits of using Amazon ES is that you can utilize AWS Identity and Access Management (IAM) to control access to your Continue Reading

The curious case of Cassandra Reads

Reading Time: 5 minutes In our previous blog, we discovered how Cassandra handles its write queries. Now it’s time to understand how it ensures all the read requests are fulfilled. Let’s first have an overall view of Cassandra. Apache Cassandra is a free and open-source distributed NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of Continue Reading

Cassandra Writes: A Mystery?

Reading Time: 5 minutes Apache Cassandra is a free and open-source distributed NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. It is a peer to peer database where each node in the cluster constantly communicates with each other to share and receive information (node status, data ranges and so on). There is no Continue Reading

Simple Things You Can Learn From Cassandra Nodetool (Monitor/Manage) For DC/OS

Reading Time: 4 minutes Cassandra native tool called nodetool is used for monitoring and managing cassandra cluster for dcos

Data modeling in Cassandra

Reading Time: 3 minutes Role of Partitioning & Clustering Keys in Cassandra Primary and Clustering Keys should be one of the very first things you learn about when modeling Cassandra data.  With this post I will cover what the different types of Primary Keys are, how they can be used, what their purpose is, and how they affect your queries. Primary key Primary Keys are defined when you create Continue Reading

Business Intelligence-Data Visualization: Tableau

Reading Time: 3 minutes Spark, Bigdata, NoSQL, Hadoop are some of the most using and top in charts technologies that we frequently use in Knoldus, when these terms used than one thing comes into picture is ‘Huge Data, millions/billions of records’ Knoldus developers use these terms frequently, managing (and managing means here- storing data, rectifying data, normalizing it, cleaning it and much more) such amount of data is really Continue Reading

Cassandra Data Modeling – Primary , Clustering , Partition , Compound Keys

Reading Time: 5 minutes In this post we are going to discuss more about different keys available in Cassandra . Primary key concept in Cassandra is different from Relational databases. Therefore it is worth spending time to understand this concept. Lets take an example and create a student table which had a student_id as a primary key column. 1) primary key  create table person (student_id int primary key, fname Continue Reading