Scala

Apache Spark: Handle Corrupt/Bad Records

Reading Time: 3 minutes Most of the time writing ETL jobs becomes very expensive when it comes to handling corrupt records. And in such cases, ETL pipelines need a good solution to handle corrupted records. Because, larger the ETL pipeline is, the more complex it becomes to handle such bad records in between. Corrupt data includes: Missing information Incomplete information Schema mismatch Differing formats or data types Apache Spark: Continue Reading

couchbase_xdcr

Couchbase High Availability and Disaster Recovery: Part 1

Reading Time: 4 minutes Couchbase Server is an open-source, distributed, NoSQL, document-oriented engagement database. It is designed to support strong features like:– Flexible data model– Simple administration– Query and Analytics– Memory first architecture– High Availability and many more Couchbase strongly emphasizes reliability, high availability, and simple management. It aims to perform operations while the system remains online, without interrupting running applications. High Availability Focused on high availability, it leverages Continue Reading

Introduction to Logging in R using log4r

Reading Time: 3 minutes One of the most important aspect of an application is Logging. Since logs provide visibility into the behavior of a running app. Hence logs play a vital role in maintenance and enhancement of an application. However, most of us are already aware with the importance of logging. That’s why we add them in our applications. But one thing that we are not aware of is Continue Reading

Persistent Entity in Lagom

Reading Time: 4 minutes Lagom is an open source framework for building systems of Reactive microservices in Java or Scala. It is build on Akka and Play. It has a concept of persistent entity and in this blog, we will learn about persistent entity in Lagom in detail. First of all, before reading about persistent entity it would be better if you have some knowledge about Event Sourcing and Continue Reading

Parsing database Query with Apache Calcite

Reading Time: 3 minutes Hey there, as a technical person sometimes we have to write the query of database and that looks good but we don’t know the query we wrote was syntactically correct or not. So in this blog, we parse the database query and test it using a test case with the help of Apache Calcite. So not wasting any time lets discuss about Apache Calcite and Continue Reading

FIX Protocol : Pros and Cons

Reading Time: 2 minutes FIX stands Financial Information Exchange. The protocol is a highly scalable electronic communication protocol to facilitate real time exchange of information related to financial market. It has become the standard method of pre-trade, trade and post-trade communication. Market participants including banks, hedge funds, prime brokers etc, utilise FIX for their own trading means all to connect directly to clients. It is a full-fledged liquidity provider.

Parse database query with JSQL Parser

Reading Time: 3 minutes Hi guys, as we discussed in the previous blog that is about parsing the database query, it is also an alternative for parsing the SQL queries. So write the query of database and that looks good to you but you don’t know the query you wrote was syntactically correct or not. In this blog, we parse the database query and test it using a test Continue Reading

Demystifying gRPC: A brief history of Remote execution.

Reading Time: 6 minutes It has been a year since I’ve been exploring the gRPC framework. Most of the time, I’ve written web services in gRPC and glued them together with their implementations. Most of the time without having to explore it why and how it came into the light?  I find people relating the gRPC framework to the “Microservices” world. If you dig on the Internet, you can Continue Reading

top 7 data analytics trends

Top 7 Data Analytics and Management Trends for 2020

Reading Time: 5 minutes We live in an era of data as it lies at the heart of digital transformation. And datasets are no longer as simple as before. They have increased in volumes, velocity, complexity and above all, are coming from multiple sources. Top tech giants like Google, Netflix, Amazon, and others are crunching massive amounts of data on a daily basis to give you a personalized experience. Continue Reading

Database Normalization :: Part 2

Reading Time: 6 minutes Introduction Normalization helps one attain a good database design and thereby ensures continues efficiency of the database. Normalization, which is a process for assigning attributes to entities, offers the following advantages: There are 7 types of Normal forms: In my previous blog, Database Normalization :: Part 1 I’ve discussed about first four.In this blog, we will be looking into 4NF, 5NF and DKNF. Fourth Normal Continue Reading