Author: Jatin Demla

Introducing Druid: Realtime Fast Data Analytics Database

In this post, we are going to learn about an ultra-fast, powerful and open source tool for real-time data analysis. Its druid this time, we will discuss why druid is so fast as well as the scenario in which condition it is suitable to use and how it internally stores the data and its architecture. What is Apache Druid? Apache Druid is a distributed, high-performance Continue Reading

Introduction to Google Kubernetes

Over the past two years, Containerization has provided a lot of flexibility to the developer in which most popular container technology is docker. Container provide the developer a virtual environment for isolated process or application on the host system. Before everything else lets take a look what used to happen in past. Earlier when the number of systems for running application were limited they were Continue Reading

protecting sensitive data in docker

A step-by-step guide for protecting sensitive data in docker

Managing the password, access tokens and private keys are being tedious in the application. Any small mistakes accidentally expose all the secret information. Even storing such thing in docker images can be easily accessible one should just run the image in the interactive mode container and all your application code is available in containers. Docker provides secrets to protect all secret data. This blog explains Continue Reading

kafka with spark

Integrating Kafka With Spark Structure Streaming

Kafka is a messaging broker system which facilitates the passing of messages between producer and consumer whereas Spark Structure streaming consumes static and streaming data from various sources like kafka, flume, twitter or any other socket which can be processed and analysed using high level algorithm for machine learning and finally pushed the result out to external storage system. The main advantage of structured streaming Continue Reading

Avro Communication over TCP Sockets

Storing/Transferring object is a requirement of most applications. What if there is a need for communication between machine having incompatible architecture. Java Serialization won’t work for that. Now, if you are thinking about Serialization Framework then you are right. So, let’s start with one of the Serialization framework Apache Avro. What is Avro? Apache Avro is a language-neutral data serialization system. It’s a schema-based system Continue Reading

Knoldus Pune Careers - Hiring Freshers

Get a head start on your career at Knoldus. Join us!