Author: Ram Indukuri

Airbyte OSS Metrics in Prometheus

Reading Time: 4 minutes Airbyte is a fast-growing ELT tool that helps acquire data from multiple sources. Particularly useful in building data lakes. Airbyte offers pre-built connectors to over 300 sources and 10s of destinations and also allows custom connectors to be built quickly using language SDKs. Airbyte recently released Opentelemetry-based metrics, however, the documentation has been spotty and incomplete. You can check it out here. In this blog, Continue Reading

How to use external kafka from kubernetes ?

Reading Time: 3 minutes Kafka is very widely used messaging infrastructure. If you are building software that require using kafka from your kubernetes cluster, you can either use strimzi.io kafka operator or use your local or aws based kafka cluster. While strimzi installation is a breeze and easy to use, one particular challenge is that it is cumbersome to access the kafka topics from local. In this simple blog, Continue Reading

Realtime Supply Chains

Reading Time: 7 minutes Supply chains is a serious topic and is critical to the survival of mankind. COVID has proven that the current supply chains performed reasonably well and ensured ‘essential’ goods are delivered. But, it was also evident that even with couple of months of disruption things have become very scary. There are several trends that are emerging. First is the impact of corona virus. Reconfiguring Global Continue Reading

migration-to-cloud-databricks-cloudera

Migrating to Cloud: Inhouse Hadoop to Databricks

Reading Time: 6 minutes Migration of applications is a good thing. It forces the organization to clean up junk, that is never used. It adds a lot of innovation and new ideas to your engineering teams. It is important to build confidence in our teams that future migrations are not stressful and pushes teams to design systems to be flexible. It sends a message to vendors that you are Continue Reading

Analytics on the edge – How Apache Mesos enabled ships to crunch data

Reading Time: 5 minutes Introduction & the Problem One of our key customers, a large cruise line has ships sail with capacity running into few thousands of people on board. They are going through a successful digital transformation which includes managing full life cycle of a guest on mobile, data science-driven personalization, etc and we are fortunate to be part of the whole journey. These ships generate varieties of Continue Reading

Are Knowledge Graphs the future of Data Lakes?

Reading Time: 5 minutes Data Lakes will evolve into knowledge graphs. This article is aimed at explaining the meaning of Knowledge graphs based on semantic web and why it will eventually secure its rightful place in organizing enterprise knowledge.