fast data

Basic Anatomy of a Flink Program

Reading Time: 3 minutes Hi Folks! Hope you all are safe in the COVID-19 pandemic and learning new tools and tech while staying at home. I also have just started learning a very prominent Big Data framework for stream processing which is  Flink. Flink is a distributed framework and based on the streaming first principle, means it is a real streaming processing engine and implements batch processing as a special case. In Continue Reading

MachineX: Run ML model prediction faster with Hummingbird

Reading Time: 3 minutes In this blog, we will see how to make our machine learning model’s prediction faster with a recently open-sourced library Hummingbird. Nowadays, we can see a lot of frameworks for deploying or serving the machine learning model into production. As a result, It is a headache for a data scientist to choose between these frameworks, keeping in mind how their model either Sklearn or LightGBM Continue Reading

Re-evaluating Data Strategies to Respond in Real-Time

Reading Time: 2 minutes Fast Data is empowering organizations to respond in real-time. About 75% of organizations are already using it for at least some of their applications.

Data Lake – Build it in Phases

Reading Time: 3 minutes Data Lake – How to build a data lake and what are the phases involved in the same.

Fast Data: The New Age Analytics For Enhanced Customer Experience

Reading Time: 6 minutes Data is evolving both in terms of quality and quantity in today’s enterprises and in the past few years, changes have occurred at a much faster pace. Not long ago, Big Data was considered the next big thing for digital transformation. Technologies like Hadoop and HBase made sense as batch processing of data was the norm. But things are not the same now.  By the Continue Reading

Apache Spark

Deep Dive into Apache Spark Transformations and Action

Reading Time: 4 minutes In our previous blog of Apache Spark, we discussed a little about what Transformations & Actions are? Now we will get deeper into the topic and will understand what actually they are & how they play a vital role to work with Apache Spark? What is Spark RDD? Spark introduces the concept of an RDD (Resilient Distributed Dataset), an immutable fault-tolerant, distributed collection of objects Continue Reading

Knoldus-Clutch-AI-Big-Data-Top

Knoldus Joins Clutch’s Research of Top AI & Big Data Companies in 2018

Reading Time: 2 minutes The advent of the digital economy is a development that has changed the landscapes of every industry across the world. There is a new key ingredient for success; the best performing businesses are those with the best digital platforms, built to drive performance and bring customer interaction to new heights. At Knoldus, we are a team of developers and innovators dedicated to helping businesses reach Continue Reading

Challenges to Monitoring a Fast Data Application

Reading Time: 5 minutes In the present landscape, the buzzword is “Fast Data” and it is nothing but data that is not at rest. And since the data is not a rest, the traditional techniques of working on the data that is rest are no longer efficient and relevant. The importance of streaming has grown, as it provides a competitive advantage that reduces the time gap between data arrival Continue Reading

Can we stop talking about Big Data now?

Reading Time: 4 minutes If it was still 2012 I would have eagerly heard and responded to any conversation about Big Data. Well, it was the buzz and you had to be speaking the magic words for getting people to listen to the latest and greatest in technology. But fortunately/unfortunately, it is 2017 now and it is disappointing to note that most of the world has not moved beyond Continue Reading

2017 – Year of FAST Data

Reading Time: < 1 minute As we approach 2017, there is a strong focus on Fast Data. This is a combination of data at rest and data in motion and the speed has to be remarkably fast. In the deck that follows, we at Knoldus present to you how we have implemented a complex multi scale solution for a large bank on the Fast Data Architecture philosophy. As we partner Continue Reading

Knoldus Partners with Confluent to Power Real-Time Streams

Reading Time: 3 minutes Knoldus is pleased to announce a Consulting and System Integrator partnership with Confluent, the company founded by the creators of Apache KafkaTM Confluent, creators of the first streaming platform based on Apache KafkaTM, provides the most complete platform to build enterprise-scale streaming pipelines using Apache Kafka and simplify the development of stream processing applications. Via rapid adoption in the Fortune 500, Apache Kafka is quickly emerging Continue Reading