ML, AI and Data Engineering

MachineX: Medical Image Analyses for Malaria Detection

Reading Time: 9 minutes In this blog we will see the implementation of a neural network which will help us to detect malaria in a blood sample. In our previous blog MachineX: Malaria detection using Artificial Intelligence , we had talked about why Ai is important to make it more accurate and how. Now before we begin, I’d like to point out that I am neither a doctor nor Continue Reading

Understanding Spark’s Logical and Physical Plan in layman’s term

Reading Time: 5 minutes This blog pertains to Apache SPARK 2.x, where we will find out how Spark SQL works internally in layman’s terms and try to understand what is Logical and Physical Plan. Also we will be looking into Catalyst Optimizer. So let’s get started. First let’s see what Apache Spark is. The official definition of Apache Spark says that “Apache Spark™ is a unified analytics engine for large-scale Continue Reading

MachineX: Malaria detection using Artificial Intelligence

Reading Time: 5 minutes In this blog we will talk about why Malaria detection is important to detect early presence of parasitized cells in a thin blood smear. Introduction Malaria is a deadly, infectious mosquito-borne disease caused by Plasmodium parasites. These parasites are transmitted by the bites of infected female Anopheles mosquitoes. While we won’t get into details about the disease, there are five main types of malaria. Let’s Continue Reading

Apache Spark

Deep Dive into Apache Spark Transformations and Action

Reading Time: 4 minutes In our previous blog of Apache Spark, we discussed a little about what Transformations & Actions are? Now we will get deeper into the topic and will understand what actually they are & how they play a vital role to work with Apache Spark? What is Spark RDD? Spark introduces the concept of an RDD (Resilient Distributed Dataset), an immutable fault-tolerant, distributed collection of objects Continue Reading

Is Machine Learning In Banking Sector The Most Trending Thing Now?

Reading Time: 4 minutes Have you ever imagined a world without Banks? Just try it, and you will find it very difficult to live in a world without banks. Banks are one of the most important parts of the financial economy of any country. On the other hand Machine Learning and AI are one of the most trending technologies these days.  In this blog we will be discussing different Continue Reading

MachineX: Alphabets of PyTorch (Part 1)

Reading Time: 6 minutes Overview In this blog, you’ll get an introduction to deep learning using the PyTorch framework, we will see some basics of PyTorch. Introduction to PyTorch PyTorch is a Python machine learning package based on Torch, which is an open-source machine learning package based on the programming language Lua. Two main features: Tensor computation (like NumPy) with strong GPU acceleration Automatic differentiation for building and training Continue Reading

Diving deeper into Delta Lake

Reading Time: 6 minutes Delta Lake is an open-source storage layer that brings reliability to data lakes. It has numerous reliability features including ACID transactions, scalable metadata handling, and unified streaming and batch data processing.

Delta Lake To the Rescue

Reading Time: 4 minutes Welcome Back. In our previous blogs, we tried to get some insights about Spark RDDs and also tried to explore some new things in Spark 2.4. You can go through those blogs here: RDDs – The backbone of Apache Spark Spark 2.4: Adding a little more Spark to your code In this blog, we will be discussing something called a Delta Lake. But first, let’s Continue Reading

MachineX: Welcome to TensorFlow 2.0

Reading Time: 3 minutes With the release of Tensorflow 2.0 we have taken another step towards machine dominating human dysphoria. Just kidding!!, this debate is for big people like Elon Musk, Mark Juckerberg or Jack Ma. We are just happy that Tensorflow 2.0 has been released and it will make our life a lot easier as 2.0 is being improved with consideration of freedbacks from its users. As part Continue Reading

MachineX: Image Data Augmentation Using Keras

Reading Time: 4 minutes In this blog , we will focus on Image Data Augmentation using keras and how we can implement same. Problem When we work with image classification projects, the input which a user will give can vary in many aspects like angles, zoom and stability while clicking the picture. So we should train our model to accept and make sense of almost all types of inputs. Continue Reading

MachineX: Importance of ML/AI in Healthcare

Reading Time: 3 minutes Folks, In this blog I will going to explain the importance of ML/AI in healthcare sector.  First of all, I just want to share some statistics regarding the expenditure on healthcare by the people of different countries. Here is a list of a few BRICS and newly industrialized nations with their per capita expenditure on health. Here we can see in case of India only Continue Reading

Spark – Actions and Transformations

Reading Time: 4 minutes Hey guys, welcome to series of spark blogs, this blog being the first blog in this series we would try to keep things as crisp as possible, so let’s get started. So I recently get to start learning spark about believe me and now it has made me inquisitive about it, for a brief introduction of spark, I would say that it is a pretty Continue Reading

Tale of Apache Spark

Reading Time: 6 minutes Data is being produced extensively in today’s world and it is going to be generated more rapidly in future. 90% of total data that is produced in the world is produced in last two years only and it is estimated that in 2020 world’s total data would reach 45 ZB and data generated each day would be enough that if we try to store it Continue Reading

Knoldus Pune Careers - Hiring Freshers

Get a head start on your career at Knoldus. Join us!