Artificial intelligence

Cloud Data Loss Prevention (DLP): Part-2

Reading Time: 2 minutes Google Cloud Platform’s Data Loss Protection API provides a service that can make organizations manage sensitive data, including detecting and redaction, masking, and tokenizing such data. This can help organizations comply with regulations such as GDPR, and reduce the risk of data exposure and data breaches. Such as a name, email address, telephone number, identification number, or credit card number. In the previous blog Cloud Data Loss Continue Reading

A Simple Guide to OCR using Pytesseract

Reading Time: 2 minutes What is OCR OCR is an acronym for optical character recognition. It is a widespread technology to recognize text inside images, such as scanned documents and photos. OCR technology is used to convert virtually any kind of image containing written text (typed, handwritten, or printed) into machine-readable text data.  OCR using Pytesseract Python-tesseract is a wrapper for Google’s Tesseract OCR engine. It can read any Continue Reading

Text Data Vectorization Techniques in Natural Language Processing

Reading Time: 6 minutes Features in any Machine Learning algorithms are generally numerical data on which we can easily perform any mathematical operations. But Machine Learning algorithms cannot work on raw text data. Machine Learning algorithms can only process numerical representation in form of vector(matrix) of actual text. For converting textual data into numerical representation of features we can use the following text vectorization techniques in Natural Language Processing. Continue Reading

DBSCAN Clustering Algorithm

Reading Time: 4 minutes What is Clustering? Clustering, often known as cluster analysis, is an unsupervised machine learning task. Using a clustering algorithm entails providing the algorithm with a large amount of unlabeled data and allowing it to locate whatever groupings in the data it can. The names given to these groups are clusters. A cluster is a collection of data points that are related to one another based Continue Reading

Fundamentals of Tensorflow – Part I

Reading Time: 4 minutes What is TensorFlow? TensorFlow is an open-source end-to-end machine learning library. It is for preprocessing data, modeling data, and serving models (getting them into the hands of others). It has a comprehensive, flexible ecosystem of tools, libraries, and community resources that lets researchers push the state-of-the-art in ML. And developers easily build and deploy ML-powered applications. Installing TensorFlow TensorFlow is tested and supported on the following Continue Reading

How To Find Correlation Value Of Categorical Variables.

Reading Time: 4 minutes Hey folks, In this blog we are going to find out the correlation of categorical variables. What is Categorical Variable? In statistics, a categorical variable has two or more categories.But there is no intrinsic ordering to the categories. For example, a binary variable(such as yes/no question) is a categorical variable having two categories (yes or no), and there is no intrinsic ordering to the categories. Continue Reading

Know basics of OpenCV for Image Processing in Python

Reading Time: 5 minutes OpenCV known as Open Source Computer Vision, is a library in Python used for Computer Vision and Image Processing Tasks. It has a modular structure and includes several shared and static libraries. OpenCV can be used in languages such as Python, C++, Java etc. Some of the applications of OpenCV include Edge Detection, Face Detection, Object Detection, Face Recognition etc. Using OpenCV we can process Continue Reading

Getting Familiar with Activation Function and Its Types.

Reading Time: 7 minutes Hey Folks, In this blog we are going to discuss activation function in Artificial Neural Networks and their different types. Before going there, let’s get some idea about what is an artificial neural network? Artificial Neural Network(i.e., ANN) Artificial Neural Network refers to a biologically inspired sub-field of Artificial Intelligence modeled after the brain. ANN is a computational network based on a biological neural network Continue Reading

Is SpaCy Python NLP Any Good? Seven Ways You Can Be Certain

Reading Time: 4 minutes SpaCy is a free, open-source library for advanced Natural Language Processing (NLP) in Python. If you’re operating with plenty of text, you’ll eventually want to know more about it. For example, what’s it about? What do the phrases suggest in context? Who is doing what to whom? Which texts are just like every other? Certainly, spaCy can resolve all the problems stated above. Linguistic Features in SpaCy SpaCy goes Continue Reading

Product demand forecasting with Knime

Reading Time: 5 minutes In this blog, we are going to see, Importance of demand forecasting and how we can easily create these forecasting workflows with Knime. Market request forecasting is a basic procedure for any business, however maybe none more so than those in buyer packaged products. Stock, production, storage, delivering, showcasing – each aspect of CPG and retail organizations’ activities are influenced by accurate forecasting. Identifying shoppers’ Continue Reading

MachineX: Run ML model prediction faster with Hummingbird

Reading Time: 3 minutes In this blog, we will see how to make our machine learning model’s prediction faster with a recently open-sourced library Hummingbird. Nowadays, we can see a lot of frameworks for deploying or serving the machine learning model into production. As a result, It is a headache for a data scientist to choose between these frameworks, keeping in mind how their model either Sklearn or LightGBM Continue Reading

AI : Nature of Environments

Reading Time: 2 minutes In the previous posts, we looked at the Rational Agent and the operating environment. In this post we would try to study the nature of the environment. The environment is the Task Environment (problem) for which the Rational Agent is the solution. Any task environment is characterised on the basis of PEAS. Performance – What is the performance characteristic which would either make the agent Continue Reading

COVID-19 Detector: Detecting Corona from X-Ray

Reading Time: 4 minutes A web application using deep learning to help medical practitioners to detect COVID-19 symptoms with chest x-rays. COVID-19 disease, caused by the SARS-CoV-2 virus, was identified in December 2019 in China and declared a global pandemic by the WHO(World Health Organization) on 11 March 2020. The disease first originated in Wuhan, China and since then it has spread globally across the world affecting more than 200 countries. Coronavirus disease 2019 (COVID-19) Continue Reading