python

Is SpaCy Python NLP Any Good? Seven Ways You Can Be Certain

Reading Time: 4 minutes SpaCy is a free, open-source library for advanced Natural Language Processing (NLP) in Python. If you’re operating with plenty of text, you’ll eventually want to know more about it. For example, what’s it about? What do the phrases suggest in context? Who is doing what to whom? Which texts are just like every other? Certainly, spaCy can resolve all the problems stated above. Linguistic Features in SpaCy SpaCy goes Continue Reading

Pandas for Data Analysis

Reading Time: 4 minutes Why Pandas for data Analysis? Real ‘raw’ data needs a lot of ‘wrangling’ operations before it can be ready for dissection by a data scientist one of the popular tools for data wrangling in python is Pandas. Because of the availability of widespread packages of Pandas for almost every possible function. The library Pandas is one such package that makes life easier especially for data analysis. Through Continue Reading

Scala vs Python for Apache Spark: An In-depth Comparison

Reading Time: 5 minutes Imagine the first day of a new Apache Spark project. The project manager looks at the team and says: which one to choose, scala or python. So let’s start with “scala vs python for spark”.  You may wonder if this is a tricky question. What does the enterprise demand say? Is this like asking iOS or Android? Is there a right or wrong answer? So Continue Reading

Functional Programming with Python

Reading Time: 4 minutes What is Functional Programming? “Functional programming is a programming paradigm that treats computation as the evaluation of mathematical functions and avoids state and mutable data”. In other words, functional programming (FP) promotes code with no side effects and immutable variables. It is a declarative type of programming style. Its main focus is on “what to solve” in contrast to an imperative style where the main Continue Reading

How to install python in target host using Ansible

Reading Time: 3 minutes Ansible is an open-source automation engine that automates software provisioning, configuration management, and application deployment. It is quickly becoming the popular configuration management tool today. It lets you to control and config the target nodes from single host machine. As we know the only requirement in target machine is the modern version of python installed. But what if the python version is not installed on Continue Reading

An Introduction to Mako Templates in Python

Reading Time: 3 minutes Introduction to Mako Have you ever worked on a project in Python and thought to yourself, “Gee, it would be nice if I could dynamically generate web pages and/or text and have them directly interact with my program.” Well, you’re in luck because enter Mako templates. Mako is a template library and a Python Sever Page language, allowing content such as HTML, XML, and text Continue Reading

Marshmallow: Easy Serialization in Python

Reading Time: 2 minutes Introduction Marshmallow, stylized as “marshmallow”, is an object-relational mapping library which is used to convert objects to and from Python data types. It is often used alongside SQLAlchemy, an ORM that maps database schemas to Python objects. Marshmallow is often used to deserialize Python dicts to SQLAlchemy models and vice versa. Let’s focus on how to use Marshmallow. Creating Schemas First, we need to create Continue Reading

GitPython: How to use Git with Python

Reading Time: 4 minutes GitPython is a python library used to interact with git repositories. It is a module in python used to access our git repositories. It provides abstractions of git objects for easy access of repository data, and additionally allows you to access the git repository more directly using pure python implementation. Requirements for GitPython Python3 Git GitPython module pip and virtualenv, which come packaged with Python Continue Reading

Running Apache Airflow DAG with Docker

Reading Time: 3 minutes In this blog, we are going to run the sample dynamic DAG using docker. Before that, let’s get a quick idea about the airflow and some of its terms. What is Airflow? Airflow is a workflow engine which is responsible for managing and scheduling running jobs and data pipelines. It ensures that the jobs are ordered correctly based on dependencies and also manages the allocation Continue Reading