cleaning data

Thinking AI? Think Data First

Reading Time: 4 minutes There is a lot of interest in Machine Learning and AI. Ofcourse, a lot of it is still the level 1 of AI . This is when we are thinking about machines acting like humans. Everyone wants to jump on the bandwagon of AI. It is an amazing field and man organizations do not want to be left behind. That said, something which is ignored most of the time is the fuel, the data!

Spark – LDA : A Complete example of clustering algorithm for topic discovery.

Reading Time: 6 minutes In this blog we will be demonstrating the functionality of applying the full ML pipeline over a set of documents which in this case we are using 10 books from the internet. So lets start with first thing first.. What is Clustering ? Clustering is the task of grouping a set of objects in such a way that objects in the same group (called a Continue Reading