1 comment on “Blue Green Deployments: Reducing the downtime of apps”

Blue Green Deployments: Reducing the downtime of apps


Ever heard of "application outage"? As part of agile practice we release our work frequently and often when a newer version of an application released to production, we get application outages due to issues like unexpected traffic, introduced a bug into…

4 comments on “Error handling in Scala: What, where and how?”

Error handling in Scala: What, where and how?


error handling, monadic, functional erorr handling

1 comment on “AMPS: Empowering real time message driven applications.”

AMPS: Empowering real time message driven applications.


Greetings!! In this blog, we will talk about AMPS, a pub-sub engine which delivers messages in real time with a subject of interest. AMPS is mainly used by Financial Institutions as enterprise message bus. We will also demonstrate how we…

1 comment on “Introduction to Structured Streaming”

Introduction to Structured Streaming


Hello!!  Knoldus had organized half an hour session on Structured Streaming briefing about the API changes, how it is different from the early Stream Computation paradigm (DStreams) and example API demonstration. Hope you will enjoy. Below are the slides and Video…

3 comments on “Sharing RDD’s states across Spark applications with Apache Ignite”

Sharing RDD’s states across Spark applications with Apache Ignite


Apache Ignite offers an abstraction over native Spark RDDs such that the state of RDDs can be shared across spark jobs, workers and applications which is not possible with native Spark RDDS. In this blog, we will walk through the…

1 comment on “Controlling RDD Partitions in Apache Spark”

Controlling RDD Partitions in Apache Spark


In this blog, we will discuss What is RDD partitioning, why Partitioning is important and how to create and use spark Partitioners to minimize the shuffle operations across the nodes in a distributed Spark application. What is Partitioning? Partitioning is a transformation…

1 comment on “Build your personalized movie recommender with Scala and Spark”

Build your personalized movie recommender with Scala and Spark


In this blog I will explain what is a recommendation engine in general, and How to build a personalized recommendation model using Scala and Spark Collaborative filtering algorithm. What is a Recommendation Engine? I assume you've shopped online for books…

2 comments on “Introduction to Java 8”

Introduction to Java 8


The Functional Features of Java8 Java 8 was a major release in terms of language and APIs. The language includes several ideas from functional programming like behavior parameterization, passing lambda expression as methods, processing data with stream pipelines etc. The following presentation…

1 comment on “Broadcast variables in Spark, how and when to use them?”

Broadcast variables in Spark, how and when to use them?


As documentation for Spark Broadcast variables states, they are immutable shared variable which are cached on each worker nodes on a Spark cluster.  In this blog, we will demonstrate a simple use case of broadcast variables. When to use Broadcast variable?…

1 comment on “Aggregating Neighboring vertices with Apache Spark GraphX Library”

Aggregating Neighboring vertices with Apache Spark GraphX Library


To get the problems addressed by "Neighborhood Aggregation", we can think of the queries like: "Who has the maximum number of followers under 20 on twitter?" In this blog, we will learn how to aggregate properties of neighboring vertices on a graph…