Category Archives: big data

Getting Introduced with Presto


Hi Folks! In today’s blog I will be introducing you to a new open source distributed Sql Query Engine – Presto. It is designed for running SQL queries over Big Data( petabytes of Data). It was designed by the people … Continue reading

Posted in big data, Scala | Tagged , , , , | Leave a comment

Connecting To Presto via JDBC


Hi Guys, In this blog we’ll be discussing about how to make a connection to presto server using JDBC, but before we get started let’s discuss what Presto is. What is Presto ? So, Presto is an open source distributed … Continue reading

Posted in big data, database, Java, Scala, sql | Tagged | Leave a comment

Introduction To HADOOP !


Here I am to going to  write a blog on Hadoop! “Bigdata is not about data! The value in Bigdata [is in] the analytics. ” -Harvard Prof. Gary King So the Hadoop came into Introduction! Hadoop is an open source, … Continue reading

Posted in Apache Flink, apache spark, big data, database, HDFS, knoldus, Scala, software, Spark, Test, testing | 2 Comments

Apache Spark : Spark Union adds up the partition of input RDDs


Some days back when I was doing union of 2 pair rdds, I found the strange behavior for the number of partitions. The output RDD got different number of partition than input Rdd. For ex: suppose rdd1 and rdd2, each … Continue reading

Posted in Agile, apache spark, Best Practices, big data, Scala | Leave a comment

Lagom Framework: The Legacy WordCount Example


What is Lagom? Lagom is an open source micro-service framework, built with Akka message-driven runtime and Play web framework and finally light bend service orchestration. Mixing all these technologies abstracts away the complexities of building, running, and managing microservice architectures. … Continue reading

Posted in big data, Microservices, Scala | Tagged , , , , | Leave a comment

Spark Cassandra Connector On Spark-Shell


Using Spark-Cassandra-Connector on Spark Shell Hi All , In this blog we will see how we can execute our spark code on spark shell using Cassandra . This is very efficient at testing or learning time , where we have … Continue reading

Posted in apache spark, big data, Cassandra, Scala, Spark | 2 Comments

Transaction Management in Cassandra


As we are all from the Sql Background and its been ages SQL rules the market , so transaction are something favorite to us . While Cassandra does not support ACID (transaction) properties but it gives you the ‘AID’ among … Continue reading

Posted in big data, Cassandra, NoSql, Scala | 4 Comments

Twitter’s tweets analysis using Lambda Architecture


Hello Folks, In this blog i will explain  twitter’s tweets analysis with lambda architecture. So first we need to understand  what is lambda architecture,about its component and usage. According to Wikipedia, Lambda architecture is a data processing architecture designed to handle … Continue reading

Posted in Akka, akka-http, Apache Kafka, apache spark, Architecture, Batch, big data, Cassandra, Scala, Spark, Streaming | 5 Comments

Cassandra Tips And Techniques


Tips and Techniques to Load Data in Cassandra Using Java – Driver .
To Implement Pagination Concepts in various Scenarios in Cassandra Continue reading

Posted in big data, Cassandra, NoSql, Scala | 1 Comment

Short Interview With SMACK Tech Stack !!!


Hello guy’s, today’s we conduct short interview with SMACK about its architecture and there uses. Let’s start with of some introduction. Interviewer: How would you describe your self ? SMACK: I am SMACK (Spark, Mesos, Akka, Cassandra and Kafka) and … Continue reading

Posted in Akka, Apache Kafka, apache spark, big data, Cassandra, Scala, Spark | Tagged , , , , , , , , , , , , | Leave a comment