MeetUp on “BlinkDB and G-OLA: Supporting Continuous Answers with Error Bars in SparkSQL”

Table of contents
Reading Time: < 1 minute

Big datasets are growing exponentially, but our needs to get quick interactive responses to our queries remain ever as important. This talk will feature an overview of various components in BlinkDB and introduce a new generalized online aggregation (G-OLA) paradigm in SparkSQL to incrementally process massive amounts of data on clusters of tens, hundreds or thousands of machines while returning approximate answers. More precisely, this new execution model enables SparkSQL to present the user with meaningful approximate results (with error bars) that are continuously refined and updated, at a speed comfortable to the user, while it crunches larger and larger fractions of the whole dataset in the background. This not only alleviates the need for pre-processing the data in advance for a wide range of queries, but also enables the users to observe the progress of a query and control its execution on the fly– enabling a smooth time/accuracy trade-off.

Knoldus is organizing an one hour session on 24th Nov 2015 at 6:00 PM. Mr. Sameer Agarwal from Databricks would give session on “BlinkDB and G-OLA”. All of you are invited to join this session

First Floor,
Above UCO Bank,
Near Rajendra Place Metro Station,  New Delhi, India

Please click here for more details.

Written by 

Ayush is the Sr. Lead Consultant @ Knoldus Software LLP. In his 10 years of experience he has become a developer with proven experience in architecting and developing web applications. Ayush has a Masters in Computer Application from U.P. Technical University, Ayush is a strong-willed and self-motivated professional who takes deep care in adhering to quality norms within projects. He is capable of managing challenging projects with remarkable deadline sensitivity without compromising code quality.