Query Optimization

Overview of Static Partitioning in Apache Hive

Reading Time: 4 minutes What is Partitioning? In simple words, we can explain Partitioning as the process of dividing something into sections or parts, with the motive of making it easily understandable and manageable. Apache Hive allows us to organize the table into multiple partitions where we can group the same kind of data together. It is used for distributing the load horizontally which also helps to increase query Continue Reading

Exploring HepPlanner for Apache Calcite

Reading Time: 3 minutes In this blog, we will see different ways to manipulate a rel node tree using a Hep planner. A basic understanding of Apache Calcite is necessary for this. Check out the homepage here https://calcite.apache.org/ What is a HepPlanner? It is a rule-based planner to transform a relational expression represented as a tree-like structure. It allows us to specify a condition to identify particular nodes of Continue Reading

Apache Calcite : Adding custom types and functions

Reading Time: 2 minutes Introduction In this blog we will introduce a custom function and type in our SQL . In the end,we want to parse,validate and convert to a relational node for a simple query like“SELECT CAST(my_custom_function(name) as my_custom_type) FROM SAMPLE” . Setting up the basics A sample schema First we need a simple table named Sample : Sample(ID int not null,NAME varchar not null) FrameworkConfig Next we Continue Reading