Data Integration

IntelliJ hacks using Scala

Reading Time: 3 minutes Introduction An IDE or integrated development environment is a software application that combines, in one place, all the tools needed for a software development project. On a more basic level, IDEs provide interfaces for users to write code, organize text groups, and automate programming redundancies. Here we will talk about IntelliJ IDEA, one of the popular IDE in the market which has advantages over others Continue Reading

PDI: An Introduction to Spoon

Reading Time: 4 minutes Prerequisites: Basic knowledge about Big Data and ETL. What is PDI? PDI stands for Pentaho Data Integration. It is a tool that provides us with ETL capabilities to effectively manage huge and complex data ingestion pipelines. Its use cases include: Loading huge data sets into databases. Performing simple to complex transformations on data. Data migration between different databases. and many more… Installing PDI in your Continue Reading