Tag Archives: HDFS

Smattering of HDFS

INTRODUCTION TO HDFS :- Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers.It has many similarities with existing distributed file systems. However, the differences from other distributed file … Continue reading

Posted in HDFS | Tagged , , , , , | 2 Comments

BigData Specifications – Part 1 : Configuring MySql Metastore in Apache Hive

Apache Hive is used as a data warehouse over Hadoop to provide users a way to load, analyze and query the data from various resources. Data is stored into databases or file systems like HDFS (Hadoop Distributed File System). Hive … Continue reading

Posted in Scala | Tagged , , , , , , , , , | Leave a comment