Tag Archives: HDFS

Simple Java program to Append to a file in Hdfs


In this blog, I will present you with a java program to append to a file in HDFS. I will be using Maven as the build tool. Now to start with- First, we need to add maven dependencies in pom.xml. … Continue reading

Posted in big data, HDFS, Java | Tagged , , , | 1 Comment

Resolving the Failure Issue of NameNode


In the previous blog “Smattering of HDFS“, we learnt that “The NameNode is a Single Point of Failure for the HDFS Cluster”. Each cluster had a single NameNode and if that machine became unavailable, the whole cluster would become unavailable … Continue reading

Posted in big data, HDFS, Scala | Tagged , , , , , , , | 1 Comment

Smattering of HDFS


INTRODUCTION TO HDFS :- Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers.It has many similarities with existing distributed file systems. However, the differences from other distributed file … Continue reading

Posted in HDFS | Tagged , , , , , | 2 Comments

BigData Specifications – Part 1 : Configuring MySql Metastore in Apache Hive


Apache Hive is used as a data warehouse over Hadoop to provide users a way to load, analyze and query the data from various resources. Data is stored into databases or file systems like HDFS (Hadoop Distributed File System). Hive … Continue reading

Posted in Scala | Tagged , , , , , , , , , | Leave a comment