Hadoop Framework Tutorial

Hadoop – Overview What is Big Data What is Hadoop Installing Hadoop in Pseudo-distributed mode GenericOptionsParser And ToolRunner in Hadoop HDFS Introduction to Hadoop Distributed File System (HDFS) Frequently Used HDFS Commands With Examples NameNode, Secondary Namenode and Datanode in HDFS HDFS Replica Placement Policy How to Fix Corrupt Blocks…

Continue reading

Avro File Format in Hadoop

Apache Avro is a data serialization system native to Hadoop which is also language independent. Apache Avro project was created by Doug Cutting, creator of Hadoop to increase data interoperability in Hadoop. Avro implementations for C, C++, C#, Java, PHP, Python, and Ruby are available making it easier to interchange…

Continue reading