Monday 30 December 2013

Hadoop Online Training



HADOOP ONLINE TRAINING


INTRODUCTION

  • What is Hadoop?
  • History of Hadoop
  • Building Blocks Hadoop Eco-System
  • Who is behind Hadoop?
  • What Hadoop is good for and why it is Good

HDFS

  • Configuring HDFS
  • Interacting With HDFS
  • HDFS Permissions and Security
  • Additional HDFS Tasks
  • HDFS Overview and Architecture
  • HDFS Installation
  • Hadoop File System Shell
  • File System Java API

MAPREDUCE

  • Map/Reduce Overview and Architecture
  • Installation
  • Developing Map/Red Jobs
  • Input and Output Formats
  • Job Configuration
  • Job Submission
  • Practicing Map Reduce Programs (atleast 10 Map Reduce Algorithms )

Getting Started With Eclipse IDE

  • Configuring Hadoop API on Eclipse IDE
  • Connecting Eclipse IDE to HDFS

Hadoop Streaming


Advanced MapReduce Features

  • Custom Data Types
  • Input Formats
  • Output Formats
  • Partitioning Data
  • Reporting Custom Metrics
  • Distributing Auxiliary Job Data

Distributing Debug Scripts


Using Yahoo Web Services


Pig

  • Pig Overview
  • Installation
  • Pig Latin
  • Pig with HDFS

Hive

  • HiveOverview
  • Installation
  • HiveQL
  • Hive Unstructured Data Analyzation
  • Hive Semistructured Data Analyzation

HBase

  • HBase Overview and Architecture
  • HBase Installation
  • HBase Shell
  • CRUDoperations
  • Scanning and Batching
  • Filters
  • HBase Key Design

ZooKeeper

  • Zoo Keeper Overview
  • Installation
  • Server Mantainace

Sqoop

  • Sqoop Overview
  • Installation
  • Imports and Exports

CONFIGURATION

  • Basic Setup
  • Important Directories
  • Selecting Machines
  • Cluster Configurations
  • Small Clusters: 2-10 Nodes
  • Medium Clusters: 10-40 Nodes
  • Large Clusters: Multiple Racks

Integrations


Putting it all together

  • Distributed installations
  • Best Practices

No comments:

Post a Comment