Flume and Sqoop for Ingesting Big Data

Engineers who want to port data from legacy data stores to HDFS

Self-Paced

Learning Style

Intermediate

Difficulty

2 Hours

Course Duration

Course Info

Download PDF

Certificate

See Sample

About Individual Course:
  • Individual course plan gives you access to this course
On Sale!
Now Only $10.00 Regular Price $49.00
Now Only $10.00 Regular Price $49.00
/ Each
When you subscribe, you get:
Learn Subscription plan gives you access to this course and over 836 other popular courses
On Sale!
Now Only $39.99 Regular Price $44.99
Now Only $39.99 Regular Price $44.99
/ Month
Team
Pricing
  • Buy 5-9 Enrollments And Save 68% ($12.74 monthly.)
  • Buy 10-19 Enrollments And Save 72% ($11.24 monthly.)
  • Buy 20-above Enrollments And Save 78% ($8.99 monthly.)
Engineers who want to port data from legacy data stores to HDFS

Course Information

Import data : Flume and Sqoop play a special role in the Hadoop ecosystem. They transport data from sources like local file systems, HTTP, MySQL and Twitter which hold/produce data to data stores like HDFS, HBase and Hive. Both tools come with built-in functionality and abstract away users from the complexity of transporting data between these systems. 

Flume: Flume Agents can transport data produced by a streaming application to data stores like HDFS and HBase. 

Sqoop: Use Sqoop to bulk import data from traditional RDBMS to Hadoop storage architectures like HDFS or Hive. 

Course Objective:

Practical implementations for a variety of sources and data stores ..

  • Sources : Twitter, MySQL, Spooling Directory, HTTP
  • Sinks : HDFS, HBase, Hive

Flume features : 

Flume Agents, Flume Events, Event bucketing, Channel selectors, Interceptors

Sqoop features : 

Sqoop import from MySQL, Incremental imports using Sqoop Jobs

Audience:

  • Engineers building an application with HDFS/HBase/Hive as the data store
  • Engineers who want to port data from legacy data stores to HDFS

Prerequisite:

  • Knowledge of HDFS is a prerequisite for the course
  • HBase and Hive examples assume basic understanding of HBase and Hive shells
  • HDFS is required to run most of the examples, so you'll need to have a working installation of HDFS

Outline

More Information

More Information
Subjects Big Data
Lab Access No
Learning Style Self-Paced Learning
Difficulty Intermediate
Course Duration 2 Hours
Language English
VPA Discount VPA Discount

Reviews

Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account

Contact A Learning Consultant


click here