HDP Analyst Data Science (HW HDP DS)

Describe supervised and unsupervised learning differences
  • Virtual Instructor-Led

    Learning Style
  • Intermediate

    Difficulty
  • 3 Days

    Course Duration
COURSE OPTIONS
TypePriceDiscounts
Individual Course
$2,295.00
/ Each
$2,295.00
Describe supervised and unsupervised learning differences

About Course

This course Provides instruction on the processes and practice of data science, including machine learning and natural language processing. Included are: tools and programming languages (Python, IPython, Mahout, Pig, NumPy, pandas, SciPy, Scikitlearn), the Natural Language Toolkit (NLTK), and Spark MLlib.

Course Objective:

Recognize use cases for data science on Hadoop

  • Describe the Hadoop and YARN architecture
  • Describe supervised and unsupervised learning differences
  • Use Mahout to run a machine learning algorithm on Hadoop
  • Describe the data science life cycle
  • Use Pig to transform and prepare data on Hadoop
  • Write a Python script
  • Describe options for running Python code on a Hadoop cluster
  • Write a Pig User-Defined Function in Python
  • Use Pig streaming on Hadoop with a Python script
  • Use machine learning algorithms
  • Describe use cases for Natural Language Processing (NLP)
  • Use the Natural Language Toolkit (NLTK)
  • Describe the components of a Spark application
  • Write a Spark application in Python
  • Run machine learning algorithms using Spark MLlib
  • Take data science into production

Audience:

  • Architects, software developers, analysts and data scientists who need to apply data science and machine learning on Hadoop.

Prerequisite:

  • Students must have experience with at least one programming or scripting language, knowledge in statistics and/or mathematics,
  • and a basic understanding of big data and Hadoop principles. Students new to Hadoop are encouraged to attend the HDP Overview: Apache Hadoop Essentials course.
More Information
Lab AccessNo
TechnologyHadoop
TopicsBig Data
Learning StyleVirtual Instructor-Led
DifficultyIntermediate
Course Duration3 Days
LanguageEnglish
VPA EligibleVPA Eligible
Write Your Own Review
You're reviewing:HDP Analyst Data Science (HW HDP DS)
Your Rating
Why QuickStart

Why Choose QuickStart?

  • Cognitive Learning Based Platform
  • Manager Mode and Full Learning Dashboard
  • Peer/Team Social Networking Experience
  • Bulk Pricing and Team Discouns

Concierge

Sales (866) 991-3924

Mon-Fri. 8am-6pm CST

Concierge

Chat Live With Us

Mon-Fri. 8am-6pm CST

Concierge

Ask A Learning Concierge