Scalable programming with Scala and Spark

If you are a data scientist or an analyst, you're accustomed to having various frameworks for working with information.

Self-Paced

Learning Style

Course

Learning Style

Intermediate

Difficulty

9 Hours

Course Duration

Course Info

Download PDF

Certificate

See Sample

tab
About Individual Course:
  • Individual course plan gives you access to this course
On Sale!
Now Only $10.00 Regular Price $49.00
Now Only $10.00 Regular Price $49.00
/ Each
When you subscribe, you get:
Learn Subscription plan gives you access to this course and over 903 other popular courses
On Sale!
Now Only $39.99 Regular Price $44.99
Now Only $39.99 Regular Price $44.99
/ Month
Team
Pricing
  • Buy 5-9 Enrollments And Save 68% ($12.74 monthly.)
  • Buy 10-19 Enrollments And Save 72% ($11.24 monthly.)
  • Buy 20-above Enrollments And Save 78% ($8.99 monthly.)

You have already taken demo for this course.

If you want to get access to demo again, feel free to contact our support at (855) 800-8240

If you are a data scientist or an analyst, you're accustomed to having various frameworks for working with information.

Course Information

About the course:

If you are a data scientist or an analyst, you're accustomed to having various frameworks for working with information. Python, SQL, Java, R, and so forth. With Spark, you have a solitary engine where you can find out and play with a lot of information, run algorithms of machine learning and afterward utilize a similar framework to productionize your code.

Scala: Scala is a universally useful programming language - like C++ or Java. Its accessibility of a REPL environment and the practical programming nature make it especially appropriate for a distributed computing system like Spark.

Analytics: Using Scala and Spark you can explore and analyze your information in an intelligent situation with quick feedback. The course will tell the best way to use the intensity of Dataframes and RDDs to control information easily.

Machine Learning and Data Science: Spark's built-in libraries and core functionality make it simple to actualize complex calculations like Recommendations with not too many lines of code. We'll cover an assortment of datasets and calculations including MapReduce, PageRank, and Graph datasets.

Course Objective:

Scala Programming Constructs: Traits, Classes, Closures, First Class Functions, Case Classes Currying.

Lots of cool stuff.

  • Utilizing Spark Streaming for stream preparing
  • Spark SQL and Dataframes to work with Twitter information.
  • Recommendations for Music utilizing the Audioscrobbler and Alternating Least Squares dataset
  • Utilizing the PageRank calculation with the dataset of Google web graph.
  • Working with graph information utilizing the dataset of Marvel Social system.

Spark basic and advanced features:

  • Resilient Distributed Datasets, Actions (reduce, aggregate), Transformations (map, filter, flatMap)
  • Pair RDDs, combineByKey, reduceByKey,
  • Accumulator and Broadcast variables
  • Spark for MapReduce.
  • The Java API for Spark.
  • Spark Streaming, Spark SQL, GraphX and MLlib.

Audience:

  • Specialists who need to utilize a distributed computing engine for stream processing or batch or both
  • An analyst who needs to use Spark for breaking down fascinating datasets
  • Data Scientists who need a solitary engine for modeling and analyzing information and productionizing it.

Prerequisite:

All models work without or with Hadoop. If you might want to utilize Spark with Hadoop, you'll require to have Hadoop introduced (either in cluster mode or pseudo-distributed).

Outline

More Information

More Information
Subjects App Development, Big Data
Lab Access No
Learning Style Self-Paced Learning
Learning Type Course
Difficulty Intermediate
Course Duration 9 Hours
Language English
VPA Discount VPA Discount

Reviews

Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account

Contact A Learning Consultant


click here