Scalable programming with Scala and Spark

If you are a data scientist or an analyst, you're accustomed to having various frameworks for working with information.

Self-Paced

Learning Style

Course

Learning Style

Intermediate

Difficulty

9 Hours

Course Duration

Course Info

Download PDF

Certificate

See Sample

tab
About Individual Course:
  • Individual course plan gives you access to this course
On Sale!
Now Only $10.00 Regular Price $49.00
Now Only $10.00 Regular Price $49.00
/ Each
2 Learners Have Enrolled For This Course
When you subscribe, you get:
Learn Subscription plan gives you access to this course and over 841 other popular courses
On Sale!
Now Only $39.99 Regular Price $44.99
Now Only $39.99 Regular Price $44.99
/ Month
Team
Pricing
  • Buy 5-9 Enrollments And Save 68% ($12.74 monthly.)
  • Buy 10-19 Enrollments And Save 72% ($11.24 monthly.)
  • Buy 20-above Enrollments And Save 78% ($8.99 monthly.)
2 Learners Have Enrolled For This Course

You have already taken demo for this course.

If you want to get access to demo again, feel free to contact our support at (855) 800-8240

If you are a data scientist or an analyst, you're accustomed to having various frameworks for working with information.

Course Information

About the course:

If you are a data scientist or an analyst, you're accustomed to having various frameworks for working with information. Python, SQL, Java, R, and so forth. With Spark, you have a solitary engine where you can find out and play with a lot of information, run algorithms of machine learning and afterward utilize a similar framework to productionize your code.

Scala: Scala is a universally useful programming language - like C++ or Java. Its accessibility of a REPL environment and the practical programming nature make it especially appropriate for a distributed computing system like Spark.

Analytics: Using Scala and Spark you can explore and analyze your information in an intelligent situation with quick feedback. The course will tell the best way to use the intensity of Dataframes and RDDs to control information easily.

Machine Learning and Data Science: Spark's built-in libraries and core functionality make it simple to actualize complex calculations like Recommendations with not too many lines of code. We'll cover an assortment of datasets and calculations including MapReduce, PageRank, and Graph datasets.

Course Objective:

Scala Programming Constructs: Traits, Classes, Closures, First Class Functions, Case Classes Currying.

Lots of cool stuff.

  • Utilizing Spark Streaming for stream preparing
  • Spark SQL and Dataframes to work with Twitter information.
  • Recommendations for Music utilizing the Audioscrobbler and Alternating Least Squares dataset
  • Utilizing the PageRank calculation with the dataset of Google web graph.
  • Working with graph information utilizing the dataset of Marvel Social system.

Spark basic and advanced features:

  • Resilient Distributed Datasets, Actions (reduce, aggregate), Transformations (map, filter, flatMap)
  • Pair RDDs, combineByKey, reduceByKey,
  • Accumulator and Broadcast variables
  • Spark for MapReduce.
  • The Java API for Spark.
  • Spark Streaming, Spark SQL, GraphX and MLlib.

Audience:

  • Specialists who need to utilize a distributed computing engine for stream processing or batch or both
  • An analyst who needs to use Spark for breaking down fascinating datasets
  • Data Scientists who need a solitary engine for modeling and analyzing information and productionizing it.

Prerequisite:

All models work without or with Hadoop. If you might want to utilize Spark with Hadoop, you'll require to have Hadoop introduced (either in cluster mode or pseudo-distributed).

Outline

More Information

More Information
SubjectsApp Development, Big Data
Lab AccessNo
Learning StyleSelf-Paced Learning
Learning TypeCourse
DifficultyIntermediate
Course Duration9 Hours
LanguageEnglish
VPA DiscountVPA Discount

Reviews

Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account

Course Expert:

Author

Tom Robertson
(Data Science Enthusiast)

Tom Robertson is a Python-based data scientist with domain knowledge in behavioral neuroscience research and non-profit development. He is currently employeed as Data Science Instructor at QuickStart Technologies. Tom is inspired by the belief that data science can improve quality of life by providing our researchers, doctors and policy-makers with new, data-driven insights.

Tom has expertise on Python, SQL, and Spark. He has worked on several libraries including but not limited to Scikit-learn, Pandas, NumPy, Matplotlib, Seaborn, SciPy, NLTK, Keras, and Tensorflow.

This Subscription Includes:

Information
Virtual instructor Led

Virtual Classroom Courses

Our virtual instructor-led courses give you access to live instructors training you with other live students in a virtual classroom environment.

200+ Virtual Classroom Courses
Information
Self-Paced

Online Self-Paced Courses

Take self-paced online courses at your convenience and own pace, with unlimited access to courses in various emerging technologies.

700+ Self-Paced Courses
Information
College

E-books blogs, case studies, ariticles

As part of informal learning, our platform will recommend E-books, whitepapers, case studies, articles, and videos. This is AI curated content closely aligned with your learning objectives.

E-books blogs, case studies,...
Information
College

College Accredited Courses

QuickStart courses are accredited by several top schools and universities, including Texas A&M and University of Phoenix. You can print out certificates and also apply them towards your degree plan with them.

College Accredited
Information
Dashboard

Full Learning Dashboard & Analytics

Access all your enrolled, completed, course statistics, and community discussions from one centralized and intuitive learning dashboard with built in analytics, course tracking, time spent, and more.

Analytics/Reporting
Information
Social

QuickStart Discussions

Engage with other learners where you can directly chat, ask questions, and socialize with other learners experts and instructors on a course subject.

Community Access Community Access
Information
Labs

Virtual Labs

Videos and lectures only go so far. Get real world, hands-on practice with virtual labs (not available for all courses).

Virtual Labs Virtual Labs
Information
Live Instructor Support

Live Instructor Mentoring & Support

Get your IT problems solved through a community of mentors, experts and peers. Get live help from experts to answer questions on course material or guidance on a project.

Mentoring & Discussions Mentoring & Discussions
Information
Dashboard

Career Paths

Start a learning pathway towards understanding and mastering your career. With QuickStart career paths, you can fully understanding and being the best in your field.

Learning Paths Learning Paths
Information
Dashboard

Informal Learning

Access to AI curated content from various content publishers which can help in self-directed learning.

Informal Learning Informal Learning
click here