Machine Learning Essentials with Scala

Machine Learning Essentials with Scala is an essentials-level, three-day hands-on course that teaches students core skills and concepts in modern ML practices. This course is geared for attendees new to machine learning who need introductory level coverage of these topics, rather than a deep dive of the math and statistics behind Machine Learning. Students will learn basic algorithms from scratch. For each machine learning concept, students will first learn about and discuss the foundations, its applicability and limitations, and then explore the implementation and use, reviewing and working with specific use cases.

Retail Price: $2,195.00

Next Date: 04/08/2024

Course Days: 3


Enroll in Next Date

Request Custom Course


Course Objectives

Working in a hands-on learning environment, led by our Machine Learning expert instructor, students will learn about and explore:

  • Popular machine learning algorithms, their applicability and limitations
  • Practical application of these methods in a machine learning environment
  • Practical use cases and limitations of algorithms

Topics Covered: This is a high-level list of topics covered in this course. Please see the detailed Agenda below

  • Machine Learning (ML) Overview
  • Machine Learning Environment
  • Machine Learning Concepts
  • Feature Engineering (FE)
  • Linear regression
  • Logistic Regression
  • Classification : SVM (Supervised Vector Machines)
  • Classification : Decision Trees & Random Forests
  • Classification : Naive Bayes
  • Clustering (K-Means)
  • Principal Component Analysis (PCA)
  • Recommendation (Collaborative filtering)

 

Course Prerequisites

This in an introductory-level course is geared for experienced Scala developers intending to start using learning about and working with basic machine learning algorithms and concepts.

Pre-Requisites:  Students should have

  • Basic Scala programming skills.  Attendees without Scala programming background may view labs as follow along exercises or team with others to complete them. (NOTE: This course is also offered in Python or R – please inquire for details)
  • Good foundational mathematics skills in Linear Algebra and Probability, to start learning about and using basic machine learning algorithms and concepts
  • Basic Linux skills, including familiarity with command-line options such as ls, cd, cp, and su

 

Course Agenda

Please note that this list of topics is based on our standard course offering, evolved from typical industry uses and trends. We’ll work with you to tune this course and level of coverage to target the skills you need most.

Machine Learning (ML) Overview

  • Machine Learning landscape
  • Machine Learning applications
  • Understanding ML algorithms & models (supervised and unsupervised)

Machine Learning Environment

  • Introduction to Jupyter notebooks / R-Studio
  • Exercise: Getting familiar with ML environment

Machine Learning Concepts

  • Statistics Primer
  • Covariance, Correlation, Covariance Matrix
  • Errors, Residuals
  • Overfitting / Underfitting
  • Cross validation, bootstrapping
  • Confusion Matrix
  • ROC curve, Area Under Curve (AUC)
  • Exercise: Working with Basic Statistics

Feature Engineering (FE)

  • Preparing data for ML
  • Extracting features, enhancing data
  • Data cleanup
  • Visualizing Data
  • Exercise: data cleanup
  • Exercise: visualizing data

Linear regression

  • Simple Linear Regression
  • Multiple Linear Regression
  • Running LR
  • Evaluating LR model performance
  • Exercise / Use case: House price estimates

Logistic Regression

  • Understanding Logistic Regression
  • Calculating Logistic Regression
  • Evaluating model performance
  • Use case: credit card application, college admissions

Classification : SVM (Supervised Vector Machines)

  • SVM concepts and theory
  • SVM with kernel
  • Use case: Customer churn data

Classification : Decision Trees & Random Forests

  • Theory behind trees
  • Classification and Regression Trees (CART)
  • Random Forest concepts
  • Exercise / Use case: predicting loan defaults, estimating election contributions

Classification : Naive Bayes

  • Theory behind Naive Bayes
  • Running NB algorithm
  • Evaluating NB model
  • Exercise / Use case: spam filtering

Clustering (K-Means)

  • Theory behind K-Means
  • Running K-Means algorithm
  • Estimating the performance
  • Exercise / Use case: grouping cars data, grouping shopping data

Principal Component Analysis (PCA)

  • Understanding PCA concepts
  • PCA applications
  • Running a PCA algorithm
  • Evaluating results
  • Exercise / Use case: analyzing retail shopping data

Recommendation (Collaborative filtering)

  • Recommender systems overview
  • Collaborative Filtering concepts
  • Use case: movie recommendations, music recommendations

Time Permitting: Capstone Project

  • Hands-on guided workshop utilizing skills learned throughout the course
Course Dates Course Times (EST) Delivery Mode GTR
4/8/2024 - 4/10/2024 10:00 AM - 6:00 PM Virtual Enroll
6/5/2024 - 6/7/2024 10:00 AM - 6:00 PM Virtual Enroll
7/24/2024 - 7/26/2024 10:00 AM - 6:00 PM Virtual Enroll
9/11/2024 - 9/13/2024 10:00 AM - 6:00 PM Virtual Enroll
10/23/2024 - 10/25/2024 10:00 AM - 6:00 PM Virtual Enroll
12/4/2024 - 12/6/2024 10:00 AM - 6:00 PM Virtual Enroll