Introduction to R | JumpStart to R Programming

R is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has also found followers among statisticians, engineers and scientists without computer programming skills who find it easy to use. Its popularity is due to the increasing use of data mining for various goals such as set ad prices, find new drugs more quickly or fine-tune financial models. R has a wide variety of packages for data mining. It's a language that many non-programmers can easily work with, naturally extending a skill set that is common to high-end Excel users. It's the perfect tool for when the analyst has a statistical, numerical, or probabilities-based problem based on real data and they've pushed Excel past its limits.

Retail Price: $2,195.00

Next Date: Request Date

Course Days: 3


Request a Date

Request Custom Course


Learning Objectives

This course provides indoctrination in the practical use of the umbrella of technologies that are on the leading edge of data science development focused on R and related tools. Working in a hands-on learning environment, led by our expert practitioner, students will learn R and its ecosystem, and where it’s a better a tool than Excel.

This course is approximately 50% hands-on, combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises. Our engaging instructors and mentors are highly experienced practitioners who bring years of current "on-the-job" experience into every classroom. Working in a hands-on learning environment, guided by our expert team, attendees will learn about and explore:

  • R Language and Mathematics
  • How to work with R Vectors
  • How to read and write data from files, and how to categorize data in factors
  • How to work with Dates and perform Date math
  • How to work with multiple dimensions and DataFrame essentials
  • Essential Data Science and how to use R with it
  • Visualization in R
  • How R can be used in Spark (Optional / Overview)

 

Audience & Pre-Requisites

Attendees for this course should have prior practical hands-on experience with another programming language. Prior exposure to working with statistics and probability, as well as hands-on working knowledge of Excel would also be helpful but is not required. We will collaborate with you to design the best solution to ensure your needs are met, whether we customize the material, or devise a different educational path to help your team best prepare for this training.

 


  1. Getting Started with R
  • Making R more friendly, R and available GUIs
  • The R environment
  • Related software and documentation
  • R and statistics
  • Using R interactively
  • An introductory session
  • Getting help with functions and features
  • R commands, case sensitivity, etc.
  • Recall and correction of previous commands
  • Executing commands from or diverting output to a file
  • Data permanency and removing objects

 

  1. Simple manipulations; numbers and vectors
  • Vectors and assignment
  • Vector arithmetic
  • Generating regular sequences
  • Logical vectors
  • Missing values
  • Character vectors
  • Index vectors; selecting and modifying subsets of a data set
  1. Objects, their modes and attributes
  • Intrinsic attributes: mode and length
  • Changing the length of an object
  • Getting and setting attributes
  • The class of an object

 

  1. Ordered and unordered factors
  • A specific example
  • The function tapply() and ragged arrays
  • Ordered factors

 

  1. Arrays and matrices
  • Arrays
  • Array indexing. Subsections of an array
  • Index matrices
  • The array() function
  • Mixed vector and array arithmetic. The recycling rule
  • The outer product of two arrays
  • Generalized transpose of an array
  • Matrix facilities
  • Matrix multiplication
  • Linear equations and inversion
  • Eigenvalues and eigenvectors
  • Singular value decomposition and determinants
  • Least squares fitting and the QR decomposition
  • Forming partitioned matrices, cbind() and rbind()
  • The concatenation function, (), with arrays
  • Frequency tables from factors

 

  1. Lists and data frames
  • Lists
  • Constructing and modifying lists
  • Concatenating lists
  • Data frames
  • Making data frames
  • attach() and detach()
  • Working with data frames
  • Attaching arbitrary lists
  • Managing the search path

 

  1. Reading data from files
  • The read.table()function
  • The scan() function
  • Accessing builtin datasets
  • Loading data from other R packages
  • Editing data

 

 

  1. Probability distributions
  • R as a set of statistical tables
  • Examining the distribution of a set of data
  • One- and two-sample tests

 

  1. Grouping, loops and conditional execution
  • Grouped expressions
  • Control statements
  • Conditional execution: if statements
  • Repetitive execution: for loops, repeat and while

 

  1. Writing your own functions
  • Simple examples
  • Defining new binary operators
  • Named arguments and defaults
  • The '...' argument
  • Assignments within functions
  • More advanced examples
  • Efficiency factors in block designs
  • Dropping all names in a printed array
  • Recursive numerical integration
  • Scope
  • Customizing the environment
  • Classes, generic functions and object orientation

 

  1. Statistical models in R
  • Defining statistical models; formulae
  • Contrasts
  • Linear models
  • Generic functions for extracting model information
  • Analysis of variance and model comparison
  • ANOVA tables
  • Updating fitted models
  • Generalized linear models
  • Nonlinear least squares and maximum likelihood models
  • Least squares
  • Maximum likelihood
  • Some non-standard models

 

  1. Graphical procedures
  • High-level plotting commands
  • The plot() function
  • Displaying multivariate data
  • Display graphics
  • Arguments to high-level plotting functions
  • Low-level plotting commands
  • Mathematical annotation
  • Hershey vector fonts
  • Interacting with graphics
  • Using graphics parameters
  • Permanent changes: The par() function
  • Temporary changes: Arguments to graphics functions
  • Graphics parameters list
  • Graphical elements
  • Axes and tick marks
  • Figure margins
  • Multiple figure environment
  • Device drivers
  • PostScript diagrams for typeset documents
  • Multiple graphics devices
  • Dynamic graphics

 

  1. Packages
  • Standard packages
  • Contributed packages and CRAN
  • Namespaces

 



Sorry! It looks like we haven’t updated our dates for the class you selected yet. There’s a quick way to find out. Contact us at 502.265.3057 or email info@training4it.com


Request a Date