Overview
Data Science training will help you prepare to move a step forward to becoming a Data Scientist who works majorly in R. Learn data science and You will have good exposure to building predictive models using machine learning on your own.
Objectives
At the end of Intro to Data Science training course, participants will be able to
Prerequisites
There is no specific pre-requisite for the course however exposure to core Java and statistics will be beneficial.
Course Outline
- Introduction to Statistics
- Types of Statistics
- Measures of central tendency
- Measure of dispersion
- Visualization Techniques
- Bias, Skews, Percentiles and Ranges
- Probability
- Bayes Theorem and Decision Tree
- R packages
- Understanding Vectors in R
- Data Manipulation Techniques
- R functions
- Basic YARN and Map Reduce
- Pig
- Hive
- Core Spark
- Resilient Distributed Datasets (RDDs)
- RDD operations
- Spark Cluster Managers
- Spark Abstractions