Data Analytics with R Certification Training


Data Analytics with R Certification Training

Data Analytics with R training will help you gain expertise in R Programming, Data Manipulation, Exploratory Data Analysis, Data Visualization, Data Mining, Regression, Sentiment Analysis and using R Studio for real life case studies on Retail, Social Media.


Instructor-led Data Analytics with R live online classes





Sep 11th SAT & SUN (5 WEEKS) Weekend Batch SOLD OUT Timings – 07:00 AM to 10:00 AM (IST)
Dec 11th SAT & SUN (5 WEEKS) Weekend Batch ⚡FILLING FAST Timings – 08:30 PM to 11:30 PM (IST)
Mar 19th SAT & SUN (5 WEEKS) Weekend Batch Timings – 07:00 AM to 10:00 AM (IST)

Introduction to Data Analytics

Learning Objectives: This module introduces you to some of the important keywords in R like Business Intelligence, Business Analytics, Data and Information. You can also learn how R can play an important role in solving complex analytical problems. This module tells you what is R and how it is used by the giants like Google, Facebook, Bank of America, etc. Also, you will learn use of ‘R’ in the industry, this module also helps you compare R with other software in analytics, install R and its packages.


  • Introduction to terms like Business Intelligence
  • Business Analytics
  • Data
  • Information
  • How information hierarchy can be improved/introduced
  • Understanding Business Analytics and R
  • Knowledge about the R language, its community and ecosystem
  • Understand the use of ‘R’ in the industry
  • Compare R with other software in analytics
  • Install R and the packages useful for the course
  • Perform basic operations in R using command line
  • Learn the use of IDE R Studio and Various GUI
  • Use the ‘R help’ feature in R
  • Knowledge about the worldwide R community collaboration

Introduction to R Programming

Learning Objectives: This module starts from the basics of R programming like datatypes and functions. In this module, we present a scenario and let you think about the options to resolve it, such as which datatype should one to store the variable or which R function that can help you in this scenario. You will also learn how to apply the ‘join’ function in SQL.


  • The various kinds of data types in R and its appropriate uses
  • The built-in functions in R like: seq(), cbind (), rbind(), merge()
  • Knowledge on the various subsetting methods
  • Summarize data by using functions like: str(), class(), length(), nrow(), ncol()
  • Use of functions like head(), tail(), for inspecting data
  • Indulge in a class activity to summarize data
  • Deploy package to perform SQL join in R

Data Manipulation in R

Learning Objectives: In this module, we start with a sample of a dirty data set and perform Data Cleaning on it, resulting in a data set, which is ready for any analysis. Thus using and exploring the popular functions required to clean data in R.


  • The various steps involved in Data Cleaning
  • Functions used in Data Inspection
  • Tackling the problems faced during Data Cleaning
  • Uses of the functions like grepl(), grep(), sub()
  • Coerce the data
  • Uses of the apply() functions

Data Import Techniques in R

Learning Objectives: This module tells you about the versatility and robustness of R which can take-up data in a variety of formats, be it from a csv file to the data scraped from a website. This module teaches you various data importing techniques in R.


  • Import data from spreadsheets and text files into R
  • Import data from other statistical formats like sas7bdat and spss
  • Packages installation used for database import
  • Connect to RDBMS from R using ODBC and basic SQL queries in R
  • Basics of Web Scraping

Exploratory Data Analysis

Learning Objectives: In this module, you will learn that exploratory data analysis is an important step in the analysis. EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis. You will also learn about the various tasks involved in a typical EDA process.


  • Understanding the Exploratory Data Analysis(EDA)
  • Implementation of EDA on various datasets
  • Boxplots
  • Whiskers of Boxplots
  • Understanding the cor() in R
  • EDA functions like summarize(), llist()
  • Multiple packages in R for data analysis
  • The Fancy plots like the Segment plot, HC plot in R

Data Visualization in R

Learning Objectives: In this module, you will learn that visualization is the USP of R. You will learn the concepts of creating simple as well as complex visualizations in R.


  • Understanding on Data Visualization
  • Graphical functions present in R
  • Plot various graphs like tableplot, histogram, Boxplot
  • Customizing Graphical Parameters to improvise plots
  • Understanding GUIs like Deducer and R Commander
  • Introduction to Spatial Analysis

Data Mining: Clustering Techniques

Learning Objectives: This module lets you know about the various Machine Learning algorithms. The two Machine Learning types are Supervised Learning and Unsupervised Learning and the difference between the two types. We will also discuss the process involved in ‘K-means Clustering’, the various statistical measures you need to know to implement it in this module.


  • Introduction to Data Mining
  • Understanding Machine Learning
  • Supervised and Unsupervised Machine Learning Algorithms
  • K-means Clustering

Data Mining: Association Rule Mining & Collaborative filtering

Learning Objectives: In this module, you will learn how to find the associations between many variables using the popular data mining technique called the “Association Rule Mining”, and implement it to predict buyers’ next purchase. You will also learn a new technique that can be used for recommendation purpose called “Collaborative Filtering”. Various real-time based scenarios are shown using these techniques in this module.


  • Association Rule Mining
  • User Based Collaborative Filtering (UBCF)
  • Item Based Collaborative Filtering (IBCF)

Linear and Logistic Regression

Learning Objectives: This module touches the base of ‘Regression Techniques’. Linear and logistic regression is explained from the basics with the examples and it is implemented in R using two case studies dedicated to each type of Regression discussed.


  • Linear Regression
  • Logistic Regression

Anova and Sentiment Analysis

Learning Objectives: This module tells you about the Analysis of Variance (Anova) Technique. The algorithm and various aspects of Anova have been discussed in this module. Additionally, this module also deals with Sentiment Analysis and how we can fetch, extract and mine live data from Twitter to find out the sentiment of the tweets.


  • Anova
  • Sentiment Analysis

Data Mining: Decision Trees and Random Forest

Learning Objectives: This module covers the concepts of Decision Trees and Random Forest. The algorithm for creation of trees and classification of decision trees and the various aspects like the Impurity function Gini Index, Pruning, Entropy etc are extensively taught in this module. The algorithm of Random Forests is discussed in a step-wise approach and explained with real-life examples. At the end of the class, these concepts are implemented on a real-life data set.


  • Decision Tree
  • The 3 elements for classification of a Decision Tree
  • Entropy
  • Gini Index
  • Pruning and Information Gain
  • Bagging of Regression and Classification Trees
  • Concepts of Random Forest
  • Working of Random Forest
  • Features of Random Forest

Project Work

Learning Objectives: This module discusses various concepts taught throughout the course and their implementation in a project.


  • Analyze census data to predict insights on the income of the people, based on the factors like: age, education, work-class, occupation using Decision Trees, Logistic Regression and Random Forest
  • Analyze the Sentiment of Twitter data, where the data to be analyzed is streamed live from twitter and sentiment analysis is performed on the same

About the Course

CertAdda’s Data Analytics with R training course is specially designed to provide the requisite knowledge and skills to become a successful analytics professional. It covers concepts of Data Manipulation, Exploratory Data Analysis, etc before moving over to advanced topics like the Ensemble of Decision trees, Collaborative filtering, etc.

Course Objectives

After the completion of the CertAdda’s Data Analytics with R course, you should be able to:

  • Understand concepts around Business Intelligence and Business Analytics
  • Explore Recommendation Systems with functions like Association Rule Mining , user-based collaborative filtering and Item-based collaborative filtering among others
  • Apply various supervised machine learning techniques
  • Perform Analysis of Variance (ANOVA)
  • Learn where to use algorithms – Decision Trees, Logistic Regression, Support Vector Machines, Ensemble Techniques etc
  • Use various packages in R to create fancy plots
  • Work on a real-life project, implementing supervised and unsupervised machine learning techniques to derive business insightsWhy learn
  • Data Analytics with R
  • The Data Analytics with R training certifies you in mastering the most popular Analytics tool. “R” wins on Statistical Capability, Graphical capability, Cost, rich set of packages and is the most preferred tool for Data Scientists

Who should go for this Course?

This course is meant for all those students and professionals who are interested in working in analytics industry and are keen to enhance their technical skills with exposure to cutting-edge practices. This is a great course for all those who are ambitious to become ‘Data Analysts’ in near future. This is a must learn course for professionals from Mathematics, Statistics or Economics background and interested in learning Business Analytics.

What are the pre-requisites for this Course?

The pre-requisites for learning ‘Data Analytics with R’ includes basic statistics knowledge. We provide a complimentary course “Statistics Essentials for R” to all the participants who enroll for the Data Analytics with R Training. This course helps you brush up your statistics skills.

What are the system requirements for this course?

Your system should have minimum 4GB RAM and i3 processor or above.

How will I execute the Practicals?

For doing your practicals, you would need to install R set-up or R-Studio on your system. The step-wise installation guides for setting up the environment on various operating systems are present in the LMS. In case you come across any doubt, the 24*7 support team will promptly assist you.

Which Case-Studies will be part of the course?

Towards the end of the Course, you will be working on a live project. You can choose any of the following as your Project work:

  • Project #1: Sentiment Analysis of Twitter Data
    Industry: Social Media
    Description: A sports gear company is planning to brand themselves by putting their company logo on the jersey of an IPL team. We assume that any team which is more popular on twitter will give a good ROI. So, we evaluate two different teams of IPL based on their social media popularity and the team which is more popular on twitter will be chosen for brand endorsement. The data to be analyzed is streamed live from twitter and sentiment analysis is performed on the same. The final output involves a comparable visualization plot of both the teams, so that the clear winner can be seen. The following insights need to be calculated:

    • Setup connection with twitter using twitter package. And perform authentication using handshake function.
    • Import tweets from the official twitter handle of the two teams using SearchTwitter function.
    • Prepare a sentiment function in R, which will take the arguments and find its negative or positive score.
    • Score against each tweet should be calculated.
    • Compare the scores of both the teams and visualize it.
  • Project #2: Census Data Analysis
    Industry: Government Dataset
    Description: Analyze the census data and predict whether the income exceeds $50K per year. Follow end to end modelling process involving:

    • Perform Exploratory Data Analysis and establish hypothesis of the data.
    • Test for Multi col-linearity, handle outliers and treat missing data.
    • Create training and validation data sets using Stratified Random Sampling (SRS) of data.
    • Fit Classification model on training set (Logistic Regression/Decision Tree)
    • Perform validation of the models (ROC curve, Confusion Matrix)
    • Evaluate and freeze the final model.
  • Additional Resources:
    Here is the list of few additional case studies that you will get at edureka for deeper understanding of R applications.

    • Study#1: Market Basket Analysis
      Industry: Retail – CPG
      Description: Market Basket Analysis is done to see if there are combinations of products that frequently co-occur in transactions. The analysis gives clues as to what a customer might have bought if the idea had occurred to them. This is done using the “Association Rules” on real-time data. In this case study, you shall understand various methods for finding useful associations in large data sets using statistical performance measures. You will also learn how to manage the peculiarities of working with transaction data.
      Data-set: The data set used here is from a grocery super store with 9835 rows of free flowing data without any labels.
    • Study#2: Strategic Customer Segmentation for Retail Business
      Industry: E-Commerce, Retail
      Description: In this case study, we will consider the dataset from a UK-based online retail business for the last two years. The objective of this case study is to do customer segmentation in this data set.
      For this exercise, we are going to use customer’s recency, frequency and monetary (RFM) values. From these three derived values, we will segment entire customer base and will generate insights on the data set provided to do customer segmentation using RFM Model based Clustering Analysis.
      Data-set: Comprises 0.5 million records and 8 variables. Each record is for one online order placed by the customer.
    • Study#3: Pricing Analytics and Price Elasticity
      Industry: Retail
      Description: A retailer is planning to sell a new type of cheese in some of its stores. This is a pilot project for the retailer & based on the data collected during this pilot phase, retailer wants to understand a few things.
      To promote sales of cheese, the retailer is planning for two different types of in-store advertisement:

      • Cheese as a natural product
      • Cheese as a family caring product

      Now the retailer wants to know:

      • Which in-store advertisement theme is better and giving better sales of cheese in the store?
      • How the sales of cheese is reacting to its price change i.e. price elasticity?
      • What is the impact of the price changes of other products in the same store (e.g. Ice-cream & Milk) on the sales of cheese i.e. cross-price elasticity.
      • What should be the best price of cheese to maximize the sales and then do sales forecast.

      Data-set: The data set used in this case study will have the following columns:

      • Price of Cheese
      • Sales of Cheese
      • Advertising method for cheese (either as a natural product or as a family product)
      • Price of Ice cream
      • Price of Milk
    • Study#4: Clustering Application using Shiny
      Industry: Consumer Packaged Goods
      Description: Shiny turn your analyses into interactive web applications, it is a web application framework for R. The data set that we are using in this case study relates to the clients of a wholesale distributor. It comprises, the annual spending in monetary units (m.u.) on diverse product categories. With this data we want to create a web based shiny application which can segment customers of wholesale distributor based upon the parameter passed thru ui.r
      Data-set: The data set used in this case study has 440 rows of data and has the following attributes in columns –

      • Channel
      • Region
      • Fresh
      • Milk
      • Grocery
      • Frozen
      • Detergents_Paper
      • Delicatessen

What if I miss a class?

You will never lose any lecture. You can choose either of the two options: View the recorded session of the class available in your LMS or You can attend the missed session, in any other live batch.

Will I Get Placement Assistance?

To help you in this endeavor, we have added a resume builder tool in your LMS. Now, you will be able to create a winning resume in just 3 easy steps. You will have unlimited access to use these templates across different roles and designations. All you need to do is, log in to your LMS and click on the “create your resume” option.

Can I Attend a Demo Session before Enrolment?

We have limited number of participants in a live session to maintain the Quality Standards. So, unfortunately participation in a live class without enrolment is not possible. However, you can go through the sample class recording and it would give you a clear insight about how are the classes conducted, quality of instructors and the level of interaction in the class.

Who are the Instructor at Edureka?

All our instructors are working professionals from the Industry and have at least 10-12 yrs of relevant experience in various domains. They are subject matter experts and are trained by Edureka for providing online training so that participants get a great learning experience.

What if I have more queries?

Just give us a CALL at +91 8178510474 / +91 9967920486 OR email at