Data Mining Using R – Training Course

Data Mining using R (DR201)

This training course is about Data Mining using R; it’s essentially about exploring data. You will learn some advanced R programming methods that help you to summarise and visualise your data and to carry out machine learning.

In this Data Mining using R training course (DR201) you will learn various aspects of data mining using R, including how to produce general summary statistics and how to explore your data visually using graphical methods (including advanced R graphs). You’ll also learn how to rearrange your data using cross tabulation and contingency tables to help spot potential patterns. You will also learn some advanced methods of data mining using R tools for machine learning by cluster analysis (unsupervised machine learning) and regression analysis (supervised machine learning).

R: the statistical programming language is a good choice for data mining as it has many powerful tools that you can use for data exploration. Our training course will give you the skills you need to find patterns and relationships, giving you key insights into your data.

Data Mining Course Overview

The Data Mining using R course will give you the skills you need to explore data. This R training course provides an overview of methods of data exploration and covers some of the more important techniques in data analysis, i.e. looking for patterns and meaning in your data.

Who should attend

Anyone who needs to analyse data will find this course useful. The methods demonstrated in this data mining course are applicable to multiple disciplines.

What you will learn/course outline

This Data Mining Course includes various topics, including:

  • Summarizing data numerically.
    • Summary statistics.
    • Aggregating data.
  • Tabulating data.
    • Frequency (contingency) tables.
    • Cross Tabulation.
  • Graphical summary of data.
    • Summary charts.
    • Exploratory charts.
  • Unsupervised Machine Learning.
    • Hierarchical clustering.
    • K-means analysis.
    • Principal Co-ordinates analysis.
  • Supervised Machine Learning.
    • Regression analysis.
    • Curvilinear models.
    • Non-Gaussian models.

Prerequisites

You need to know some basics of working with R but knowledge of statistics or analytics is not essential for this training course.

What to bring

You’ll simply need a computer and the R program. A spreadsheet is also helpful, but not essential. Online courses are delivered using the Zoom software, which is available for Windows or Macintosh computers. Support material is supplied online, usually via Google Classroom.

You can get R from the R-Project website. We also recommend using RStudio, which is a useful “wrapper” for R. RStudio is very useful and has many additional features that help “drive” R.

How Long does the Data Mining Course Last?

The duration of this Data Mining using R training course is two days (starting at 9.30am and ending around 4.30pm).

Since COVID-19 online courses have been more popular. All our data training courses are available online, which means we can be flexible about dates. We can also arrange training at our main training centre in London, or at your workplace. At the moment there are no set dates for courses but you can Check dates here to see details of forthcoming events.

Our main training centre is at 107, Cheapside, London EC2V 6DN. This is right in the heart of London and is close to Bank and St. Paul’s underground stations.

Related Courses

The Data Mining using R course (DR201) is designed to help you explore your data more effectively, using the powerful Open Source program R. Other R-based courses include:

  • Beginning R (DR101) – A foundation course in using R: the statistical programming language.
  • Data Visualisation using R (DR202) – Advanced data visualisation using R (2-day).

Training Dates and Locations

My Publications

I have written several books on ecology and data analysis

An Introduction to R
Data Analysis and Visualisation
£35.00
Beginning R: The Statistical
Programming Language
£26.99
Statistics for Ecologists
Using R and Excel
£34.99
The Essential R
Reference
£44.99
Community
Ecology
£39.99
Managing Data
Using Excel
£24.99

Register your interest for our Training Courses

We run training courses in data management, visualisation and analysis using Excel and R: The Statistical Programming Environment. Courses will be held at one of our training centres in London. Alternatively we can come to you and provide the training at your workplace. Training Courses are also available via an online platform.




    Get In Touch Now

    for any information regarding our training courses, publications or help with a data project