GenEpi-BioTrain - Virtual training 14 - R data analysis and visualisation for beginners

 R is an open source software, a widely used programming language for statistical data analysis and a powerful tool for visualisation. The aim of this virtual training is to introduce participants to data analysis and plotting in R. 

The training is divided into four sessions:

  1. In the first session, participants will be introduced to the Rstudio environment and will acquire a strong understanding of basic data types and structures in R. 
  2. The second session will focus on downloading libraries, importing data and basic commands and concepts in manipulating dataframes using base R and tidyverse packages. 
  3. The third session will introduce participants to functions and a number of plotting methods for different kinds of data visualisation. 
  4. The fourth session will summarize content learned in the previous three sessions by conducting a small analysis of a dataset using the previously taught tools.
Note that you will need to have both R and RStudio preinstalled and running on your PC. You will receive more information on how to install both programs once you are enroled.


Duration: Four half-days of 3-4 hours duration, with interactive classes and small assignments in between, during the week 17-20 February 2025.


Location: Online - Find the link after enrolment.


Audience: Public health microbiologists, epidemiologist, or beginners in bioinformatics interested in learning about R programming for data visualisation.

Learning objectives:  

    • Install and understand the R programming language and the Rstudio shell.
    • Import and export data to R.
    • Manipulate data objects in R.
    • Visualise data using base plotting in R.
    • Understand how different visualisations aid in interpretation of data.
    • Conduct simple analysis of resistance increase over time OR exploratory data analysis of WHO tuberculosis dataset.

Participation: This course is open for public enrolment.