This course is aimed at the intermediate R developer who wants to learn how to do useful data analysis tasks in R. It will focus on “modern R”, specifically using the tidyverse collection of packages which are designed for data science.
Data analysis is a huge topic and we couldn’t possibly cover it all in one short course. The purpose of this workshop is to give you an introduction to some of the most useful tools and to demonstrate some of the most common problems that surface.
This workshop assumes you have knowledge of R equivalent to that covered in Beginning R and Intermediate R. This workshop provides the background necessary to follow Text and Sentiment Analysis in R.
You can jump ahead to any chapter:
For the purposes of this workshop we will be using RStudio. If you haven’t installed and got RStudio working, then please follow the instructions in the Intermediate R workshop.
In previous courses, you’ve used the Rscript
command line program to execute scripts and the RStudio console to run interactively. This course will use another tool called R Notebooks (or R Markdown Notebooks) to run your R code. It operates like a standard RStudio console interactive session, with the addition of allowing you to intersperse your code with blocks of text to explain what you’re doing and embed output such as graphs directly into the page.
To get started, launch RStudio. In the menu, select “File | New File | R Notebook”. This will open a new R Notebook, which should look somethign like this;