Note: This doesn’t cover absolutely everything in the textbook, rather puts more emphasis on things that are: worth explaining, nuanced, difficult, has appeared in the midterm. If you’d like to suggest additions, fill out this form (Or if you find mistakes): https://docs.google.com/forms/d/e/1FAIpQLScOjou48ZukL9gTWY2rbFMWwo81kulsJ7-LEYTjh-iEGFcoHQ/viewform?usp=sharing
Link to Textbook / Link to More Formulas
What does it mean when code is reproducible?
The first thing this course teaches us to do is read a simple .csv (comma separated variable) file, but we need to first import a library (tidyverse) which will allow us to read data (this library allows us to do much more than just read data)
library(tidyverse)
read_data <- read_csv(data = "____.csv")
print(marathon_small, n = 50)
# prints n amount of rows of the dataset
# specifying the number of rows to be printed **IS** a valid argument for print
Below is an illustration of how to use the filter function in R, to manipulate a data frame and only keep rows with the specified value of a column variable.