# these make comments
3+5
## [1] 8
palmerpenguins
is a dataset called
penguins# we are looking at the penguin data
# to load the package we use the command library()
library(palmerpenguins)
penguins
## # A tibble: 344 × 8
## species island bill_length_mm bill_depth_mm flipper_length_mm body_mass_g
## <fct> <fct> <dbl> <dbl> <int> <int>
## 1 Adelie Torgersen 39.1 18.7 181 3750
## 2 Adelie Torgersen 39.5 17.4 186 3800
## 3 Adelie Torgersen 40.3 18 195 3250
## 4 Adelie Torgersen NA NA NA NA
## 5 Adelie Torgersen 36.7 19.3 193 3450
## 6 Adelie Torgersen 39.3 20.6 190 3650
## 7 Adelie Torgersen 38.9 17.8 181 3625
## 8 Adelie Torgersen 39.2 19.6 195 4675
## 9 Adelie Torgersen 34.1 18.1 193 3475
## 10 Adelie Torgersen 42 20.2 190 4250
## # … with 334 more rows, and 2 more variables: sex <fct>, year <int>
# add the data set to our environment
data_penguins <- penguins
install.packages("")
to installlibrary()
# to get more info about something use ?
?palmerpenguins
# mean(data_penguins)
# didn't work because it is nonsense
# to get a variable from a data set use $
bills_length <- data_penguins$bill_length_mm
# this produces a vector (a single column or row from a data frame/table/matrix)
#mean(bills_length)
# still doesn't work because of NA values
mean(bills_length,na.rm = TRUE)
## [1] 43.92193
# the average bill length of all penguins in the data set is 43.92 mm
# getting a specific value is like playing battleship
data_penguins[5,2]
## # A tibble: 1 × 1
## island
## <fct>
## 1 Torgersen
data_penguins[7,4]
## # A tibble: 1 × 1
## bill_depth_mm
## <dbl>
## 1 17.8
# to get a row
data_penguins[3,]
## # A tibble: 1 × 8
## species island bill_length_mm bill_depth_mm flipper_length_… body_mass_g sex
## <fct> <fct> <dbl> <dbl> <int> <int> <fct>
## 1 Adelie Torge… 40.3 18 195 3250 fema…
## # … with 1 more variable: year <int>
First thing we will do is install a new package. In the CONSOLE, run
the appropriate code to install the package
fivethirtyeight
.
fivethirtyeight
package (which you should have
just installed), locally# code to load the package
biopics
full of
data about biographical picture movies. Choose a an appropriate name and
save this data frame to your local environment.# saving data frame to environment
head()
on the data set. What
does the head()
function do?# See head() of biopic data
Type your answer in words here or write your answer as a comment in the code chunk
box_office
in the data frame represents.# putting a ? at the beginning tells you more about the data
?biopics
## No documentation for 'biopics' in specified packages and libraries:
## you could try '??biopics'
Type your answer in words here or write your answer as a comment in the code chunk
Type your answer to the question here
number_of_subjects
:type_of_subjects
:box_office
:number_of_subjects
column to your local
environment.# saving number_of_subjects data locally
table()
function.# use table() on the local variable you made in 7.
barplot(table())
on the variable
you made in 7.# use barplot(table()) on the local variable you made in 7.