-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy pathfoundations_ls02_using_rstudio.qmd
388 lines (215 loc) · 15.7 KB
/
foundations_ls02_using_rstudio.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
---
title: 'Using RStudio'
output:
html_document:
number_sections: true
toc: true
toc_float: true
css: !expr here::here("global/style/style.css")
highlight: kate
editor_options:
markdown:
wrap: 100
canonical: true
chunk_output_type: console
---
```{r echo = F, message = F, warning = F}
source(here::here("global/functions/misc_functions.R"))
knitr::opts_chunk$set(class.source = "tgc-code-block")
```
------------------------------------------------------------------------
## Learning objectives
1. You can identify and use the following tabs in RStudio: Source, Console, Environment, History, Files, Plots, Packages, Help and Viewer.
2. You can modify RStudio's interface options to suit your needs.
## Introduction
Now that you have access to R & RStudio, let's go on a quick tour of the RStudio interface, your digital home for a long time to come.
We will cover a lot of territory quickly. Do not panic. You are not expected to remember it all this. Rather, you will see these topics again and again throughout the course, and you will naturally assimilate them that way.
You can also refer back to this lesson as you progress.
The goal here is simply to make you aware of the tools at your disposal within RStudio.
------------------------------------------------------------------------
To get started, you need to open the RStudio application:
- If you are working with RStudio Cloud, go to [rstudio.cloud](https://rstudio.cloud), log in, then click on the "r_intro" project that you created in the last lesson. (If you do not see this, simply create a new R project using the "New Project" icon at the top right).
- If you are working on your local computer, go to your applications folder and double click on the RStudio icon. Or you search for this application from your Start Menu (Windows), or through Spotlight (Mac).
## The RStudio panes
By default, RStudio is arranged into four window panes.
If you only see three panes, open a new script with `File > New File > R Script` . This should reveal one more pane.
![](images/new_r_script.jpg){width="451" height="71"}
Before we go any further, we will rearrange these panes to improve the usability of the interface.
To do this, in the RStudio menu at the top of the screen, select `Tools > Global Options` to bring up RStudio's options. Then under `Pane Layout`, adjust the pane arrangement. The arrangement we recommend is shown below.
![](images/rstudio_pane_layout.png){width="403"}
At the top left pane is the Source tab, and at the top right pane, you should have the Console tab.
Then at the bottom left pane, no tab options should checked---this section should be left empty, with the drop-down saying just "TabSet".
Finally, at the bottom right pane, you should check the following tabs: Environment, History, Files, Plots, Packages, Help and Viewer.
------------------------------------------------------------------------
Great, now you should have an RStudio window that looks something like this:
![](images/rstudio_four_panes_rearranged.jpg){width="488"}
------------------------------------------------------------------------
The top-left pane is where you will do most of the coding. Make this larger by clicking on its maximize icon:
![](images/maximize_icon.jpg){width="353"}
------------------------------------------------------------------------
Note that you can drag the bar that separates the window panes to resize them.
![](images/drag_to_resize_panes.png){width="231"}
------------------------------------------------------------------------
Now let's look at each of the RStudio tabs one by one. Below is a summary image of what we will discuss:
![](images/rstudio_panes_detail.jpg)
### Source/Editor
![](images/rstudio_editor.png){width="469"}
The source or editor is where your R "scripts" go. A script is a text document where you write and save code.
Because this is where you will do most of your coding, it is important that you have a lot of visual space. That is why we rearranged the RStudio pane layout above---to give the Editor more space.
Now let's see how to use this Editor.
------------------------------------------------------------------------
First, **open a new script** under the File menu if one is not yet open: `File > New File > R Script`. In the script, type the following:
```{r eval = F}
print("excited for R!")
```
To **run code**, place your cursor anywhere in the code, then hit `Command` + `Enter` on macOS, or `Control` + `Enter` on Windows.
This should send the code to the Console and run it.
------------------------------------------------------------------------
You can also **run multiple lines at once**. To try this, add a second line to your script, so that it now reads:
```{r eval = F}
print("excited for R!")
print("and RStudio!")
```
Now drag your cursor to highlight both lines and press `Command`/`Control` + `Enter`.
To **run the entire script**, you can use `Command`/`Control` + `A` to select all code, then press `Command`/`Control` + `Enter`. Try this now. Deselect your code, then try to the shortcut to select all.
::: {.callout-note title='Side Note'}
There is also a 'Run' button at the top right of the source panel ( ![](images/run_icon.jpg){width="37"} ), with which you can run code (either the current line, or all highlighted code). But you should try to use the keyboard shortcut instead.
:::
------------------------------------------------------------------------
To **open the script in a new window**, click on the third icon in the toolbar directly above the script.
![](images/rstudio_pop_out_icon.png){width="329"}
To put the window back, click on the same button on the now-external window.
------------------------------------------------------------------------
Next, **save the script.** Hit `Command`/`Control` + `S` to bring up the Save dialog box. Give it a file name like "rstudio_intro".
- If you are working with RStudio cloud, the file will be saved in your project folder.
- If you are working on your local computer, save the file in an easy-to-locate part of your computer, perhaps your desktop. (Later on we will think about the "proper" way to organize and store scripts).
------------------------------------------------------------------------
You can **view data frames** (which are like spreadsheets in R) in the same pane. To observe this, type and run the code below on a new line in your script:
```{r eval = F}
View(women)
```
Notice the uppercase "V" in `View()`.
![](images/rstudio_view_data_frame.png){width="234"}
`women` is the name of a dataset that comes loaded with R. It gives the average heights and weights for American women aged 30--39.
You can click on the "x" icon to the right of the "women" tab to close this data viewer.
------------------------------------------------------------------------
### Console
The *console*, at the bottom left, is where **code is executed**. You can type code directly here, but it will not be saved.
Type a random piece of code (maybe a calculation like `3 + 3`) and press 'Enter'.
![](images/rstudio_console.png){width="594"}
If you place your cursor on the last line of the console, and you press the **up arrow**, you can go back to the last code that was run. Keep pressing it to cycle to the previous lines.
To run any of these previous lines, press *Enter*.
### Environment
![](images/environment_tab.png){width="527"}
At the top right of the RStudio Window, you should see the **Environment** tab.
The Environment tab shows datasets and other objects that are loaded into R's working memory, or "workspace".
To explore this tab, let's import a dataset into your environment from the web. Type the code below into your script and run it:
```{r eval = F}
ebola_data <- read.csv("https://tinyurl.com/ebola-data-sample")
```
::: {.callout-note title='Side Note'}
You don't need to understand exactly what the code above is doing for now. We just want to quickly show you the basic features of the Environment pane; we'll look at data importing in detail later.
Also, if you do not have active internet access, the code above will not run. You can skip this section and move to the "History" tab.
:::
You have now imported the dataset and stored it in an *object* named `ebola_data`. (You could have named the object anything you want.)
Now that the dataset is stored by R, you should be able to see it in the Environment pane. If you click on the blue drop-down icon beside the object's name in the Environment tab to reveal a summary.
![](images/rstudio_explore_object_dropdown.png){width="351"}
Try clicking directly on the `ebola_data` dataset from the Environment tab. This opens it in a 'View' tab.
------------------------------------------------------------------------
You can **remove an object from the workspace** with the `rm()` function. Type and run the following in a new line on your R script.
```{r eval = F}
rm(ebola_data)
```
Notice that the `ebola_data` object no longer shows up in your environment after having run that code.
The broom icon, at the top of the Environment pane can also be used to clear your workspace.
![](images/broom_icon.png)
To practice using it, try re-running the line above that imports the Ebola dataset, then clear the object using the broom icon.
### History
Next, the **History** tab shows previous commands you have run.
![](images/history_tab.png){width="470"}
You can click a line to highlight it, then send it to the console or to your script with the "To Console" and "To Source" icons at the top of this tab.
To select multiple lines, use the "Shift-click" method: click the first item you want to select, then hold down the "Shift" key and click the last item you want to select.
Finally, notice that there is a search bar at the top right of the History pane where you can search for past commands that you have run.
### Files
Next, the **Files** tab. This shows the files and folders in the folder you are working in.
![](images/rstudio_files.png)
The tab allows you to interact with your computer's file system.
Try playing with some of the buttons here, to see what they do. You should try at least the following:
- Make a new folder
- Delete that folder
- Make a new R Script
- Rename that script
### Plots
Next, the **Plots** tab. This is where figures that are generated by R will show up. Try creating a simple plot with the following code:
```{r eval = F}
plot(women)
```
![](images/rstudio_plots.png)
That code creates a plot of the two variables in the `women` dataset. You should see this figure in the Plots tab.
Now, test out the buttons at the top of this tab to explore what they do. In particular, try to export a plot to your computer.
### Packages
Next, let's look at the **Packages** tab.
![](images/packages_tab.png){width="419"}
Packages are collections of R code that extend the functionality of R. We will discuss packages in detail in a future lesson.
For now, it is important to know that to use a package, you need to *install* then *load* it. Packages need to be installed only once, but must be loaded in each new R session.
All the package names you see (in blue font) are packages that are installed on your system. And packages with a checkmark are packages which are *loaded* in the current session.
You can install a package with the Install button of the Packages tab.
![](images/rstudio_install_package_icon.png){width="348"}
But it is better to install and load packages with R code, rather than the Install button. Let's try this. Type and run the code below to install the {highcharter} package.
```{r eval = F, message = F}
install.packages("highcharter")
library(highcharter)
```
The first line installs the package. The second line *loads* the package from your package library.
Because you only need to install a package once, you can now remove the installation line from your script.
------------------------------------------------------------------------
Now that the {highcharter} package has been installed and loaded, you can use the functions that come in the package. To try this, type and run the code below:
```{r}
highcharter::hchart(women$weight)
```
This code uses the `hchart()` *function* from the {highcharter} package to plot an interactive histogram showing the distribution of weights in the `women` dataset.
(Of course, you may not yet know what a function is. We'll get to this soon.)
### Viewer
Notice that the histogram above shows up in a **Viewer** tab. This tab allows you to preview HTML files and interactive objects.
### Help
Lastly, the **Help** tab shows the documentation for different R objects. Try typing out and running each line below to see what this documentation looks like.
```{r eval = F}
?hchart
?women
?read.csv
```
![](images/example_help_page.png){width="451"}
Help files are not always very easy to understand for beginners, but with time they will become more useful.
## RStudio options
RStudio has a number of useful options for changing it's look and functionality. Let's try these. You may not understand all the changes made for now. That's fine.
In the RStudio menu at the top of the screen, select `Tools > Global Options` to bring up RStudio's options.
- Now, under `Appearance`, choose your ideal theme. (We like the "Crimson Editor" and "Tomorrow Night" themes.)
![](images/rstudio_themes.png){width="530"}
- Under `Code > Display`, check "Highlight R function calls". What this does is give your R *functions* a unique color, improving readability. You will understand this later.
- Also under `Code > Display`, check "Rainbow parentheses". What this does is make your "nested parentheses" easier to read by giving each pair a unique color.
![](images/options_highlight_function_rainbow_parentheses.jpg){width="446"}
![](images/highlight_r_function_calls.jpg){width="460" height="66"}
![](images/rainbow_parentheses.jpg){width="361"}
- Finally under `General > Basic`, **uncheck** the box that says **"Restore .RData into workspace at startup"**. You don't want to restore any data to your workspace (or *environment)* when you start RStudio. Starting with a clean workspace each time is less likely to lead to errors.
This also means that you never want to **"save your workspace to .RData on exit"**, so set this to **Never**.
## Command palette
The Rstudio command palette gives instant, searchable access to many of the RStudio menu options and settings that we have seen so far.
The palette can be invoked with the keyboard shortcut `Ctrl` + `Shift` + `P` (`Cmd` + `Shift` + `P` on macOS).
It's also available on the *Tools* menu (*Tools* -\> *Show Command Palette*).
![](images/command_palette.jpg){width="527"}
Try using it to:
- Create a new script (Search "new script" and click on the relevant option)
- Rename a script (Search "rename" and click on the relevant option)
## Wrapping up
Congratulations! You are now a new citizen of RStudio.
Of course, you have only scratched the surface of RStudio functionality. As you advance in your R journey, you will discover new features, and you will hopefully grow to love the wonderful integrated development environment (IDE) that is RStudio. One good place to start is the official RStudio IDE [cheatsheet](https://thegraphcourses.org/wp-content/uploads/2022/03/rstudio-IDE-cheatsheet.pdf).
Below is one section of that sheet:
![](images/rstudio_cheatsheet.png)
See you in the next lesson!
## Further resources
1. [23 RStudio Tips, Tricks, and Shortcuts](https://www.dataquest.io/blog/rstudio-tips-tricks-shortcuts/)
## References
Some material in this lesson was adapted from the following sources:
- "Rstudio Cheatsheets." *RStudio*, <https://www.rstudio.com/resources/cheatsheets/.>
- "Chapter 1 Getting Started: Data Skills for Reproducible Research." *Chapter 1 Getting Started \| Data Skills for Reproducible Research*, <https://psyteachr.github.io/reprores-v2/intro.html.>
`r tgc_license()`