Investigating UN Meeting Records

How to:

Obtain the metadata from United Nations digital library search
Use rvest and download.files to scrape the pdfs from the search results
Convert the pdfs to txt files
Read them into R
Split each file up by speaker
Use regular expression pattern matching to extract the speakers' names and organisations
Use quanteda to create a dictionary to see which speakers/organisations are talking about a topic of interest the most

I had to do this for a project, so I thought I'd share my code to save someone else the pain of having to figure this out from scratch.

Find the guide here (it's an R Markdown file). I've set the working directories in the Markdown file to make it work if you clone the whole respository to your ~/Desktop and run the code; the pdf and txt folders are empty and ready to receive downloads of the pdf files/converted txt files. If you'd like to see the nicely rendered html result of the Markdown file, you'll need to first clone/download the repository.

I've included some random search results in the results.csv file for you to experiment with.

This will be quite a detailed walkthrough; for a more advanced alternative try this guide by Dr Pablo Barberá.

Many thanks to Dr Pablo Barberá and Dr Gokhan Ciflikli for their invaluable help and advice.

Feedback would be greatly appreciated!

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
pdfs		pdfs
txt		txt
.DS_Store		.DS_Store
.RData		.RData
.Rhistory		.Rhistory
.gitattributes		.gitattributes
README.md		README.md
References		References
UN Digital Library Search Results.Rmd		UN Digital Library Search Results.Rmd
UN_Digital_Library_Search_Results.html		UN_Digital_Library_Search_Results.html
bib.xml		bib.xml
results.csv		results.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Investigating UN Meeting Records

About

Releases

Packages

Languages

thelautiff/UN_meeting_records

Folders and files

Latest commit

History

Repository files navigation

Investigating UN Meeting Records

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages