These installation instructions accompany the replication package for the ICSE 2020 paper "How to Not Get Rich: An Empirical Study of Donations in Open Source" (preprint here) by Cassandra Overney, Jens Meinicke, Christian Kästner, and Bogdan Vasilescu.
Cassandra Overney, Jens Meinicke, Christian Kästner, Bogdan Vasilescu. 2020.
How to Not Get Rich: An Empirical Study of Donations in Open Source.
In Proceedings of 42nd International Conference on Software Engineering,
Seoul, Republic of Korea, May 23–29, 2020 (ICSE ’20), 13 pages. DOI: 10.1145/3377811.3380410
The artifact is organized in two publicly available Docker containers (via Docker Hub), making it as easy as possible to interact with, without the need for complicated system setup.
The first container (cmustrudel/oss-donations:jupyter
) starts a Jupyter Notebook pre-loaded with the necessary packages and data. There are 4 notebook files inside the container, that reproduce Figures 1, 5, and 6 in the paper.
The second container (cmustrudel/oss-donations:rstudio-regression
) starts RStudio pre-loaded with the necessary packages and data. The R script that loads inside the container on startup reproduces all the regression modeling and time series analysis results in the paper.
The data
folder contains the following CSV files
asking_group_npm_gh.csv
: metrics for GitHub projects that ask for donationsasking_group_npm.csv
: metrics for npm projects that ask for donationsasking_money_sample.csv
: a sample of projects that receive a lot of donations, used in qualitative analysisasking_no_money_sample.csv
: a sample of projects that don't receive any donations, used in qualitative analysisasking_some_money_sample.csv
: a sample of projects that receive some donations, used in qualitative analysiscommercial_npm_control.csv
: metrics for npm projects that are owned by commercial organizations and don't ask for donationshighest_download_count_npm_control.csv
: metrics for npm projects that have the highest download counts and don't ask for donationsrandom_gh_control.csv
: metrics for a random sample of GitHub projects that don't ask for donationsrandom_npm_control.csv
: metrics for a random sample of npm projects that don't ask for donationsrdd_ask_funding_date_monthly_earning.csv
: monthly earning data for projects asking for donationsrdd_ask_funding_date.csv
: time series data for projects asking for donationsrdd_get_funding_date_monthly_earning.csv
: monthly earning data for projects receiving donationsrdd_get_funding_date.csv
: time series data for projects receiving donations
See detailed instructions for how to access and use the artifact here. In a nutshell, you only need to install Docker and interact independently with either/both of the two Docker containers.
The documentation explaining how to access and interact with the artifact is available in our GitHub repository, as well as here.
The artifact is availble under MIT License.