Skip to content

cfeng1/github_data_research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Github Data Research

Research on GitHub collaboration patterns.

Environment

In the repo directory, run

conda env create -f binder/environment.yml 
conda activate githubdata
jupyter labextension install @jupyter-widgets/jupyterlab-manager
conda install ipykernel
python3 -m ipykernel install --user --name=githubdata
conda deactivate

Package environment:

conda-pack

To remove kernel in jupyter

jupyter kernelspec list
jupyter kernelspec uninstall unwanted-kernel

Data Collection

Folder "data_collection" stores queries that collects data in the sample.

  • Start from PyPI repositories (based on Valiev, Vasilescu, Herbsleb (2018)), track commit history of all authors including contributions to those that are not in PyPI.

  • Complementary data: unreleased Python data, also need to track all author's contribution history

Exploratory Data Analysis

Folder for notebooks and results for exploratory data analysis (reduced-form).

Model Estimation

Folder for structural model and related estimation codes.

Counterfactual Analysis

Folder for codes of counterfactual Analysis

About

Research on GitHub collaboration patterns

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages