UNICEF AI4D Poverty Mapping Project

📜 Description

The UNICEF AI4D Poverty Mapping Project aims to develop open datasets and machine learning (ML) models for poverty mapping estimation across nine countries in Southeast Asia (SEA).

We also aim to open source all the scripts, experiments and other artifacts used for developing these datasets and models, in order to allow others to replicate our work, as well as to collaborate and extend our work for their own use cases.

This project is part of Thinking Machines's overall push for open science through the AI4D (AI for Development) Research Bank which aims to accelerate the development and adoption of effective machine learning (ML) models for development across Southeast Asia.

We also publish our interim (work in progress) and final notebooks here

Data Access and Downloads

Due to the sensitive nature of the data and the DHS program terms of use, we cannot provide the raw DHS data.

You can, however, request for access to raw data yourself on the DHS website. In that case, you can use GeoWrangler's DHS processing utils help perform the said pre-processing.

The notebooks assume that the DHS Stata and Shape Files are located in data/dhs/<iso-country-code>/ where the <iso-country-code> is the two-letter ISO country code.

The only other data access requirement is the EOG Nightlights Data which requires registering for an account. The nightlights download require the use of these credentials (user name and password) to download the nightlights data automatically.

All the other datasets used in this projects are publically available and the notebooks all provide the code necessary to download as well cache the data.

Due to the size of the downloaded datasets, please make sure you have enough disk space (minimum 40GB-50GB) to accommodate all the datasets used in building the models.

⚙️ Local Setup for Development

This repo assumes the use of conda/mamba for simplicity in installing GDAL.

Requirements

Python 3.9
make
mamba/conda

🐍 One-time Set-up

Run this the very first time you are setting-up the project on a machine to set-up a local Python environment for this project.

Install mamba for your environment if you don't have it yet.

wget "https://github.com/conda-forge/miniforge/releases/latest/download/Mambaforge-$(uname)-$(uname -m).sh"
bash Mambaforge-$(uname)-$(uname -m).sh

Create a local conda env and activate it. This will create a conda env folder in your project directory.

make conda-env
conda activate ./env

Run the one-time set-up make command.

make setup

To test if the setup was successful, run the tests.

make test

You should get a message that the tests passed.

🐍 Testing

To run automated tests, simply run make test.

📦 Dependencies

Over the course of development, you will likely introduce new library dependencies. This repo uses pip-tools to manage the python dependencies.

There are two main files involved:

requirements.in - contains high level requirements; this is what we should edit when adding/removing libraries
requirements.txt - contains exact list of python libraries (including depdenencies of the main libraries) your environment needs to follow to run the repo code; compiled from requirements.in

When you add new python libs, please do the ff:

Add the library to the requirements.in file. You may optionally pin the version if you need a particular version of the library.
Run make requirements to compile a new version of the requirements.txt file and update your python env.
Commit both the requirements.in and requirements.txt files so other devs can get the updated list of project requirements.

Note: When you are the one updating your python env to follow library changes from other devs (reflected through an updated requirements.txt file), simply run pip-sync requirements.txt

📜Documentation

We are using Quarto to maintain the Unicef AI4D Poverty Mapping documentation site

Here are some quick tips to running quarto/updating the doc site:

Download: quarto download
Install:

sudo dpkg -i quarto-1.2.247-linux-amd64.deb

Preview the site locally (view in http://localhost:4444) :

quarto preview --port 4444 --no-browser

Update the site (must have maintainer role):

quarto publish gh-pages --no-browser

Pro-tip : If you are using VS Code as your code editor, install the Quarto extension to make editing/previewing the doc site a lot smoother.

Name		Name	Last commit message	Last commit date
Latest commit History 315 Commits
assets		assets
docs		docs
notebooks		notebooks
povertymapping		povertymapping
scripts		scripts
test-notebooks		test-notebooks
tests		tests
.gitignore		.gitignore
.gitleaksignore		.gitleaksignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
_quarto.yml		_quarto.yml
about.qmd		about.qmd
catalog-info.yml		catalog-info.yml
cross-country-report.qmd		cross-country-report.qmd
environment.yml		environment.yml
index.qmd		index.qmd
mkdocs.yaml		mkdocs.yaml
progress.qmd		progress.qmd
pyproject.toml		pyproject.toml
requirements.in		requirements.in
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
sprint-3-report.qmd		sprint-3-report.qmd
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UNICEF AI4D Poverty Mapping Project

📜 Description

Data Access and Downloads

⚙️ Local Setup for Development

Requirements

🐍 One-time Set-up

🐍 Testing

📦 Dependencies

📜Documentation

About

Releases

Packages

Languages

License

huruizverastegui/unicef-ai4d-poverty-mapping

Folders and files

Latest commit

History

Repository files navigation

UNICEF AI4D Poverty Mapping Project

📜 Description

Data Access and Downloads

⚙️ Local Setup for Development

Requirements

🐍 One-time Set-up

🐍 Testing

📦 Dependencies

📜Documentation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages