-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
geoJson data for umbrella projects and their subsidiaries #27
Comments
@cc50liu, umbrella projects and related projects are going to have a fairly different relationship than the project/location relationship of previously geocoded datasets such as the World Bank one you mentioned. The short version is that umbrella projects should be dropped from most uses of the geospatial features, and any non-umbrella projects referenced by an umbrella project should be treated as individual unrelated projects. The long version: Umbrella projects are provided for additional context on agreements and financial arrangements that are in many practical ways separate from actual project activities. The two examples of umbrella projects are:
And the actual documentation describing umbrella projects for more context:
The effective duplication of some project information across the umbrella projects and connected non-umbrella projects is a core reason for distinguishing between the two. If you aggregate the total commitment value of a set of connect umbrella and non-umbrella projects you would potentially be doubling the actual value of the projects. As a side note, given that umbrella project geospatial features are typically not useful and potentially even inaccurate (relative to what ends up being actually implemented) we will likely revisit whether umbrella projects should be included at all in the geospatial dataset. The last potential scenario I will mention is if you are linking your analysis of the geospatial features to broader policy/etc analysis in which, for example, you might need to consider the relationship or timing of umbrella projects (initial agreements), actual project implementation, and/or other umbrella projects (debt relief). There is no formal lookup for linking umbrella projects to non-umbrella projects, but some of our team has written some code to parse these relationships out for their own analysis (though it may not cover all umbrella projects). If you end up needing to head down this road, I can put you in touch with my colleagues. |
Is there a data source that contains a structured representation of the one-to-many relationship between umbrella projects and their subsidiary projects?
The AidData WorldBank dataset handles one-to-many relationship situations by providing separate files; for example, the
locations.csv
file can have multiple locations for each project record found in theprojects.csv
file. Is there something comparable to that in the China dataset where I can get a list of all the subsidiary project ids below an umbrella project?I'm asking because I notice that some umbrella projects have geoJson data associated with them, and I would like to confirm that their sub-projects also have specific geoJson data before I exclude the umbrella projects from my analysis. Here are some project ids of umbrella projects that have geoJson data:
41552
53002
30171
53184
56910
1713
1557
Is reading the text descriptions the only way to identify their subsidiary projects so I can verify the subsidiaries also have geoJson data? Or, is there some structured data available that would facilitate this? Many thanks.
The text was updated successfully, but these errors were encountered: