Skip to content

Commit

Permalink
Minor changes to docs
Browse files Browse the repository at this point in the history
  • Loading branch information
kzollove committed Dec 6, 2023
1 parent 914b823 commit b8c04fb
Show file tree
Hide file tree
Showing 4 changed files with 126 additions and 8 deletions.
109 changes: 109 additions & 0 deletions docs/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -478,6 +478,115 @@ <h1><strong>Meeting Schedule</strong></h1>
meeting link</a>)</p>
<p><br></p>
</div>
<div id="site-map" class="section level1">
<h1><strong>Site Map</strong></h1>
<p>This site contains the following sections:</p>
<p><br></p>
<div id="background" class="section level3">
<h3><strong><a href="background.html">Background</a></strong></h3>
<table>
<colgroup>
<col width="40%" />
<col width="60%" />
</colgroup>
<tbody>
<tr class="odd">
<td><a href="background.html#Problem_Space">Problem Space</a></td>
<td align="left">High level summary of working group mission</td>
</tr>
<tr class="even">
<td><a href="background.html#Goals">Goals</a></td>
<td align="left">Goals of working group</td>
</tr>
<tr class="odd">
<td><a href="background.html#Notable_Challenges">Notable
Challenges</a></td>
<td align="left">Core challenges of effort</td>
</tr>
<tr class="even">
<td><a
href="background.html#PublicationsPresentation">Publications/Presentations</a></td>
<td align="left">Links to some relevant publications &amp;
presentations</td>
</tr>
</tbody>
</table>
</div>
<div id="development-effort" class="section level3">
<h3><strong>Development Effort</strong></h3>
<table>
<colgroup>
<col width="40%" />
<col width="60%" />
</colgroup>
<tbody>
<tr class="odd">
<td><strong><a href="development.html">Development Overview</a></strong>
<br> <a href="development.html#Purpose_and_Goals">Purpose and Goals</a>
<br> <a href="development.html#Context">Context</a> <br> <a
href="development.html#Scope">Scope</a> <br> <a
href="development.html##What_we_need">What we need</a> <br> <a
href="development.html#Project_Management">Project Management</a></td>
<td align="left">Overview of current development effort</td>
</tr>
<tr class="even">
<td><strong><a href="progressMap.html">Progress Map</a></strong> <br> <a
href="progressMap.html#Miro_Map">Miro Map</a> <br> <a
href="progressMap.html#Roadmap">Roadmap</a></td>
<td align="left">Miro Map of completed and outstanding work within scope
(with links)</td>
</tr>
<tr class="odd">
<td><strong><a href="githubProject.html">Github Project
Orientation</a></strong> <br> <a
href="githubProject.html#Approach">Approach</a> <br> <a
href="githubProject.html#Architecture">Architecture</a> <br> <a
href="githubProject.html#Project_Views">Project Views</a> <br> <a
href="githubProject.html#Approach">Example Walkthrough</a></td>
<td align="left">Documentation about navigating and understanding the
Github Project and approach</td>
</tr>
<tr class="even">
<td><strong><a href="gettingInvolved.html">Getting Involved</a></strong>
<br> <a
href="gettingInvolved.html#join_collaboration_channels">Collaboration
Channels</a> <br> <a
href="gettingInvolved.html#Review_Project_Documentation">Review Project
Documentation</a></td>
<td align="left">Suggestions and links for getting started in the
effort!</td>
</tr>
</tbody>
</table>
</div>
<div id="conventions" class="section level3">
<h3><strong><a href="conventions.html">Conventions</a></strong></h3>
<table>
<colgroup>
<col width="40%" />
<col width="60%" />
</colgroup>
<tbody>
<tr class="odd">
<td><a href="conventions.html#Diagnostic">Diagnostic</a></td>
<td align="left">Overview of current development effort</td>
</tr>
<tr class="even">
<td><a href="conventions.html#Treatment">Treatment</a></td>
<td align="left">Overview of current development effort</td>
</tr>
</tbody>
</table>
<hr />
</div>
<div id="model" class="section level3">
<h3><strong><a href="model.html">Model</a></strong></h3>
<hr />
</div>
<div id="tools" class="section level3">
<h3><strong><a href="tooling.html">Tools</a></strong></h3>
</div>
</div>



Expand Down
2 changes: 1 addition & 1 deletion rmd/_site.yml
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@ navbar:
href: background.html
- text: "Data Models"
icon: fa-database
href: gaia-datamodels.html
href: datamodels.html
- text: "Gaia"
icon: fa-globe
menu:
Expand Down
File renamed without changes.
23 changes: 16 additions & 7 deletions rmd/gaia-intro.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -8,24 +8,33 @@ output:
collapsed: false
---

# **OHDSI GIS Gaia Overview**
# **OHDSI GIS Gaia**

**Gaia** refers to the amalgamation of infrastructure, software, standards, tools, and the overall workflow that the OHDSI GIS Workgroup has developed to assist researchers with integrating place-based datasets into their patient-based health database and subsequent analyses.
**Gaia** refers to the amalgamation of infrastructure, software, standards, tools, and the overall workflow that the OHDSI GIS Workgroup has developed to assist researchers with integrating place-based datasets into their patient-centric health database and perform subsequent analyses.

**Gaia** includes multiple major elements:
- gaiaCatalog: a functional metadata catalog containing references to publicly-hosted geospatial datasets and instructions for their download and standardization
- gaiaCore: a Postgis database for managing harmonized data sources, the dockerized DeGauss geocoding tool, and gaiaR, an R package for managing interactions between gaiaCore, gaiaCatalog, and any of the Gaia "extensions"
- Extensions: a broad suite of software packages that are powered by gaiaCore. The most relevant of these packages is gaiaOhdsi, an R package that contains operations specific to interacting with an OMOP CDM or external OHDSI software. Other example of extensions are the gaiaVis tools which provide a set of visualizations for data in gaiaCore.
- GaiaCatalog: a functional metadata catalog containing references to publicly-hosted geospatial datasets and instructions for their download and standardization
- GaiaCore: a Postgis database for managing harmonized data sources, the dockerized DeGauss geocoding tool, and an R package.
- Extensions: a broad suite of software packages providing functionality for interacting with GaiaCore. The most relevant of these extensions is gaiaOhdsi, an R package that contains operations specific to interacting with an OMOP CDM or external OHDSI software. Another example extension are the gaiaVis tools which provide a set of visualizations for data in GaiaCore.

## Purpose

What is the purpose of Gaia? Why are we doing all of this?

Gaia provides a standardized, automated, reproducible, and easily shareable means for integrating place-based datasets into a database of longitudinal patient health data.

The *simplest case* for Gaia is a single researcher looking to leverage place-based data. After standing up a local or cloud instance of gaiaCore, any researcher now has access to a wealth of curated sources of geospatial data ranging from environmental toxin data to one of many Social Determinants of Health Indexes derived from the US Census data. Instead of the countless hours of work typical to munging multiple disparate geospatial datasets, the researcher can simply use the functions from the gaiaR package to load datasets into their Postgis database all in a harmonized geospatial data format. They've now quickly enabled datasets across many domains, years, and regions in a single Postgis database to which they connect using the software of their choice and begin performing ad hoc exploratory data analyses, creating visualizations, or even powering their own geospatial applications.
## The Case for Gaia

Taking this scenario a step further, a researcher with an established OMOP CDM database may wish to incorporate a subset of geospatial variables into their CDM database alongside their patient health data. The steps necessary to perform this ingestion, which requires geocoding of patient address and a spatiotemporal join, are all handled by gaiaCore and the gaiaOhdsi extension. Thehe DeGauss geocoder, a lightweight geocoder that operates fully locally to ensure that patient information is not transmitted, is easily utilized through a gaiaR wrapper. standardized spatiotemporal joins from the gaiaOhdsi extension relate patient addresses to polygon, line, or point geometries. By transforming the place-based data into patient-level information, it is now ready to be inserted into the CDM extension table "exposure_occurrence". The DDL and insert scripts for this table are also contained in the gaiaOhdsi extension. Once the data has been added to the CDM, it can be used to create cohort definitions, develop predictive models, and generally utilized by all relevant external OHDSI tooling.
### Simplest case

The *simplest case* for Gaia is a single researcher looking to leverage place-based data. After standing up a local or cloud instance of GaiaCore, any researcher now has access to a wealth of curated sources of geospatial data ranging from environmental toxin data to one of many Social Determinants of Health Indexes derived from the US Census data. Instead of the countless hours of work typical to munging multiple disparate geospatial datasets, the researcher can simply use the functions from the gaiaR package to load datasets into their Postgis database all in a harmonized geospatial data format. They've now quickly enabled datasets across many domains, years, and regions in a single Postgis database to which they connect using the software of their choice and begin performing ad hoc exploratory data analyses, creating visualizations, or even powering their own geospatial applications.


### Using GaiaCore with an OMOP-shaped database

Taking this scenario a step further, a researcher with an established OMOP CDM database may wish to incorporate a subset of geospatial variables into their CDM database alongside their patient health data. The steps necessary to perform this ingestion, which requires geocoding of patient address and a spatiotemporal join, are all handled by GaiaCore and the gaiaOhdsi extension. Thehe DeGauss geocoder, a lightweight geocoder that operates fully locally to ensure that patient information is not transmitted, is easily utilized through a gaiaR wrapper. standardized spatiotemporal joins from the gaiaOhdsi extension relate patient addresses to polygon, line, or point geometries. By transforming the place-based data into patient-level information, it is now ready to be inserted into the CDM extension table "exposure_occurrence". The DDL and insert scripts for this table are also contained in the gaiaOhdsi extension. Once the data has been added to the CDM, it can be used to create cohort definitions, develop predictive models, and generally utilized by all relevant external OHDSI tooling.

### Federated networks and research

Finally, Gaia enables standardized and reproducible workflows for federated data networks and studies. The process highlighted above to retrieve and harmonize geospatial datasets, perform spatiotemporal joins to transform place-based data to person-level information, and insert person-level information into an OMOP CDM and define cohorts, is fully reproducible. Each step of the process contains detailed, structured metadata focused on provenance of source data and rationale for transformation methods. By scripting and containerizing an entire Gaia workflow, the process of pairing place-based data, often handled using undocumented ad-hoc methods unique to single sites, can be packaged and shipped across an entire network with minimal effort.

0 comments on commit b8c04fb

Please sign in to comment.