Skip to content

A reference guide for accessing and using data in the cloud.

Notifications You must be signed in to change notification settings

eosdis-nasa/cloud-data-resource

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 

Repository files navigation

EOSDIS Cloud Data Resource

Welcome to the EOSDIS Cloud Data Resource page! This document serves to act as a guide and resource to users working in the cloud. This page is meant to serve as an amalgamation of NASA's resources for using data from and in the cloud. This includes sources from NASA, NASA partners and other affiliates.

Why Use the Cloud?

NASA EOSDIS is evolving to use a commercial cloud to ingest, archive, process, distribute, and manage the anticipated large volumes of new mission data. Placing the EOSDIS archive collectively in the cloud will, for the first time, place NASA Earth Observing data “close to compute” and improve management and accessibility of these data while also expediting science discovery for data users. Having EOSDIS data in the cloud will not change the existing user experience of interacting with these data, but it will offer new methods of access not otherwise possible with on-premises platforms.

Earthdata Cloud Primer for Amazon Web Services

The Cloud Primer* is a series of tutorial PDFs to help walk a user through the steps needed to do analysis in the cloud. The Primer focuses on doing so through AWS (via EC2 and S3) as our data archives are stored in AWS. Users can take best advantage of data speeds and compute resources by using these Amazon services which "sit" directly next to the data.

The tutorial is broken up into 9 parts:

01 — Why Use the Cloud?
02 — Understanding and Managing Costs in the AWS Cloud
03 — Create a Basic Elastic Cloud Computer (EC2) Instance
04 — Connect to an AWS EC2 Instance - Windows and PuTTY
05 — Connect to an AWS EC2 Instance - Mac OS X
06 — Create Cloud Storage Using AWS Single Storage Service (S3) Buckets
07 — Access AWS-hosted S3 Earthdata
08 — Bring Your Own Script to the Cloud
09 — Glossary and Acronyms Explained\

*Note: The Cloud Primer is currently in the process of being updated. If you have questions, comments or concerns, please feel free to open an issue.

Openscapes

Openscapes started in 2018 as a Mozilla fellowship to "help other environmental science teams harness the power of collaborative coding and foster a scientific culture that is more welcoming, open, and efficient". It is now a NASA ROSES funded program meant to support researchers migrating workflows to the cloud by working with mentors from the EOSDIS (NASA's Earth Observing System Data and Information System) DAACs (Distributed Active Archive Centers) to help support the specific needs of the NASA community.

Five EOSDIS DAACs are participating in the first cohort: Atmospheric Science Data Center (ASDC), Goddard Earth Sciences Data and Information Services Center (GES DISC), Land Processes DAAC (LP DAAC), National Snow and Ice Data Center DAAC (NSIDC), and Physical Oceanography DAAC (PO.DAAC). Each DAAC selected up to three mentors, who are already helping researchers utilize the Earthdata Cloud.

The Openscapes Github page has phenomenal resources for connecting with mentors, presentations, and the NASA Earthdata Cloud Cookbook which is a great resource to answer questions about if cloud analysis is for you and how to and several how-to's for those less software-inclined. They also have a workshop at AGU 2023: Enabling Analysis in the Cloud Using NASA Earth Science Data!

DAAC Specific Resources

Several DAACs have guides on how they have migrated to the cloud and how you as a user can take advantage of their cloud data. Although these resources are largely specific to that particular DAAC, much of the information is useful in any context.

NASA Earthdata Cloud Data Access Guide

The NASA Earthdata Cloud Data Access Guide from the National Snow and Ice Data Center (NSIDC) is a step-by-step guide for accessing ICESat products in the cloud. It walks through finding the data via Earthdata Search, saving the generated S3 links for cloud use, creating temporary AWS S3 credentials, and using the data from an S3 bucket in an EC2 instance.

LAADS DAAC in the Cloud

The LAADS DAAC in the Cloud from the Level-1 and Atmospheric Archive & Distribution System DAAC (LAADS) includes benefits for moving into the cloud, a video instruction course for using S3 direct access and Earthdata Search in the cloud, some frequently asked questions and a migration to the cloud timeline.

PO.DAAC Cloud Data Resources

PO.DAAC has two cloud data resources - a written slideshow guide detailing how to access data and utilize it in the cloud, as well as a recorded webinar detailing a step-by-step walkthrough of using PO.DAAC data hosted in Earthdata Cloud that assumes zero knowledge of both AWS services and Earthdata Cloud.

Questions? Comments?

If you have any questions or comments regarding using NASA Earthdata in the cloud, or want to request any resources, please feel free to open a Github issue. If you don't have a Github account or are unsure of how to open an issue, you can always email me at nicholas.l.doty@nasa.gov.

Hits

About

A reference guide for accessing and using data in the cloud.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published