From 6d93bbefc4336fc2eea15567205223d403fba8a8 Mon Sep 17 00:00:00 2001 From: Justin Hiemstra Date: Wed, 18 Dec 2024 20:12:01 +0000 Subject: [PATCH 1/3] Add 'pelican enables researchers to...' blurb in index page --- docs/pages/about-pelican.mdx | 9 +++++++-- 1 file changed, 7 insertions(+), 2 deletions(-) diff --git a/docs/pages/about-pelican.mdx b/docs/pages/about-pelican.mdx index cb9fad833..48209fbc2 100644 --- a/docs/pages/about-pelican.mdx +++ b/docs/pages/about-pelican.mdx @@ -5,11 +5,16 @@ import ImageRow from "@/components/ImageRow"; Pelican is an open-source software platform for building data federations that works by connecting a broad range of data repositories under a unified architecture. Whether data lives on a POSIX filesystem, in S3, or behind an HTTP server, Pelican aims to bring this data together and simplify its access by abstracting away the need to know where it comes from. **Pelican's goals are to**: -- Enable researchers to access data from wherever it lives wherever they need it -- without having to learn multiple backend technologies. This access could take place in a Jupyter notebook, a campus cluster, or from national-scale computing infrastructure like the [OSPool](https://osg-htc.org/services/open_science_pool.html). -- Enable repositories and storage providers to make their data accessible to a broad range of users while maintaining control over how their data is accessed and by whom +- Enable access to data wherever its needed, regardless of where it comes from -- without having to learn multiple backend technologies. This access could take place in a Jupyter notebook, a campus cluster, or from national-scale computing infrastructure like the [OSPool](https://osg-htc.org/services/open_science_pool.html). +- Support Open Science initiatives by supporting inter-disciplinary data sharing - Encourage and support [FAIR](https://www.go-fair.org/fair-principles/) data practices - Allow computing providers to stage data on-site as it's needed +**Pelican enables researchers to**: +- Integrate their data with national-scale computing infrastructure, with a focus on easy setup, distributed data caching and object delivery efficiency +- Make their data accessible to a broad range of users while maintaining control over how their data is accessed and by whom +- Aggregate disparate data repositories like S3, Globus and Posix under a common namespace + The flagship federation underpinned by Pelican is called the [Open Science Data Federation](https://osdf.osg-htc.org/) (OSDF), which serves a variety of large scientific collaborations across more than fifty data providers and approximately two dozen caches located throughout the world, often at points of presence within the global Research and Education networks such as ESNet and Internet2. ## Core Concepts and Terminology From c8697925f4e464ffb7277658514074cda50187a6 Mon Sep 17 00:00:00 2001 From: Justin Hiemstra <75916364+jhiemstrawisc@users.noreply.github.com> Date: Thu, 16 Jan 2025 10:05:35 -0600 Subject: [PATCH 2/3] Update docs/pages/about-pelican.mdx Co-authored-by: Andrew Owen <60940495+aowen-uwmad@users.noreply.github.com> --- docs/pages/about-pelican.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/pages/about-pelican.mdx b/docs/pages/about-pelican.mdx index 48209fbc2..82dc96023 100644 --- a/docs/pages/about-pelican.mdx +++ b/docs/pages/about-pelican.mdx @@ -5,7 +5,7 @@ import ImageRow from "@/components/ImageRow"; Pelican is an open-source software platform for building data federations that works by connecting a broad range of data repositories under a unified architecture. Whether data lives on a POSIX filesystem, in S3, or behind an HTTP server, Pelican aims to bring this data together and simplify its access by abstracting away the need to know where it comes from. **Pelican's goals are to**: -- Enable access to data wherever its needed, regardless of where it comes from -- without having to learn multiple backend technologies. This access could take place in a Jupyter notebook, a campus cluster, or from national-scale computing infrastructure like the [OSPool](https://osg-htc.org/services/open_science_pool.html). +- Enable access to data wherever it is needed, regardless of where it comes from -- without having to learn multiple backend technologies. This access could take place in a Jupyter notebook, a campus cluster, or from national-scale computing infrastructure like the [OSPool](https://osg-htc.org/services/open_science_pool.html). - Support Open Science initiatives by supporting inter-disciplinary data sharing - Encourage and support [FAIR](https://www.go-fair.org/fair-principles/) data practices - Allow computing providers to stage data on-site as it's needed From a4e69faa6b0189881349c7a122f45291e4d38299 Mon Sep 17 00:00:00 2001 From: Justin Hiemstra Date: Thu, 16 Jan 2025 16:08:32 +0000 Subject: [PATCH 3/3] Use 'coalesce' instead of 'aggregate' to describe combining storage repos under a common namespace --- docs/pages/about-pelican.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/pages/about-pelican.mdx b/docs/pages/about-pelican.mdx index 82dc96023..c02cf9e51 100644 --- a/docs/pages/about-pelican.mdx +++ b/docs/pages/about-pelican.mdx @@ -13,7 +13,7 @@ Pelican is an open-source software platform for building data federations that w **Pelican enables researchers to**: - Integrate their data with national-scale computing infrastructure, with a focus on easy setup, distributed data caching and object delivery efficiency - Make their data accessible to a broad range of users while maintaining control over how their data is accessed and by whom -- Aggregate disparate data repositories like S3, Globus and Posix under a common namespace +- Coalesce disparate data repositories like S3, Globus and Posix under a common namespace The flagship federation underpinned by Pelican is called the [Open Science Data Federation](https://osdf.osg-htc.org/) (OSDF), which serves a variety of large scientific collaborations across more than fifty data providers and approximately two dozen caches located throughout the world, often at points of presence within the global Research and Education networks such as ESNet and Internet2.