From 83c9fd5f8210575492fdf7267fa5e8ff86e38ac7 Mon Sep 17 00:00:00 2001 From: Dan King Date: Tue, 10 Oct 2023 10:47:16 -0400 Subject: [PATCH 1/4] add Hail to the out-of-core community list --- web/pandas/community/ecosystem.md | 15 +++++++++++++++ 1 file changed, 15 insertions(+) diff --git a/web/pandas/community/ecosystem.md b/web/pandas/community/ecosystem.md index e6a6b3a8531ca..cfc2dd8798f5b 100644 --- a/web/pandas/community/ecosystem.md +++ b/web/pandas/community/ecosystem.md @@ -512,6 +512,21 @@ wasted). - ``vaex.from_pandas`` - ``vaex.to_pandas_df`` +### [Hail Query](https://hail.is/) + +An out-of-core, preemptible-safe, distributed, dataframe library serving +the genetics community. Hail Query ships with on-disk data formats, +in-memory data formats, an expression compiler, a query planner, and a +distributed sort algorithm all designed to accelerate queries on large +matrices of genome sequencing data. + +It is often easiest to use Pandas to manipulate the summary statistics or +other small aggregates produced by Hail. For this reason, Hail provides +native import to and export from Pandas DataFrames: + +- [`Table.from_pandas`](https://hail.is/docs/0.2/hail.Table.html#hail.Table.from_pandas) +- [`Table.to_pandas`](https://hail.is/docs/0.2/hail.Table.html#hail.Table.to_pandas) + ## Data cleaning and validation ### [pyjanitor](https://github.com/pyjanitor-devs/pyjanitor) From 2abd4e14846264abbba709cf40d855e298e0e74a Mon Sep 17 00:00:00 2001 From: Dan King Date: Thu, 19 Oct 2023 10:18:27 -0400 Subject: [PATCH 2/4] Pandas to pandas Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com> --- web/pandas/community/ecosystem.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/web/pandas/community/ecosystem.md b/web/pandas/community/ecosystem.md index cfc2dd8798f5b..ea589be3fe469 100644 --- a/web/pandas/community/ecosystem.md +++ b/web/pandas/community/ecosystem.md @@ -520,7 +520,7 @@ in-memory data formats, an expression compiler, a query planner, and a distributed sort algorithm all designed to accelerate queries on large matrices of genome sequencing data. -It is often easiest to use Pandas to manipulate the summary statistics or +It is often easiest to use pandas to manipulate the summary statistics or other small aggregates produced by Hail. For this reason, Hail provides native import to and export from Pandas DataFrames: From 95741f5a487122a0629841aa3372b66686784f75 Mon Sep 17 00:00:00 2001 From: Dan King Date: Thu, 19 Oct 2023 11:13:12 -0400 Subject: [PATCH 3/4] Pandas to pandas Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com> --- web/pandas/community/ecosystem.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/web/pandas/community/ecosystem.md b/web/pandas/community/ecosystem.md index ea589be3fe469..8881387834b22 100644 --- a/web/pandas/community/ecosystem.md +++ b/web/pandas/community/ecosystem.md @@ -522,7 +522,7 @@ matrices of genome sequencing data. It is often easiest to use pandas to manipulate the summary statistics or other small aggregates produced by Hail. For this reason, Hail provides -native import to and export from Pandas DataFrames: +native import to and export from pandas DataFrames: - [`Table.from_pandas`](https://hail.is/docs/0.2/hail.Table.html#hail.Table.from_pandas) - [`Table.to_pandas`](https://hail.is/docs/0.2/hail.Table.html#hail.Table.to_pandas) From 897ce0653fd01b8d67c1c13ba00295733c772250 Mon Sep 17 00:00:00 2001 From: Dan King Date: Thu, 26 Oct 2023 22:04:45 -0400 Subject: [PATCH 4/4] Update ecosystem.md --- web/pandas/community/ecosystem.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/web/pandas/community/ecosystem.md b/web/pandas/community/ecosystem.md index 8881387834b22..58c5da67bcd74 100644 --- a/web/pandas/community/ecosystem.md +++ b/web/pandas/community/ecosystem.md @@ -524,8 +524,8 @@ It is often easiest to use pandas to manipulate the summary statistics or other small aggregates produced by Hail. For this reason, Hail provides native import to and export from pandas DataFrames: -- [`Table.from_pandas`](https://hail.is/docs/0.2/hail.Table.html#hail.Table.from_pandas) -- [`Table.to_pandas`](https://hail.is/docs/0.2/hail.Table.html#hail.Table.to_pandas) +- [`Table.from_pandas`](https://hail.is/docs/latest/hail.Table.html#hail.Table.from_pandas) +- [`Table.to_pandas`](https://hail.is/docs/latest/hail.Table.html#hail.Table.to_pandas) ## Data cleaning and validation