What do the Hydra docs mean by "DuckDB tables"? #440
-
https://docs.hydra.so/components/duckdb
My data is not big enough (< 100GB) to be very interested in reading Parquet direct from S3, except maybe as a way to pre-load it into the db ... I want it on local SSD for fastest possible queries. When I tested standalone duckdb from Python it seemed like it was faster to query against data that had first been loaded into a table via So I was hoping to do the same via pg_duckdb I couldn't find other mentions of "DuckDB tables" concept in the docs here - is that a thing? or it just means any tables in a db with To get the behaviour I want can I do a regular |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
That would be any table that is You can copy / cache parquet files from S3 to local SSD using the |
Beta Was this translation helpful? Give feedback.
That would be any table that is
using duckdb
, which currently are either MotherDuck tables or temporary tables.You can copy / cache parquet files from S3 to local SSD using the
duckdb.cache
function which will then use the SSD copy automatically after it's downloaded. We just added some more functions to allow management of the cache, so you can expect to see that shipped in a future release.