Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CT-676] Set default table type to parquet (instead of implicit default text) #363

Closed
iknox-fa opened this issue May 20, 2022 · 4 comments
Closed
Labels
bug Something isn't working Stale

Comments

@iknox-fa
Copy link
Contributor

Per discussion: https://dbt-labs.slack.com/archives/C017GDLAF7D/p1652996315344739 it may make sense to set the default table type to delta.

@iknox-fa iknox-fa added bug Something isn't working triage labels May 20, 2022
@github-actions github-actions bot changed the title Set default table type to delta [CT-676] Set default table type to delta May 20, 2022
@jtcohen6
Copy link
Contributor

Probably parquet, given that not everyone uses dbt-spark with access to the delta file format. We actually have code already that's meant to do this, but I suspect that "adapter-specific configs" are not being resolved correctly in dbt-core:

file_format: str = "parquet"

In the meantime, we could set that default here instead:

{%- set file_format = config.get('file_format', validator=validation.any[basestring]) -%}

The default file format in the dbt-databricks plugin is indeed set to delta, since users are guaranteed access to it in the Databricks Runtime, and it gets them some extra functionality.

@lostmygithubaccount
Copy link

@iknox-fa @jtcohen6 anything further here? should we keep this issue open? remove the triage label and move to the backlog? something else?

@jtcohen6
Copy link
Contributor

jtcohen6 commented Aug 9, 2022

Let's:

  • Set the default to parquet, by changing the config.get logic in spark__file_format_clause
  • Make sure we have a ticket to track the fact that AdapterSpecific configs don't really work as intended. Most recently captured in [CT-634] Node configs: tech debt dbt-core#5236

@jtcohen6 jtcohen6 removed the triage label Aug 9, 2022
@jtcohen6 jtcohen6 changed the title [CT-676] Set default table type to delta [CT-676] Set default table type to parquet (instead of implicit default text) Aug 9, 2022
@github-actions
Copy link
Contributor

github-actions bot commented Feb 7, 2023

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Stale
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants