Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CT-3567] [Feature] unit testing support format: sql #9405

Closed
3 tasks done
Tracked by #8283
graciegoheen opened this issue Jan 18, 2024 · 2 comments · Fixed by #9873
Closed
3 tasks done
Tracked by #8283

[CT-3567] [Feature] unit testing support format: sql #9405

graciegoheen opened this issue Jan 18, 2024 · 2 comments · Fixed by #9873
Assignees
Labels
enhancement New feature or request user docs [docs.getdbt.com] Needs better documentation
Milestone

Comments

@graciegoheen
Copy link
Contributor

graciegoheen commented Jan 18, 2024

Is this your first time submitting a feature request?

  • I have read the expectations for open source contributors
  • I have searched the existing issues, and I could not find an existing issue for this feature
  • I am requesting a straightforward extension of existing dbt functionality, rather than a Big Idea better suited to a discussion

Describe the feature

We should allow support to set format: sql for unit test definition:

  - name: test_valid_email_address
    model: dim_wizards 
    given: 
      - input: ref('stg_wizards')
        format: sql
        rows: |
          select 1 as wizard_id, cool@example.com as email, example.com as email_top_level_domain union all 
          select 2 as wizard_id, cool@unknown.com as email, unknown.com as email_top_level_domain union all 
          select 3 as wizard_id, badgmail.com as email, gmail.com as email_top_level_domain union all 
          select 4 as wizard_id, missingdot@gmailcom as email, gmail.com as email_top_level_domain
      - input: ref('top_level_email_domains')
        format: sql
        rows: |
          select example.com as tld union all,
          select gmail.com as tld
    expect: 
      format: sql
      rows: |
        select 1 as wizard_id, true as is_valid_email_address union all
        select 2 as wizard_id, false as is_valid_email_address union all
        select 3 as wizard_id, false as is_valid_email_address union all
        select 4 as wizard_id, false as is_valid_email_address 

We should also allow folks to also use sql fixutres:

  - name: test_valid_email_address
    model: dim_wizards 
    given: 
      - input: ref('stg_wizards')
        format: sql
        fixture: wizard_emails_input
      - input: ref('top_level_email_domains')
        format: sql
        fixture: tld_input
    expect: 
      format: sql
      fixture: wizard_emails_output

Anything else?

  • One of the main motivations for this is this would automatically allow support for complex data types (if you want to use type variant/json/etc. just use format: sql) [CT-2998] [SPIKE] Handle unit testing of JSON and ARRAY data types #8423
  • We would not allow for folks to use jinja in this SQL, the reasons being:
    • we don't think folks should {{ ref() }} a model from their static input data (we don't want to consider DAG order, we don't wan't to point to data that could change, etc.)
    • this would mean folks couldn't use cross-db macros in their SQL (we're ok with that sacrifice)
    • this would mean folks couldn't use macros in their SQL

Scope

  • Allow for SQL fixtures, but throw error with jinja present.
@graciegoheen graciegoheen added enhancement New feature or request triage labels Jan 18, 2024
@github-actions github-actions bot changed the title [Feature] unit testing support format: sql [CT-3567] [Feature] unit testing support format: sql Jan 18, 2024
@martynydbt martynydbt added this to the v1.8 milestone Feb 26, 2024
@MichelleArk MichelleArk added the user docs [docs.getdbt.com] Needs better documentation label Apr 3, 2024
@MichelleArk
Copy link
Contributor

Caveats for documentation:

  • if you use format: sql you have to specify all of the columns, but you can write out the exact SQL you want to pass (can use whatever datatypes you want)
  • jinja is not supported in SQL fixtures for unit tests

@FishtownBuildBot
Copy link
Collaborator

Opened a new issue in dbt-labs/docs.getdbt.com: dbt-labs/docs.getdbt.com#5289

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request user docs [docs.getdbt.com] Needs better documentation
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants