Granular Key Assertions #106

gowerc · 2024-07-10T17:06:26Z

Closes #76

github-actions · 2024-07-10T17:09:53Z

Code Coverage Summary

Filename                Stmts    Miss  Cover    Missing
--------------------  -------  ------  -------  --------------------------
R/ascii_tables.R          105       8  92.38%   10, 148, 158, 163-166, 211
R/cast_variables.R         49       0  100.00%
R/diffdf.R                209      18  91.39%   373-390, 417
R/generate_keyname.R       10       1  90.00%   16
R/identify.R              152       8  94.74%   283-290
R/is_different.R           52       0  100.00%
R/issuerows.R              40       0  100.00%
R/issues.R                 17       1  94.12%   51
R/misc_functions.R         34       2  94.12%   9, 13
R/print.R                  20       0  100.00%
TOTAL                     688      38  94.48%

Results for commit: 1acb505

Minimum allowed coverage is 80%

♻️ This comment has been updated with latest results

github-actions · 2024-07-10T17:10:42Z

Unit Tests Summary

1 files 13 suites 6s ⏱️
52 tests 51 ✅ 1 💤 0 ❌
578 runs 571 ✅ 7 💤 0 ❌

Results for commit 1acb505.

♻️ This comment has been updated with latest results.

kieranjmartin

I am wondering if we are changing this if we might also tell the user what the two different modes/classes are?

I think it would be much cleaner to have something like

"Error, key1 is numeric in BASE and character in COMPARE" rather than mentioning modes at all, which the user might not understand.

Similarly

"Error, key1 has class 'A' in BASE and class 'B' in COMPARE"

R/diffdf.R

gowerc · 2024-07-16T16:18:08Z

I am wondering if we are changing this if we might also tell the user what the two different modes/classes are?

I think it would be much cleaner to have something like

"Error, key1 is numeric in BASE and character in COMPARE" rather than mentioning modes at all, which the user might not understand.

Similarly

"Error, key1 has class 'A' in BASE and class 'B' in COMPARE"

I'm not sure... like I get the appeal of this but in practice it's kinda messy. If there are several variables with mismatches or if its a variable with lots of classes it is likely difficult to print this in a nice way; ideally you'd probs then want to print a table but this then just feels like overkill. My gut feeling is what we have here is more than enough to inform the user on what to look into to fix the error.

kieranjmartin · 2024-07-18T15:33:56Z

Github doesn't have threaded replies :(

I think my main issue with the error message as it is is that mode may not mean anything to the user right now. Maybe even an explanation "A and B have different modes (different types of data, e.g. numeric and character)"

or maybe even just say they have different classes, as that will also be true I think, and classes is better understood, and we error if they dont match on class anyway?

gowerc · 2024-07-25T14:40:47Z

@kieranjmartin - Can you remember why we ended up using mode instead of typeof ? Doing a very quick service scan it almost feels like typeof is more granular and thus accurate. I also imagine messages of "are a different type" would be more meaningful that "different mode"

EDIT ---

Useful breakdown of where they differ to each other link some other commenters there seem to be inferring that typeof and class are the most important and that mode is pretty much value-less

Overall I'm not sure how I want to proceed with this, I really don't want to say "different types" or "different classes" if we are comparing modes because yes whilst many users won't know the difference that doesn't change the fact that it would be wrong / misleading. Potentially we could just add a "(see ?mode)" to point users in the right direction ?

kieranjmartin · 2024-07-25T15:23:32Z

Potentially we could just add a "(see ?mode)" to point users in the right direction ?

Yes maybe this is the best course forwards actually, just so they have some guidance as to what it means, as I don't think mode is a commonly used term (and honestly I would have to look it up to remember exactly what it means). I do not recall why we did not use type; I think mode is perhaps more pedantic than type, and we wanted to be that pedantic.? Although that list seems to say not

gowerc · 2024-07-25T15:45:04Z

Yer I just ran diffdf tests using type instead of mode and a load of the factor comparison code falls over. I think this is because factors store as ints that have type "integer" whilst reals have type "double" they both have a mode of "number" that is to say mode appears to be more fussy than type. I'm guessing at the time we thought this level of comparison doesn't matter though I feel inclined to disagree now (especially given that we have "strict_factor" to opt in and out of such pedantic comparisons which can be used to mask this).

I think I would propose switching to typeof instead though perhaps for v2 as its likely to introduce some backwards compatible breaks

kieranjmartin

Approved, but maybe add (see ?mode) to the error message

granular assertions

af375ce

gowerc requested a review from kieranjmartin July 10, 2024 17:06

gowerc assigned kieranjmartin Jul 10, 2024

kieranjmartin reviewed Jul 16, 2024

View reviewed changes

R/diffdf.R Outdated Show resolved Hide resolved

moved function

f7a35df

fix spelling

69c77b4

Merge branch 'master' into 76-better-errors

b2b4055

kieranjmartin previously approved these changes Jul 26, 2024

View reviewed changes

added help text

1acb505

gowerc dismissed kieranjmartin’s stale review via 1acb505 July 26, 2024 12:42

gowerc merged commit 0fa69c2 into master Jul 26, 2024
23 checks passed

gowerc deleted the 76-better-errors branch July 26, 2024 13:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Granular Key Assertions #106

Granular Key Assertions #106

gowerc commented Jul 10, 2024

github-actions bot commented Jul 10, 2024 •

edited

Loading

github-actions bot commented Jul 10, 2024 •

edited

Loading

kieranjmartin left a comment

gowerc commented Jul 16, 2024

kieranjmartin commented Jul 18, 2024

gowerc commented Jul 25, 2024 •

edited

Loading

kieranjmartin commented Jul 25, 2024

gowerc commented Jul 25, 2024

kieranjmartin left a comment

Granular Key Assertions #106

Granular Key Assertions #106

Conversation

gowerc commented Jul 10, 2024

github-actions bot commented Jul 10, 2024 • edited Loading

Code Coverage Summary

github-actions bot commented Jul 10, 2024 • edited Loading

Unit Tests Summary

kieranjmartin left a comment

Choose a reason for hiding this comment

gowerc commented Jul 16, 2024

kieranjmartin commented Jul 18, 2024

gowerc commented Jul 25, 2024 • edited Loading

kieranjmartin commented Jul 25, 2024

gowerc commented Jul 25, 2024

kieranjmartin left a comment

Choose a reason for hiding this comment

github-actions bot commented Jul 10, 2024 •

edited

Loading

github-actions bot commented Jul 10, 2024 •

edited

Loading

gowerc commented Jul 25, 2024 •

edited

Loading