AthenaLinker: Unexpected output running predict
when one input is empty
#2478
Unanswered
alanakilleen
asked this question in
Q&A
Replies: 2 comments
-
Here is a repro for Splink Version 4. I tested against Again, Sample Code
Expected Output
Actual Output
|
Beta Was this translation helpful? Give feedback.
0 replies
-
Not sure if this repros for other back ends, but after confirming the version 4 repro for Athena, I have created a bug for this issue: #2496. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
We have an recurring job that runs
predict
link_only
in Athena. Sometimes, the incoming dataset is empty.For version
3.9.9
, that would produce an empty result set, which made sense.After upgrading to Splink 3 version
3.9.10+
, the output is unexpected. It appears that in this case the non-empty input dataset is linked to itself?Sample Code
Note that
<bucket name>
should be replaced with an S3 bucket.Expected Output
Actual Output
Beta Was this translation helpful? Give feedback.
All reactions