Expose similarity matching item pairs in Python library (aka crosswalk table) #62
Labels
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
Task make a function called
generate_crosswalk_table(all_questions, similarity, threshold)
which takes the output ofmatch_instruments
and gives the pairs that match above a threshold.Description
The web UI allows users to see the matching item pairs above a given threshold
Can we make the Python library also return the matching pairs above a threshold? This is called the crosswalk table
A crosswalk table is the same information as is currently coming back in the similarity matrix but just in a different format
It is a long-format data frame that shows each matching pair of questions above a certain threshold, along with their respective IDs, question texts, and match scores. Here's an example structure:
See also equivalent issue in R: harmonydata/harmony_r#4
The text was updated successfully, but these errors were encountered: