Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ad-hoc recognizers #677

Merged
merged 14 commits into from
May 5, 2021
Merged

Ad-hoc recognizers #677

merged 14 commits into from
May 5, 2021

Conversation

omri374
Copy link
Contributor

@omri374 omri374 commented Apr 22, 2021

Adding the option to add recognizers (regex/deny-list) only in the context of one request.
This allows user to define their own recognizers without having to write code nor redeploy the service, similar to the capability in V1.
Main difference from V1 is that there is no persistency of these recognizers and each request would have to contain the list of recognizers it needs for PII identification. The V1 recognizer store capability is in our future roadmap.

Added:

  • ad_hoc_recognizers parameter to AnalyzerEngine.analyze
  • docstring
  • unit tests
  • Parsing of the recognizers json in AnalyzerRequest
  • app.py update
  • e2e tests for both deny-list and regex based recognizers
  • Documentation (under adding recognizers)
  • api-doc.yaml (swagger spec)

@omri374 omri374 requested review from SharonHart, navalev, shiranr, balteravishay and itye-msft and removed request for SharonHart April 22, 2021 10:34
navalev
navalev previously approved these changes Apr 25, 2021
Copy link
Contributor

@navalev navalev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks great! Not sure about the name though, but can be discusssed offline

@SharonHart SharonHart linked an issue Apr 25, 2021 that may be closed by this pull request
@omri374
Copy link
Contributor Author

omri374 commented May 4, 2021

@navalev and @SharonHart please re-review. Added some tests and did some better exception handling.

@omri374
Copy link
Contributor Author

omri374 commented May 4, 2021

@navalev any suggestions for the name?

@navalev
Copy link
Contributor

navalev commented May 4, 2021

@navalev any suggestions for the name?

for lack of better name, lets use ad-hoc :)

navalev
navalev previously approved these changes May 4, 2021
@omri374 omri374 merged commit d5d9551 into main May 5, 2021
@omri374 omri374 deleted the feature/ad_hoc_recos branch May 5, 2021 07:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

How to run 'ad-hoc' regex patterns or word lists in v2?
3 participants