Version 0.2.0
- Removed notebooks (pseudonomyzation)
- Removed redundant classes
FakerSpan
,FakerSpanResult
and updated code to useSpan
andInputSample
respectively, changedSentenceFaker
to inherit from Faker instead of using composition. - Removed functions
from_faker_span
,from_faker_spans_result
convert_faker_spans
fromInputSample
, as faker spans are nowSpan
s so there no need for translation. - Removed
PresidioDataGenerator
to usePresidioSentenceFaker
instead - Removed support for CRF models
- Removed the
FlairTrainer
class, please refer to the official Flair documentation for training Flair models - Removed CRF as the package used is no longer maintained
- Improved evaluation notebooks: Notebook 4 shows a vanilla Presidio evaluation, notebook 5 shows a more customized Presidio with improved accuracy (#103)
- Removed the Pseudonomyzation notebook as there is a more advanced approach within Presidio (#103)
- Added the ability to use generic entities and skip words (#103)
- Added the ability to do faster batch predict (#103)
- Added sample_id to be able to reproduce the full sample (#103)
- Fixed issue with hospital provider networking (#103)
- Fix translation of Input Sample tags (#88)
- Fix Presidio wrapper to call predict with a language parameter (#79)
- Updates to all classes inheriting from BaseModel, as the predict signature has changed (now containing **kwargs) (#92)
- Added Poetry instead of setup.py (#91)
- Rename UsDriverLicenseProvider.driver_license to us_driver_license (#90)
- Removed redundant classes FakerSpan, FakerSpanResult and updated code to use Span and InputSample respectively instead (#72)
- Changed SentenceFaker to inherit from Faker instead of using composition (#72)
- Simplified the use of SentenceFaker in the default option (RecordGenerator is instantiated if records are passed, otherwise a SpanGenerator is instantiated) (#72)
- Updates to unit tests to support this change (#72)
- Updates to poetry to include the config in setup.cfg, setup.py, and pytest.ini (#72)