Update dependency charset-normalizer to v2.1.0 #11

renovate · 2022-07-12T02:02:43Z

This PR contains the following updates:

Package	Change	Age	Adoption	Passing	Confidence
charset-normalizer	`==2.0.4` -> `==2.1.0`

Release Notes

ousret/charset_normalizer

`v2.1.0`

Compare Source

Added

Output the Unicode table version when running the CLI with --version (PR #194)

Changed

Re-use decoded buffer for single byte character sets from @nijel (PR #175)
Fixing some performance bottlenecks from @deedy5 (PR #183)

Fixed

Workaround potential bug in cpython with Zero Width No-Break Space located in Arabic Presentation Forms-B, Unicode 1.1 not acknowledged as space (PR #175)
CLI default threshold aligned with the API threshold from @oleksandr-kuzmenko (PR #181)

Removed

Support for Python 3.5 (PR #192)

Deprecated

Use of backport unicodedata from unicodedata2 as Python is quickly catching up, scheduled for removal in 3.0 (PR #194)

`v2.0.12`

Compare Source

Fixed

ASCII miss-detection on rare cases (PR #170)

`v2.0.11`

Compare Source

Added

Explicit support for Python 3.11 (PR #164)

Changed

The logging behavior have been completely reviewed, now using only TRACE and DEBUG levels (PR #163 #165)

`v2.0.10`

Compare Source

Fixed

Fallback match entries might lead to UnicodeDecodeError for large bytes sequence (PR #154)

Changed

Skipping the language-detection (CD) on ASCII (PR #155)

`v2.0.9`

Compare Source

Changed

Moderating the logging impact (since 2.0.8) for specific environments (PR #147)

Fixed

Wrong logging level applied when setting kwarg explain to True (PR #146)

`v2.0.8`

Compare Source

Changed

Improvement over Vietnamese detection (PR #126)
MD improvement on trailing data and long foreign (non-pure latin) data (PR #124)
Efficiency improvements in cd/alphabet_languages from @adbar (PR #122)
call sum() without an intermediary list following PEP 289 recommendations from @adbar (PR #129)
Code style as refactored by Sourcery-AI (PR #131)
Minor adjustment on the MD around european words (PR #133)
Remove and replace SRTs from assets / tests (PR #139)
Initialize the library logger with a NullHandler by default from @nmaynes (PR #135)
Setting kwarg explain to True will add provisionally (bounded to function lifespan) a specific stream handler (PR #135)

Fixed

Fix large (misleading) sequence giving UnicodeDecodeError (PR #137)
Avoid using too insignificant chunk (PR #137)

Added

Add and expose function set_logging_handler to configure a specific StreamHandler from @nmaynes (PR #135)
Add CHANGELOG.md entries, format is based on Keep a Changelog (PR #141)

`v2.0.7`

Compare Source

Added

Add support for Kazakh (Cyrillic) language detection (PR #109)

Changed

Further, improve inferring the language from a given single-byte code page (PR #112)
Vainly trying to leverage PEP263 when PEP3120 is not supported (PR #116)
Refactoring for potential performance improvements in loops from @adbar (PR #113)
Various detection improvement (MD+CD) (PR #117)

Removed

Remove redundant logging entry about detected language(s) (PR #115)

Fixed

Fix a minor inconsistency between Python 3.5 and other versions regarding language detection (PR #117 #102)

`v2.0.6`

Compare Source

Fixed

Unforeseen regression with the loss of the backward-compatibility with some older minor of Python 3.5.x (PR #100)
Fix CLI crash when using --minimal output in certain cases (PR #103)

Changed

Minor improvement to the detection efficiency (less than 1%) (PR #106 #101)

`v2.0.5`

Compare Source

Changed

The project now comply with: flake8, mypy, isort and black to ensure a better overall quality (PR #81)
The BC-support with v1.x was improved, the old staticmethods are restored (PR #82)
The Unicode detection is slightly improved (PR #93)
Add syntax sugar __bool__ for results CharsetMatches list-container (PR #91)

Removed

The project no longer raise warning on tiny content given for detection, will be simply logged as warning instead (PR #92)

Fixed

In some rare case, the chunks extractor could cut in the middle of a multi-byte character and could mislead the mess detection (PR #95)
Some rare 'space' characters could trip up the UnprintablePlugin/Mess detection (PR #96)
The MANIFEST.in was not exhaustive (PR #78)

Configuration

📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.

If you want to rebase/retry this PR, click this checkbox.

This PR has been generated by Mend Renovate. View repository job log here.

Update dependency charset-normalizer to v2.1.0

c92a158

renovate bot force-pushed the renovate/charset-normalizer-2.x branch from 99477de to c92a158 Compare July 12, 2022 02:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update dependency charset-normalizer to v2.1.0 #11

Update dependency charset-normalizer to v2.1.0 #11

renovate bot commented Jul 12, 2022

Update dependency charset-normalizer to v2.1.0 #11

Are you sure you want to change the base?

Update dependency charset-normalizer to v2.1.0 #11

Conversation

renovate bot commented Jul 12, 2022

Release Notes

Added

Changed

Fixed

Removed

Deprecated

Fixed

Added

Changed

Fixed

Changed

Changed

Fixed

Changed

Fixed

Added

Added

Changed

Removed

Fixed

Fixed

Changed

Changed

Removed

Fixed

Configuration