Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update dependency charset-normalizer to v2.1.0 #11

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

renovate[bot]
Copy link
Contributor

@renovate renovate bot commented Jul 12, 2022

Mend Renovate

This PR contains the following updates:

Package Change Age Adoption Passing Confidence
charset-normalizer ==2.0.4 -> ==2.1.0 age adoption passing confidence

Release Notes

ousret/charset_normalizer

v2.1.0

Compare Source

Added
  • Output the Unicode table version when running the CLI with --version (PR #​194)
Changed
Fixed
  • Workaround potential bug in cpython with Zero Width No-Break Space located in Arabic Presentation Forms-B, Unicode 1.1 not acknowledged as space (PR #​175)
  • CLI default threshold aligned with the API threshold from @​oleksandr-kuzmenko (PR #​181)
Removed
  • Support for Python 3.5 (PR #​192)
Deprecated
  • Use of backport unicodedata from unicodedata2 as Python is quickly catching up, scheduled for removal in 3.0 (PR #​194)

v2.0.12

Compare Source

Fixed
  • ASCII miss-detection on rare cases (PR #​170)

v2.0.11

Compare Source

Added
  • Explicit support for Python 3.11 (PR #​164)
Changed
  • The logging behavior have been completely reviewed, now using only TRACE and DEBUG levels (PR #​163 #​165)

v2.0.10

Compare Source

Fixed
  • Fallback match entries might lead to UnicodeDecodeError for large bytes sequence (PR #​154)
Changed
  • Skipping the language-detection (CD) on ASCII (PR #​155)

v2.0.9

Compare Source

Changed
  • Moderating the logging impact (since 2.0.8) for specific environments (PR #​147)
Fixed
  • Wrong logging level applied when setting kwarg explain to True (PR #​146)

v2.0.8

Compare Source

Changed
  • Improvement over Vietnamese detection (PR #​126)
  • MD improvement on trailing data and long foreign (non-pure latin) data (PR #​124)
  • Efficiency improvements in cd/alphabet_languages from @​adbar (PR #​122)
  • call sum() without an intermediary list following PEP 289 recommendations from @​adbar (PR #​129)
  • Code style as refactored by Sourcery-AI (PR #​131)
  • Minor adjustment on the MD around european words (PR #​133)
  • Remove and replace SRTs from assets / tests (PR #​139)
  • Initialize the library logger with a NullHandler by default from @​nmaynes (PR #​135)
  • Setting kwarg explain to True will add provisionally (bounded to function lifespan) a specific stream handler (PR #​135)
Fixed
  • Fix large (misleading) sequence giving UnicodeDecodeError (PR #​137)
  • Avoid using too insignificant chunk (PR #​137)
Added

v2.0.7

Compare Source

Added
  • Add support for Kazakh (Cyrillic) language detection (PR #​109)
Changed
  • Further, improve inferring the language from a given single-byte code page (PR #​112)
  • Vainly trying to leverage PEP263 when PEP3120 is not supported (PR #​116)
  • Refactoring for potential performance improvements in loops from @​adbar (PR #​113)
  • Various detection improvement (MD+CD) (PR #​117)
Removed
  • Remove redundant logging entry about detected language(s) (PR #​115)
Fixed
  • Fix a minor inconsistency between Python 3.5 and other versions regarding language detection (PR #​117 #​102)

v2.0.6

Compare Source

Fixed
  • Unforeseen regression with the loss of the backward-compatibility with some older minor of Python 3.5.x (PR #​100)
  • Fix CLI crash when using --minimal output in certain cases (PR #​103)
Changed
  • Minor improvement to the detection efficiency (less than 1%) (PR #​106 #​101)

v2.0.5

Compare Source

Changed
  • The project now comply with: flake8, mypy, isort and black to ensure a better overall quality (PR #​81)
  • The BC-support with v1.x was improved, the old staticmethods are restored (PR #​82)
  • The Unicode detection is slightly improved (PR #​93)
  • Add syntax sugar __bool__ for results CharsetMatches list-container (PR #​91)
Removed
  • The project no longer raise warning on tiny content given for detection, will be simply logged as warning instead (PR #​92)
Fixed
  • In some rare case, the chunks extractor could cut in the middle of a multi-byte character and could mislead the mess detection (PR #​95)
  • Some rare 'space' characters could trip up the UnprintablePlugin/Mess detection (PR #​96)
  • The MANIFEST.in was not exhaustive (PR #​78)

Configuration

📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, click this checkbox.

This PR has been generated by Mend Renovate. View repository job log here.

@renovate renovate bot force-pushed the renovate/charset-normalizer-2.x branch from 99477de to c92a158 Compare July 12, 2022 02:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

0 participants