Skip to content

Releases: rspeer/python-ftfy

v6.3.1

26 Oct 00:51
Compare
Choose a tag to compare
version number updates

v6.3.0

11 Oct 04:54
Compare
Choose a tag to compare
  • Switched packaging from poetry to uv.
  • Uses modern Python packaging exclusively (no setup.py).
  • Added support for mojibake in Windows-1257 (Baltic).
  • Detects mojibake for "Ü" in an uppercase word, such as "ZURÜCK".
  • Expanded a heuristic that notices improbable punctuation.
  • Fixed a false positive involving two concatenated strings, one of which began with the § sign.
  • Rewrote chardata.py to be more human-readable and debuggable, instead of being full of keysmash-like character sets.

See CHANGELOG.md for the full changelog.

Trusted Publishing is now supposed to create these releases on GitHub at the same time that it publishes to PyPI, following the user guide. It didn't, but it's supposed to.

I think I've fixed the problem (upgrading to sigstore/gh-action-sigstore-python@v3.0.0 from the broken v2.1.1), and maybe future releases really will be as simple as pushing a tag.

v6.2.3

06 Aug 01:36
Compare
Choose a tag to compare

See CHANGELOG.md for version changes.

Can you tell that I'm creating these releases manually? I've set up a GitHub action that publishes to PyPI, which is reasonably well documented, but I can't find one that creates a release here on GitHub with the Python package included. Please let me know (or make a PR) if you know how.

v6.0.3

23 Aug 21:02
Compare
Choose a tag to compare

Updates in 6.0.x:

  • New function: ftfy.fix_and_explain() can describe all the transformations that happen when fixing a string. This is similar to what ftfy.fixes.fix_encoding_and_explain() did in previous versions, but it can fix more than the encoding.
  • fix_and_explain() and fix_encoding_and_explain() are now in the top-level ftfy module.
  • Changed the heuristic entirely. ftfy no longer needs to categorize every Unicode character, but only characters that are expected to appear in mojibake.
  • Because of the new heuristic, ftfy will no longer have to release a new version for every new version of Unicode. It should also run faster and use less RAM when imported.
  • The heuristic ftfy.badness.is_bad(text) can be used to determine whether there appears to be mojibake in a string. Some users were already using the old function sequence_weirdness() for that, but this one is actually designed for that purpose.
  • Instead of a pile of named keyword arguments, ftfy functions now take in a TextFixerConfig object. The keyword arguments still work, and become settings that override the defaults in TextFixerConfig.
  • Added support for UTF-8 mixups with Windows-1253 and Windows-1254.
  • Overhauled the documentation: https://ftfy.readthedocs.org
  • Requires Python 3.6 or later.

v5.5.1

12 Mar 21:18
Compare
Choose a tag to compare

See CHANGELOG.md for release notes.