Skip to content

Releases: openzim/python-scraperlib

2.0.0

06 Dec 15:12
Compare
Choose a tag to compare
  • Using pylibzim v2.1.0 (using libzim 8.1.0)

Added

  • [libzim] Entry.get_redirect_entry()
  • [libzim] Item.get_indexdata() to implement custom IndexData per entry (writer)
  • [libzim] Archive.media_count

Changed

  • [libzim] Archive.article_count updated to match scraperlib's version
  • Archive.article_counter now deprecated. Now returns Archive.article_count
  • Archive.media_counter now deprecated. Now returns Archive.media_count

Removed

  • [libzim] lzma compression algorithm

1.8.0

03 Oct 11:18
Compare
Choose a tag to compare

Added

  • download.get_session() to build a new requests Session

Changed

  • download.stream_file() accepts a session param to use instead of creating one

1.7.0

03 Oct 11:15
Compare
Choose a tag to compare

Added

  • zim.Creator now supports ignore_duplicates: bool parameter to
    prevent duplicates from raising exceptions
  • zim.Creator.add_item, zim.Creator.add_redirect and zim.Creator.add_item_for
    now supports a duplicate_ok: bool parameter to prevent an exception
    should this item/redirect be a duplicate

1.6.3

03 Oct 11:15
Compare
Choose a tag to compare

Added

  • download.stream_file() supports passing headers (scrapers were already using it)

1.6.2

03 Oct 11:14
Compare
Choose a tag to compare

Changed

  • Fixed filesystem.get_content_mimetype() crashing on non-guessable byte stream

1.6.1

03 Oct 11:14
Compare
Choose a tag to compare

Changed

  • Wider range of accepted lxml dependency version as 4.9.1 fixes a security issue

1.6.0

11 Jun 10:39
Compare
Choose a tag to compare

Added

  • Archive.get_metadata_item() to retrieve full item instead of just value

Changed

  • Using pylibzim v1.1.0 (using libzim 7.2.1)
    • Adding duplicate entries now raises RuntimeError
    • filesize is fixed for larger ZIMs