Skip to content

Releases: cligs/textbox

Almost Summer Release

28 May 14:48
1edeb51
Compare
Choose a tag to compare

In this release, the existing collections have been updated to be more consistent and to improve validation. The main changes are:

  • The keywords in the text classification section of the TEI header have been normalized and hierarchized in all the collections.
  • A TEI keyword list and a schematron file controlling term values have been created for each collection.
  • The schemas for TEI master files and annotated versions have been merged to a single, common schema for all TEI files. See also https://github.com/cligs/reference where the schema files are hosted.
  • In the schema and TEI files, a CLiGS namespace has been introduced for:
    • a CLiGS specific attribute importance used to indicate the importance of each genre assignment in case of several different assignments
    • non-TEI sentence and word level attributes resulting from NLP annotations with FreeLing
  • Folder names have been adjusted to be consistent for all collections in the textbox.
  • For details, see the version history in the "next" branch: https://github.com/cligs/textbox/commits/next.

Advent Release

05 Dec 06:53
Compare
Choose a tag to compare

This new release contains:

  • A new collection of Italian Short Stories and Novellas (1880s-1920s)
  • A new collection of Italian Novels (1850 and 1890)
  • A new collection of Spanish Short Stories from 1880-1940
  • 15 new novels in the Corpus of Spanish Novels from 1880-1940

Spring is coming release.

10 Mar 21:51
Compare
Choose a tag to compare

This release notably includes the following changes:

  • The "romancesportuguesas" collection has been added
  • The "theatre17" collection has been added
  • Some unsuitable texts have been removed (this makes this release not backwards compatible with previous releases)
  • Some of the ZIP versions have been removed (it is hard to keep them up to date)
  • For details, see the version history here: https://github.com/cligs/textbox/commits/next

New Year release.

20 Jan 13:54
Compare
Choose a tag to compare

The CLiGS "textbox" contains collections of literary texts written in Spanish and French and encoded according to the Guidelines of the Text Encoding Initiative. This release makes four collections available.