Piculet

Piculet is a module for extracting data from XML or HTML documents using XPath queries. It consists of a single source file with no dependencies other than the standard library. If available, it will make use of the lxml package for improved performance and better XPath support.

Piculet is used for the parsers of the Cinemagoer project.

Getting started

Piculet works with Python 3.8 and later versions. You can install it using pip:

pip install piculet

Installing Piculet creates a script named piculet which can be used to invoke the command line interface:

$ piculet -h
usage: piculet [-h] [--version] [--html] -s SPEC [document]

For example, say you want to extract some data from the file shining.html. An example specification is given in movie.json. Download both of these files and run the command:

$ piculet -s movie.json shining.html

Getting help

The documentation is available on: https://piculet.readthedocs.io/

The source code can be obtained from: https://github.com/uyar/piculet

License

Piculet is released under the LGPL license, version 3 or later. Read the included LICENSE.txt file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 876 Commits
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
CHANGES.rst		CHANGES.rst
LICENSE.txt		LICENSE.txt
README.rst		README.rst
piculet.py		piculet.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Piculet

Getting started

Getting help

License

About

Releases

Packages

Contributors 2

Languages

License

uyar/piculet

Folders and files

Latest commit

History

Repository files navigation

Piculet

Getting started

Getting help

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages