This repository is a binging of Any2Json for Python language.
In today's data-driven landscape, navigating the complexities of semi-structured documents poses a significant challenge for organizations. These documents, characterized by diverse formats and a lack of standardization, often require specialized skills for effective manipulation and analysis. However, we propose a novel framework to address this challenge. By leveraging innovative algorithms and machine learning techniques, Any2Json offers a solution that gives you control over the data extraction process with tweakable and repeatable settings. Moreover, by automating the extraction process, it not only saves time but also minimizes errors, particularly beneficial for industries dealing with large volumes of such documents. Crucially, this framework integrates with machine learning workflows, unlocking new possibilities for data enrichment and predictive modeling. By leveraging determinist algorithms, this framework is perfect to prepare your data for training processes in a predictive and reproductible manner. Aligned with the paradigm of data as a service, it offers a scalable and efficient means of managing semi-structured data, thereby expanding the toolkit of data services available to organizations.
Visit our full documentation and learn more about how it works, try our tutorials and find a full list of plugins and models.
- The Java Developer Kit, version 17.
- Python 3.8.2 or above.
- Pip 20.0.2 or above.
- Poetry 1.7.1 or above.
- Just 1.24.0 or above.
Run the following command line:
pip install pyany2json
Before to use the package, you need to download the jars in your project. Run the following command line:
python pyany2json/setup.py
To run the tutorial1:
python examples/tutorial1.py
The following links will give you documentation about some background information, takes you through some implementation details, and then focuses on step-by-step instructions for getting the most out of Any2Json:
- Using PyAny2Json: here.
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.
- Romuald Rousseau, romuald.rousseau@servier.com
- 0.1.1
- 0.1.0
- Initial Release