This repo is built as a demo for the Hierarchical data table which overlaps to different pages without header. It uses Langchain, Semantic Chunking, Azure Document intelligence and AI Search Please insert your file and try the lab
Before running the notebook, make sure you have the following installed:
- Python: Install the latest version of Python from the official website.
- Jupyter Notebook: Install Jupyter Notebook using the following command in PowerShell:
pip install jupyter
To replace the multi_page_table.pdf with your document, follow these steps:
- Open the existing document.
- Locate the multi_page_table section.
- Replace the content of the multi_page_table.pdf path with the path of the existing document and change the name references
- Save the changes.
-
Clone the repository:
git clone <repository_url>
-
Navigate to the project directory:
cd <project_directory>
-
Create a virtual environment (optional but recommended):
python -m venv venv
-
Activate the virtual environment:
.\venv\Scripts\activate
-
Install the required dependencies:
pip install -r requirements.txt
-
Install Jupyter Notebook:
pip install jupyter
-
Launch Jupyter Notebook:
jupyter notebook
-
In your web browser, navigate to the notebook file (
lab.ipynb
) and open it. -
Run the notebook cells one by one to execute the demo , replace the document location
That's it! You have successfully run the demo Python notebook in Windows PowerShell.