This project is an Information Retrieval (IR) System built with Spring Boot. It provides functionality for processing, analyzing, and retrieving information from documents, particularly PDFs.
If you find this code useful, please consider starring this repository!
/src
- Source code directory/main/java
- Java source files/main/resources
- Configuration files and resources/test
- Test files
/pdfs
- Directory for PDF documents/output
- Directory for generated output files/.mvn
,mvnw
,mvnw.cmd
- Maven wrapper files
- Java 24
- Maven
- Clone the repository
- Build the project using Maven:
or on Windows:
./mvnw clean install
mvnw.cmd clean install
- Run the application as a Spring Boot app:
./mvnw spring-boot:run
- Document processing and analysis
- Information retrieval capabilities
- PDF handling functionality using Apache PDFBox
- Web-based interface using Thymeleaf templates
This project is developed using:
- Spring Boot 3.5.0
- Thymeleaf for server-side templating
- Apache PDFBox 2.0.27 for PDF processing
- Java 24
Spring Boot DevTools are included for enhanced development experience with features like:
- Automatic restarts when code changes
- LiveReload support automatically refreshing the browser
- Enhanced development-time property defaults