Integrates AWS Bedrock's multimodal capabilities (Claude 3) into the Docling framework for generating image descriptions within document processing pipelines.
-
Updated
Apr 28, 2025 - Python
Integrates AWS Bedrock's multimodal capabilities (Claude 3) into the Docling framework for generating image descriptions within document processing pipelines.
A serverless solution to streamline ESG compliance using AI-driven automation. Built with the AWS CDK (Python), Amazon Textract, Amazon Bedrock, and other AWS services to process and analyse compliance reports.
pRISM is a repository that combines Retrieval-Augmented Generation (RAG) with a multi-LLM voting approach to create accurate and reliable AI-generated outputs. It integrates multiple language models, including Mistral, Claude 3.5, and OpenAI, to enhance performance through advanced consensus techniques
Distributed GCS-GCS multilingual PDF processing service built for horizontal scaling and concurrency, can be deployed using docker compose for voluminous processing
AI-powered invoice processing system using Google Document AI - Automated AP workflows with CI/CD pipeline for enterprise finance operations
Customized LangChain Azure Document Intelligence loader for table extraction and summarization
A fast, flexible API for extracting text from PDFs and images using smart file detection and OCR—perfect for automating your document workflows.
Add a description, image, and links to the document-processing-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the document-processing-pipeline topic, visit your repo's landing page and select "manage topics."