🧵 Needle in a Haystack 🧵

Needle in a Haystack is a project that leverages the Microsoft Florence-2 API to process images and generate an interactive HTML gallery. It performs tasks such as caption generation and object detection, annotating images with bounding boxes and labels for detected objects. The results are formatted into a JSON structure and presented alongside annotated images. Using Bootstrap for styling and D3.js for visualization, the gallery includes a treemap to represent label occurrences, enabling users to filter images by detected objects.

📂 Project Structure 📂

haystack/
    .vscode/
        extensions.json
    annotated/
    benchmark/
        1080p/
        1440p/
        4K/
        720p/
    benchmark.py
    image_data.json
    images/
    server.py
    start.py
    viewer.html

requirements.txt

setup.py

⚙️ Setup ⚙️

📋 Prerequisites 📋

Python 3.8 or higher
pip (Python package installer)
Florence2 (https://pinokio.computer/item?uri=https://github.com/pinokiofactory/florence2)

📥 Installation 📥

Clone the repository:

git clone https://github.com/yourusername/needle-in-haystack.git

Install the required Python packages:

python setup.py

This will generate the

requirements.txt

file and install all dependencies listed in it.

🚀 Usage 🚀

📥 Installing Pinokio and Downloading Florence2 Model 📥

Before processing images, you need to install Pinokio and download the Florence2 Model.

Install Pinokio:
- If Pinokio is not already installed, you can install it here:
  
  https://program.pinokio.computer/#/?id=windows
Download the Florence2 Model:
- After installing Pinokio, download the Florence2 Model:
  
  https://pinokio.computer/item?uri=https://github.com/pinokiofactory/florence2
- This will download and set up the Florence2 Model in the appropriate directory.

🖼️ Processing Images 🖼️

To process images and generate the annotated results first start pinokio and launch Florence2, then run:

cd haystack
python start.py

This script will process images in the images/ directory, generate captions and object detection results, and save the annotated images in the annotated/ directory. The results will be saved in image_data.json and viewer.html.

🖥️ Starting the Server 🖥️

To start the server and open the interactive HTML gallery in your browser, run:

python server.py

You will be prompted to open the browser automatically. Type y for yes or n for no. If yes, the viewer.html will launch in a browser. This file was created in the previous step.

📊 Benchmarking Models 📊

To benchmark different Florence models on your images, run:

python benchmark.py

This script will benchmark the models on images in the benchmark/ directory and display system stats in real-time and give you recommendations based on your computer's performance.

📄 Project Files 📄

`start.py`

This script processes images using the Microsoft Florence-2 API, generates captions and object detection results, and saves the annotated images and results.

`server.py`

This script starts an HTTP server to serve the interactive HTML gallery. It logs GET and POST requests and can open the gallery in the browser automatically.

`benchmark.py`

This script benchmarks different Florence models on images in the benchmark/ directory. It displays system stats in real-time and summarizes the benchmark results.

`viewer.html`

This is the interactive HTML gallery generated by start.py. It displays the annotated images and allows users to filter images by detected objects using a treemap.

`image_data.json`

This file contains the results of the image processing, including captions and object detection results.

requirements.txt

This file lists the Python packages required for the project.

setup.py

This script generates the

requirements.txt

file and installs the required Python packages.

`extensions.json`

This file contains recommended extensions for Visual Studio Code.

🤝 Contributing 🤝

Contributions are welcome! Please open an issue or submit a pull request on GitHub.

📜 License 📜

This project is licensed under the MIT License. See the LICENSE file for details.

👤 Author 👤

Daniel Penrod

Feel free to explore the object detection results on your images and filter the gallery by detected objects. What will you find? 🕵️‍♂️

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
haystack		haystack
.gitignore		.gitignore
benchmark1.png		benchmark1.png
benchmark2.png		benchmark2.png
readme.md		readme.md
requirements.txt		requirements.txt
server.png		server.png
setup.py		setup.py
start.png		start.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧵 Needle in a Haystack 🧵

📂 Project Structure 📂

⚙️ Setup ⚙️

📋 Prerequisites 📋

📥 Installation 📥

🚀 Usage 🚀

📥 Installing Pinokio and Downloading Florence2 Model 📥

🖼️ Processing Images 🖼️

🖥️ Starting the Server 🖥️

📊 Benchmarking Models 📊

📄 Project Files 📄

`start.py`

`server.py`

`benchmark.py`

`viewer.html`

`image_data.json`

`extensions.json`

🤝 Contributing 🤝

📜 License 📜

👤 Author 👤

About

Releases

Packages

Languages

galactic-plane/needle-in-haystack

Folders and files

Latest commit

History

Repository files navigation

🧵 Needle in a Haystack 🧵

📂 Project Structure 📂

⚙️ Setup ⚙️

📋 Prerequisites 📋

📥 Installation 📥

🚀 Usage 🚀

📥 Installing Pinokio and Downloading Florence2 Model 📥

🖼️ Processing Images 🖼️

🖥️ Starting the Server 🖥️

📊 Benchmarking Models 📊

📄 Project Files 📄

start.py

server.py

benchmark.py

viewer.html

image_data.json

extensions.json

🤝 Contributing 🤝

📜 License 📜

👤 Author 👤

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`start.py`

`server.py`

`benchmark.py`

`viewer.html`

`image_data.json`

`extensions.json`

Packages