- Introduction
- Methodology
- Repository Organization
- Requirements
- Accessing Data
- Usage Instructions
- Related Resources
- License
- Archiving
- Contact
This repository contains the code corresponding to the Seeing the Words project conducted jointly at the Eep Talstra Centre for Bible and Computer (ETCBC) and the Computer Science department, Vrije Universiteit Amsterdam. The aim of the project was to generate images by using biblical text as prompts. The generated images were analyzed using different tools with the results stored in a CSV file. The corresponding dataset is called the Visio Divina Dataset (VVD) and it is made available on the DANS SSH Data Station.
The canonical citation for this project is below:
- bibtex: TODO.
- APA: TODO.
Here, we provide a summary of the methodology of this project. The following biblical texts were used.
- Adam and Eve's Expulsion of Paradise (Genesis 4:23-24)
- The Tower of Babel (Genesis 11:1-9)
- Binding of Isaac (Genesis 22:9-14)
- The Last Supper (Mark 14:12-25)
- Moses Found (Exodus 2:5-9).
The following Text2Image (T2I) generators were used.
- DALL E
- Midjourney
- Stable Diffusion (various versions)
There are in total 7,116 images generated.
We then performed different means to evaluate the generated images. Details are provided in the paper.
The repository is organized as follows:
.
├── data/
│ ├──prompts/
│ │ └── <prompt text files>
│ ├──data_csv/
│ └── <data_csv_files>
│ ├── images/
│ │ └── <image files links>
├── src/
│ └──
│
├── models/
│
└── README.md
- The folder data/data_csv/ contains the CSV file with relevant analytical and evaluation results.
- The folder data/images/ contains image links to the generated images that are hosted on ImageKit.
- The folder data/prompts/ contains the (pre-processed) biblical text used to generate images.
- All the code are in the folder ./src/.
- The models used to evaluate generated images are available upon request.
To access and work with this data, you will need the following:
- Python 3.6+
- Pandas (for CSV file handling)
- PIL or OpenCV (for image handling)
- analyzeppl for analysis
- import the imagekit before image processing as follows:
from imagekitio import ImageKit
Install the dependencies with:
pip install pandas
pip install imagekitio
The set of images is named Visio Divina Dataset (VVD). There are multiple ways one can access the data.
The dataset can be downloaded directly from the DANS SSH Data station To make sure the script runs correctly, you need to unzip the images and put them under the directory ./data/images/ folder.
The images are hosted on ImageKit and can be viewed directly on ImageKit by following the link.
The data.csv file can be found in the ./data/data_csv/ folder. It contains the analysis result.
Example to load the CSV in Python:
import pandas as pd
data = pd.read_csv('data/data_csv/prompt_0/data.csv')
Make sure to check the CSV headers for proper column names and data types.
To load and display images hosted on ImageKit in Python:
from imagekitio import ImageKit
Example image path
magekit = ImageKit(
private_key='your private_key',
public_key='public_kRnwmhVYMoavSB9Eqsc0gEc8rKw=',
url_endpoint = "https://ik.imagekit.io/seeingthewords/'
)
image_url = imagekit.url({
"path": "/default-image.jpg"
}
)
Load and display the image:
image = Image.open(image_path)
image.show()
- Clone the repository to your local machine.
- Install the necessary packages (see the instructions above).
- Use the examples above to load the CSV data and images as needed.
- Run any analysis or scripts provided in the project.
- Specify the license under which the project is shared.
-
The VR Exhibition: link.
-
The code generates the Visio Divina Dataset. It has been made available on the DANS SSH Data Station.
-
Please check the webpage of ETCBC for other related resources: webpage.
The code and data have been archived at the Vrije Universiteit by Shuai Wang. You can download the code, data, intermediate results, selected samples for manual process and analysis, and the paper on YODA: link to be added.
The code and results included in this repository can be used for free to generate images and perform further analysis. All that has been included in this repository is released under the GNU GPL v3 license.
- Prof. Willem van Peursen: e-mail, VU Research portal (the main researcher).
- Hidde Makimei: e-mail, Linkedin, GitHub (the main developer).
- Shuai Wang: e-mail, website, LinkedIn, Github (data steward).
To follow the latest development of the project, please check the following pages: