Seeing the Words

This repository contains the code corresponding to the Seeing the Words project conducted jointly at the Eep Talstra Centre for Bible and Computer (ETCBC) and the Computer Science department, Vrije Universiteit Amsterdam. The aim of the project was to generate images by using biblical text as prompts. The generated images were analyzed using different tools with the results stored in a CSV file. The corresponding dataset is called the Visio Divina Dataset (VVD) and it is made available on the DANS SSH Data Station.

The canonical citation for this project is below:

bibtex: TODO.
APA: TODO.

Methodology

Here, we provide a summary of the methodology of this project. The following biblical texts were used.

Adam and Eve's Expulsion of Paradise (Genesis 4:23-24)
The Tower of Babel (Genesis 11:1-9)
Binding of Isaac (Genesis 22:9-14)
The Last Supper (Mark 14:12-25)
Moses Found (Exodus 2:5-9).

The following Text2Image (T2I) generators were used.

DALL E
Midjourney
Stable Diffusion (various versions)

There are in total 7,116 images generated.

We then performed different means to evaluate the generated images. Details are provided in the paper.

Repository Organization

The repository is organized as follows:

.
├── data/
│   ├──prompts/
│   │   └── <prompt text files>
│   ├──data_csv/
│   └── <data_csv_files>
│   ├── images/ 
│   │   └── <image files links>
├── src/
│   └──
│
├── models/
│
└── README.md

The folder data/data_csv/ contains the CSV file with relevant analytical and evaluation results.
The folder data/images/ contains image links to the generated images that are hosted on ImageKit.
The folder data/prompts/ contains the (pre-processed) biblical text used to generate images.
All the code are in the folder ./src/.
The models used to evaluate generated images are available upon request.

Requirements

To access and work with this data, you will need the following:

Python 3.6+
Pandas (for CSV file handling)
PIL or OpenCV (for image handling)
analyzeppl for analysis
import the imagekit before image processing as follows:

    from imagekitio import ImageKit

Install the dependencies with:

    pip install pandas 
    pip install imagekitio

Accessing Data

The set of images is named Visio Divina Dataset (VVD). There are multiple ways one can access the data.

1. Download the entire dataset.

The dataset can be downloaded directly from the DANS SSH Data station To make sure the script runs correctly, you need to unzip the images and put them under the directory ./data/images/ folder.

2. Browsing the dataset on the ImageKit online

The images are hosted on ImageKit and can be viewed directly on ImageKit by following the link.

3. Checking the data analysis result

The data.csv file can be found in the ./data/data_csv/ folder. It contains the analysis result.

Example to load the CSV in Python:

    import pandas as pd

    data = pd.read_csv('data/data_csv/prompt_0/data.csv')

Make sure to check the CSV headers for proper column names and data types.

4. Display Some Images

To load and display images hosted on ImageKit in Python:

    from imagekitio import ImageKit

Example image path

    magekit = ImageKit(
        private_key='your private_key',
        public_key='public_kRnwmhVYMoavSB9Eqsc0gEc8rKw=',
        url_endpoint = "https://ik.imagekit.io/seeingthewords/'
    )


    image_url = imagekit.url({
                "path": "/default-image.jpg"
            }
    )


Load and display the image:    

    image = Image.open(image_path)
    image.show()

Usage Instructions

Clone the repository to your local machine.
Install the necessary packages (see the instructions above).
Use the examples above to load the CSV data and images as needed.
Run any analysis or scripts provided in the project.
Specify the license under which the project is shared.

Related Resources

The VR Exhibition: link.
The code generates the Visio Divina Dataset. It has been made available on the DANS SSH Data Station.
Please check the webpage of ETCBC for other related resources: webpage.

Archiving

The code and data have been archived at the Vrije Universiteit by Shuai Wang. You can download the code, data, intermediate results, selected samples for manual process and analysis, and the paper on YODA: link to be added.

License

The code and results included in this repository can be used for free to generate images and perform further analysis. All that has been included in this repository is released under the GNU GPL v3 license.

Contact

Prof. Willem van Peursen: e-mail, VU Research portal (the main researcher).
Hidde Makimei: e-mail, Linkedin, GitHub (the main developer).
Shuai Wang: e-mail, website, LinkedIn, Github (data steward).

To follow the latest development of the project, please check the following pages:

The GitHub page of ETCBC: GitHub.
The official website of the ETCBC webpage: homepage.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Seeing the Words

Table of Contents

Introduction

Methodology

Repository Organization

Requirements

Accessing Data

1. Download the entire dataset.

2. Browsing the dataset on the ImageKit online

3. Checking the data analysis result

4. Display Some Images

Usage Instructions

Related Resources

Archiving

License

Contact

About

Releases

Packages

Languages

License

ETCBC/SeeingTheWords

Folders and files

Latest commit

History

Repository files navigation

Seeing the Words

Table of Contents

Introduction

Methodology

Repository Organization

Requirements

Accessing Data

1. Download the entire dataset.

2. Browsing the dataset on the ImageKit online

3. Checking the data analysis result

4. Display Some Images

Usage Instructions

Related Resources

Archiving

License

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages