Image Captioning in Web VR

Description / Rationale

This is small project, which shows the use of image captioning (machine learning task; model used: https://huggingface.co/nlpconnect/vit-gpt2-image-captioning) as used in Web VR. It was inspired by similar project created by Misslivirose titled Scene Reader which shows image captioning with Three.js and Microsoft Azure service.

Instructions

To see image captioning at work, click on camera icon. On every click image of the scene with caption will be generated. In order to see the magic happen, try to find answer to the riddle.

Tools Used

The project uses A-Frame at its core with Hugging Face API.

Credits

3D model of the room was created by Francesco Coldesina, and taken from Sketchfab.com

Demo

To see the application at work: Demo application

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
3d		3d
decoder		decoder
img		img
js		js
LICENSE		LICENSE
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning in Web VR

Description / Rationale

Instructions

Tools Used

Credits

Demo

About

Releases

Packages

Languages

License

akbartus/WebVR-Captioning

Folders and files

Latest commit

History

Repository files navigation

Image Captioning in Web VR

Description / Rationale

Instructions

Tools Used

Credits

Demo

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages