RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling

Paper, Video, Project Page, and Talk

npm install
npm run spacy-install
node server.js
npm run start

Migrate color tracking, hand tracking, and speech recognition to the server side or web worker
Add a CSV reader to read mapping information from CSV files
Replace WebSpeech API with more robust and accurate speech recognition API such as Whisper.cpp
Semi-Automated suggestions for generated contents or content retrieval

Customizability of the presentation
Semantic cues from the real-time speech (not just nouns) ... and more on future work

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
public		public
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
server.js		server.js
spacy-listen.py		spacy-listen.py