Edith is an assistant system that recognize objects or text through point-detecting and give auditory feedback. This name comes from the movie “Spider-Man: Far From Home”, in which iron man Tony Stark sent spider man Peter Parker a smart glasses and Edith is the name of the artificial tactical intelligence system deployed in that glasses.
For Edith' users, we developed 3 modes, aiming at helping different groups of people:
- Object Recognition Mode: https://youtu.be/Uj8FFHap59U
- Text Recognition Mode: https://youtu.be/ui5UoAk-Maw
- Accessibility Mode: https://youtu.be/xdyWIbBVSe0
Here is a brief introduction of our work:
- Developed a smart assistant system on Linux(RPI) and Windows(PC) with three modes (Object-recognition mode/Text-recognition mode/Assistive mode) that gives real-time auditory feedback.
- Utilized pyttsx3 for voice synthesis with MP3 format and speak out the live-voice according to text information on RPi.
- Applied MediaPipe and OpenCV for gesture recognition to wake the recognition.
- Used YOLOv5, PyTorch and PaddleOCR to train object recognition model and character recognition model with Python, achieved accuracy of 94.1% and 97.4% respectively.
- Applied Socket and Multithreading to established communication between Edge-devices and PC for image collection, and live-voice response.
Thanks for reading.