Kyaru-Detector

Is Kyaru in this media ? An AI-driven character recognition

Disclaimer

This project, as of now, is more like a notebook listing my tests. I want it to list what I've tried, and what is yet to do. I am trying to do everything myself, and I will surely make mistakes. All those mistakes will be documented. Do not take this repository as a functioning solution for this problem.

Context

Recognizing a character is no simple feat. For example, let's say you have a model can can recognize the character when they look at you. What happens when the character looks elsewhere ? When the character moves, and the animation gets "distorted" ? Or when you only see their back ? Or only a part of the character, not including the entire face ? What if the artstyle gets very different ? Maybe a "fan-art" ?

The answer's simple : the model won't recognize the character if it's been solely trained on detecting faces in a certain style.

In this project, we will try to build a complete pipeline which will recognize Kyaru. We will try to detect her not only thanks to her face, but using all the characteristics of her character design. We will even try to add voice recognition.

First approach : Face detection

Introduction

When I think of "character recognition", my mind goes towards "detection" first. If I want my project to be able to detect Kyaru, I need it to be able to detect characters first, and then to decide if the character is Kyaru or not.

One of the easiest ways, and most cost-efficient ones is nagadomi's lbpcascade_animeface. The cascade works pretty well and as intended : it extracts the face of characters.

However, while I truly respect nagadomi's works, I feel like I should train a model myself. Nagadomi's cascade was trained 9 years ago, maybe trying to do a new one could result in a better model.

Training a new cascade could be considered a good idea. However, because of traincascade being removed since OpenCV's 4.0, I am thinking on exploring another lead.

First step

I have trained a model to detect faces. Model and notebooks are available here.

Second step

I'm going to use this face detector to extract faces and create a new dataset. One will be "Kyaru" and the other "Not Kyaru". We'll feed those extracts to an InceptionV3 and see how it will perform.

//

Might as well train a small dataset on full Kyarus to see how it turns out...

Second approach

Even though the first approach is not finshed yet, it does not prevent me from taking a break and trying to think differently about the problem.

The previous one is great because it helps detecting faces. But what about everything else ? A character is not only a face, it is also a complex blend of attributes.

Here's an example of what defines Kyaru :

Green eyes
Long grey hair
Grey cat ears with white tips
Characteristic outfit
Voice Actress

There are multiple approaches we want to check out :

Multi-Label Classification : We can try to create a model which will describe what it sees, and then compare it with the set of attributes corresponding to Kyaru.
Simple Detection model : We are trying to detect faces, but can we expand the annotation to the whole character ? This is something we must try out.

Bibliography

[1] nagadomi's lbpcascade_animeface blog post, 2011

[2] Z. Lan, K. Maeda, T. Ogawa and M. Haseyama, "Hierarchical Multi-Label Attribute Classification With Graph Convolutional Networks on Anime Illustration," in IEEE Access, vol. 11, pp. 35447-35456, 2023, doi: 10.1109/ACCESS.2023.3265728.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kyaru-Detector

Disclaimer

Context

First approach : Face detection

Introduction

First step

Second step

Second approach

Bibliography

About

Releases

Packages

Fuyucch1/Kyaru-Detector

Folders and files

Latest commit

History

Repository files navigation

Kyaru-Detector

Disclaimer

Context

First approach : Face detection

Introduction

First step

Second step

Second approach

Bibliography

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages