Improve the face alignment performance in detect_faces() #1409

huulockt · 2024-12-23T21:17:48Z

Tickets

What has been done

With this PR, the detect_faces() logic has been modified when alignment is enabled:

The original image is passed directly to the detect function, so the detection results are not affected by the alignment flag.
Only the face region is aligned, rather than the entire image, which improves alignment speed.

Sorry I didn't add a unit test case for #1244 as I promised. I think this bug can only be detected visually, since it’s hard to test automatically. But if an automated test is needed, I’d suggest using template matching algorithms.

How to test

make lint && make test

serengil · 2024-12-24T11:03:01Z

When I perform current design and your change of detection module for this image, I am getting these results:

current design:

opencv deteceted 5 faces
ssd deteceted 3 faces
dlib deteceted 4 faces
mtcnn deteceted 6 faces
retinaface deteceted 7 faces
yunet deteceted 4 faces
yolov8 deteceted 8 faces
centerface deteceted 6 faces

your change:
opencv deteceted 5 faces
ssd deteceted 4 faces
dlib deteceted 4 faces
mtcnn deteceted 5 faces
retinaface deteceted 6 faces
yunet deteceted 4 faces
yolov8 deteceted 7 faces
centerface deteceted 7 faces

You can also find the detection results here:

As you can see, your change causes not to find faces close to image boundaries for most of detectors.

I am sharing my test code here:

from deepface import DeepFace

img_path = "dataset/selfie-many-people.jpg"

detector_backends = [
    "opencv",
    "ssd",
    "dlib",
    "mtcnn",
    "retinaface",
    "yunet",
    "yolov8",
    "centerface",
]

for detector_backend in detector_backends:
    face_objs = DeepFace.extract_faces(
        img_path=img_path,
        detector_backend=detector_backend,
        # expand_percentage=0,
    )
    print(f"{detector_backend} deteceted {len(face_objs)} faces")
    fig = plt.figure(figsize=(10, 10))
    for face_obj in face_objs:
        face = face_obj["face"]
        plt.imshow(face)
        plt.axis("off")
        plt.show()

TLDR: current design can detect faces close to boundaries but your change cannot.

huulockt · 2024-12-24T12:05:39Z

Thanks for the feedback! I'm busy with Christmas right now, but I’ll check it carefully soon. For now, here are my thoughts:

Each model is pretrained on its own dataset, so there are always some constraints on what it can detect.
Lightweight detectors like SSD, YuNet, and CenterFace can struggle with large images. In my design, by not adding border before detection helps, these models tend to perform better.
Detection results mainly depend on the threshold. If we want to make it easier for users, we could run a proper benchmark to suggest optimal thresholds. Otherwise, users would need to find an appropriate threshold themselves for their specific models (as I'm doing).

P/S: Merry Christmas! Hope you enjoy the holiday season. 🎄

serengil · 2024-12-24T12:12:10Z

Of course, take your time. I hope you understand my concern. When increasing its time consumption performance, I don't want to decrease its detection performance. Enhancement should offer same accuracy performance or more. Here are my comments:

Each model is pretrained on its own dataset, so there are always some constraints on what it can detect -> This is independent from model because with current design retinaface and mtcnn can detect more faces. So, your change caused this.
Lightweight detectors like SSD, YuNet, and CenterFace can struggle with large images. In my design, by not adding border before detection helps, these models tend to perform better -> right, ssd and centerface outperform than existing design.
Detection results mainly depend on the threshold -> again, we used same threshold for retinaface and mtcnn, but new design is missing some faces. So, that must be related to the detection logic you proposed.

huulockt · 2024-12-25T22:19:12Z

Actually, the detection results in my design are the same as the current design when the align flag is turned off. Here’s the first solution I came up with: We can keep the current border-adding step before detection, but combine it with my proposal to only apply alignment to the face region. I implemented this in the last commit. However, when I tested it myself, I couldn’t figure out why mtcnn still returns 5 faces in both my design and the current design. Could you please run the test code again with mtcnn?

Moreover, the above solution doesn’t fully preserve the detection improvements observed with models like ssd, centerface, and yunet(*). I propose adding these models in a skip-border-addition list, and the code would look like this:
if align is True and model_name not in skip_list:

Additionally, for further clarification, in my design, yolov8 can detect all 7 faces, but faces near the border return outer-eye coordinates as (0,0), which affects alignment. The current design didn't have this problem because of the border, so please let me know if you want to add yolov8 to the skip-list in the list too.

What do you think of these ideas?

(*) For yunet, with a threshold of 0.8, the current design detects 4 faces, while my design detects 6. Perhaps we could consider lowering the threshold for better results.

serengil · 2024-12-25T23:00:12Z

So, we are still adding borders to image. Why should we merge this PR then? I cannot see any reasonable improvements.

huulockt · 2024-12-26T00:45:03Z

The main improvement in my design lies in how the alignment input is adjusted, regardless of whether borders are added or not. Currently, the entire image is used as the input for alignment, and this process is repeated n times—once for each face detected.

In my design, only the facial area is used as the input for each alignment operation. Since the entire image is significantly larger than the facial area, this optimization saves a considerable amount of processing time—especially when multiple faces are present in the image.

I hope this explanation clarifies the benefits of the proposed changes.

adjust alignment input to enhance time efficiency

421ef9e

huulockt force-pushed the enhance-aligment-performance branch from 5a0eea1 to 421ef9e Compare December 25, 2024 22:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the face alignment performance in detect_faces() #1409

Improve the face alignment performance in detect_faces() #1409

huulockt commented Dec 23, 2024

serengil commented Dec 24, 2024

huulockt commented Dec 24, 2024

serengil commented Dec 24, 2024

huulockt commented Dec 25, 2024

serengil commented Dec 25, 2024 •

edited

Loading

huulockt commented Dec 26, 2024

Improve the face alignment performance in detect_faces() #1409

Are you sure you want to change the base?

Improve the face alignment performance in detect_faces() #1409

Conversation

huulockt commented Dec 23, 2024

Tickets

What has been done

How to test

serengil commented Dec 24, 2024

huulockt commented Dec 24, 2024

serengil commented Dec 24, 2024

huulockt commented Dec 25, 2024

serengil commented Dec 25, 2024 • edited Loading

huulockt commented Dec 26, 2024

serengil commented Dec 25, 2024 •

edited

Loading