Tesis de maestría / master thesis

Towards a real-time lightweight facial reconstruction model

Abstract

3D facial reconstruction algorithms are highly effective for diverse uses, including facial recognition, virtual reality, and medical imaging. Yet, the intricacy and computational demands of these methods, coupled with the limited availability of datasets, have confined their use to a specific set of researchers and experts. Furthermore, in response to the demand for resource-efficient solutions, the development of lightweight processes has become a key area of research in computer vision. These models aim to find an equilibrium between model size, computational demands, and accuracy. They offer advantages like efficient use of resources, quicker inference times, and enhanced accessibility. Particularly for 3D facial reconstruction models, lightweight architectures open up possibilities for deployment on less powerful hardware, given that these techniques typically depend on high-performance processors like NVIDIA graphics cards. This thesis presents an overview of 3D face creation, followed by state-of-the-art methods which were analyzed in a comparative table, offering an survey of the fundamental characteristics of each method. As well as that, a benchmark comparison among various leading lightweight models in a facial reconstruction framework, aiming to decrease its computational complexity to enable testing on a mobile device. A quantitative evaluation, such as its losses over the training and testing stages, the inference speed achieved and an evaluation in cutting-edge datasets were presented. Additionally, an analysis on the qualitative aspect, for example, the 3D pose or depth estimation. Those aspects were the base to select a lightweight backbone. Finally, an user interface was developed using Python and Kivy. The model was runned on a constrained-device, such as a single-core of a commercial laptop, to examine its performance. EfficientNetLite was determined as a suitable replacement for the current backbone, since its characteristics and scores obtained over several examinations presented a similar behavior to MobileNet-V1, the default backbone of the facial reconstruction model selected.

Description

https://orcid.org/0000-0001-6451-9109

Collections

Loading...

Document viewer

Select a file to preview:
Reload

logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

Licencia