Use this URL to cite or link to this record in EThOS: https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.786108
Title: Looking deep at people : towards understanding and generating humans in images with deep learning
Author: Bem, Rodrigo Andrade de
ISNI:       0000 0004 7971 5761
Awarding Body: University of Oxford
Current Institution: University of Oxford
Date of Award: 2018
Availability of Full Text:
Access from EThOS:
Full text unavailable from EThOS. Please try the link below.
Access from Institution:
Abstract:
Understanding and generating people in images and videos is a long-standing goal in computer vision. A significant effort has been devoted to these tasks by the research community along the last decades, greatly motivated by a large number of potential applications, like surveillance, human-machine interaction, action and behaviour recognition, motion capture, video reenactment, and computer graphics animation. Also driving the necessity of this remarkable endeavour, one can mention the difficulties for tackling such problems, generated for instance by the endless combinations of environments, visual appearances, and postures in which humans can appear in images. Besides that, the high-dimensionality of the human body, the inherent noise of visual data and the ill-posed characteristics of the problems are also relevant issues. Nonetheless, meaningful advances in the field were achieved recently using deep learning. This thesis pursues further advances towards understanding and generating people in visual data by the development of new discriminative and generative deep learning methods. The main contributions are: i) A deep learning framework for 2D human pose estimation, which allows for mean-field inference over part-based models; ii) A conditional deep generative model that achieves state-of-the-art results on generating images of humans conditioned on body posture; and iii) A structured semi-supervised deep generative model that jointly performs pose estimation and image generation, understanding and generating people in images in a single framework.
Supervisor: Torr, Philip H. S. Sponsor: CAPES Foundation ; Ministry of Education of Brazil
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID: uk.bl.ethos.786108  DOI: Not available
Keywords: Deep learning ; Artificial intelligence ; Computer engineering ; Machine learning ; Computer vision ; Computer science ; Human body analysis
Share: