Cattle face generation using an adversarial network with bounding boxes of facial subregions

Silva, Gabriel Rezende da

Please use this identifier to cite or link to this item: https://repositorio.ufjf.br/jspui/handle/ufjf/19401

Files in This Item:

File	Description	Size	Format
gabrielrezendedasilva.pdf	PDF/A	10.95 MB	Adobe PDF	View/Open

Type:	Dissertação
Title:	Cattle face generation using an adversarial network with bounding boxes of facial subregions
Author:	Silva, Gabriel Rezende da
First Advisor:	Vieira, Marcelo Bernardes
Co-Advisor:	Maciel, Luiz Maurílio da Silva
Referee Member:	Souza, Jairo Francisco de
Referee Member:	Giraldi, Gilson Antonio
Resumo:	A coleta de dados no âmbito da pecuária de precisão é um sério desao. A escassez de conjuntos de dados de imagens anotadas para problemas relacionados à pecuária de precisão limita o potencial de algoritmos de aprendizado profundo. As informações aprendidas pelas Redes Adversárias Generativas (GANs) podem ser empregadas para melhorar o desempenho de tarefas de aprendizado de máquina, como classicação. Subregiões especícas da região facial de bovinos, como focinho, orelhas e olhos, são indicadores signicativos de dor, desconforto e estresse nos animais, além de serem utilizadas para a identicação individual. Nesse contexto, o presente trabalho propõe uma nova arquitetura que reforça a localização espacial de sub-regiões faciais de interesse. O método proposto integra módulos de caixa delimitadora com uma GAN conhecida na literatura, denominada BigGAN. A hipótese é que a incorporação de informações de localização espacial, fornecidas por caixas delimitadoras de sub-regiões de interesse, como focinho, orelhas e olhos, pode aprimorar a geração de imagens sintéticas de faces de bovinos. Assim, propõe-se uma nova conguração desses módulos, baseada no provimento de múltiplos objetos em uma cena. Os módulos são acoplados em diferentes profundidades no gerador, formando uma pirâmide de resolução, e em um nível especíco no discriminador. Também propõe-se a utilização de caixas multi-escalas para representar as sub-regiões de interesse. A premissa é suavizar as áreas de transição entre os objetos e o plano de fundo, ao mesmo tempo em que reforça as características dos objetos. Para treinar o modelo, ltrou-se um conjunto de dados pré-anotados para a tarefa de detecção, adaptando-o à tarefa de geração. O novo conjunto de dados inclui 9▷495 faces de bovinos e um subconjunto de anotações de caixas delimitadoras. Para avaliar a viabilidade da proposta, realizou-se uma análise extensiva, tanto quantitativa quanto qualitativa. Os resultados experimentais mostraram ganhos proporcionados pela inserção de informações adicionais de sub-regiões de interesse via módulos de caixa delimitadora. A capacidade de inserir arbitrariamente a posição das sub-regiões permite possíveis aplicações como a geração facial personalizada pelo usuário.
Abstract:	Data collection in precision livestock farming is a serious challenge. The scarcity of annotated image datasets for precision livestock problems limits the potential of deep learning algorithms. The information learned by the Generative Adversarial Networks (GANs) can be employed to improve the performance of machine learning tasks, such as classication. Specic subregions of the facial cattle region, such as the muzzle, ears, and eyes, are signicant indicators of pain, discomfort, and stress in animals, as well as being used for individual identication. In this context, in the present work we develop a new architecture that enhances the spatial localization of facial interest subregions. The proposed method combines bounding box modules with a GAN known in the literature, named BigGAN. The hypothesis is that the incorporation of spatial location information, provided by interest subregions bounding boxes, such as the muzzle, ears, and eyes, can improve the generation of synthetic cattle faces images. Thus, we proposed a new conguration of these modules, based on providing multiple objects in a scene. The modules are attached at dierent depths in the generator, forming a resolution pyramid, and at a specic level in the discriminator. We also propose to use a multiscale boxes to represent the interest subregions. The premise is to smooth out the transitions between the objects and the background, while reinforcing the objects’ features. To train the model, we ltered a pre-annotated dataset for the detection task and adapted it for the generation task. The new dataset includes 9, 495 cattle faces and a subset of bounding box annotations. To assess the feasibility of the proposal, we conducted extensive analysis, both quantitative and qualitative. The experimental results showed improvements provided by the inclusion of additional information for the interest subregions via bounding box modules. The capacity to arbitrarily insert the subregions’ position allows for possible applications such as facial generation designed by user.
Keywords:	Geração facial de bovinos Aprendizado profundo Redes adversárias generativas Módulos de caixa delimitadora Cattle facial generation Deep learning Generative adversarial networks Bounding box modules
CNPq:	CNPQ::CIENCIAS EXATAS E DA TERRA
Language:	eng
Country:	Brasil
Publisher:	Universidade Federal de Juiz de Fora (UFJF)
Institution Initials:	UFJF
Department:	ICE – Instituto de Ciências Exatas
Program:	Programa de Pós-graduação em Ciência da Computação
Access Type:	Acesso Aberto Attribution-NonCommercial-ShareAlike 3.0 Brazil
Creative Commons License:	http://creativecommons.org/licenses/by-nc-sa/3.0/br/
URI:	https://repositorio.ufjf.br/jspui/handle/ufjf/19401
Issue Date:	27-Sep-2024
Appears in Collections:	Mestrado em Ciência da Computação (Dissertações)

Show full item record Recommend this item

This item is licensed under a Creative Commons License