Visualización e interpretación de redes neuronales convolucionales mediante dropout espacial

Hernández Hernández, Abián

View/Open

Export Citations

Date

2020

URI

http://riull.ull.es/xmlui/handle/915/21778

Abstract

El glaucoma es una de las principales causas de la ceguera total a nivel mundial, por lo que su identificación a tiempo es crucial. En los últimos años con el avance de las tecnologías, en concreto con el avance de las técnicas de computación y Deep Learnig, se ha abierto otra posible vía para generar una herramienta que facilite el diagnóstico del glaucoma. En esta línea de investigación es en la que trabaja un grupo de profesores de la ULL entre los que se encuentra el tutor de este Trabajo de Fin de Grado. El objetivo principal del mismo es estudiar en qué partes de una retinografía se fija una red neuronal entrenada para clasificar si el ojo está sano o no, y ver si existe coincidencia entre esas regiones de la imagen y estructuras anatómicas o sectores del disco óptico que son relevantes para los especialistas en sus diagnósticos. Con ese fin, se han utilizado métodos de visualización tipo CAM: Grad-CAM, Grad-CAM++ y Score-CAM, para generar mapas de calor de la imagen de entrada que resalten las zonas de interés. Además de estos métodos, se introduce uno nuevo desarrollado por el tutor, el cual selecciona un conjunto mínimo de pixels de la imagen que constituyen la información fundamental a preservar por la red neuronal para poder hacer la predicción sin que la probabilidad asignada a la clase correspondiente se vea apenas alterada. Se han evaluado sus prestaciones y se ha visto que puede ser una alternativa muy interesante a los otros métodos.

Glaucoma is one of the leading causes of total blindness worldwide, so its early identification is crucial. In recent years, with the advancement of technologies, specifically with the advancement of computing techniques and Deep Learnig, another possible way has been opened to generate a tool that facilitates the diagnosis of glaucoma. It is in this line of research that a group of professors from the ULL works, among whom is the tutor of this Final Degree Project. The main objective is to study in which parts of a retinography a trained neural network is fixed to classify whether the eye is healthy or not, and to see if there is a coincidence between those regions of the image and anatomical structures or sectors of the optic disc that are relevant to specialists in their diagnoses. To this end, CAM-type visualization methods: Grad-CAM, Grad-CAM ++ and Score-CAM have been used to generate heat maps of the input image that highlight the areas of interest. In addition to these methods, a new one developed by the tutor is introduced, which selects a minimum set of pixels of the image that constitute the fundamental information to be preserved by the neural network in order to be able to make the prediction without the probability assigned to the corresponding class look barely altered. Its benefits have been evaluated and it has been seen that it can be a very interesting alternative to the other methods.

Licencia Creative Commons (Reconocimiento-No comercial-Sin obras derivadas 4.0 Internacional)

Except where otherwise noted, this item's license is described as Licencia Creative Commons (Reconocimiento-No comercial-Sin obras derivadas 4.0 Internacional)