Explaining Neural Networks Through Concept Extraction

Project Overview

This research project dedicated to unraveling the complexities of deep neural networks (DNNs) through the lens of explainable artificial intelligence (XAI), focusing specifically on the concept extraction process. This project addresses the critical need for transparency and understandability in DNNs, particularly in industrial applications where reliability and decision-making accuracy are paramount. The research centers on developing innovative methods for concept extraction, localization, and learning. These methodologies aim to clarify what DNNs learn during training, transforming the often opaque ‘black box’ of neural networks into a more interpretable and trustworthy system. This is especially relevant in scenarios where convolutional neural networks (CNNs) are employed for crucial tasks like industrial quality control, and understanding their decision-making process is vital for ensuring both safety and efficiency.

Key to this project is the creation of global explanations for trained neural network models through advanced concept extraction techniques. The research focuses into the development of novel algorithms like the Scale-Preserving Automatic Concept Extraction (SPACE) and Extracting Concepts with Local Aggregated Descriptors (ECLAD). These approaches are designed to not only identify but also accurately locate relevant concepts within images, a significant leap forward in understanding and validating CNNs’ predictions. The application of these techniques in Industry 4.0 scenarios has demonstrated their efficacy in providing clear, actionable insights into the CNNs’ decision mechanisms. The project showcases its utility across various industrial use cases, from material design and manufacturing to maintenance, proving that these advanced concept extraction methods can effectively bridge the gap between raw data processing and human-interpretable knowledge, thereby enhancing the credibility and applicability of machine learning in the industrial domain.

Andres Felipe Posada Moreno
Andres Felipe Posada Moreno
Postdoc Researcher in eXplainable Artificial Intelligence (XAI) and Foundation Models

My research interests include eXplainable Artificial Intelligence (XAI), Foundation Models, and applied AI.