Explainability in Graph Data Science: Interpretability, replicability, and reproducibility of community detection

July 2022

Explainability in Graph Data Science: Interpretability, replicability, and reproducibility of community detection

abstract_general_1.jpg

By:

Selin Aviyente; Abdullah Karaaslanli

In many modern data science problems, data are represented by a graph (network), e.g., social, biological, and communication networks. Over the past decade, numerous signal processing and machine learning (ML) algorithms have been introduced for analyzing graph structured data. With the growth of interest in graphs and graph-based learning tasks in a variety of applications, there is a need to explore explainability in graph data science. In this article, we aim to approach the issue of explainable graph data science, focusing on one of the most fundamental learning tasks, community detection, as it is usually the first step in extracting information from graphs. A community is a dense subnetwork within a larger network that corresponds to a specific function. Despite the success of different community detection methods on synthetic networks with strong modular structure, much remains unknown about the quality and significance of the outputs of these algorithms when applied to real-world networks with unknown modular structure. Inspired by recent advances in explainable artificial intelligence (AI) and ML, in this article, we present methods and metrics from network science to quantify three different aspects of explainability, i.e., interpretability, replicability, and reproducibility, in the context of community detection.

Introduction

Modern data analysis involves large sets of structured data, where the structure carries critical information about the nature of the data. Typically, graphs are used as mathematical tools to describe the structure of such data. Graphs are ubiquitous in the real world, representing objects and their relationships in varied domains, including social networks, e-commerce networks, biological networks, traffic networks, and brain networks [1]. As a result, numerous signal processing and ML tasks have been extended for analyzing graph structured data, e.g., graph signal processing (GSP), graph topology inference, node classification, link prediction, community detection, and supervised learning with graphs [2]. Among these tasks, community detection is fundamental for uncovering links between structure and function in complex networks. The community detection problem is challenging, in part, because it is not very well posed. For this reason, researchers have proposed a variety of definitions of what constitutes a community and an array of algorithms corresponding to these definitions [3]. While the success of these algorithms has been quantified for synthetic networks with ground truth information, it is harder to evaluate the accuracy, significance, and meaning of the obtained community structure for real networks. For these results to be useful in a variety of scientific and technological studies, there is a need to provide transparency to the community detection algorithms and their outputs.

Over the past decade, the explainability of data-driven methods, e.g., AI and ML, has been a focus of research in the ML and data mining communities. While the ML community is mostly focused on describing how black boxes work, data mining is more interested in explaining the decisions, without even understanding how the opaque decision systems work. Recent survey articles on the topic offer a multitude of terminologies, such as interpretability, accountability, responsibility, transparency, comprehensibility, accuracy, and understandability, to evaluate different dimensions of explainability [4], [5]. Along with these different terminologies, a variety of methods, including black-box input–output analysis, sensitivity analysis, saliency maps, attention heat maps, and approximation of the predictions using simple proxy models, have been introduced [4].

Read on IEEE Xplore

Tags:

SPM Article

IEEE Signal Processing Magazine

SPM July 2022

Congratulations Image (1).png

SPS Members Recieve Prestigious IEEE Awards

congratulations.jpg

Congratulations to Signal Processing Society Members Elevated to Senior Members!

Submit_Manuscript_pg.jpg

Submit a Proposal for ICASSP 2030

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Explainability in Graph Data Science: Interpretability, replicability, and reproducibility of community detection

Publications & Resources

Signal Processing Magazine

For Authors

Congratulations Image (1).png

congratulations.jpg

Submit_Manuscript_pg.jpg

Top Reasons to Join SPS Today!

Explainability in Graph Data Science: Interpretability, replicability, and reproducibility of community detection

abstract_general_1.jpg

Introduction

SPS Social Media

IEEE SPS Educational Resources

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Explainability in Graph Data Science: Interpretability, replicability, and reproducibility of community detection

Search form

You are here

Publications & Resources

Signal Processing Magazine

For Authors

Top Reasons to Join SPS Today!

Explainability in Graph Data Science: Interpretability, replicability, and reproducibility of community detection

Introduction

SPS Social Media

IEEE SPS Educational Resources