An Empirical Study of the Effect of Noise Models on Centrality Metrics

Soumya Sarkar, Abhishek Karn, Animesh Mukherjee, Sanjukta Bhowmick

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

An important yet little studied problem in network analysis is the effect of the presence of errors in creating the networks. Errors can occur both due to the limitations of data collection techniques and the implicit bias during modeling the network. In both cases, they lead to changes in the network in the form of additional or missing edges, collectively termed as noise. Given that network analysis is used in many critical applications from criminal identification to targeted drug discovery, it is important to evaluate by how much the noise affects the analysis results. In this paper, we present an empirical study of how different types of noise affect real-world networks. Specifically, we apply four different noise models to a suite of nine networks, with different levels of perturbations to test how the ranking of the top-k centrality vertices changes. Our results show that deletion of edges has less effect on centrality than the addition of edges. Nevertheless, the stability of the ranking depends on all three parameters: the structure of the network, the type of noise model used, and the centrality metric to be computed. To the best of our knowledge, this is one of the first extensive studies to conduct both longitudinal (across different networks) and horizontal (across different noise models and centrality metrics) experiments to understand the effect of noise in network analysis.

Original languageEnglish (US)
Title of host publicationSpringer Proceedings in Complexity
PublisherSpringer
Pages3-21
Number of pages19
DOIs
StatePublished - Jan 1 2019
Externally publishedYes

Publication series

NameSpringer Proceedings in Complexity
ISSN (Print)2213-8684
ISSN (Electronic)2213-8692

Fingerprint

Centrality
Electric network analysis
Empirical Study
Metric
Network Analysis
Model
Ranking
Drug Discovery
Deletion
Experiments
Horizontal
Perturbation
Evaluate
Modeling
Experiment

Keywords

  • Accuracy of analysis
  • Centrality metrics
  • Noise models in networks

ASJC Scopus subject areas

  • Applied Mathematics
  • Modeling and Simulation
  • Computer Science Applications

Cite this

Sarkar, S., Karn, A., Mukherjee, A., & Bhowmick, S. (2019). An Empirical Study of the Effect of Noise Models on Centrality Metrics. In Springer Proceedings in Complexity (pp. 3-21). (Springer Proceedings in Complexity). Springer. https://doi.org/10.1007/978-3-030-14683-2_1

An Empirical Study of the Effect of Noise Models on Centrality Metrics. / Sarkar, Soumya; Karn, Abhishek; Mukherjee, Animesh; Bhowmick, Sanjukta.

Springer Proceedings in Complexity. Springer, 2019. p. 3-21 (Springer Proceedings in Complexity).

Research output: Chapter in Book/Report/Conference proceedingChapter

Sarkar, S, Karn, A, Mukherjee, A & Bhowmick, S 2019, An Empirical Study of the Effect of Noise Models on Centrality Metrics. in Springer Proceedings in Complexity. Springer Proceedings in Complexity, Springer, pp. 3-21. https://doi.org/10.1007/978-3-030-14683-2_1
Sarkar S, Karn A, Mukherjee A, Bhowmick S. An Empirical Study of the Effect of Noise Models on Centrality Metrics. In Springer Proceedings in Complexity. Springer. 2019. p. 3-21. (Springer Proceedings in Complexity). https://doi.org/10.1007/978-3-030-14683-2_1
Sarkar, Soumya ; Karn, Abhishek ; Mukherjee, Animesh ; Bhowmick, Sanjukta. / An Empirical Study of the Effect of Noise Models on Centrality Metrics. Springer Proceedings in Complexity. Springer, 2019. pp. 3-21 (Springer Proceedings in Complexity).
@inbook{db88c2715a5c44b9aadce59f2d4f3940,
title = "An Empirical Study of the Effect of Noise Models on Centrality Metrics",
abstract = "An important yet little studied problem in network analysis is the effect of the presence of errors in creating the networks. Errors can occur both due to the limitations of data collection techniques and the implicit bias during modeling the network. In both cases, they lead to changes in the network in the form of additional or missing edges, collectively termed as noise. Given that network analysis is used in many critical applications from criminal identification to targeted drug discovery, it is important to evaluate by how much the noise affects the analysis results. In this paper, we present an empirical study of how different types of noise affect real-world networks. Specifically, we apply four different noise models to a suite of nine networks, with different levels of perturbations to test how the ranking of the top-k centrality vertices changes. Our results show that deletion of edges has less effect on centrality than the addition of edges. Nevertheless, the stability of the ranking depends on all three parameters: the structure of the network, the type of noise model used, and the centrality metric to be computed. To the best of our knowledge, this is one of the first extensive studies to conduct both longitudinal (across different networks) and horizontal (across different noise models and centrality metrics) experiments to understand the effect of noise in network analysis.",
keywords = "Accuracy of analysis, Centrality metrics, Noise models in networks",
author = "Soumya Sarkar and Abhishek Karn and Animesh Mukherjee and Sanjukta Bhowmick",
year = "2019",
month = "1",
day = "1",
doi = "10.1007/978-3-030-14683-2_1",
language = "English (US)",
series = "Springer Proceedings in Complexity",
publisher = "Springer",
pages = "3--21",
booktitle = "Springer Proceedings in Complexity",

}

TY - CHAP

T1 - An Empirical Study of the Effect of Noise Models on Centrality Metrics

AU - Sarkar, Soumya

AU - Karn, Abhishek

AU - Mukherjee, Animesh

AU - Bhowmick, Sanjukta

PY - 2019/1/1

Y1 - 2019/1/1

N2 - An important yet little studied problem in network analysis is the effect of the presence of errors in creating the networks. Errors can occur both due to the limitations of data collection techniques and the implicit bias during modeling the network. In both cases, they lead to changes in the network in the form of additional or missing edges, collectively termed as noise. Given that network analysis is used in many critical applications from criminal identification to targeted drug discovery, it is important to evaluate by how much the noise affects the analysis results. In this paper, we present an empirical study of how different types of noise affect real-world networks. Specifically, we apply four different noise models to a suite of nine networks, with different levels of perturbations to test how the ranking of the top-k centrality vertices changes. Our results show that deletion of edges has less effect on centrality than the addition of edges. Nevertheless, the stability of the ranking depends on all three parameters: the structure of the network, the type of noise model used, and the centrality metric to be computed. To the best of our knowledge, this is one of the first extensive studies to conduct both longitudinal (across different networks) and horizontal (across different noise models and centrality metrics) experiments to understand the effect of noise in network analysis.

AB - An important yet little studied problem in network analysis is the effect of the presence of errors in creating the networks. Errors can occur both due to the limitations of data collection techniques and the implicit bias during modeling the network. In both cases, they lead to changes in the network in the form of additional or missing edges, collectively termed as noise. Given that network analysis is used in many critical applications from criminal identification to targeted drug discovery, it is important to evaluate by how much the noise affects the analysis results. In this paper, we present an empirical study of how different types of noise affect real-world networks. Specifically, we apply four different noise models to a suite of nine networks, with different levels of perturbations to test how the ranking of the top-k centrality vertices changes. Our results show that deletion of edges has less effect on centrality than the addition of edges. Nevertheless, the stability of the ranking depends on all three parameters: the structure of the network, the type of noise model used, and the centrality metric to be computed. To the best of our knowledge, this is one of the first extensive studies to conduct both longitudinal (across different networks) and horizontal (across different noise models and centrality metrics) experiments to understand the effect of noise in network analysis.

KW - Accuracy of analysis

KW - Centrality metrics

KW - Noise models in networks

UR - http://www.scopus.com/inward/record.url?scp=85065828640&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85065828640&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-14683-2_1

DO - 10.1007/978-3-030-14683-2_1

M3 - Chapter

T3 - Springer Proceedings in Complexity

SP - 3

EP - 21

BT - Springer Proceedings in Complexity

PB - Springer

ER -