Spectrum-Based Statistical Methods for Directed Graphs with Applications in Biological Data

  • Victor Chavauty Villela USP
  • Eduardo Silva Lira USP
  • André Fujita USP

Resumo


Graphs often model complex phenomena in diverse fields, such as social networks, connectivity among brain regions, or protein-protein interactions. However, standard computational methods are insufficient for empirical network analysis due to randomness. Thus, a natural solution would be the use of statistical approaches. A recent paper by Takahashi et al. suggested that the graph spectrum is a good fingerprint of the graph’s structure. They developed several statistical methods based on this feature. These methods, however, rely on the distribution of the eigenvalues of the graph being real-valued, which is false when graphs are directed. In this paper, we extend their results to directed graphs by analyzing the distribution of complex eigenvalues instead. We show the strength of our methods by performing simulations on artificially generated groups of graphs and finally show a proof of concept using concrete biological data obtained by Project Tycho.

Palavras-chave: Network Correlation, Graph Statistics, EcoG

Referências

Alon, U.: An Introduction to Systems Biology: Design Principles of Biological Circuits. Chapman & Hall/CRC Mathematical and Computational Biology (2006)

Barabasi, A.L., Oltvai, Z.N.: Network biology: understanding the cell’s functional organization. Nat. Rev. Genet. 5, 101–113 (2004)

Barabási, A.L., Albert, R.: Emergence of scaling in random networks. Science 286, 509–512 (1999)

Bullmore, E., Sporns, O.: Complex brain networks: graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 10, 186–198 (2009)

Csardi, G., Nepusz, T.: The igraph software package for complex network research. Interjournal Complex Systems, 1695 (2006)

Duong, T.: ks: Kernel density estimation and kernel discriminant analysis for multivariate data in r. Journal of Statistical Software 21(7), 1–16 (2007)

Erdős, P., Rényi, A.: On random graphs i. Publ. Math. Debrecen 6, 290–297 (1959)

Freeman, L.: A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977)

Fujita, A., C.vidal, M., Y.Takahashi, D.: A statistical method to distinguish functional brain networks. Frontiers in Neuroscience 11 (2017)

Fujita, A., Silva Lira, E., De Siqueira Santos, S.: A semi-parametric statistical test to compare complex networks. Journal of Complex Networks 8 (2020)

Fujita, A., Takahashi, D.Y., Balardin, J.B., Vidal, M.C., Sato, J.R.: Correlation between graphs with an application to brain network analysis. Computational Statistics & Data Analysis 109, 76–92 (2017)

Lees-miller, J., Hammersley, J., Wilson, R., Yasumasa Takahashi, D., Fujita, A., Bisol Balardin, J., Calebe Vidal, M., Ricardo Sato, J.: Correlation between graphs with an application to brain network analysis. Computational Statistics & Data Analysis 109, 76–92 (2017)

MacKay, D.J.: Information Theory, Inference, and Learning Algorithms. Cambridge University Press, first edn. (2003)

Ramos, T.C., Mourão-Miranda, J., Fujita, A.: Spectral density-based clustering algorithms for complex networks. Frontiers in Neuroscience 17 (2023)

Ribeiro, A., Vidal, M., Sato, J., Fujita, A.: Granger causality among graphs and application to functional brain connectivity in autism spectrum disorder. Entropy 23, 1204 (2021)

Sameshima, K., Baccala, L.: Methods in brain connectivity inference through multivariate time series analysis (2016)

Santos, S.S., Fujita, A.: statGraph: statistical methods for graphs (2017). https://www.cran.r-project.org/package=statGraph

Scott, J.: Social Network Analysis. Sage, Newcastle upon Tyne (2012)

Watts, D., Strogatz, S.: Collective dynamics of ’small-world’ networks. Nature 393, 440–442 (1998)

Yanagawa, T., Chao, Z.C., Hasegawa, N., Fujii, N.: Large-scale information flow in conscious and unconscious states: an ecog study in monkeys. PloS one 8(11), e80845 (2013)
Publicado
13/06/2023
VILLELA, Victor Chavauty; LIRA, Eduardo Silva; FUJITA, André. Spectrum-Based Statistical Methods for Directed Graphs with Applications in Biological Data. In: SIMPÓSIO BRASILEIRO DE BIOINFORMÁTICA (BSB), 16. , 2023, Curitiba/PR. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 46-57. ISSN 2316-1248.