InfoFlow: A Distributed Algorithm to Detect Communities According to the Map Equation

Park K. Fung

Resumen

Formidably sized networks are becoming more and more common, including in social sciences, biology, neuroscience, and the technology space. Many network sizes are expected to challenge the storage capability of a single physical computer. Here, we take two approaches to handle big networks: first, we look at how big data technology and distributed computing is an exciting approach to big data storage and processing. Second, most networks can be partitioned or labeled into communities, clusters, or modules, thus capturing the crux of the network while reducing detailed information, through the class of algorithms known as community detection. In this paper, we combine these two approaches, developing a distributed community detection algorithm to handle big networks. In particular, the map equation provides a way to identify network communities according to the information flow between nodes, where InfoMap is a greedy algorithm that uses the map equation. We develop discrete mathematics to adapt InfoMap into a distributed computing framework and then further develop the mathematics for a greedy algorithm, InfoFlow, which has logarithmic time complexity, compared to the linear complexity in InfoMap. Benchmark results of graphs up to millions of nodes and hundreds of millions of edges confirm the time complexity improvement, while maintaining community accuracy. Thus, we develop a map equation based community detection algorithm suitable for big network data processing.

Palabras claves

graph - community detection - big data

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 3 Parte: 3 (2019)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

Future Internet
Water
ISPRS International Journal of Geo-Information

DOI

https://doi.org/10.3390/bdcc3030042

Artículos similares

Study on Spatio-Temporal Indexing Model of Geohazard Monitoring Data Based on Data Stream Clustering Algorithm

Acceso

Jiahao Li, Weiwei Song, Jianglong Chen, Qunlan Wei and Jinxia Wang

Yunnan Province, residing in the eastern segment of the Qinghai?Tibet Plateau and the western part of the Yunnan?Guizhou Plateau, faces significant challenges due to its intricate geological structures and frequent geohazards. These pose monumental risks... ver más

Revista: ISPRS International Journal of Geo-Information

Best BiCubic Method to Compute the Planimetric Misregistration between Images with Sub-Pixel Accuracy: Application to Digital Elevation Models

Acceso

Serge Riazanoff, Axel Corseaux, Clément Albinet, Peter A. Strobl, Carlos López-Vázquez, Peter L. Guth and Takeo Tadono

In recent decades, an important number of regional and global digital elevation models (DEMs) have been released publicly. As a consequence, researchers need to choose between several of these models to perform their studies and to use these DEMs as thir... ver más

Revista: ISPRS International Journal of Geo-Information

Onboard Distributed Trajectory Planning through Intelligent Search for Multi-UAV Cooperative Flight

Acceso

Kunfeng Lu, Ruiguang Hu, Zheng Yao and Huixia Wang

Trajectory planning and obstacle avoidance play essential roles in the cooperative flight of multiple unmanned aerial vehicles (UAVs). In this paper, a unified framework for onboard distributed trajectory planning is proposed, which takes full advantage ... ver más

Revista: Drones

Distributed Average Consensus Algorithms in d-Regular Bipartite Graphs: Comparative Study

Acceso

Martin Kenyeres and Jozef Kenyeres

Consensus-based data aggregation in d-regular bipartite graphs poses a challenging task for the scientific community since some of these algorithms diverge in this critical graph topology. Nevertheless, one can see a lack of scientific studies dealing wi... ver más

Revista: Future Internet

An Improved Spanning Tree-Based Algorithm for Coverage of Large Areas Using Multi-UAV Systems

Acceso

Jan Chleboun, Thulio Amorim, Ana Maria Nascimento and Tiago P. Nascimento

In this work, we propose an improved artificially weighted spanning tree coverage (IAWSTC) algorithm for distributed coverage path planning of multiple flying robots. The proposed approach is suitable for environment exploration in cluttered regions, whe... ver más

Revista: Drones

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas