Sketch-based Community Detection via Representative Node Sampling

التفاصيل البيبلوغرافية
العنوان: Sketch-based Community Detection via Representative Node Sampling
المؤلفون: Andre Beckus, Mahlagha Sedghi, George K. Atia
المصدر: ICPR
بيانات النشر: IEEE, 2021.
سنة النشر: 2021
مصطلحات موضوعية: Degree (graph theory), business.industry, Computer science, Complete graph, Sampling (statistics), 020206 networking & telecommunications, 02 engineering and technology, 010501 environmental sciences, computer.software_genre, 01 natural sciences, Sketch, Stochastic block model, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), Artificial intelligence, Data mining, Cluster analysis, business, computer, 0105 earth and related environmental sciences, Clustering coefficient
الوصف: This paper proposes a sketch-based approach to the community detection problem which clusters the full graph through the use of an informative and concise sketch. The reduced sketch is built through an effective sampling approach which selects few nodes that best represent the complete graph and operates on a pairwise node similarity measure based on the average commute time. After sampling, the proposed algorithm clusters the nodes in the sketch, and then infers the cluster membership of the remaining nodes in the full graph based on their aggregate similarity to nodes in the partitioned sketch. By sampling nodes with strong representation power, our approach can improve the success rates over full graph clustering. In challenging cases with large node degree variation, our approach not only maintains competitive accuracy with full graph clustering despite using a small sketch, but also outperforms existing sampling methods. The use of a small sketch allows considerable storage savings, and computational and timing improvements for further analysis such as clustering and visualization. We provide numerical results on synthetic data based on the homogeneous, heterogeneous and degree corrected versions of the stochastic block model, as well as experimental results on real-world data.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::941ab1083d05b953cf4b7865d2c5d807
https://doi.org/10.1109/icpr48806.2021.9413202
حقوق: CLOSED
رقم الأكسشن: edsair.doi...........941ab1083d05b953cf4b7865d2c5d807
قاعدة البيانات: OpenAIRE