Community Detection in Who-Calls-Whom Social Networks
Authors/Creators
- 1. Computer Science and Engineering Department, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
- 2. BioSense Institute, University of Novi Sad, Novi Sad, Serbia
- 3. Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece
Description
Mobile phone service providers collect large volumes of data
all over the globe. Taking into account that significant information is
recorded in these datasets, there is a great potential for knowledge discov-
ery. Since the processing pipeline contains several important steps, like
data preparation, transformation, knowledge discovery, a holistic app-
roach is required in order to avoid costly ETL operations across different
heterogeneous systems. In this work, we present a design and implemen-
tation of knowledge discovery from CDR mobile phone data, using the
Apache Spark distributed engine. We focus on the community detec-
tion problem which is extremely challenging and it has many practical
applications. We have used Apache Spark with the Louvain community
detection algorithm using a cluster of machines, to study the scalability
and efficiency of the proposed methodology. The experimental evaluation
is based on real-world mobile phone data.