Distributed Generation of Billion-node Social Graphs with Overlapping Community Structure.
In the field of social community detection, it is commonly accepted to utilize graphs with reference community structure for accuracy evaluation. The method for generating large random social graphs with realistic community structure is introduced in the paper. The resulting graphs have several of recently discovered properties of social community structure which run counter to conventional wisdom: dense community overlaps, superlinear growth of number of edges inside a community with its size, and power law distribution of user-community memberships. Further, the method is by-design distributable and showed near-linear scalability in Amazon EC2 cloud using Apache Spark implementation.Full text of the paper in pdf
5th Workshop on Complex Networks, CompleNet 2014, Bologna, Italy. Studies in Computational Intelligence Volume 549, 2014, pp. 199-208.