Journal article Open Access

Public Perceptions on Organised Crime, Mafia, and Terrorism: A Big Data Analysis based on Twitter and Google Trends

Panos Kostakos

Jaishankar, K

Public perceptions enable crime and motivate government policy on law and order; however, there has been limited empirical research on serious crime perceptions in social media. Recently, open source data—and ‘big data’—have enabled researchers from different fields to develop cost-effective methods for opinion mining and sentiment analysis. Against this backdrop, the aim of this paper is to apply state-of-the-art tools and techniques for assembly and analysis of open source data. We set out to explore how non-discursive behavioural data can be used as a proxy for studying public perceptions of serious crime. The data collection focused on the following three conversational topics: organised crime, the mafia, and terrorism. Specifically, time series data of users’ online search habits (over a ten-year period) were gathered from Google Trends, and cross-sectional network data (N=178,513) were collected from Twitter. The collected data contained a significant amount of structure. Marked similarities and differences in people’s habits and perceptions were observable, and these were recorded. The results indicated that ‘big data’ is a cost-effective method for exploring theoretical and empirical issues vis-à-vis public perceptions of serious crime.

Files (1.6 MB)
Name Size
1.6 MB Download
  • Allum, F. (2006). Camorristi, politicians, and businessmen: the transformation of organized crime in post-war Naples. Leeds: Northern Universities Press. Allum, F. Longo, F., Irrera, D., & Kostakos, P. (eds). (2010). Defining and defying organized crime: discourse, perceptions and reality. London: Routledge. Anderson, A., Goel, S., Huber, G., Malhotra, N., & Watts, D. J. (2014). Political Ideology and Racial Preferences in Online Dating. Sociological Science, 1, 128-40. Andriani, P., & McKelvey, B. (2009). Perspective-from Gaussian to Paretian thinking: causes and implications of power laws in organizations. Organization Science, 20(6), 1053-1071. Arsovska, J. & Kostakos P. (2010). The social perception of organized crime in the Balkans: a world of diverging views? In F. Allum, F. Longo, D. Irrera, & P. Kostakos (Eds.), Defining and defying organized crime: discourse, perceptions and reality (pp. 113-131). London, Routledge. Arsovska, J., & Michilli, A. (2015). Perceptions of Ethnic Albanians in New York City and the Role of Stereotypes in Fostering Social Exclusion and Criminality. The European Review of Organised Crime, 2(1), 24-48. Bovenkerk, F. (1998). Organized crime and ethnic minorities: is there a link? Transnational Organized Crime, 4(3), 109-126. Bovenkerk, F. Siegel, D., & Zaitch, D. (2003). Organized crime and ethnic reputation manipulation. Crime, Law and Social Change, 39(1), 23-38. Bryman, A. (2012). Social research methods. Oxford: Oxford University Press. Cha, M., Kwak, H., Rodriguez, P., Yeol Ahnt, Y., & Moon, S. (2007). I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system. Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, 1-14. Chambliss, W. J. (1971). Vice, corruption, bureaucracy, and power. Wis. L. Rev, 4, 1150-1173. Chin, K. L. (2000). Chinatown gangs: Extortion, enterprise, and ethnicity. Oxford: Oxford University Press. Choi, H., & Varian, H. (2012). Predicting the Present with Google Trends. Economic Record, 88(s1), 2-9. Covington, S., & Bloom, B. (2003). Gendered justice: Women in the criminal justice system. In B. Bloom (Ed.). Gendered Justice: Addressing Female Offenders (pp. 3-23). Durham, NC: Carolina Academic Press. Daniele, V., & Marani, U. (2011). Organized crime, the quality of local institutions and FDI in Italy: A panel data analysis. European Journal of Political Economy, 27(1), 132-142. Daugherty, T. Eastin, M. S., Bright, L. F., & Chu S. C. (2011). Expectancy-Value: Identifying Relationships Associated with Consuming User-Generated Content. In Burns, N. M., Daugherty, T., & Eastin, M. (eds). Handbook of Research on Digital Media and Advertising: User-Generated Content Consumption (pp. 146-160). Hershey, PA, USA: IGI Global. Décary-Hétu, D., & Aldridg, J. (2015). Sifting through the Net: Monitoring of Online Offenders by Researchers. The European Review of Organised Crime, 2(2), 122-141. Eagle, N., Pentland, A., Lazer, D. (2009). Inferring friendship network structure by using mobile phone data. Proceedings of the National Academy of Sciences, 106(36), 15274-15278. Ferrer-i-Cancho, R., & Elvevåg, B. (2010). Random texts do not exhibit the real Zipf's law-like rank distribution. PLoS ONE, 5(3), e9411. Fond, G., Gaman, A., Brunel, L., Haffen, E., & Llorca, P. M. (2015). Google Trends®: Ready for real-time suicide prevention or just a Zeta-Jones effect? An exploratory study. Psychiatry Research, 228(3), 913-917. Kostakos, V., & Ferreira, D. (2015). The Rise of Ubiquitous Instrumentation. Frontiers in ICT, October 25. Retrieved from (accessed 20 October 2015). Gerber, M. S. (2014). Predicting crime using Twitter and kernel density estimation Decision Support Systems, 61, 115-125. Girardin, F., Calabrese, F., Fiore, F. D., Ratti, C., Blat, J. (2008). Digital footprinting: Uncovering tourists with user-generated content. Pervasive Computing, IEEE, 7(4), 78-85. Gottschalk, P. (2013). Limits to Corporate Social Responsibility: The Case of Gjensidige Insurance Company and Hells Angels Motorcycle Club. Corporate Reputation Review, 16(3), 177-186. Hand, E. (2011). Culturomics: Word play. Nature, 474(7352), 436-440. Hill, D. (2010). A critical mass of corruption: why some football leagues have more match-fixing than others. International Journal of Sports Marketing & Sponsorship, 11(3), 221-235. Howard, P. N., Duffy, A., Freelon, D., Hussain, M. M., Mari, W., Mazaid, M. (2011). Opening closed regimes: what was the role of social media during the Arab Spring? Project on Information Technology and Political Islam. Department of Communication, University of Washington. Ianni, F. (1974). Black Mafia: Ethnic succession in organized crime. New York: Simon and Schuster. Jerry, J., Steven, S. & Ralph, R. (2014). Assessing the success factors of organized crime groups: Intelligence challenges for strategic thinking. Policing: an international journal of police strategies & management, 37(1), 206-227. Kallus, N. (2014). Predicting crowd behavior with big public data. Proceedings of the companion publication of the 23rd international conference on World wide web companion, 625-30. Kleemans, E. R., & Van de Bunt, H. G. (1999). The social embeddedness of organized crime. Transnational Organized Crime, 5(1), 19-36. Kleemans, E. R., & Van de Bunt, H. G. (2008). Organised crime, occupations and opportunity. Global Crime, 9(3), 185-197. Kostakos, V., Juntunen, T., Goncalves, J., Hosio, S. and Ojala, T. (2013). Where am I? Location archetype keyword extraction from urban mobility patterns. PLoS ONE, 8(5), e63980. Kostakos, V., Nicolai, T., Yoneki, E., O'Neill, E., Kenn, H., & Crowcroft, J. (2009). Understanding and measuring the urban pervasive infrastructure. Personal and Ubiquitous Computing, 13(5), 355-364. Krumm, J., Davies, N., & Narayanaswami, C. (2008). User-generated content. IEEE Pervasive Computing, 4, 10-11. Liang, Y., Zheng, X., Zeng, D. D., Zhou, X., Leischow, S. J., & Chung, W. (2015). Characterizing Social Interaction in Tobacco-Oriented Social Networks: An Empirical Analysis. Scientific Reports, 5, Article number: 10060. Lieberman, E., Michel, J., Jackson, J., Tang, T., & Nowak, M. A. (2007). Quantifying the evolutionary dynamics of language. Nature, 449(7163), 713-716. Liu, Y., Kostakos, V., Li, H. (2015). Climatic Effects on Planning Behavior, PLoS ONE, 10(6), e0131954. Makin, D. A., & Morczek, A. L. (2015). The Dark Side of Internet Searches: A Macro Level Assessment of Rape Culture. International Journal of Cyber Criminology, 9(1), 1. McGlone, M. S. (2005). Contextomy: The art of quoting out of context. Media, Culture & Society, 27(4), 511-522. Mcillwain, J. S. (1997). From tong war to organized crime: revising the historical perception of violence in Chinatown. Justice Quarterly, 14(1), 25-52. Mendoza, A. A., (2015). Nociones de justicia, legalidad y legitimidad de las normas entre jóvenes de cincopaíses de América Latina. Sociedade e Estado, 30(1), 75-97. Michel, J-B, Shen, Y., Aiden, A., Veres, V. and Gray, M., The Google Books Team, Pickett, J., Hoiberg, D., Clancy, D., Norvig, P., Orwant, J., Pinker, S., Nowak, M., & Lieberman, A. E. (2011). Quantitative Analysis of Culture Using Millions of Digitized Books. Science, 14, 176-82. Newman, M. E. J. (2005). Power laws, Pareto distributions and Zipf's law. Contemporary Physics, 46(5), 323-351. Obgar, J. (1999). Slouching toward Bork: The Culture Wars and Self-Criticism in Hip-Hop Music. Journal of Black Studies, 30(2), 164-183. Ouimet, M., & Montmagny-Grenier, C. (2014). Homicide and Violence—International and Cross-National Research. The Construct Validity of the Results Generated by the World Homicide Survey. International Criminal Justice Review, 24(3), 222-234. Paoli, L. (2003). Mafia brotherhoods: Organized crime, Italian style. Oxford: Oxford University Press. Preis, T., Moat, H. S., & Stanley, H. E. (2013). Quantifying trading behavior in financial markets using Google Trends. Scientific reports, 3, 1684. Pruss, S. B. (2014). The German Medias Portrayal of Ethnic Organised Crime and Its Implications. The European Review of Organised Crime, 1(2), 97-118. Reynolds, D. (2011). Manipulating perceived risk to deter and disrupt counterfeiters. Journal of Financial Crime, 18(1), 105-118. Sakaki, T., Okazaki, M., & Matsuo, Y. (2010). Earthquake shakes Twitter users: real-time event detection by social sensors. Proceedings of the 19th international conference on World Wide Web, 851-60. Sarno, F. (2014). Italian mafias in Europe: between perception and reality. A comparison of press articles in Spain, Germany and the Netherlands. Trends in Organized Crime, 17(4), 313-341. Schneider, P. T., & Schneider, J.C. (2003). Reversible destiny: Mafia, antimafia, and the struggle for Palermo. California: University of California Press. Seifter, A., Schwarzwalder, A., Geis, K., & Aucott, J. (2010). The utility of Google Trends for epidemiological research: Lyme disease as an example. Geospatial health, 4(2), 135-137. Shen, A., Antonopoulos, G. A., & Papanicolaou, G. (2013). Chinas stolen children: internal child trafficking in the Peoples Republic of China. Trends in organized crime, 16(1), 31-48. Smith, M.A., Rainie, L., Shneiderman, B., & Himelboim, I. (2014). Mapping twitter topic networks: From polarized crowds to community clusters. Washington: Pew Research Center. Smith, D. (1975).The Mafia Mystique. London: Hutchinson. Louis, C., & Zorlu, G. (2012). Can Twitter predict disease outbreaks? BMJ, 344:e2353. Sullivan, D. (2013). Google still world's most popular search engine by far, but share of unique searchers dips slightly. Search Engine Land, February 11. Retrieved from Sung, H-E (2004). State failure, economic failure, and predatory organized crime: A comparative analysis. Journal of Research in Crime and Delinquency, 41(2), 111-129. Sutter, C. J., Webb, J. W., Kistruck, G. M., & Bailey, A. V. (2013). Entrepreneurs' responses to semi-formal illegitimate institutional arrangements. Journal of Business Venturing, 28(6), 743-758. Tilley, N., & Hopkins, M. (2008). Organized crime and local businesses. Criminology and Criminal Justice, 8(4), 443-459. Travaglino, G. A., Abrams, D., Randsley de Moura, G., & Russo, G. (2015). That is how we do it around here: Levels of identification, masculine honor, and social activism against organized crime in the south of Italy. European Journal of Social Psychology, 45(3), 342-348. United Nations (2015). United Nations News Centre - UN projects 40% of world will be online by year end, 4.4 billion will remain unconnected. October 26, 2015, Retrieved from Van Dijk, V. J. (2007). Mafia markers: assessing organized crime and its impact upon societies. Trends in organized crime, 10(4), 39-56. Van Dijk, J. J. M., Mayhew, P., & Killias, M. (1990). Experiences of crime across the world: key findings from the 1989 international crime survey. Boston: Kluwer Law and Taxation Publishers. Vosen, S., & Schmidt, T. (2011). Forecasting private consumption: survey-based indicators vs. Google trends. Journal of Forecasting, 30(6), 565-578. Wang, X., Brown, D. E., & Gerber, M. S. (2012). Spatio-temporal modeling of criminal incidents using geographic, demographic, and Twitter-derived information. 2012 IEEE International Conference on Intelligence and Security Informatics (ISI), 36-41. Williams, M., & Levi, M. (2012). Perceptions of the eCrime controllers: Modelling the influence of cooperation and data source factors. Security Journal, 28, 252–27. Young, A. B., & Allum, F. (2012). A comparative study of British and German press articles on organised crime (1999-2009). Crime, Law and Social Change, 58(2), 139-157. Zipf, G. K. (1949). Human behavior and the principle of least effort. Cambridge: Addison-Wesley Press. Farmer, D. (2013). Google Search scratches its brain 500 million times a day, Cnet, May 13, Retrieved from: Yamaguchi, Y., Takahashi, T., Amagasa, T., & Kitagawa, H. (2010). TURank: Twitter User Ranking Based on User-Tweet Graph Analysis. Web Information Systems Engineering – WISE 2010, 11th International Conference Proceedings, 240-253. Zubiaga, A., Spina, D., Martínez, R., & Fresno, V. (2014). Real-time classification of twitter trends. Journal of the Association for Information Science and Technology, 66(3), 462-473.

All versions This version
Views 444445
Downloads 253253
Data volume 401.5 MB401.5 MB
Unique views 421422
Unique downloads 232232


Cite as