Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published October 10, 2018 | Version v1
Conference paper Open

Towards Data-Driven Generation of Visualizations for Automatically Generated News Articles

  • 1. University Helsinki Institute for Information Technology


A feature news story is often accompanied by illustrations and visuals. These visualizations can be, e.g., timelines, line charts, pie charts, or images. In this article, we present a largely data-driven and domain-independent approach for generating visualizations to accompany automatically generated news articles. We demonstrate the feasibility of our approach by applying it to statistical data on crime in Finland. The practical implementation demonstrates how the automatically generated visualizations provide additional information and interactivity to the news articles. We further illustrate how the approach presented is easily transferable to different domains with structured numerical datasets.


Towards Data-Driven Generation of Visualizations for Automatically Generated News Articles.pdf

Additional details


NewsEye – NewsEye: A Digital Investigator for Historical Newspapers 770299
European Commission


  • H Arthur Robinson. 1958. Elements of cartography. John Wiley And Sons, Inc; New York.
  • Jean-Daniel Fekete and Catherine Plaisant. 2003. Interactive information visualization of a million items. In The Craft of Information Visualization. Elsevier, 279–286.
  • Stephen Few and Perceptual Edge. 2007. Data visualization: past, present, and future. IBM Cognos Innovation Center (2007).
  • Statistics Finland. 2018. Statistics on offences and coercive measures. http: //
  • Finlex. 2018. Rikoslaki 19.12.1889/39. 18890039001. Online; accessed 21 June 2018.
  • Michael Friendly and Daniel J Denis. 2001. Milestones in the history of thematic cartography, statistical graphics, and data visualization. URL http://www. datavis. ca/milestones 32 (2001), 13.
  • Tong Gao, Jessica R Hullman, Eytan Adar, Brent Hecht, and Nicholas Diakopoulos. 2014. NewsViews: an automated pipeline for creating custom geovisualizations for news. In Proceedings of the SIGCHI conference on human factors in computing systems. ACM, 3005–3014
  • Jeffrey L Griffin and Robert L Stevenson. 1994. The effectiveness of locator maps in increasing reader understanding of the geography of foreign news. Journalism Quarterly 71, 4 (1994), 937–946.
  • JS Highcharts. 2012. AS Highsoft Solutions. http://www. highcharts. com (2012).
  • Jessica Hullman, Nicholas Diakopoulos, and Eytan Adar. 2013. Contextifier: automatic generation of annotated stock visualizations. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2707–2716.
  • Daniel A Keim. 2002. Information visualization and visual data mining. IEEE Transactions on Visualization & Computer Graphics 1 (2002), 1–8.
  • Yea-Seul Kim, Jessica Hullman, and Maneesh Agrawala. 2016. Generating personalized spatial analogies for distances and areas. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 38–48.
  • Leo Leppänen, Myriam Munezero, Mark Granroth-Wilding, and Hannu Toivonen. 2017. Data-Driven News Generation for Automated Journalism. In Proceedings of the 10th International Conference on Natural Language Generation. 188–197.
  • Leo Leppänen, Myriam Munezero, Stefanie Sirén-Heikel, Mark Granroth-Wilding, and Hannu Toivonen. 2017. Finding and expressing news from structured data. In Proceedings of the 21st International Academic Mindtrek Conference. ACM, 174–183.
  • Vibhu O Mittal, Giuseppe Carenini, Johanna D Moore, and Steven Roth. 1998. Describing complex charts in natural language: A caption generation system. Computational Linguistics 24, 3 (1998), 431–467.
  • Matteo Picozzi, Nervo Verdezoto, Matti Pouke, Jarkko Vatjus-Anttila, and Aaron John Quigley. 2013. Traffic visualization-applying information visualization techniques to enhance traffic planning. In GRAPP 2013 IVAPP 2013- Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications. SciTePress.
  • François Portet, Ehud Reiter, Albert Gatt, Jim Hunter, Somayajulu Sripada, Yvonne Freer, and Cindy Sykes. 2009. Automatic generation of textual summaries from neonatal intensive care data. Artificial Intelligence 173, 7-8 (2009), 789–816.
  • Ehud Reiter and Robert Dale. 2000. Building natural language generation systems. Cambridge university press
  • C Carl Robusto. 1957. The cosine-haversine formula. The American Mathematical Monthly 64, 1 (1957), 38–40.
  • Steven F Roth, John Kolojejchick, Joe Mattis, and Jade Goldstein. 1994. Interactive graphic design using automatic presentation knowledge. In Proceedings of the SIGCHI conference on Human factors in computing systems. ACM, 112–117.
  • Mike Samuels and Nancy Samuels. 1975. Seeing with the mind's eye: The history, techniques, and uses of visualization. Random House Incorporated.
  • Yiwen Sun, Jason Leigh, Andrew Johnson, and Sangyoon Lee. 2010. Articulate: A semi-automated model for translating natural language queries into meaningful visualizations. In International Symposium on Smart Graphics. Springer, 184–195.