VisStoryMaker: supporting non-expert analysts in visually exploring datasets and communicating insights with visual annotations and data stories
Resumo
Due to data production and availability growth, professionals in several disciplines have been facing an increasing need to explore and understand data, obtain insights, and communicate them effectively. Many visualization systems have been developed commercially and within the research community to support non-expert analysts. We can consider at least three challenges these tools aim to face: support the selection of appropriate visualizations and decide on the visual mappings, extract and communicate factual information from the visualizations, and use visualizations in data-rich narratives. In response to these challenges, we developed VisStoryMaker, a visualization tool that supports both exploration and communication about data. To aid users in exploring and understanding data, VisStoryMaker recommends visualizations through system-generated questions and data facts. To support communicating about data, the system recommends visual annotations of data facts and provides a story-building module, allowing analysts to use the generated charts and facts as a blueprint for a data story. We have conducted empirical studies to compare VisStoryMaker's features with existing applications: chart recommendations with Voyager~2, storytelling construction with Flourish, and data facts and chart annotations with Tableau. Our findings indicate that the system-generated questions and data facts supported non-expert analysts in exploratory analysis. They perceived visual data facts annotations as useful and supported them in raising hypotheses about the data, understanding data, and leading to insights, thus enhancing data analysis. Participants perceived the visual annotations and StoryMaker as helpful in organizing the system-generated pieces of information and incorporating them into comprehensive narratives and presentations.
Referências
S Arevalo Arboleda and A Dewan. 2016. Unveiling storytelling and visualization of data. 14th SC@ RUG 2016-2017 2017 (2016), 38–42.
AE Bartz. 1999. Basic statistical concepts (4th ed.). Upper Saddle River, NJ: Merrill, New York, NY, USA.
Scott Bateman, Regan L. Mandryk, Carl Gutwin, Aaron Genest, David McDine, and Christopher Brooks. 2010. Useful junk? the effects of visual embellishment on comprehension and memorability of charts. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI ’10). Association for Computing Machinery, New York, NY, USA, 2573–2582. DOI: 10.1145/1753326.1753716
Virginia Braun and Victoria Clarke. 2012. Thematic analysis. In APA handbook of research methods in psychology, Vol 2: Research designs: Quantitative, qualitative, neuropsychological, and biological., Harris Cooper, Paul M. Camic, Debra L. Long, A. T. Panter, David Rindskopf, and Kenneth J. Sher (Eds.). American Psychological Association, Washington, 57–71. DOI: 10.1037/13620004 tex.ids= BraunClarke2012ThematicAnalysis.
Alberto Cairo. 2012. The Functional Art: An introduction to information graphics and visualization. New Riders, California.
Joseph Campbell. 2008. The hero with a thousand faces. Vol. 17. New World Library, USA.
Stuart Card, Jock Mackinlay, and Ben Shneiderman. 1999. Readings in Information Visualization: Using Vision to Think. Morgan Kaufmann, San Francisco, CA, USA.
Stephen M. Casner. 1991. Task analytic approach to the automated design of graphic presentations. ACM Trans. on Graphics 10, 2 (April 1991), 111–151. DOI: 10.1145/108360.108361
Roger Clark. 2017. Convenience Sample. In The Blackwell Encyclopedia of Sociology. JohnWiley & Sons, Ltd, USA, 1–2. DOI: 10.1002/9781405165518.wbeosc131.pub2
William S. Cleveland and Robert McGill. 1984. Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods. J. Amer. Statist. Assoc. v. 79 , n. 387 (Sept. 1984), p. 531–554. DOI: 10.1080/01621459.1984.10478080 Publisher: Taylor & Francis.
Chris Crawford. 2004. Interactive Storytelling. In The Video Game Theory Reader. Routledge, New York, NY, USA. Num Pages: 15.
Zhe Cui, Sriram Karthik Badam, M Adil Yalçin, and Niklas Elmqvist. 2019. DataSite: Proactive visual data exploration with computation of insight-based recommendations. Information Visualization 18, 2 (April 2019), 251–267. DOI: 10.1177/1473871618806555 Publisher: SAGE Publications.
Edward E. Cureton. 1956. Rank-biserial correlation. Psychometrika 3 (1956), 287– 290. DOI: 10.1007/BF02289138 Place: Germany Publisher: Springer.
Fred D. Davis. 1989. Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology. MIS Quarterly 13, 3 (1989), 319–340. DOI: 10.2307/249008 Publisher: Management Information Systems Research Center, University of Minnesota.
Taissa Abdalla Filgueiras de Sousa and Simone Diniz Junqueira Barbosa. 2014. Recommender system to support chart constructions with statistical data. In International Conference on Human-Computer Interaction. Springer, Springer International Publishing, Cham, 631–642.
Victor Dibia and Cagatay Demiralp. 2019. Data2Vis: Automatic Generation of Data Visualizations Using Sequence-to-Sequence Recurrent Neural Networks. IEEE Computer Graphics and Applications 39, 5 (Sept. 2019), 33–46. DOI: 10.1109/MCG.2019.2924636
G. Dove and S. Jones. 2012. Narrative Visualization: Sharing Insights into Complex Data. In Interfaces and Human Computer Interaction (IHCI 2012). Paper presented at the Interfaces and Human Computer Interaction (IHCI 2012), Portugal, 21–23. [link]
Micheline Elias, Marie-Aude Aufaure, and Anastasia Bezerianos. 2013. Storytelling in Visual Analytics Tools for Business Intelligence. In Human-Computer Interaction – INTERACT 2013, Paula Kotzé, Gary Marsden, Gitte Lindgaard, Janet Wesson, and Marco Winckler (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 280–297.
Stephen Few. 2009. Now You See it: Simple Visualization Techniques for Quantitative Analysis. Analytics Press, Oakland,CA,USA.
Ana Figueiras. 2014. How to Tell Stories Using Visualization. In 2014 18th International Conference on Information Visualisation. IEEE Institute of Electrical and Electronics Engineers, Paris, France, 18–18. DOI: 10.1109/IV.2014.78
Ana Raquel de Ponte Figueiras. 2016. How to tell stories using visualization: strategies towards Narrative Visualization. In 1st Joint Conference and Exhibition Fostering Science & Innovation Ecosystems. ProQuest Dissertations Publishing, Lisboa, Portugal, 18.
Nahum Gershon and Ward Page. 2001. What storytelling can do for information visualization. Commun. ACM 44, 8 (2001), 31–37.
David Gotz and Zhen Wen. 2009. Behavior-driven visualization recommendation. In Proc. of Intelligent User Interfaces (New York, USA) (IUI ’09). ACM, New York, NY, USA, 315–324. DOI: 10.1145/1502650.1502695
Lars Grammel, Melanie Tory, and Margaret-Anne Storey. 2010. How Information Visualization Novices Construct Visualizations. IEEE Trans. on Visualization and Computer Graphics v. 16 , n. 6 (Nov. 2010), p. 943–952. DOI: 10.1109/TVCG.2010.164
Jeffrey Heer and Ben Shneiderman. 2012. Interactive Dynamics for Visual Analysis. Queue v. 10 , n. 2 (Feb. 2012), p. 30–55. DOI: 10.1145/2133416.2146416.
Eric Horvitz. 1999. Principles of mixed-initiative user interfaces. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems (New York, NY, USA, 1999-05-01) (CHI ’99). Association for Computing Machinery, New York, NY, USA, 159–166. DOI: 10.1145/302979.303030
Kevin Hu, Michiel A. Bakker, Stephen Li, Tim Kraska, and César Hidalgo. 2019. VizML: A Machine Learning Approach to Visualization Recommendation. In Proc. CHI Conf. on Human Factors in Computing Systems. ACM, New York, USA, 1–12. DOI: 10.1145/3290605.3300358
Kevin Hu, Diana Orghian, and César Hidalgo. 2018. DIVE: A Mixed-Initiative System Supporting Integrated Data Exploration Workflows. In Proceedings of the Workshop on Human-In-the-Loop Data Analytics (HILDA’18). Association for Computing Machinery, New York, NY, USA, 1–7. DOI: 10.1145/3209900.3209910
Jessica Hullman and Nick Diakopoulos. 2011. Visualization Rhetoric: Framing Effects in Narrative Visualization. IEEE Trans. on Visualization and Computer Graphics v. 17 , n. 12 (Dec. 2011), p. 2231–2240. DOI: 10.1109/TVCG.2011.255
Jessica Hullman, Nicholas Diakopoulos, and Eytan Adar. 2013. Contextifier: automatic generation of annotated stock visualizations. In Proc. SIGCHI Conf. on Human Factors in Computing Systems. ACM, New York, USA, 2707–2716. DOI: 10.1145/2470654.2481374
Hwiyeon Kim, Juyoung Oh, Yunha Han, Sungahn Ko, Matthew Brehmer, and Bum Chul Kwon. 2019. Thumbnails for Data Stories: A Survey of Current Practices. In 2019 IEEE Visualization Conference (VIS). IEEE, Vancouver, BC, Canada, 116–120. DOI: 10.1109/VISUAL.2019.8933773
Younghoon Kim and Jeffrey Heer. 2018. Assessing Effects of Task and Data Distribution on the Effectiveness of Visual Encodings. Computer Graphics Forum v. 37 , n. 3 (2018), p. 157–167. DOI: 10.1111/cgf.13409
Andy Kirk. 2016. Data Visualisation: A Handbook for Data Driven Design. SAGE, UK.
Cole Nussbaumer Knaflic. 2019. Storytelling with Data: Let’s Practice! John Wiley & Sons, USA.
N. Kong and M. Agrawala. 2012. Graphical Overlays: Using Layered Elements to Aid Chart Reading. IEEE Transactions on Visualization and Computer Graphics 18, 12 (2012), 2631–2638. DOI: 10.1109/TVCG.2012.229
Robert Kosara and Jock Mackinlay. 2013. Storytelling: The next step for visualization. Computer 46, 5 (2013), 44–50.
Jonathan Lazar, Jinjuan Heidi Feng, and Harry Hochheiser. 2017. Research Methods in Human-Computer Interaction, Second Edition (2 edition ed.). Morgan Kaufmann, Cambridge, MA.
Bongshin Lee, Nathalie Henry Riche, Petra Isenberg, and Sheelagh Carpendale. 2015. More than telling a story: Transforming data into visually shared stories. IEEE computer graphics and applications 35, 5 (2015), 84–90.
Doris Jung-Lin Lee, Vidya Setlur, Melanie Tory, Karrie Karahalios, and Aditya Parameswaran. 2022. Deconstructing Categorization in Visualization Recommendation: A Taxonomy and Comparative Study. , 4225-4239 pages. DOI: 10.1109/TVCG.2021.3085751
Raul de Araújo Lima and Simone Diniz Junqueira Barbosa. 2020. A QuestionOriented Visualization Recommendation Approach for Data Exploration. In Proceedings of the International Conference on Advanced Visual Interfaces (Salerno, Italy) (AVI ’20). Association for Computing Machinery, New York, NY, USA, Article 43, 5 pages. DOI: 10.1145/3399715.3399849
Y. Luo, X. Qin, C. Chai, N. Tang, G. Li, and W. Li. 2020. Steerable Self-driving Data Visualization. IEEE Trans. on Knowledge and Data Engineering 34, 1 (April 2020), 475—-490. DOI: 10.1109/TKDE.2020.2981464
Kwan-Liu Ma, Isaac Liao, Jennifer Frazier, Helwig Hauser, and Helen-Nicole Kostis. 2011. Scientific storytelling using visualization. IEEE Computer Graphics and Applications 32, 1 (2011), 12–19.
Jock Mackinlay. 1986. Automating the Design of Graphical Presentations of Relational Information. ACM Trans. Graph. 5, 2 (April 1986), 110–141. DOI: 10.1145/22949.22950
Jock Mackinlay, Pat Hanrahan, and Chris Stolte. 2007. Show Me: Automatic Presentation for Visual Analysis. IEEE Trans. on Visualization and Computer Graphics 13, 6 (Nov. 2007), 1137–1144. DOI: 10.1109/TVCG.2007.70594.
Sean McKenna, Nathalie Henry Riche, Bongshin Lee, Jeremy Boy, and Miriah Meyer. 2017. Visual narrative flow: Exploring factors shaping data visualization story reading experiences. Computer Graphics Forum 36, 3 (2017), 377–387.
Dominik Moritz, Chenglong Wang, Greg L. Nelson, Halden Lin, Adam M. Smith, Bill Howe, and Jeffrey Heer. 2019. Formalizing Visualization Design Knowledge as Constraints: Actionable and Extensible Models in Draco. IEEE Trans. on Visualization and Computer Graphics 25, 1 (Jan. 2019), 438–448. DOI: 10.1109/TVCG.2018.2865240
Tamara Munzner. 2014. Visualization Analysis and Design. CRC Press, Boca , FL, USA.
Arpit Narechania, Arjun Srinivasan, and John Stasko. 2021. NL4DV: A Toolkit for Generating Analytic Specifications for Data Visualization from Natural Language Queries. IEEE Transactions on Visualization and Computer Graphics 27, 2 (2021), 369–379. DOI: 10.1109/TVCG.2020.3030378
Daniel B Perry, Bill Howe, Alicia MF Key, and Cecilia Aragon. 2013. VizDeck: Streamlining exploratory visual analytics of scientific data.
Steven Pinker. 1990. A theory of graph comprehension. Artificial intelligence and the future of testing 1 (1990), 73–126.
Georges Polti. 1917. The thirty-six dramatic situations. Editor Company, Boston, USA.
Donghao Ren, Matthew Brehmer, Bongshin Lee, Tobias Höllerer, and Eun Kyoung Choe. 2017. Chartaccent: Annotation for data-driven storytelling. In 2017 IEEE Pacific Visualization Symposium (PacificVis). IEEE, IEEE, Seoul, South Korea, 230–239.
Donghao Ren, Matthew Brehmer, Bongshin Lee, Tobias Höllerer, and Eun Kyoung Choe. 2017. ChartAccent: Annotation for data-driven storytelling. In 2017 IEEE Pacific Visualization Symposium (PacificVis). IEEE, Seoul, South Korea, 230–239. DOI: 10.1109/PACIFICVIS.2017.8031599 ISSN: 2165-8773.
Nathalie Henry Riche, Christophe Hurter, Nicholas Diakopoulos, and Sheelagh Carpendale. 2018. Data-Driven Storytelling. CRC Press, Boca Raton. GoogleBooks-ID: bnxTDwAAQBAJ.
Ariane M. B. Rodrigues, Gabriel D. J. Barbosa, Raul de A. Lima, Dieinison J. F. Braga, Hélio Lopes, and Simone D. J. Barbosa. 2020. Revisiting Visualization Task Taxonomies: Specifying Functions for the Data Transformations Stage. In Human-Computer Interaction. Design and User Experience. Springer International Publishing, New York, USA, 655–671. DOI: 10.1007/978-3-030-490591_48
Ariane M. B. Rodrigues, Gabriel D. J. Barbosa, Hélio Lopes, and Simone D. J. Barbosa. 2019. Comparing the Effectiveness of Visualizations of Different Data Distributions. In Conf. on Graphics, Patterns and Images (SIBGRAPI). IEEE, New York, USA, 84–91. DOI: 10.1109/SIBGRAPI.2019.00020 ISSN: 2377-5416.
María Teresa Rodríguez, Sérgio Nunes, and Tiago Devezas. 2015. Telling Stories with Data Visualization. In Proceedings of the 2015 Workshop on Narrative & Hypertext (Guzelyurt, Northern Cyprus) (NHT ’15). Association for Computing Machinery, New York, NY, USA, 7–11. DOI: 10.1145/2804565.2804567
Bahador Saket, Alex Endert, and Cagatay Demiralp. 2019. Task-Based Effectiveness of Basic Visualizations. IEEE Trans. on Visualization and Computer Graphics v. 25 , n. 7 (July 2019), p. 2505–2512. DOI: 10.1109/TVCG.2018.2829750.
Arvind Satyanarayan and Jeffrey Heer. 2014. Authoring narrative visualizations with ellipsis. Computer Graphics Forum 33, 3 (2014), 361–370.
Arvind Satyanarayan, Dominik Moritz, Kanit Wongsuphasawat, and Jeffrey Heer. 2016. Vega-lite: A grammar of interactive graphics. IEEE transactions on visualization and computer graphics 23, 1 (2016), 341–350.
Edward Segel and Jeffrey Heer. 2010. Narrative Visualization: Telling Stories with Data. IEEE Trans. on Visualization and Computer Graphics v. 16 , n. 6 (Nov. 2010), p. 1139–1148. DOI: 10.1109/TVCG.2010.179
E. Segel and J. Heer. 2010. Narrative Visualization: Telling Stories with Data. IEEE Transactions on Visualization and Computer Graphics 16, 6 (2010), 1139– 1148. DOI: 10.1109/TVCG.2010.179
Arjun Srinivasan, Steven M. Drucker, Alex Endert, and John Stasko. 2019. Augmenting Visualizations with Interactive Data Facts to Facilitate Interpretation and Communication. IEEE Trans. on Visualization and Computer Graphics 25, 1 (Jan. 2019), 672–681. DOI: 10.1109/TVCG.2018.2865145
Christina Stoiber, Sonja Radkohl, Florian Grassinger, Daniela Moitzi, Holger Stitz, Eva Goldgruber, Dominic Girardi, and Wolfgang Aigner. 2023. Authoring tool for Data Journalists integrating Self-Explanatory Visualization Onboarding Concept for a Treemap Visualization. In Proceedings of the 15th Biannual Conference of the Italian SIGCHI Chapter (CHItaly ’23). Association for Computing Machinery, New York, NY, USA, 1–14. DOI: 10.1145/3605390.3605394
C. Stolte, D. Tang, and P. Hanrahan. 2002. Polaris: a system for query, analysis, and visualization of multidimensional relational databases. IEEE Trans. on Visualization and Computer Graphics 8, 1 (Jan. 2002), 52–65. DOI: 10.1109/2945.981851
Chao Tong, Richard Roberts, Rita Borgo, Sean Walton, Robert S Laramee, Kodzo Wegba, Aidong Lu, Yun Wang, Huamin Qu, Qiong Luo, et al. 2018. Storytelling and visualization: An extended survey. Information 9, 3 (2018), 65.
Edward R. Tufte. 2001. The Visual Display of Quantitative Information (2nd ed.). Graphics Press, Cheshire, Connecticut, USA.
Manasi Vartak, Sajjadur Rahman, Samuel Madden, Aditya Parameswaran, and Neoklis Polyzotis. 2015. SeeDB: efficient data-driven visualization recommendations to support visual analytics. Proceedings of the VLDB Endowment 8, 13 (Sept. 2015), 2182–2193. DOI: 10.14778/2831360.2831371
Frank Wilcoxon. 1992. Individual Comparisons by Ranking Methods. In Breakthroughs in Statistics: Methodology and Distribution, Samuel Kotz and Norman L. Johnson (Eds.). Springer, New York, NY, 196–202. DOI: 10.1007/9781-4612-4380-9_16
Wesley Willett, Jeffrey Heer, Joseph Hellerstein, and Maneesh Agrawala. 2011. CommentSpace: structured support for collaborative visual analysis. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI ’11). Association for Computing Machinery, New York, NY, USA, 3131–3140. DOI: 10.1145/1978942.1979407
Wita Wojtkowski and W Gregory Wojtkowski. 2002. Storytelling: its role in information visualization. In European Systems Science Congress, Vol. 5. Citeseer, Emerald Group Publishing Limited, USA, 1–5.
Kanit Wongsuphasawat, Dominik Moritz, Anushka Anand, Jock Mackinlay, Bill Howe, and Jeffrey Heer. 2015. Voyager: Exploratory Analysis via Faceted Browsing of Visualization Recommendations. IEEE Trans. on Visualization and Computer Graphics 22, 1 (Jan. 2015), 649–658. DOI: 10.1109/TVCG.2015.2467191.
Kanit Wongsuphasawat, Zening Qu, Dominik Moritz, Riley Chang, Felix Ouk, Anushka Anand, Jock Mackinlay, Bill Howe, and Jeffrey Heer. 2017. Voyager 2: Augmenting Visual Analysis with Partial View Specifications. In Proc. CHI Conf. on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, 2648–2659. DOI: 10.1145/3025453.3025768