SelfBI: On-demand user decision making using web data

  • Manoela Camila Barbosa da Silva Federal University of São Carlos (UFSCar)
  • Sahudy Montenegro González Federal University of São Carlos (UFSCar)

Abstract


Data from Web sources, such as social media, tend to be volatile to be stored in the DW, making them a good option for situational data. Situational data are useful for decision-making queries at a particular time and situation, and can be discarded after analysis. This article describes an architecture that aims to integrate situational data from social media to user queries at the right time; this is, when the user needs them for decision making. The focus of the work is (1) the ETL process for obtaining situational data in real time; and (2) to propose an OLAP operator capable of integrating these data into user queries results.
Keywords: Data Warehouse, Transient Data, Data Integration, ETL, OLAP

References

Abello, A., Romero, O., Pedersen, T. B., Berlanga, R., Nebot, V., Aramburu, M. J., and Simitsis, A. (2015). Using Semantic Web Technologies for Exploratory OLAP: A Survey. IEEE Transactions on Knowledge and Data Engineer, 27:571–588.

Abelló, A., Darmont, J., Etcheverry, L., Golfarelli, M., Mazón, J.-N., Naumann, F., Pedersen, T., Rizzi, S. B., Trujillo, J., Vassiliadis, P., and Vossen, G. (2013). Fusion cubes: Towards Self-Service Business Intelligence. International Journal of Data Warehousing and Mining, 9:66–88.

Benker, T. (2013). A Hybrid OLAP & OLTP Architecture Using Non-Relational Data Components. Enterprise Modelling and Information Systems Architectures, 222 of Lecture Notes in Informatics:41–57.

Etcheverry, L. and Vaisman, A. A. (2012). Enhancing OLAP Analysis with Web Cubes. Lecture Notes in Computer Science, 7295:469–483.

González, S. and Berbel, T. (2014). Considering unstructured data for OLAP: a feasibility study using a systematic review. Revista de Sistemas de Informação da FSMA, 14.

Mansmann, S., Rehman, N. U., Weiler, A., and Scholl, M. H. (2014). Discovering OLAP dimensions in semi-structured data. Information Systems, 44:120–133.

Thollot, R., Kuchmann-beauger, N., and aude Aufaure, M. (2012). Semantics and Usage Statistics for Multi-Dimensional Query Expansion. In Proceedings of International Conference of Database Systems for Advanced Applications, pages 250–260.

Thomsen, C., Pedersen, T. B., and Lehner, W. (2008). RiTE: Providing On-Demand Data for Right-Time Data Warehousing. Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on, pages 456–465.
Published
2016-10-04
DA SILVA, Manoela Camila Barbosa; GONZÁLEZ, Sahudy Montenegro. SelfBI: On-demand user decision making using web data. In: BRAZILIAN SYMPOSIUM ON DATABASES (SBBD), 31. , 2016, Salvador/BA. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2016 . p. 235-240. ISSN 2763-8979. DOI: https://doi.org/10.5753/sbbd.2016.24334.