Um Estudo sobre Arquiteturas e Metadados em Data Lakes
Abstract
The large amount of data generated today through the Internet and by organizations requires new approaches to manage them. Data lakes are repositories of large data and have been an alternative to manage heterogeneous data and metadata. This work presents a study about architectures and metadata management approaches for data lake management systems.References
Beheshti, A., Benatallah, B., Nouri, R., and Tabebordbar, A. (2018). CoreKG: A Knowledge Lake Service. Proc. VLDB Endow., 11(12):1942–1945.
Diamantini, C. et al. (2021). An Approach to Extracting Topic-guided Views from the Sources of a Data Lake. Inf. Syst. Frontiers, 23(1):243–262.
Hai, R., Quix, C., and Jarke, M. (2021). Data Lake Concept and Systems: A Survey. CoRR, abs/2106.09592.
Halevy, A. Y. et al. (2016). Goods: Organizing Google’s Datasets. In International Conference on Management of Data, pages 795–806. ACM.
Hashem, I. A. T. et al. (2016). The Role of Big Data in Smart City. Int. J. Inf. Manag., 36(5):748–758.
Nargesian, F. et al. (2019). Data Lake Management: Challenges and Opportunities. Proc. VLDB Endow., 12(12):1986–1989.
Ravat, F. and Zhao, Y. (2019a). Data Lakes: Trends and Perspectives. In Database and Expert Systems Applications,, volume 11706 of LNCS, pages 304–313. Springer.
Ravat, F. and Zhao, Y. (2019b). Metadata Management for Data Lakes. In ADBIS Short Papers and Workshops, volume 1064 of Communications in Computer and Information Science, pages 37–44. Springer.
Sawadogo, P. N., Kibata, T., and Darmont, J. (2019). Metadata Management for Textual Documents in Data Lakes. In International Conference on Enterprise Information Systems, pages 72–83. SciTePress.
Diamantini, C. et al. (2021). An Approach to Extracting Topic-guided Views from the Sources of a Data Lake. Inf. Syst. Frontiers, 23(1):243–262.
Hai, R., Quix, C., and Jarke, M. (2021). Data Lake Concept and Systems: A Survey. CoRR, abs/2106.09592.
Halevy, A. Y. et al. (2016). Goods: Organizing Google’s Datasets. In International Conference on Management of Data, pages 795–806. ACM.
Hashem, I. A. T. et al. (2016). The Role of Big Data in Smart City. Int. J. Inf. Manag., 36(5):748–758.
Nargesian, F. et al. (2019). Data Lake Management: Challenges and Opportunities. Proc. VLDB Endow., 12(12):1986–1989.
Ravat, F. and Zhao, Y. (2019a). Data Lakes: Trends and Perspectives. In Database and Expert Systems Applications,, volume 11706 of LNCS, pages 304–313. Springer.
Ravat, F. and Zhao, Y. (2019b). Metadata Management for Data Lakes. In ADBIS Short Papers and Workshops, volume 1064 of Communications in Computer and Information Science, pages 37–44. Springer.
Sawadogo, P. N., Kibata, T., and Darmont, J. (2019). Metadata Management for Textual Documents in Data Lakes. In International Conference on Enterprise Information Systems, pages 72–83. SciTePress.
Published
2022-06-27
How to Cite
RODRIGUES, Jéssica Xafranski; MELLO, Ronaldo dos Santos.
Um Estudo sobre Arquiteturas e Metadados em Data Lakes. In: REGIONAL DATABASE SCHOOL (ERBD), 17. , 2022, Lages/SC.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2022
.
p. 131-134.
ISSN 2595-413X.
DOI: https://doi.org/10.5753/erbd.2022.223535.
