A Unified Data Model for Blockchains


The popularity of blockchains has been steadily rising since its inception in 2009. Its original focus was the creation of a distributed trustless system for a digital currency. Bitcoin was the first widely used blockchain. However, many more have been further created: Ethereum, Solana, Hyperledger Fabric, to name a few. Due to it, blockchain networks and its properties are being scrutinised by data scientists. This topic is known as blockchain analysis. However, analysis ́ tasks are hard to be accomplished over multiple blockchains. Even though most have similar concepts, each one has its own data modeled differently. As a consequence, scientists end up needing to do rework in order to apply similar analytics and algorithms in more than one blockchain network. In fact, blockchain data modeling is an open issue. This master’s thesis aims at proposing an unified model for blockchain data. For the best of our knowledge, there is no similar work in the literature.

Palavras-chave: blockchain, data model, unified


Akcora, C. G. et al. (2018). Blockchain Data Analytics. Intelligent Informatics, page 4.

Bartoletti, M. et al. (2017). A General Framework for Blockchain Analytics. In Workshop on Scalable and Resilient Infrastructures for Distributed Ledgers, pages 1-6.

Bragagnolo, S. et al. (2018). Ethereum Query Language. In International Workshop on Emerging Trends in Software Engineering for Blockchain, pages 1-8.

Chen, W. et al. (2018). Detecting Ponzi Schemes on Ethereum: Towards Healthier Blockchain Technology. In World Wide Web conference, pages 1409-1418.

Han, J. et al. (2019). Enabling SQL-query Processing for Ethereum-based Blockchain Systems. In Int. Conference on Web Intelligence, Mining and Semantics, pages 1-7.

Huang, H. et al. (2021). A Survey of State-of-the-art on Blockchains: Theories, Modelings, and Tools. ACM Computing Surveys, 54(2):1-42.

Li, Y. et al. (2017). EtherQL: A Query Layer for Blockchain System. In International Conference on Database Systems for Advanced Applications, pages 556-567. Springer.

Linoy, S. et al. (2019). Scalable Privacy-preserving Query Processing over Ethereum Blockchain. In International Conference on Blockchain, pages 398-404. IEEE.

Mello, R. d. S. (2002). Uma Abordagem Bottom-up para a Integração Semântica de Esquemas XML. PhD thesis, Universidade Federal do Rio Grande do Sul.

Morzy, T., Wojciechowski, M., and Zakrzewicz, M. (1999). Pattern-oriented Hierarchical Clustering. In East European Conference on Advances in Databases and Information Systems, pages 179-190. Springer.

Nick, J. D. (2015). Data-driven De-anonymization in Bitcoin. Master’s thesis, ETH-Zurich.

Peng, Z. et al. (2019). VQL: Providing Query Efficiency and Data Authenticity in Blockchain Systems. In International Conference on Data Engineering Workshops, pages 1-6. IEEE.

Pratama, F. A. and Mutijarsa, K. (2018). Query Support for Data Processing and Analysis on Ethereum Blockchain. In Int. Symposium on Electronics and Smart Devices (ISESD), pages 1-5. IEEE.

Przytarski, D., Stach, C., Gritti, C., and Mitschang, B. (2021). Query Processing in Blockchain Systems: Current State and Future Challenges. Future Internet, 14(1):1.

Trihinas, D. (2019). Datachain: A Query Framework for Blockchains. In International Conference on Management of Digital EcoSystems, pages 134-141.

Turner, A. B., McCombie, S., and Uhlmann, A. J. (2020). Analysis Techniques for Illicit Bitcoin Transactions. Frontiers Comput. Sci., 2:600596.

Vo, H. T., Kundu, A., and Mohania, M. K. (2018). Research Directions in Blockchain Data Management and Analytics. In Extending Database Technology (EDBT), pages 445-448.

Zhou, E. et al. (2019). Ledgerdata Refiner: A Powerful Ledger Data Query Platform for Hyperledger Fabric. In International Conference on Internet of Things: Systems, Management and Security (IOTSMS), pages 433-440. IEEE.
MEYER, João Vicente; MELLO, Ronaldo dos Santos. A Unified Data Model for Blockchains. In: WORKSHOP DE TESES E DISSERTAÇÕES (WTDBD) - SIMPÓSIO BRASILEIRO DE BANCO DE DADOS (SBBD), 37. , 2022, Búzios. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2022 . p. 112-118. DOI: https://doi.org/10.5753/sbbd_estendido.2022.21852.