Empirical Evaluation of Private Comparison Techniques Applied in Entity Resolution

  • Thiago Pereira da Nóbrega State University of Paraíba / Federal University of Campina Grande
  • Carlos Eduardo Santos Pires Federal University of Campina Grande
  • Tiago Brasileiro Araújo Federal University of Campina Grande

Abstract


Privacy-Preserving Record Linkage (PPRL) consists in identifying which records in two or more databases correspond to the same entity. In this context, different private data comparison techniques have been used, e.g., Bloom filters. However, Bloom filters do not perform well when numeric data or dates are compared. This work aims to evaluate if Homomorphic Asymmetric Cryptography (HAC) can improve the accuracy of the comparison involving non-textual private data. The results indicate that the use of HAC in non-textual data comparison can improve the accuracy of PPRL
Keywords: Privacy-Preserving Record Linkage, Precision Analysis, Homomorphic Asymmetric Cryptography

References

Agarwal, S. and Trachtenberg, A. (2006). Approximating the number of differences between remote sets. In IEEE Information Theory Workshop, pages 217–221. IEEE.

Christen, P. (2012). Data Matching. Springer Berlin Heidelberg, Berlin, Heidelberg.

Parmar, P., B. Padhar, S., N. Patel, S., I. Bhatt, N., and H. Jhaveri, R. (2014). Survey of Various Homomorphic Encryption algorithms and Schemes. International Journal of Computer Applications, 91(8):26–32.

Pita, R., Pinto, C., Melo, P., Silva, M., Barreto, M., and Rasella, D. (2015). A Spark-based workflow for probabilistic record linkage of healthcare data. CEUR Workshop Proceedings, 1330:17–26.

Randall, S. M., Ferrante, A. M., Boyd, J. H., Bauer, J. K., and Semmens, J. B. (2014). Privacy-preserving record linkage on large real world datasets. Journal of Biomedical Informatics, 50:205–212.

Schmidlin, K., Clough-Gorr, K. M., Spoerri, A., and group, f. S. N. C. s. (2015). Privacy Preserving Probabilistic Record Linkage (P3RL): a novel method for linking existing health-related data and maintaining participant confidentiality. BMC Medical Research Methodology, 15(1):46.

Schnell, R., Bachteler, T., and Reiher, J. (2009). Privacy-preserving record linkage using Bloom filters. BMC Med Inform.Decis.Mak., 9:41.

Tran, K.-n., Vatsalan, D., and Christen, P. (2013). GeCo. Proceedings of the 22nd ACM international conference on Conference on information & knowledge management - CIKM ’13, pages 2473–2476.

Vatsalan, D., Christen, P., and Verykios, V. S. (2013). A taxonomy of privacy-preserving record linkage techniques. Information Systems, 38(6):946–969.
Published
2016-10-04
DA NÓBREGA, Thiago Pereira; PIRES, Carlos Eduardo Santos; ARAÚJO, Tiago Brasileiro. Empirical Evaluation of Private Comparison Techniques Applied in Entity Resolution. In: BRAZILIAN SYMPOSIUM ON DATABASES (SBBD), 31. , 2016, Salvador/BA. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2016 . p. 121-126. ISSN 2763-8979. DOI: https://doi.org/10.5753/sbbd.2016.24315.