Top-k application with Privacy on Public Data of COVID-19 in the State of Ceará
Abstract
In this article we propose a differentially private reverse top-k query. Our strategy allows the researcher to obtain less frequent data according to his search criteria, with a high guarantee of privacy of the individuals who contributed with the personal data in the original database. We apply our strategy on public data for COVID-19 in the State of Ceará. Our experimental results show that the result of the proposed top-k query returns a high degree of similarity to the result of a conventional top-k query.
References
Dwork, C. (2011). Differential privacy.Encycl. Cryptography and Security, pages 338–340.
Dwork, C., McSherry, F., Nissim, K., and Smith, A. (2006). Calibrating noise to sensiti-vity in private data analysis. In Theory of cryptography conference, pages 265–284.
Dwork, C., Roth, A., et al. (2014). The algorithmic foundations of differential privacy. Foundations and Trends in Theoretical Computer Science, 9(3-4):211–407.
McSherry, F. and Talwar, K. (2007). Mechanism design via differential privacy. In FOCS,volume 7, pages 94–103.
McSherry, F. D. (2009). Privacy integrated queries: an extensible platform for privacy-preserving data analysis. In ACM SIGMOD Int. Conf. on Management of data, pages19–30.
Narayanan, A. and Shmatikov, V. (2006). How to break anonymity of the netflix prize dataset. arXiv preprint cs/0610105. SUS(2020).
Boletim epidemiológico novo coronavírus(covid-19). https://bit.ly/32yFY7a. Acessado em 19-06-2020.
Vlachou, A., Doulkeridis, C., Kotidis, Y., and Nørv ̊ag, K. (2010). Reverse top-k queries. In International Conference on Data Engineering, pages 365–376.