Relational Databases versus Search Engines: A Performance Comparison for Storing and Querying DICOM Metadata
Resumo
The Digital Imaging and Communications in Medicine (DICOM) standard adopts files as individual, self-contained repositories for the storage of a mixed of alphanumerical and binary content regarding radiological images. Usually, groups of DICOM files are hierarchically organized in studies and series, physically disposed into filesystem directory trees. Despite its simplicity in storing content, ordinary filesystems do not provide index capabilities allowing searches by content – restricting access by directory names and file names. To surpass such limitation, Picture Archiving and Communication Systems (PACSs) often adopt Relational Database Management Systems (RDBMSs) as metadata repositories, benefiting from its general-purposed index structures. An alternative approach, not quite explored, considers the adoption of search engines as metadata catalogs, aiming to minimize the search time by exploring the engine’s index optimizations. In order to evaluate the performance on managing DICOM metadata, this work compares relational database instances to a search engine in terms of storage space, storage time, and query time. Results show that, in the best case, the search engine is slightly slower in storing content; however, it requires 69% less disk space for the same dataset. For queries, in turn, the search engine performs up to 8.3 times faster in retrieving groups of tags.
Referências
Bastiao Silva, L. A., Beroud, L., Costa, C., and Oliveira, J. L. (2014). Medical imaging archiving: A comparison between several nosql solutions. In Biomedical and Health Informatics (BHI), 2014 IEEE-EMBS International Conference on, pages 65–68.
Bidgood Jr,W. D., Horii, S. C., Prior, F.W., and Van Syckle, D. E. (1997). Understanding and using dicom, the data interchange standard for biomedical imaging. Journal of the American Medical Informatics Association, 4(3):199–212.
Clark, K. et al. (2013). The cancer imaging archive (tcia): Maintaining and operating a public information repository. Journal of Digital Imaging, 26:1045–1057.
Codd, E. F. (1970). A relational model of data for large shared data banks. Communications of the ACM, 13(6):377–387.
Croft, B., Metzler, D., and Strohman, T. (2009). Search Engines: Information Retrieval in Practice. Pearson.
Elmasri, R. and Navathe, S. B. (2010). Fundamentals of Database Systems. Pearson.
Faggioni, L., Neri, E., Castellana, C., Caramella, D., and Bartolozzi, C. (2011). The future of pacs in healthcare enterprises. European Journal of Radiology, 78:253–258.
Huang, H. K. (2011). Short history of pacs. part i: Usa. European Journal of Radiology, 78:163–176.
Lemke, H. U. (2011). Short history of pacs. part ii: Europe. European Journal of Radiology, 78:177–183.
Mansoori, B., Erhard, K. K., and Sunshine, J. L. (2012). Picture archiving and communication system (pacs) - implementation, integration and benefits in an integrated health system. Academic Radiology, 19(2):229–235.
Mildenberger, P., Eichelberg, M., and Martin, E. (2002). Introduction to the dicom standard. European Radiology, 12(4):920–927.
National Electrical Manufacturers Association (2015a). Dicom ps3.4 2015c - service class specifications. Date last accessed 2016-01-31.
National Electrical Manufacturers Association (2015b). Dicom ps3.5 2015c - data structures and encoding. Date last accessed 2016-02-03.
Pianykh, O. S. (2012). Digital Imaging and Communications in Medicine (DICOM) – A Practical Introduction and Survival Guide. Springer.
Rascovsky, S. J., Delgado, J. A., Sanz, A., Calvo, V. D., and Castrillón, G. (2012). Informatics in radiology: Use of couchdb for document-based storage of dicom objects. RadioGraphics, 32(3):913–927.
Savaris, A., Härder, T., and Wangenheim, A. v. (2014). Dcmdsm: a dicom decomposed storage model. Journal of the American Medical Informatics Association, 21(5):917–924.
Singh, R., Chubb, L., Pantanowitz, L., and Parwani, A. (2011). Standardization in digital pathology: Supplement 145 of the dicom standards. Journal of Pathology Informatics, 2(1):23.
Sumathi, S. and Esakkirajan, S. (2007). Fundamentals of Relational Database Management Systems. Springer.
The Cancer Imaging Archive Team (2015). Lidc-idri. Date last accessed 2016-02-02.