Ontology-Based XML Content Dissemination
Abstract
As Internet and distributed systems evolve, a new paradigm aggregates the concept of content dissemination to XML query engines. The query is still evaluated over the stored data, but it is also registered into the system. Then, those queries will also be evaluated over the incoming data such that the documents that satisfy them are disseminated back to the users. In such a context, this paper proposes, that ontologies be applied in order to improve the performance of content-based dissemination systems. Our initial experimental evaluation shows that such solution is viable and exhibits considerable advantage over the state-of-the-art techniques.
References
Berglund, A., et. al (2007). XML Path Language (XPath) 2.0. In W3C Recommendation, [link].
Boyd, M., et. al (2004). Automed: A bav data integration system for heterogeneous data sources. In Proc. of CAiSE, pages 82–97.
Broekstra, J., Ehrig, M., and Haase, P. (2003). A metadata model for semantics-based peer-to-peer systems. In Work. on Semantics in Peer-to-Peer and Grid Computing.
Costa, M., et. al. (2005). Vigilante: End-to-End Containment of Internet Worms. In Proc. of SOSP.
Diao, Y., Rizvi S. and Franklin, M. J. (2004). “Towards an Internet-Scale XML Dissemination Service”, In: Proc. of VLDB, p. 612-623.
Levenshtein, V. (1996). Binary codes capable of correcting deletions, insertions, and reversals. Cybernetics and Control Theory, 10(8):707 – 710.
Li, G., Hou, S., and Jacobsen, H.-A. (2007). Routing of XML and XPath Queries in Data Dissemination Networks. In Proc. of ICDE, pages 1400–1404.
Madhavan, J., Bernstein, P. A., and Rahm, E. (2001). Generic schema matching with cupid. In Proc. of VLDB, pages 49–58.
Maedche, A. and Staab, S. (2002) Measuring similarity between ontologies. In Proc. of European Conference on Knowledge Acquisition and Management, 2002. p.251-263.
Maedche, A., et. al (2002). Mafra - a mapping framework for distributed ontologies. In Proc. of EKAW, pages 235–250.
C. D. Manning and Schutze, H. (1999). Foundations of Statistical Natural Language Processing. Cambridge, MA: MIT Press.
Meghini, C. and Spyratos, N. (2007). Computing Intensions of Digital Library Collections. In Proc. of ICFCA, pages 66–81.
Moro, M.M., Bakalov, P., and Tsotras, V. J. (2007). Early Profile Pruning on XML-aware Publish/Subscribe Systems. In Proc. of VLDB, pages 866–877.
Saccol, D. B., Edelweiss, N., Galante, R. M., Mello, M. R. (2008a). Managing Application Domains in P2P Systems. In: IRI 2008 - IEEE International Conference on Information Reuse and Integration 2008, July 13-15, 2008, Las Vegas, USA.
Saccol, D. B., Noll, R. P., Edelweiss, N., and Galante, R.M. (2008b). An Ontology-based Approach for Semantic Interoperability in P2P Systems. In Proc. of ICEIS.
Silva, R. da, Stasiu, R., Orengo, V. M., Heuser, C. A. (2007). Measuring quality of similarity functions in approximate data matching. Journal of Informetrics, 1(1): 35–46.
Snoeren, A. C., Conley, K., and Gifford, D. K. (2001). Mesh-Based Content Routing using XML. In Proc. of SOSP, pages 160–173.
Vagena, Z., Moro, M. M., and Tsotras, V. J. (2007). RoXSum: Leveraging Data Aggregation and Batch Processing for XML Routing. In Proc. of ICDE, pages 1466–1470.
Xu, L. and Embley, D.W. (2006). A composite approach to automating direct and indirect schema mappings. Inf. Syst., 31(8):697–732.
Zhu, Y. and Hu, Y. (2007). Ferry: A P2P-Based Architecture for Content-Based Publish/Subscribe Services. IEEE Trans. Parallel Distrib. Syst., 18(5):672–685.
