Ontology-Based XML Content Dissemination

  • Mirella M. Moro UFMG
  • Renata Galante UFRGS
  • Deise de B. Saccol UNIPAMPA
  • Bernadette F. Loscio UFC

Abstract


As Internet and distributed systems evolve, a new paradigm aggregates the concept of content dissemination to XML query engines. The query is still evaluated over the stored data, but it is also registered into the system. Then, those queries will also be evaluated over the incoming data such that the documents that satisfy them are disseminated back to the users. In such a context, this paper proposes, that ontologies be applied in order to improve the performance of content-based dissemination systems. Our initial experimental evaluation shows that such solution is viable and exhibits considerable advantage over the state-of-the-art techniques.

References

Aumueller, D., Do, H. H., Massmann, S., and Rahm, E. (2005). Schema and ontology matching with coma++. In Proc. of SIGMOD Conference, pages 906–908.

Berglund, A., et. al (2007). XML Path Language (XPath) 2.0. In W3C Recommendation, [link].

Boyd, M., et. al (2004). Automed: A bav data integration system for heterogeneous data sources. In Proc. of CAiSE, pages 82–97.

Broekstra, J., Ehrig, M., and Haase, P. (2003). A metadata model for semantics-based peer-to-peer systems. In Work. on Semantics in Peer-to-Peer and Grid Computing.

Costa, M., et. al. (2005). Vigilante: End-to-End Containment of Internet Worms. In Proc. of SOSP.

Diao, Y., Rizvi S. and Franklin, M. J. (2004). “Towards an Internet-Scale XML Dissemination Service”, In: Proc. of VLDB, p. 612-623.

Levenshtein, V. (1996). Binary codes capable of correcting deletions, insertions, and reversals. Cybernetics and Control Theory, 10(8):707 – 710.

Li, G., Hou, S., and Jacobsen, H.-A. (2007). Routing of XML and XPath Queries in Data Dissemination Networks. In Proc. of ICDE, pages 1400–1404.

Madhavan, J., Bernstein, P. A., and Rahm, E. (2001). Generic schema matching with cupid. In Proc. of VLDB, pages 49–58.

Maedche, A. and Staab, S. (2002) Measuring similarity between ontologies. In Proc. of European Conference on Knowledge Acquisition and Management, 2002. p.251-263.

Maedche, A., et. al (2002). Mafra - a mapping framework for distributed ontologies. In Proc. of EKAW, pages 235–250.

C. D. Manning and Schutze, H. (1999). Foundations of Statistical Natural Language Processing. Cambridge, MA: MIT Press.

Meghini, C. and Spyratos, N. (2007). Computing Intensions of Digital Library Collections. In Proc. of ICFCA, pages 66–81.

Moro, M.M., Bakalov, P., and Tsotras, V. J. (2007). Early Profile Pruning on XML-aware Publish/Subscribe Systems. In Proc. of VLDB, pages 866–877.

Saccol, D. B., Edelweiss, N., Galante, R. M., Mello, M. R. (2008a). Managing Application Domains in P2P Systems. In: IRI 2008 - IEEE International Conference on Information Reuse and Integration 2008, July 13-15, 2008, Las Vegas, USA.

Saccol, D. B., Noll, R. P., Edelweiss, N., and Galante, R.M. (2008b). An Ontology-based Approach for Semantic Interoperability in P2P Systems. In Proc. of ICEIS.

Silva, R. da, Stasiu, R., Orengo, V. M., Heuser, C. A. (2007). Measuring quality of similarity functions in approximate data matching. Journal of Informetrics, 1(1): 35–46.

Snoeren, A. C., Conley, K., and Gifford, D. K. (2001). Mesh-Based Content Routing using XML. In Proc. of SOSP, pages 160–173.

Vagena, Z., Moro, M. M., and Tsotras, V. J. (2007). RoXSum: Leveraging Data Aggregation and Batch Processing for XML Routing. In Proc. of ICDE, pages 1466–1470.

Xu, L. and Embley, D.W. (2006). A composite approach to automating direct and indirect schema mappings. Inf. Syst., 31(8):697–732.

Zhu, Y. and Hu, Y. (2007). Ferry: A P2P-Based Architecture for Content-Based Publish/Subscribe Services. IEEE Trans. Parallel Distrib. Syst., 18(5):672–685.
Published
2009-07-20
MORO, Mirella M.; GALANTE, Renata; SACCOL, Deise de B.; LOSCIO, Bernadette F.. Ontology-Based XML Content Dissemination. In: INTEGRATED SOFTWARE AND HARDWARE SEMINAR (SEMISH), 36. , 2009, Bento Gonçalves/RS. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2009 . p. 153-167. ISSN 2595-6205.