The Computer Journal Advance Access first published online on July 28, 2007
This version published online on September 4, 2007
The Computer Journal, doi:10.1093/comjnl/bxm051
| ||||||||||||||||||||||||||||||||||||||||||||||||||
Similarity of XML-Schema Elements: A Structural and Information Content Approach
IASI-CNR, Viale Manzoni 30, I-00185 Rome, Italy
* Corresponding author: formica{at}iasi.cnr.it
Received 11 April 2006; revised 7 June 2007
EXtensible Markup Language (XML)-Schemas are the emerging standards for describing and validating semi-structured documents across the Internet, due to the rich set of modeling constructors, types and constraints they provide. Semantic similarity is growing in importance in different settings, such as digital libraries, heterogeneous databases and, in particular, the Semantic Web. The focus of this paper is the definition of a method for determining semantic similarity of XML-Schema elements in the presence of type hierarchies. Such a method has been defined by combining and revisiting: (i) the information content approach, and (ii) a method for comparing the structural components of type declarations, inspired by the maximum weighted matching problem in bipartite graphs.
Key Words: Semantic Web XML-Schemas type hierarchies information content similarity reasoning
The originally published version of this paper was incorrect.