Track: XML and Web Data
Paper Title:
Multiway SLCA-based Keyword Search in XML Data
Authors:
Abstract:
Keyword search for smallest lowest common ancestors (SLCAs)
in XML data has recently been
proposed as a meaningful way to identify interesting data nodes in
XML data whose subtrees contain an input set of keywords. In this
paper, we generalize this useful search paradigm to support keyword
search beyond the traditional AND semantics to include both
AND and OR boolean operators as well.
We first analyze properties of the
LCA computation and propose more efficient algorithms to solve the
traditional keyword search problem (with only AND semantics). We
then extend our approach to handle general keyword search involving
combinations of AND and OR boolean operators. The
effectiveness of our new algorithms is demonstrated with a
comprehensive experimental performance study.