JISE


  [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22]


Journal of Information Science and Engineering, Vol. 31 No. 2, pp. 433-454


Improving the Time and Space Performance for Processing Keyword Queries on XML Databases


YA-HUI CHANG AND WEI-HSI LEE
Department of Computer Science and Engineering
National Taiwan Ocean University
Keelung, 202 Taiwan
E-mail: yahui@ntou.edu.tw

 


    For an XML database, it is important to automatically reason meaningful answers for a given keyword query. Particularly, the MaxMatch algorithm is proposed to output contributors, which represent more (or equal number of) keywords than those contained by their sibling nodes. Such concept can distinguish the importance of sibling nodes and has attracted a lot of attention. In this paper, we intend to improve the time and space performance of the original approach. We first propose the LevelPrune algorithm, which only needs to construct the smallest necessary intermediate tree and can also reduce the times of processing nodes. We then extend the idea of constructing trees level by level to optimize physical representations, and design the corresponding algorithm LevelPrune+. According to the experimental results, our proposed algorithms outperform other stateof- the-art approaches by an order of magnitude in some cases. Moreover, the index size of LevelPrune+ is significantly reduced compared with that of the original approach. 


Keywords: XML database, query processing, keyword search, time performance, space performance

  Retrieve PDF document (JISE_201502_05.pdf)