ApproXQL: Design and Implementation of an Approximate Pattern Matching Language for XML

Torsten Schlieder
Institut für Informatik
Freie Universität Berlin
Takustr. 9, D-14195 Berlin
email: schlied@inf.fu-berlin.de

Report B 01-02
Mai 2001

Abstract
We introduce the simple query language approXQL, which supports hierarchical, Boolean-connected query patterns. The interpretation of approXQL queries is founded on cost-based query transformations: The total cost of a sequence of transformations measures the similarity between a query and the data and is used to rank the results. We describe in detail the implementation of the approXQL query processor, which uses an expanded query representation and sophisticated indexes to compute all results of a query in polynomial - typically sublinear - time with respect to the database size.

Get the report here or by anonymous ftp:
Server: fubinf.inf.fu-berlin.de
File: pub/reports/tr-b-01-02.ps.gz