An approximate search engine for structural databases

JTL Wang, X Wang, D Shasha, BA Shapiro… - ACM SIGMOD …, 2000 - dl.acm.org
JTL Wang, X Wang, D Shasha, BA Shapiro, K Zhang, Q Ma, Z Weinberg
ACM SIGMOD Record, 2000dl.acm.org
When a person interested in a topic enters a keyword into a Web search engine, the
response is nearly instantaneous (and sometimes overwhelming). The impressive speed is
due to clever inverted index structures, caching, and a domain-independent knowledge of
strings. Our project seeks to construct algorithms, data structures, and software that
approach the speed of keyword-based search engines for queries on structural databases.
A structural database is one whose data objects include trees, graphs, or a set of interrelated …
When a person interested in a topic enters a keyword into a Web search engine, the response is nearly instantaneous (and sometimes overwhelming). The impressive speed is due to clever inverted index structures, caching, and a domain-independent knowledge of strings. Our project seeks to construct algorithms, data structures, and software that approach the speed of keyword-based search engines for queries on structural databases.
A structural database is one whose data objects include trees, graphs, or a set of interrelated labeled points in two, three, or higher dimensional space. Examples include databases holding (i) protein secondary and tertiary structure, (ii) phylogenetic trees, (iii) neuroanatomical networks, (iv) parse trees, (v) molecular diagrams, and (vi) XML documents. Comparison queries on such databases require solving variants of the graph isomorphism or subisomorphism problems (for which all known algorithms are exponential), so we have explored a large heuristic space.
ACM Digital Library