skip to main content
10.1145/1008992.1008998acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Using temporal profiles of queries for precision prediction

Published: 25 July 2004 Publication History

Abstract

A key missing component in information retrieval systems is self-diagnostic tests to establish whether the system can provide reasonable results for a given query on a document collection. If we can measure properties of a retrieved set of documents which allow us to predict average precision, we can automate the decision of whether to elicit relevance feedback, or modify the retrieval system in other ways. We use meta-data attached to documents in the form of time stamps to measure the distribution of documents retrieved in response to a query, over the time domain, to create a temporal profile for a query. We define some useful features over this temporal profile. We find that using these temporal features, together with the content of the documents retrieved, we can improve the prediction of average precision for a query.

References

[1]
J. Allan, J. Callan, K. Collins-Thompson, B. Croft, F. Feng, D. Fisher, J. Lafferty, L. Larkey, T. N. Truong, P. Ogilvie, L. Si, T. Strohman, H. Turtle, and C. Zhai. The lemur toolkit for language modeling and information retrieval. http://www-2.cs.cmu.edu/lemur/, 2003.
[2]
W. B. Croft and J. Lafferty. Language Modeling for Information Retrieval. Kluwer Academic Publishers, 2003.
[3]
S. Cronen-Townsend, Y. Zhou, and W. B. Croft. Predicting query performance. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR 2002), pages 299--306, August 2002.
[4]
F. Diaz and R. Jones. Temporal profiles of queries. Technical Report YRL-2004-022, Yahoo! Research Labs, 2004.
[5]
J. Kleinberg. Bursty and hierarchical structure in streams. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD 2002), pages 91--101, July 2002.
[6]
R. Krovetz. Viewing morphology as an inference process. In Proceedings of the Sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR 1993), pages 191--203, 1993.
[7]
V. Lavrenko and W. B. Croft. Relevance-based language models. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR 2001), pages 120--127. ACM Press, 2001.
[8]
X. Li and W. B. Croft. Time-based language models. In Proceedings of the 2003 ACM CIKM International Conference on Information and Knowledge Management(CIKM 2003), pages 469--475. ACM, November 2003.
[9]
R. Swan and D. Jensen. TimeMines: Constructing timelines with statistical models of word Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD 2000), pages 73--80, August 2000.
[10]
I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, 1999. http://www.cs.waikato.ac.nz/ml/weka/.

Cited By

View all

Index Terms

  1. Using temporal profiles of queries for precision prediction

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
    July 2004
    624 pages
    ISBN:1581138814
    DOI:10.1145/1008992
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 July 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. clarity
    2. language models
    3. precision prediction
    4. time

    Qualifiers

    • Article

    Conference

    SIGIR04
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 21 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Naturalistic Digital Behavior Predicts Cognitive AbilitiesACM Transactions on Computer-Human Interaction10.1145/366034131:3(1-32)Online publication date: 7-May-2024
    • (2020)Event-Related Query Classification with Deep Neural NetworksCompanion Proceedings of the Web Conference 202010.1145/3366424.3382183(324-330)Online publication date: 20-Apr-2020
    • (2018)Location estimation of non-geo-tagged tweetsEvolutionary Intelligence10.1007/s12065-018-0163-3Online publication date: 14-Aug-2018
    • (2018)Query Performance Prediction and Classification for Information Search SystemsWeb and Big Data10.1007/978-3-319-96890-2_23(277-285)Online publication date: 19-Jul-2018
    • (2017)Exploiting Query’s Temporal Patterns for Query AutocompletionMathematical Problems in Engineering10.1155/2017/74908792017:1Online publication date: 23-Mar-2017
    • (2017)Summary generation using geo-coordinates and temporal data in microblogging environment2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT.2017.8204049(1-5)Online publication date: Jul-2017
    • (2017)Towards Exploiting Social Networks for Detecting Epidemic OutbreaksGlobal Journal of Flexible Systems Management10.1007/s40171-016-0148-y18:1(61-71)Online publication date: 11-Jan-2017
    • (2017) An Evaluation of CMIP5 GCM Simulations over the Athabasca River Basin, Canada River Research and Applications10.1002/rra.313633:5(823-843)Online publication date: 28-Feb-2017
    • (2017)High‐resolution projections of 21st century climate over the Athabasca River Basin through an integrated evaluation‐classification‐downscaling‐based climate projection frameworkJournal of Geophysical Research: Atmospheres10.1002/2016JD026158122:5(2595-2615)Online publication date: 3-Mar-2017
    • (2016)Challenges in Detecting Epidemic Outbreaks from Social Networks2016 30th International Conference on Advanced Information Networking and Applications Workshops (WAINA)10.1109/WAINA.2016.111(69-74)Online publication date: Mar-2016
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media