It is my great pleasure to welcome you to CIKM 2007 -- the 16th ACM Conference on Information and Knowledge Management. The Organizing Committee assumed the role of affirming CIKM as the premier conference at the confluence of information retrieval, databases and knowledge management.
This is the second time the conference is taking place outside the USA. Lisbon, Portugal, is hosting the 2007 event under joint organization between ACM and the University of Lisbon. This is happening at a time when Portugal is holding the presidency of the European Union and many other events are taking place around the city. Lisbon, also known as the city of the explorers, was the departing point for many of the voyages of discovery, and the first true world city, the capital of an empire spreading over all continents. This makes it an ideal place for convening scientists from around the world to exchange new ideas concerning searching, managing and exploring information.
The Program Co-chairs Alberto Laender, Deborah McGuinness and Ricardo Baeza-Yates rightfully deserve to be commended for assembling a superb Program Committee, which worked very hard to provide excellent peer feedback to the authors in a very short time. Thanks also to Bj�rn Olstad and �ystein Haug Olsen, who chaired the Industrial Track.
We had a record number of submissions (above 700 abstracts!) and accepted 86 of the reviewed papers (512) for presentation as full papers (17%). Because we had so many good papers, we decided to accept 49 as short papers for joint presentation in a poster session, and extended the usual limit from two to four pages.
Parallel linkage
We study the parallelization of the (record) linkage problem - i.e., to identify matching records between two collections of records, A and B. One of main idiosyncrasies of the linkage problem, compared to Database join, is the fact that once two ...
Structure-based inference of xml similarity for fuzzy duplicate detection
Fuzzy duplicate detection aims at identifying multiple representations of real-world objects stored in a data source, and is a task of critical practical relevance in data cleaning, data mining, or data integration. It has a long history for relational ...
A strategy for allowing meaningful and comparable scores in approximate matching
The goal of approximate data matching is to assess whether two distinct data instances represent the same real world object. This is usually achieved through the use of a similarity function, which returns a score that defines how similar two data ...
Cited By
-
Chen L, Gao S and Cao X (2017). Research on real-time outlier detection over big data streams, International Journal of Computers and Applications, 10.1080/1206212X.2017.1397388, 42:1, (93-101), Online publication date: 2-Jan-2020.
-
SanMiguel P and S�daba T (2017). Nice to be a fashion blogger, hard to be influential: An analysis based on personal characteristics, knowledge criteria, and social factors, Journal of Global Fashion Marketing, 10.1080/20932685.2017.1399082, 9:1, (40-58), Online publication date: 2-Jan-2018.
- Proceedings of the sixteenth ACM conference on Conference on information and knowledge management