skip to main content
10.1145/2339530.2339767acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
demonstration

UFIMT: an uncertain frequent itemset mining toolbox

Published: 12 August 2012 Publication History

Abstract

In recent years, mining frequent itemsets over uncertain data has attracted much attention in the data mining community. Unlike the corresponding problem in deterministic data, the frequent itemset under uncertain data has two different definitions: the expected support-based frequent itemset and the probabilistic frequent itemset. Most existing works only focus on one of the definitions and no comprehensive study is conducted to compare the two different definitions. Moreover, due to lacking the uniform implementation platform, existing solutions for the same definition even generate inconsistent results. In this demo, we present a demonstration called as UFIMT (underline Uncertain Frequent Itemset Mining Toolbox) which not only discovers frequent itemsets over uncertain data but also compares the performance of different algorithms and demonstrates the relationship between different definitions. In this demo, we firstly present important techniques and implementation skills of the mining problem, secondly, we show the system architecture of UFIMT, thirdly, we report an empirical analysis on extensive both real and synthetic benchmark data sets, which are used to compare different algorithms and to show the close relationship between two different frequent itemset definitions, and finally we discuss some existing challenges and new findings.

References

[1]
C. Aggarwal, Y. Li, J. Wang, and J. Wang. Frequent pattern mining with uncertain data. In KDD'09.
[2]
T. Bernecker, H.-P. Kriegel, M. Renz, F. Verhein, and A. Z�fle. Probabilistic frequent itemset mining in uncertain databases. In KDD'09.
[3]
T. Calders, C. Garboni, and B. Goethals. Approximation of frequentness probability of itemsets in uncertain data. In ICDM'10.
[4]
C. K. Chui, B. Kao, and E. Hung. Mining frequent itemsets from uncertain data. In PAKDD'07.
[5]
C. K.-S. Leung, M. A. F. Mateo, and D. A. Brajczuk. A tree-based approach for frequent pattern mining from uncertain data. In PAKDD'08.
[6]
L. Sun, R. Cheng, D. W. Cheung, and J. Cheng. Mining uncertain data with probabilistic guarantees. In KDD'10.
[7]
Y. Tong, L. Chen, Y. Cheng, and P. S. Yu. Mining frequent itemsets over uncertain databases. In VLDB'12.
[8]
Y. Tong, L. Chen, and B. Ding. Discovering threshold-based frequent closed itemsets over probabilistic data. In ICDE'12.
[9]
L. Wang, R. Cheng, S. D. Lee, and D. W.-L. Cheung. Accelerating probabilistic frequent itemset mining: a model-based approach. In CIKM'10.

Cited By

View all
  • (2023)BurstSketch: Finding Bursts in Data StreamsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.322368635:11(11126-11140)Online publication date: 1-Nov-2023
  • (2021)A Review on Frequent Itemsets Generation Techniques and Their Comparative Analysis Using FIMAKSN Computer Science10.1007/s42979-021-00916-x3:1Online publication date: 23-Oct-2021
  • (2019)Comprehensive mining of frequent itemsets for a combination of certain and uncertain databasesInternational Journal of Information Technology10.1007/s41870-019-00310-0Online publication date: 30-Apr-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
August 2012
1616 pages
ISBN:9781450314626
DOI:10.1145/2339530
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 August 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. UFIMT
  2. frequent itemset mining
  3. uncertain database

Qualifiers

  • Demonstration

Conference

KDD '12
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)1
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)BurstSketch: Finding Bursts in Data StreamsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.322368635:11(11126-11140)Online publication date: 1-Nov-2023
  • (2021)A Review on Frequent Itemsets Generation Techniques and Their Comparative Analysis Using FIMAKSN Computer Science10.1007/s42979-021-00916-x3:1Online publication date: 23-Oct-2021
  • (2019)Comprehensive mining of frequent itemsets for a combination of certain and uncertain databasesInternational Journal of Information Technology10.1007/s41870-019-00310-0Online publication date: 30-Apr-2019
  • (2019)Fine-grained probability counting for cardinality estimation of data streamsWorld Wide Web10.1007/s11280-018-0583-022:5(2065-2081)Online publication date: 2-Aug-2019
  • (2019)Discovering Frequent High Average Utility Itemset Without Transaction InsertionSustainable Communication Networks and Application10.1007/978-3-030-34515-0_58(555-569)Online publication date: 7-Nov-2019
  • (2018)Frequent Itemset Mining on Uncertain Database Using OWA OperatorProceedings of 2nd International Conference on Communication, Computing and Networking10.1007/978-981-13-1217-5_75(745-755)Online publication date: 8-Sep-2018
  • (2018)Frequent Itemset Mining for a Combination of Certain and Uncertain DatabasesRecent Developments and the New Direction in Soft-Computing Foundations and Applications10.1007/978-3-319-75408-6_3(25-39)Online publication date: 29-May-2018
  • (2016)Tracking frequent items over distributed probabilistic dataWorld Wide Web10.1007/s11280-015-0341-519:4(579-604)Online publication date: 1-Jul-2016
  • (2015)Mining High Utility Itemsets over Uncertain DatabasesProceedings of the 2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery10.1109/CyberC.2015.76(235-238)Online publication date: 17-Sep-2015
  • (2015)A Parallel Job Execution Time Estimation Approach Based on User Submission Patterns within Computational GridsInternational Journal of Parallel Programming10.1007/s10766-013-0294-143:3(440-454)Online publication date: 1-Jun-2015
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media