Archives
Categories
 Academia (37)
 artificial intelligence (6)
 Big data (47)
 Conference (22)
 Data Mining (77)
 Data science (40)
 General (40)
 Graph mining (3)
 Industry (2)
 Latex (2)
 Mathematics (2)
 Mobile technology (1)
 Opensource (8)
 Other (2)
 Pattern Mining (1)
 Plagiarism (7)
 Programming (17)
 Research (69)
 Time series (2)
 Uncategorized (2)
 Utility Mining (6)
 Web (2)
Tag cloud
 academia
 academic wriing
 algorithm
 algorithms
 articles
 artificial intelligence
 association rules
 big data
 comparison
 conference
 data mining
 data science
 datasets
 frequent pattern mining
 frequent patterns
 graph
 highutility mining
 ilahia
 itemset mining
 java
 journal
 latex
 library
 opensource
 pakdd
 paper
 pattern mining
 Ph.D.
 phd
 plagiarism
 programming
 Research
 research paper
 research papers
 review
 reviewer
 sequence
 sequential patterns
 software
 source code
 spmf
 time series
 utility mining
 visualization
 writing
Recent Comments
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)
 Expensive Academic Conferences  The Data Mining BlogThe Data Mining Blog on Report about the KDD 2018 conference
 Philippe FournierViger on Six important skills to become a succesful researcher
 Harshali on How to choose a good thesis topic in Data Mining?
 Claudi on Six important skills to become a succesful researcher

Recent Posts
 Expensive Academic Conferences – the case of ICDM
 Periodic patterns in Web log time series
 Upcoming book: High Utility Itemset Mining: Theory, Algorithms and Applications
 What I don’t like about academia
 News about the data mining blog
 Report about the DEXA 2018 and DAWAK 2018 conferences
 China lead in mobile payment and services
 Report about the KDD 2018 conference
 A Model for Football Pass Prediction (source code + dataset)
 The future of pattern mining
Number of visitors:
840002Top 5 most popular posts
Tag Archives: frequent pattern mining
On the Completeness of the CloSpan and IncSpan algorithms
In this blog post, I will briefly discuss the fact that the popular CloSpan algorithm for frequent sequential pattern mining is an incomplete algorithm. This means that in some special situations, CloSpan does not produce the expected results that it has been designed for, and … Continue reading
An Introduction to Sequential Pattern Mining
In this blog post, I will give an introduction to sequential pattern mining, an important data mining task with a wide range of applications from text analysis to market basket analysis. This blog post is aimed to be a short … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data mining, data science, frequent pattern mining, frequent patterns, pattern, sequence, sequential pattern
51 Comments
An Introduction to HighUtility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called “highutility itemset mining” or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, highutility mining, itemset mining, java, opensource, source code, spmf, utility mining
121 Comments
How to autoadjust the minimum support threshold according to the data size
Today, I will do a quick post on how to automatically adjust the minimum support threshold of frequent pattern mining algorithms such as Apriori, FPGrowth and PrefixSpan according to the size of the data. The problem is simple. Let’s consider … Continue reading
Posted in Data Mining, Programming
Tagged apriori, fpgrowth, frequent pattern mining, itemset mining, minsup, prefixspan
61 Comments