Archives
Categories
 Academia (27)
 artificial intelligence (4)
 Big data (37)
 Conference (17)
 Data Mining (68)
 Data science (32)
 General (36)
 Graph mining (3)
 Industry (1)
 Latex (2)
 Mathematics (2)
 Opensource (7)
 Other (2)
 Plagiarism (7)
 Programming (16)
 Research (62)
 Time series (1)
 Uncategorized (1)
 Utility Mining (4)
 Web (1)
Tag cloud
 academia
 algorithm
 algorithms
 articles
 association rules
 big data
 comparison
 conference
 data mining
 data science
 datasets
 frequent pattern mining
 frequent patterns
 graph
 highutility mining
 ilahia
 india
 itemset mining
 java
 journal
 latex
 library
 M.Sc.
 opensource
 pakdd
 paper
 pattern mining
 Ph.D.
 phd
 plagiarism
 programming
 publications
 Research
 research advisor
 research papers
 review
 reviewer
 sequence
 sequential patterns
 software
 source code
 spmf
 utility mining
 visualization
 writing
Recent Comments
 Philippe FournierViger on Six important skills to become a succesful researcher
 Stella on Six important skills to become a succesful researcher
 Philippe FournierViger on What are the steps to implement a data mining algorithm?
 ko moe on What are the steps to implement a data mining algorithm?
 Dang Nguyen on Subgraph mining datasets

Recent Posts
 Why attending academic conferences is important?
 PAKDD 2018 Conference (a brief report)
 China International BigData Industry Expo 2018 (a brief report)
 The Semantic Web and why it failed.
 招收数据挖掘领域博士后，地点：中国，深圳
 How to run SPMF without installing Java?
 KDD 2018 workshop on utilitydriven mining
 Plagiarism by Sudhir Mohod and Sharda Khode from Bapurao Deshmukh College （BDCE）
 Plagiarism by Kalli S N Prasad and S Venkata Suryanaryana at GVIT College Bhimavaram (affiliated to JNTUK) and CVR college
 Comparing Two LaTeX documents with Latexdiff
Number of visitors:
790066
Tag Archives: frequent pattern mining
On the Completeness of the CloSpan and IncSpan algorithms
In this blog post, I will briefly discuss the fact that the popular CloSpan algorithm for frequent sequential pattern mining is an incomplete algorithm. This means that in some special situations, CloSpan does not produce the expected results that it has been designed for, and … Continue reading
An Introduction to Sequential Pattern Mining
In this blog post, I will give an introduction to sequential pattern mining, an important data mining task with a wide range of applications from text analysis to market basket analysis. This blog post is aimed to be a short … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data mining, data science, frequent pattern mining, frequent patterns, pattern, sequence, sequential pattern
44 Comments
An Introduction to HighUtility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called “highutility itemset mining” or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, highutility mining, itemset mining, java, opensource, source code, spmf, utility mining
120 Comments
How to autoadjust the minimum support threshold according to the data size
Today, I will do a quick post on how to automatically adjust the minimum support threshold of frequent pattern mining algorithms such as Apriori, FPGrowth and PrefixSpan according to the size of the data. The problem is simple. Let’s consider … Continue reading
Posted in Data Mining, Programming
Tagged apriori, fpgrowth, frequent pattern mining, itemset mining, minsup, prefixspan
61 Comments