Tag Archives: frequent pattern mining

(video) Identifying Stable Periodic Frequent Patterns using SPP-Growth

Today, I present a video about finding stable periodic patterns in data, and discuss a new algorithm named SPP-Growth for this task. VIDEO LINK: https://www.philippe-fournier-viger.com/spmf/videos/SPPGrowth.mp4 The  SPP-Growth algorithm and datasets for evaluating its performance are available in the SPMF software, which is open-source and programmed in Java. Source code and … Continue reading

Posted in Big data, Data Mining, Data science, Video | Tagged , , , , , , | Leave a comment

25 years of pattern mining

This year, we are in 2019, and it is already 25 years since Agrawal wrote his seminal papers on frequent itemset mining and association rule mining in 1994. Since then, there has been thousands of papers published on this topic, some about algorithm design, new pattern mining … Continue reading

Posted in Big data, Data Mining, Data science, Pattern Mining | Tagged , , , | Leave a comment

(video) Mining Frequent Itemsets with the Apriori algorithm

This is a video presentation of the Apriori algorithm for discovering frequent itemsets in data. Frequent itemset mining is one of the most popular data mining task. VIDEO LINK: https://www.philippe-fournier-viger.com/spmf/videos/apriori.mp4 The Java source code of the Apriori algorithm and datasets for evaluating its performance are available in the SPMF software. If you want … Continue reading

Posted in Data Mining, Data science, Pattern Mining, Video | Tagged , , , , , , , | Leave a comment

An Introduction to Sequential Pattern Mining

In this blog post, I will give an introduction to sequential pattern mining, an important data mining task with a wide range of applications from text analysis to market basket analysis.  This blog post is aimed to be a short introduction. If you want to read a more … Continue reading

Posted in Big data, Data Mining, Pattern Mining | Tagged , , , , | 20 Comments

An Introduction to High-Utility Itemset Mining

In this blog post, I will give an introduction about a popular problem in data mining, which is called “high-utility itemset mining” or more generally utility mining. I  will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading

Posted in Data Mining, Research, Utility Mining | Tagged , , , , , , , , , | 99 Comments

How to auto-adjust the minimum support threshold according to the data size

Today, I will do a quick post on how to automatically adjust the minimum support threshold of frequent pattern mining algorithms such as Apriori, FPGrowth and PrefixSpan according to the size of the data. The problem is simple.  Let’s consider … Continue reading

Posted in Data Mining, Programming | Tagged , , , , , | 48 Comments