Category Archives: Pattern Mining

25 years of pattern mining

This year, we are in 2019, and it is already 25 years since Agrawal wrote his seminal papers on frequent itemset mining and association rule mining in 1994. Since then, there has been thousands of papers published on this topic, some about algorithm design, new pattern mining … Continue reading

Posted in Big data, Data Mining, Data science, Pattern Mining | Tagged , , , | Leave a comment

(video) Mining Frequent Itemsets with the Apriori algorithm

This is a video presentation of the Apriori algorithm for discovering frequent itemsets in data. Frequent itemset mining is one of the most popular data mining task. VIDEO LINK: https://www.philippe-fournier-viger.com/spmf/videos/apriori.mp4 The Java source code of the Apriori algorithm and datasets for evaluating its performance are available in the SPMF software. If you want … Continue reading

Posted in Data Mining, Data science, Pattern Mining, Video | Tagged , , , , , , , | Leave a comment

(video) Mining Sequential Rules with RuleGrowth

This is a video presentation of the paper “Mining Partially-Ordered Sequential Rules Common to Multiple Sequences” about discovering sequential rules in sequences using the RuleGrowth algorithm. VIDEO LINK: https://www.philippe-fournier-viger.com/spmf/videos/rulegrowth.mp4 More information about the RuleGrowth algorithm are provided in this research paper: Fournier-Viger, P., Wu, C.-W., Tseng, V.S., Cao, L., Nkambou, R. (2015). Mining Partially-Ordered Sequential Rules Common to Multiple … Continue reading

Posted in Big data, Data Mining, Pattern Mining | Tagged , , , , | Leave a comment

Introduction to frequent subranking mining

Rankings are made in many fields, as we naturally tend to rank objects, persons or things, in different contexts. For example, in a singing or a sport competition, some judges will rank participants from worst to best and give prizes to … Continue reading

Posted in Big data, Data Mining, Data science, Pattern Mining | Tagged , , , , | Leave a comment

(video) Minimal High Utility Itemset Mining with MinFHM

This is a video presentation of the paper “Mining Minimal High Utility Itemsets” about high utility itemset mining using MinFHM. It is the first video of a series of videos that will explain various data mining algorithms. VIDEO LINK : https://www.philippe-fournier-viger.com/spmf/videos/minfhm.mp4 More information … Continue reading

Posted in Data Mining, Data science, Pattern Mining, Video | Tagged , , , , , , | Leave a comment

The future of pattern mining

In this blog post, I will talk about the future of research on pattern mining. I will also discuss some lessons learnt from the decades of research in this field and talk about research opportunities. What is the state of research on pattern mining? Over the last … Continue reading

Posted in Data Mining, Pattern Mining | Tagged , , , | Leave a comment

How to run SPMF without installing Java?

The SPMF data mining software is a popular open-source software for discovering patterns in data and for performing other data mining tasks. Typically, to run SPMF, Java must have been installed on a computer. However, it is possible to run SPMF on a computer that does not have Java installed. For example, … Continue reading

Posted in Data Mining, Data science, open-source, Pattern Mining, Research, spmf | Tagged , , , | Leave a comment

On the Completeness of the CloSpan and IncSpan algorithms

In this blog post, I will briefly discuss the fact that the popular CloSpan algorithm for frequent sequential pattern mining is an incomplete algorithm.  This means that in some special situations, CloSpan does not produce the expected results that it has been designed for, and in particular some patterns are … Continue reading

Posted in Data Mining, Pattern Mining | Tagged , , , , | Leave a comment

On the correctness of the FSMS algorithm for frequent subgraph mining

In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm.  I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to implement the algorithm in 2017 and wish to save time to other researchers … Continue reading

Posted in Big data, Data Mining, Pattern Mining | Tagged , , , , | 2 Comments

Introduction to the Apriori algorithm (with Java code)

This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not because it is the fastest, but because it has … Continue reading

Posted in Big data, Data Mining, Pattern Mining, Programming | Tagged , , , , , , | 12 Comments