This year, we are in 2019, and it is already 25 years since Agrawal wrote his seminal papers on frequent itemset mining and association rule mining in 1994. Since then, there has been thousands of papers published on this topic, … Continue reading

# Tag Archives: pattern mining

This is a video presentation of the Apriori algorithm for discovering frequent itemsets in data. The Java source code of the Apriori algorithm and datasets for evaluating its performance are available in the SPMF software. If you want to … Continue reading

This is a video presentation of the paper “Mining Partially-Ordered Sequential Rules Common to Multiple Sequences” about discovering sequential rules in sequences using the RuleGrowth algorithm. More information about the RuleGrowth algorithm are provided in this research paper: Fournier-Viger, P., Wu, C.-W., Tseng, … Continue reading

Rankings are made in many fields, as we naturally tend to rank objects, persons or things, in different contexts. For example, in a singing or a sport competition, some judges will rank participants from worst to best and give prizes … Continue reading

This is a video presentation of the paper “Mining Minimal High Utility Itemsets” about high utility itemset mining using MinFHM. It is the first video of a series of videos that will explain various data mining algorithms. (link to download … Continue reading

I am happy to announce that the draft of the book about high utility pattern mining has been finalized and submitted to the publisher (Springer). It should thus be published in the very near future. The book contains 12 chapters written … Continue reading

In this blog post, I will talk about the future of research on pattern mining. I will also discuss some lessons learnt from the decades of research in this field and talk about research opportunities. What is the state of … Continue reading

In this blog post, I will briefly discuss the fact that the popular CloSpan algorithm for frequent sequential pattern mining is an incomplete algorithm. This means that in some special situations, CloSpan does not produce the expected results that it has been designed for, and … Continue reading

In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm. I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to … Continue reading

Discovering interesting patterns in data is often referred as data mining, data science or big data. In the last few years, I have written several blog posts providing introduction to data mining and key topics in data mining: An Introduction to … Continue reading