(video) Minimal High Utility Itemset Mining with MinFHM

This is a video presentation of the paper “Mining Minimal High Utility Itemsets” about high utility itemset mining using MinFHM. It is the first video of a series of videos that will explain various data mining algorithms.  (link to download … Continue reading

Upcoming book: High Utility Itemset Mining: Theory, Algorithms and Applications

I am happy to announce that the draft of the book about high utility pattern mining has been finalized and submitted to the publisher (Springer). It should thus be published in the very near future. The book contains 12 chapters written … Continue reading

On the Completeness of the CloSpan and IncSpan algorithms

In this blog post, I will briefly discuss the fact that the popular CloSpan algorithm for frequent sequential pattern mining is an incomplete algorithm.  This means that in some special situations, CloSpan does not produce the expected results that it has been designed for, and … Continue reading

On the correctness of the FSMS algorithm for frequent subgraph mining

In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm.  I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to … Continue reading

How to discover interesting patterns in data?

Discovering interesting patterns in data is often referred as data mining, data science or big data.  In the last few years, I have written several blog posts providing introduction to data mining and key topics in data mining: An Introduction to … Continue reading

Call for chapters: High Utility Pattern Mining, the book

CALL FOR CHAPTERS High-Utility Pattern Mining: Theory, Algorithms and Applications Editors: Philippe Fournier-Viger, Chun-Wei Lin, Roger Nkambou, Bay Vo An edited book to be published by Springer in 2018 Introduction This book will provide an introduction to the high utility mining, reviews state-of-the-art … Continue reading

An introduction to frequent subgraph mining

In this blog post, I will give an introduction to an interesting data mining task called frequent subgraph mining, which consists of discovering interesting patterns in graphs. This task is important since data is naturally represented as graph in many domains (e.g. … Continue reading

Introduction to time series mining with SPMF

This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06).  It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading

An introduction to frequent pattern mining

In this blog post, I will give a brief overview of an important subfield of data mining that is  called pattern mining.  Pattern mining consists of using/developing data mining algorithms to discover interesting,  unexpected and useful patterns in databases. Pattern … Continue reading