Tag Archives: itemset

A yet more efficient high utility itemset mining algorithm (HAMM, to appear in IEEE TKDE)

Today, I will share the good news that I have participated as co-author in a paper proposing a new algorithm for high utility itemset mining called HAMM that is very efficient, and outperforms the state-of-the-art algorithm. This is not an … Continue reading

Posted in Pattern Mining, Utility Mining | Tagged , , , , , , , , , | 2 Comments

Introduction to Rare Itemset Mining

In the last decades, many data mining algorithms were developed to find interesting patterns in data (pattern mining). In particular, a lot of studies have been done about discovering frequent patterns (patterns that appear frequently) in a data. A classic … Continue reading

Posted in Pattern Mining | Tagged , , , , , , , , , , | 1 Comment

A Glossary of High Utility Pattern Mining

Today, I present a glossary of key terms usedin high utility itemset mining. This glossary will be useful to researchers and practitioners working on this important subtopic of pattern mining. By the way, if you are new to this research … Continue reading

Posted in Data Mining, Data science, Pattern Mining, Utility Mining | Tagged , , , , , , , , | 1 Comment

What are Generator Itemsets?

Today, I will briefly explain what is a generator itemset. I will give some example and explain why generator itemsets are interesting and useful for some applications. I will also mention that efficient implementations can be found in the SPMF … Continue reading

Posted in Data Mining, Pattern Mining | Tagged , , , , , , , , | 1 Comment

How to find cost-effective patterns in data?

Have you ever wondered how to find patterns in data that are not only frequent but also profitable and cost-effective? For example, if you are an online retailer, you may want to know what products are often bought together by … Continue reading

Posted in Data Mining, Data science, Pattern Mining, spmf, Utility Mining | Tagged , , , , , , , , , , , | Leave a comment

What is a Closed Itemset and Why is it Useful?

In this blog post, I will explain in simple terms what is a closed itemset and give some examples. I will also mention a few algorithms that can be used to find closed itemsets and that they can be found … Continue reading

Posted in Data Mining, Data science, Pattern Mining | Tagged , , , , , , , , , , | 7 Comments

How to call SPMF from a C++ Program (Windows)?

I will explain how to call SPMF from a C++ program for the Windows platform. If you are interested by other programming languages, you can check my previous blog posts, where I give examples of how to call SPMF from … Continue reading

Posted in Data Mining, Data science, open-source, spmf | Tagged , , , , , , , | Leave a comment

How to call SPMF from R?

In previous blog posts, I have explained how to call SPMF as an external program from Python and how to call SPMF from C#. Today, I will explain how to call SPMF from an R program. Requirements Since SPMF is … Continue reading

Posted in Data Mining, Data science, open-source, spmf | Tagged , , , , , , , , | 8 Comments

How to draw an FP-Tree in Latex? (using TIKZ)

In this blog post, I will show how to draw a beautiful FP-Tree data structure in a Latex document. The FP-Tree is a tree-like structure that was proposed in the FP-Growth algorithm for itemset mining, and is also used in … Continue reading

Posted in Latex, Pattern Mining | Tagged , , , , , , , , , | 1 Comment

How to Analyze the Complexity of Pattern Mining Algorithms?

Today, I will explain how to analyze the complexity of pattern mining algorithms for topics such as itemset mining, sequential pattern mining, subgraph mining, sequential rule mining and periodic pattern mining. This is something that is often asked by reviewers … Continue reading

Posted in Pattern Mining | Tagged , , , , , , , | Leave a comment