Category Archives: Data science

Some interesting statistics about SPMF

While I am preparing the next version of Java SPMF data mining software (2.60), here are some interesting statistics about the project, that I have generated directly from the metadata provided by SPMF. Here it is: The number of algorithms … Continue reading

Posted in Data Mining, Data science, spmf | Tagged , , , , , | Leave a comment

Sneak peak at the new user interface of SPMF (part 1)

I am currently working on the next version of SPMF, which will be called 2.60. There will be several improvements to the user interface of SPMF. Here is an overview of some of the improvements to give you a sneak … Continue reading

Posted in Big data, Data Mining, Data science, spmf | Leave a comment

UDML 2024 Accepted papers

Today, I want to talk to you about the upcoming UDML 2024 workshop at the PAKDD 2024 conference. This year is the 6th edition of the UDML workshop. I am happy to say that this year, we received a record … Continue reading

Posted in Conference, Data Mining, Data science, Pattern Mining, Utility Mining | Tagged , , , , , | Leave a comment

A new survey paper on episode mining!

I am pleased to announce today that my collaborators and I have published a new survey paper about episode mining to give an introduction to this nice and interesting subfield of pattern mining. To our knowledge this is the most … Continue reading

Posted in Data Mining, Data science, Pattern Mining | Tagged , , , , , , , | Leave a comment

K-Means Interactive Demo in your browser

In this blog post, I introduce a new interactive tool for showing a demonstration of the K-Means algorithm for students (for teaching purposes). The K-Means clustering demo tool can be accessed here: philippe-fournier-viger.com/tools/kmeans_demo.php The K-Means demo, first let you enter … Continue reading

Posted in Data Mining, Data science | Tagged , , , , , | Leave a comment

SPMF: upcoming feature: The Memory Viewer

Today, I would like to introduce an upcoming feature that will be released in the next version of SPMF (v. 2.60). It is a tool called the Memory Viewer. This tool is very simple yet useful for investigating the performance … Continue reading

Posted in Data Mining, Data science, open-source, Pattern Mining, spmf | Tagged , , , , , , , | Leave a comment

An Online Demo of the Eclat Algorithm

I have created a new interactive webpage to demonstrate how the Eclat algorithm is applied for frequent itemset mining. This webpage allows to enter a transaction database, select the minimum support and to see step by step what the Eclat … Continue reading

Posted in Big data, Data Mining, Data science, Pattern Mining, Research | Leave a comment

An Online Tool to Draw FP-Trees

This blog post is to introduce a new tool made of HTML5 and JavaScript for drawing FP-trees, which you can access here:¬†https://www.philippe-fournier-viger.com/tools/draw_fptree.php If you are not familiar with FP-trees, the FP-tree is a data structure used in the field of … Continue reading

Posted in Data Mining, Data science, Pattern Mining | Tagged , , , , , , , , | Leave a comment

How to count the frequency of words and ngrams in a text document? – Two online tools

I have added two new webpages offering tools to count the frequency of each word and ngrams (consecutive sequences of words) in a text document. These webpages can be found here: The Word Frequency Counter First, let me show you … Continue reading

Posted in Data Mining, Data science | Tagged , , , , , , , , | Leave a comment

A Glossary of High Utility Pattern Mining

Today, I present a glossary of key terms usedin high utility itemset mining. This glossary will be useful to researchers and practitioners working on this important subtopic of pattern mining. By the way, if you are new to this research … Continue reading

Posted in Data Mining, Data science, Pattern Mining, Utility Mining | Tagged , , , , , , , , | 1 Comment