Tag Archives: spmf

The SPMF data mining library v.2.40 is released!

Hi all, I am please to announce that a new version of SPMF has just been published (v 2.40). It contains 9 novel algorithms: the HUIM-ABC algorithm for mining high utility itemsets using Artificial Bee Colony Optimization (thanks to Wei Song and Chaoming Huang for contributing the code) … Continue reading

Posted in Data Mining, open-source, Pattern Mining | Tagged , , , , , | Leave a comment

Analyzing the source code of SPMF (5 years later)

Five years ago, I had analyzed the source code of the SPMF data mining software using an open-source tool called CodeAnalyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ ). This had provided some interesting insights about the structure of the project, especially in terms of lines of codes and code to … Continue reading

Posted in Data Mining, Data science, open-source | Tagged , , , , , | Leave a comment

(video) Minimal High Utility Itemset Mining with MinFHM

This is a video presentation of the paper “Mining Minimal High Utility Itemsets” about high utility itemset mining using MinFHM. It is the first video of a series of videos that will explain various data mining algorithms. VIDEO LINK : http://www.philippe-fournier-viger.com/spmf/videos/minfhm.mp4 More information … Continue reading

Posted in Data Mining, Data science, Pattern Mining, Video | Tagged , , , , , , | Leave a comment

How to run SPMF without installing Java?

The SPMF data mining software is a popular open-source software for discovering patterns in data and for performing other data mining tasks. Typically, to run SPMF, Java must have been installed on a computer. However, it is possible to run SPMF on a computer that does not have Java installed. For example, … Continue reading

Posted in Data Mining, Data science, open-source, Pattern Mining, Research | Tagged , , , | Leave a comment

Introduction to clustering: the K-Means algorithm (with Java code)

In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis).  I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading

Posted in Big data, Data Mining, Data science, open-source | Tagged , , , , , , | Leave a comment

Introduction to time series mining with SPMF

This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06).  It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading

Posted in Big data, Data Mining, open-source, Time series | Tagged , , , , , , , , | Leave a comment

SPMF data mining library 0.98: new pattern visualization window

This blog post is to let you know that I have just published a new version of the SPMF open-source Java data mining library (0.98) that offers a new window for visualizing the patterns found by data mining algorithms. This … Continue reading

Posted in Data Mining, General, open-source, Research | Tagged , , , , , | Leave a comment

The SPMF data mining library: a brief history and what’s next?

In this blog post, I will talk about the well-known open-source library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading

Posted in Data Mining, Programming, Research | Tagged , , , | Leave a comment

An Introduction to High-Utility Itemset Mining

In this blog post, I will give an introduction about a popular problem in data mining, which is called “high-utility itemset mining” or more generally utility mining. I  will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading

Posted in Data Mining, Research, Utility Mining | Tagged , , , , , , , , , | 58 Comments

Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz

Today, I will show how to use the open-source SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found using GraphViz. Step 1 :  getting the … Continue reading

Posted in Data Mining, Programming | Tagged , , , , , | 8 Comments