Category Archives: open-source

SPMF 2.52 is released

This is just a short blog post to let you know that a new version of the SPMF library has been released, called version 2.52. This new version contains two new algorithms for high utility itemset mining and one for … Continue reading

Posted in Data Mining, open-source, Pattern Mining, spmf, Utility Mining | Tagged , , , , , , , , | Leave a comment

Towards SPMF v3.0…

Today, I will talk a little bit about the recent improvements and future direction for the SPMF data mining library. How SPMF started? SPMF is a software project that I started around 2008 when I was a Ph.D student in … Continue reading

Posted in Data Mining, Data science, open-source, spmf | Tagged , , , , , | Leave a comment

New version of SPMF (2.44): 4 new algorithms, datasets and features

Today, I am happy to announce that a new version of the SPMF open-source data mining software is released (v. 2.44). This is the download page. This new version was made possible due to several contributors. What is new? New … Continue reading

Posted in Data Mining, Data science, open-source, Pattern Mining, spmf, Utility Mining | Tagged , , , , , , , | Leave a comment

The SPMF data mining library v.2.40 is released!

Hi all, I am please to announce that a new version of SPMF has just been published (v 2.40). It contains 9 novel algorithms: It also includes some bug fixes and other minor improvements. I did not release a new version of SPMF since a few … Continue reading

Posted in Data Mining, open-source, Pattern Mining, spmf | Tagged , , , , , | Leave a comment

Analyzing the source code of SPMF (5 years later)

Five years ago, I had analyzed the source code of the SPMF data mining software using an open-source tool called CodeAnalyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ ). This had provided some interesting insights about the structure of the project, especially in terms of lines of codes and code to … Continue reading

Posted in Data Mining, Data science, open-source, spmf | Tagged , , , , , | Leave a comment

How to run SPMF without installing Java?

The SPMF data mining software is a popular open-source software for discovering patterns in data and for performing other data mining tasks. Typically, to run SPMF, Java must have been installed on a computer. However, it is possible to run SPMF on a computer that does not have Java installed. For example, … Continue reading

Posted in Data Mining, Data science, open-source, Pattern Mining, Research, spmf | Tagged , , , | Leave a comment

Introduction to clustering: the K-Means algorithm (with Java code)

In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis).  I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading

Posted in Big data, Data Mining, Data science, open-source | Tagged , , , , , , | 1 Comment

Introduction to time series mining with SPMF

This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06).  It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading

Posted in Big data, Data Mining, open-source, spmf, Time series | Tagged , , , , , , , , | 1 Comment

Discovering hidden patterns in texts using SPMF

This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words.  We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading

Posted in Big data, Data Mining, Data science, open-source, spmf | Tagged , , , | 9 Comments

An introduction to periodic pattern mining

In this blog post I will give an introduction to the discovery of periodic patterns in data. Mining periodic patterns is an important data mining task as patterns may periodically appear in all kinds of data, and it may be desirable to find them … Continue reading

Posted in Big data, Data Mining, Data science, open-source, Research, Uncategorized, Utility Mining | 29 Comments