Tag Archives: pattern mining

Introduction to the Apriori algorithm (with Java code)

This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not because it is the fastest, but because it has … Continue reading

Posted in Big data, Data Mining, Pattern Mining, Programming | Tagged , , , , , , | 12 Comments

An Introduction to Sequential Pattern Mining

In this blog post, I will give an introduction to sequential pattern mining, an important data mining task with a wide range of applications from text analysis to market basket analysis.  This blog post is aimed to be a short introduction. If you want to read a more … Continue reading

Posted in Big data, Data Mining, Pattern Mining | Tagged , , , , | 20 Comments

Introduction to time series mining with SPMF

This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06).  It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading

Posted in Big data, Data Mining, open-source, spmf, Time series | Tagged , , , , , , , , | 1 Comment

Discovering hidden patterns in texts using SPMF

This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words.  We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading

Posted in Big data, Data Mining, Data science, open-source, spmf | Tagged , , , | 9 Comments

News about SPMF

Some quick news about the SPMF project. First, this month I have made a few updates to SPMF.  Two new algorithms have been added: USpan (for high-utility sequential pattern mining) and FCHM (for correlated high utility itemset mining). Moreover, I have fixed … Continue reading

Posted in spmf | Tagged , , , , | Leave a comment

Interview with the SPMF library founder

Today, I will just write a short blog post to let you know that I was recently interviewed on Rahaman’s blog.  The interview talks about various topics such as (1) why creating the SPMF data mining library, (2) why choosing to work in … Continue reading

Posted in Big data, Data Mining, Data science, Research, spmf | Tagged , , , | 2 Comments

200,000 visitors on the SPMF website!

Today, I will just write a short blog post to mention that the SPMF open-source data mining library has recently passed the milestone of 200,000 visitors.  This is possible thanks to the support of all users of SPMF, and the contributors … Continue reading

Posted in Data Mining, Data science, open-source, Research, spmf | Tagged , , , , , | Leave a comment

New version of SPMF Java open-source data mining library (0.95)

Today, I write a post to announce a new version of the SPMF Java open-source data mining library.  It is SPMF version 0.95 and it is a major revision. It offers 11 new  data mining algorithms for various data mining … Continue reading

Posted in Data Mining, Data science, Java, open-source, spmf | Tagged , , , | 1 Comment

An introduction to frequent pattern mining

In this blog post, I will give a brief overview of an important subfield of data mining that is  called pattern mining.  Pattern mining consists of using/developing data mining algorithms to discover interesting,  unexpected and useful patterns in databases. Pattern … Continue reading

Posted in Data Mining, Research | Tagged , , , , | 28 Comments