Tag Archives: data mining

An introduction to frequent subgraph mining

In this blog post, I will give an introduction to an interesting data mining task called frequent subgraph mining, which consists of discovering interesting patterns in graphs. This task is important since data is naturally represented as graph in many domains (e.g. … Continue reading

Posted in Big data, Data Mining, Data science | Tagged , , , , , , | 4 Comments

We are launching a new data mining journal

In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague Chun-Wei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new open-access journal, … Continue reading

Posted in Big data, Data Mining, Data science, Research | Tagged , , , | Leave a comment

Introduction to clustering: the K-Means algorithm (with Java code)

In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis).  I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading

Posted in Big data, Data Mining, Data science, Open-source | Tagged , , , , , , | Leave a comment

Introduction to time series mining with SPMF

This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06).  It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading

Posted in Big data, Data Mining, Open-source, Time series | Tagged , , , , , , , , | Leave a comment

Brief report about the IEA AIE 2016 conference

This week, I have attended the IEA AIE 2016 conference, held at Morioka, Japan from the 2nd to the 4th August 2016. In this blog post, I will briefly discuss the conference. About the conference IEA AIE 2016 (29th International Conference on … Continue reading

Posted in artificial intelligence, Conference, Data Mining, Data science, Research | Tagged , , | 2 Comments

SPMF data mining library 0.98: new pattern visualization window

This blog post is to let you know that I have just published a new version of the SPMF open-source Java data mining library (0.98) that offers a new window for visualizing the patterns found by data mining algorithms. This … Continue reading

Posted in Data Mining, General, Open-source, Research | Tagged , , , , , | Leave a comment

The SPMF data mining library: a brief history and what’s next?

In this blog post, I will talk about the well-known open-source library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading

Posted in Data Mining, Programming, Research | Tagged , , , | Leave a comment

An Introduction to Sequential Rule Mining

In this blog post, I will discuss an interesting topic in data mining, which is the topic of sequential rule mining. It consists of discovering rules in sequences. This data mining task has many applications for example for analyzing the behavior of … Continue reading

Posted in Big data, Data Mining, Data science, Research | Tagged , , , | 5 Comments

How to test if a data mining mining algorithm implementation is correct?

In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading

Posted in Data Mining, Programming, Research | Tagged , , , | 3 Comments

An Introduction to High-Utility Itemset Mining

In this blog post, I will give an introduction about a popular problem in data mining, which is called “high-utility itemset mining” or more generally utility mining. I  will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading

Posted in Data Mining, Research, Utility Mining | Tagged , , , , , , , , , | 60 Comments