Tag Archives: data mining
An introduction to frequent subgraph mining
In this blog post, I will give an introduction to an interesting data mining task called frequent subgraph mining, which consists of discovering interesting patterns in graphs. This task is important since data is naturally represented as graph in many domains (e.g. …
Posted in Big data, Data Mining, Data science
Tagged algorithm, big data, data mining, data science, frequent subgraphs, graph, pattern mining
4 Comments
We are launching a new data mining journal
In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague ChunWei Lin on launching a new journal, titled "Data Science and Pattern Recognition". This is a new openaccess journal, …
Posted in Big data, Data Mining, Data science, Research
Tagged big data, data mining, data science, journal
Leave a comment
Introduction to clustering: the KMeans algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular KMeans algorithm with an example. Moreover, I will briefly explain how an opensource Java implementation of …
Posted in Big data, Data Mining, Data science, Opensource
Tagged clustering, data mining, data science, java, kmeans, opensource, spmf
Leave a comment
Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java opensource data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is …
Posted in Big data, Data Mining, Opensource, Time series
Tagged big data, data mining, data science, java, opensource, pattern mining, SAX algorithm, spmf, time series
Leave a comment
Brief report about the IEA AIE 2016 conference
This week, I have attended the IEA AIE 2016 conference, held at Morioka, Japan from the 2nd to the 4th August 2016. In this blog post, I will briefly discuss the conference. About the conference IEA AIE 2016 (29th International Conference on …
Posted in artificial intelligence, Conference, Data Mining, Data science, Research
Tagged artificial intelligence, conferenced, data mining
2 Comments
SPMF data mining library 0.98: new pattern visualization window
This blog post is to let you know that I have just published a new version of the SPMF opensource Java data mining library (0.98) that offers a new window for visualizing the patterns found by data mining algorithms. This …
Posted in Data Mining, General, Opensource, Research
Tagged big data, data mining, GPL, library, opensource, spmf
Leave a comment
The SPMF data mining library: a brief history and what’s next?
In this blog post, I will talk about the wellknown opensource library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of …
Posted in Data Mining, Programming, Research
Tagged data mining, library, opensource, spmf
Leave a comment
An Introduction to Sequential Rule Mining
In this blog post, I will discuss an interesting topic in data mining, which is the topic of sequential rule mining. It consists of discovering rules in sequences. This data mining task has many applications for example for analyzing the behavior of …
Posted in Big data, Data Mining, Data science, Research
Tagged data mining, frequent patterns, high utility, sequential rules
5 Comments
How to test if a data mining mining algorithm implementation is correct?
In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. …
Posted in Data Mining, Programming, Research
Tagged algorithm, correctness, data mining, debugging
3 Comments
An Introduction to HighUtility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called "highutility itemset mining" or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of …
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, highutility mining, itemset mining, java, opensource, source code, spmf, utility mining
60 Comments