Tag Archives: data mining

Interview with the SPMF library founder

Today, I will just write a short blog post to let you know that I was recently interviewed on Rahaman’s blog.  The interview talks about various topics such as (1) why creating the SPMF data mining library, (2) why choosing to work in … Continue reading

Posted in Big data, Data Mining, Data science, Research, spmf | Tagged , , , | 2 Comments

200,000 visitors on the SPMF website!

Today, I will just write a short blog post to mention that the SPMF open-source data mining library has recently passed the milestone of 200,000 visitors.  This is possible thanks to the support of all users of SPMF, and the contributors … Continue reading

Posted in Data Mining, Data science, open-source, Research, spmf | Tagged , , , , , | Leave a comment

The SPMF data mining library: a brief history and what’s next?

In this blog post, I will talk about the well-known open-source library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading

Posted in Data Mining, open-source, Programming, Research, spmf | Tagged , , , | Leave a comment

An Introduction to Sequential Rule Mining

In this blog post, I will discuss an interesting topic in data mining, which is the topic of sequential rule mining. It consists of discovering rules in sequences. This data mining task has many applications for example for analyzing the behavior of … Continue reading

Posted in Data Mining, Research | Tagged , , , | 33 Comments

How to test if a data mining mining algorithm implementation is correct?

In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading

Posted in Data Mining, Programming, Research, Uncategorized | Tagged , , , | 3 Comments

The PAKDD 2015 Conference (a brief report)

I have attended the 19th PAKDD 2015 conference in Ho Chi Minh City, Vietnam from the 19th to 22nd May. In this blog post, I give some brief comments about the PAKDD 2015 conference.  PAKDD ( Pacific-Asia Conference series on Knowledge Discovery and Data Mining ) is a top data mining conference, held every … Continue reading

Posted in Conference, Data Mining, Data science | Tagged , , , , | 4 Comments

An Introduction to High-Utility Itemset Mining

In this blog post, I will give an introduction about a popular problem in data mining, which is called “high-utility itemset mining” or more generally utility mining. I  will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading

Posted in Data Mining, Research, Utility Mining | Tagged , , , , , , , , , | 99 Comments

Big Problems only found in Big Data?

Today, I will discuss the topic of Big Data, which is a very popular topic nowadays.  The popularity of big data can be seen for example in universities. Many universities are currently searching for professors who do research on “big data”. Moreover, … Continue reading

Posted in artificial intelligence, Data Mining, General, Programming | Tagged , , , | Leave a comment

New version of SPMF Java open-source data mining library (0.95)

Today, I write a post to announce a new version of the SPMF Java open-source data mining library.  It is SPMF version 0.95 and it is a major revision. It offers 11 new  data mining algorithms for various data mining … Continue reading

Posted in Data Mining, Data science, Java, open-source, spmf | Tagged , , , | 1 Comment

Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz

Today, I will show how to use the open-source SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found using GraphViz. Step 1 :  getting the … Continue reading

Posted in Data Mining, Java, open-source, Programming, spmf | Tagged , , , , , | 8 Comments