Tag Archives: data mining

The SPMF data mining library: a brief history and what’s next?

In this blog post, I will talk about the well-known open-source library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading

Posted in Data Mining, Programming, Research | Tagged , , , | Leave a comment

An Introduction to Sequential Rule Mining

In this blog post, I will discuss an interesting topic in data mining, which is the topic of sequential rule mining. It consists of discovering rules in sequences. This data mining task has many applications for example for analyzing the behavior of … Continue reading

Posted in Big data, Data Mining, Data science, Research | Tagged , , , | 18 Comments

How to test if a data mining mining algorithm implementation is correct?

In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading

Posted in Data Mining, Programming, Research | Tagged , , , | 5 Comments

An Introduction to High-Utility Itemset Mining

In this blog post, I will give an introduction about a popular problem in data mining, which is called “high-utility itemset mining” or more generally utility mining. I  will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading

Posted in Data Mining, Research, Utility Mining | Tagged , , , , , , , , , | 97 Comments

Big Problems only found in Big Data?

Today, I will discuss the topic of Big Data, which is a very popular topic nowadays.  The popularity of big data can be seen for example in universities. Many universities are currently searching for professors who do research on “big data”. Moreover, … Continue reading

Posted in artificial intelligence, Data Mining, General, Programming | Tagged , , , | Leave a comment

Report of the PAKDD 2014 conference (part 3)

This post continue my report of the PAKDD 2014 in Tainan (Taiwan). The panel about big data Friday, there was a great panel about big data with 7 top researchers from the field of data mining.  I will try to faithfully report some … Continue reading

Posted in Academia, Big data, Conference, Data Mining, Data science | Tagged , , , , | 1 Comment

Report of the PAKDD 2014 conference (part 2)

This post will continue my report of the PAKDD 2014 in Tainan (Taiwan). About big data Another interesting talk at this conference was given by Jian Pei. The topic was Big Data. Some key ideas in this talk was that to make … Continue reading

Posted in Academia, Big data, Conference, Data Mining, Data science | Tagged , , , , | 2 Comments

Report of the PAKDD 2014 conference (part 1)

I am currently at the PAKDD 2014 conference in Tainan, In this post, I will report interesting information about the conference and talks that I have attended. Importance of Succint Data Structures for Data Mining I have attended a very nice … Continue reading

Posted in Big data, Conference, Data Mining, Data science | Tagged , , , | 2 Comments

Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz

Today, I will show how to use the open-source SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found, using GraphViz. Step 1 :  getting the … Continue reading

Posted in Big data, Data Mining, Data science, Programming | Tagged , , , , , | 8 Comments

Brief report about the ADMA 2013 conference

In this blog post, I will discuss my recent trip to the ADMA 2013 conference (9th Intern. Conf. on Advanced Data Mining and Applications in China (December 14-16 2013 in Hangzhou, China at Zhejiang University). Note that the view expressed … Continue reading

Posted in Data Mining, General | Tagged , , , | 2 Comments