Author Archives: Philippe Fournier-Viger

How to design memory-efficient data mining algorithms in Java?

A while ago, I had written a blog post about How to measure the memory usage of algorithms in Java. Today, I will discuss the topic of optimizing the memory usage of algorithms written in Java to design memory-efficient data mining algorithms. … Continue reading

Posted in Uncategorized | Leave a comment

The SPMF data mining library: a brief history and what’s next?

In this blog post, I will talk about the well-known open-source library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading

Posted in Data Mining, open-source, Programming, Research, spmf | Tagged , , , | Leave a comment

An Introduction to Sequential Rule Mining

In this blog post, I will discuss an interesting topic in data mining, which is the topic of sequential rule mining. It consists of discovering rules in sequences. This data mining task has many applications for example for analyzing the behavior of … Continue reading

Posted in Data Mining, Research | Tagged , , , | 33 Comments

How to test if a data mining mining algorithm implementation is correct?

In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading

Posted in Data Mining, Programming, Research, Uncategorized | Tagged , , , | 3 Comments

The PAKDD 2015 Conference (a brief report)

I have attended the 19th PAKDD 2015 conference in Ho Chi Minh City, Vietnam from the 19th to 22nd May. In this blog post, I give some brief comments about the PAKDD 2015 conference.  PAKDD ( Pacific-Asia Conference series on Knowledge Discovery and Data Mining ) is a top data mining conference, held every … Continue reading

Posted in Conference, Data Mining, Data science | Tagged , , , , | 4 Comments

An Introduction to High-Utility Itemset Mining

In this blog post, I will give an introduction about a popular problem in data mining, which is called “high-utility itemset mining” or more generally utility mining. I  will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading

Posted in Data Mining, Research, Utility Mining | Tagged , , , , , , , , , | 99 Comments

Drawing a set-enumeration tree using Java and GraphViz

In this blog post, I will explain and provide source code to automatically  draw the set-enumeration tree of a set using Java and GraphViz.  Drawing a set-enumeration tree is useful in computer science, for example in frequent itemset mining, a subfield of data … Continue reading

Posted in Data Mining, Mathematics, Programming, Research | Leave a comment

How to give a good oral presentation at an academic conference?

Today, I will discuss the topic of how to give a good oral presentation of your research at academic cconferences. This is a very important topic since if a presentation is interesting, more researchers will be interested in your work and it may bring … Continue reading

Posted in Conference, General, Research, Uncategorized | 4 Comments

Brief report about the ADMA 2014 conference

In this blog post, I will discuss my current trip to the ADMA 2014 conference (10th Intern. Conf. on Advanced Data Mining and Applications in China (December 19-21 2014 in Guilin, China). Note that the views expressed in this post are my personal opinion/interpretation … Continue reading

Posted in Conference, Data Mining | 6 Comments

Drawing the Powerset of a Set using Java and GraphViz (Hasse Diagram)

In this blog post, I will explain and provide source code to automatically  draw the powerset of a set using Java and GraphViz.  Drawing a powerset is useful in mathematics and also in computer science, for example in frequent itemset mining, it can … Continue reading

Posted in Data Mining, General, Mathematics | Tagged , , , , | 3 Comments