Author Archives: Philippe Fournier-Viger

An introduction to frequent pattern mining

In this blog post, I will give a brief overview of an important subfield of data mining that is  called pattern mining.  Pattern mining consists of using/developing data mining algorithms to discover interesting,  unexpected and useful patterns in databases. Pattern … Continue reading

Posted in Data Mining, Research | Tagged , , , , | 28 Comments

The importance of sociability for researchers

There are several characteristics required to become a great researcher. Today, I will discuss one of them that is sometimes overlooked. It is sociability.  Sociability means to build an maintain relationships with other researchers.   The nature of the social relationships … Continue reading

Posted in General, Research | Tagged , | 2 Comments

How to encourage data mining researchers to share their source code and datasets?

A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“.  I explained several advantages that researchers can get by sharing the source code of … Continue reading

Posted in Data Mining, Research | Tagged , , , | Leave a comment

The importance of constraints in data mining

Today, I will discuss an important concept in data mining which is the use of constraints. Data mining is a broad field incorporating many different kind of techniques for discovering unexpected and new knowledge from data. Some main data mining … Continue reading

Posted in Data Mining | Tagged , , | Leave a comment

How to measure the memory usage of data mining algorithms in Java?

Today, I will discuss the topic of accurately evaluating the memory usage of data mining algorithms in Java. I will share several problems that I have discovered with memory measurements in Java for data miners and strategies to avoid these … Continue reading

Posted in Data Mining, Programming, Research | Tagged , , , , , | 1 Comment

How to make good looking charts for research papers?

Charts are often used in research papers to present experimental results. Today, I will discuss how to make good looking charts for presenting research results. I will not cover everything about this topic. But I will explain some key ideas. … Continue reading

Posted in General, Research | Tagged , , , , | 5 Comments

How to search for a research advisor by e-mail?

In this blog post, I will talk about how to search for a research advisor by e-mail. I will talk about this because today, I received an e-mail from a Ph.D student from abroad asking to work with me as … Continue reading

Posted in General, Research | Tagged , , | 3 Comments

What are the steps to implement a data mining algorithm?

In this post, I will discuss what are the steps that I follow to implement a data mining algorithm.  The subject of this post comes from a question that I have received by e-mail recently, and I think that it … Continue reading

Posted in Data Mining, Programming | Tagged , , , , | 45 Comments

Choosing data structures according to what you want to do

Today, I write a post about programming. I want to share a simple but important idea for writing optimized code. The idea is to choose data structures according to what you want to do instead of what you want to … Continue reading

Posted in Data Mining, Programming | Tagged , , , , , | Leave a comment

Analyzing the source code of the SPMF data mining software

Hi everyone, In this blog post, I will discuss how I have applied an open-source tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ )  to analyze the source code of my open-source data mining software named SPMF. I have applied … Continue reading

Posted in Data Mining, Java, open-source, Pattern Mining, Programming, spmf | Tagged , , , , | 2 Comments