Today, I write a post about programming. I want to share a simple but important idea for writing optimized code. The idea is to choose data structures according to what you want to do instead of what you want to … Continue reading

# Category Archives: Data Mining

Hi everyone, In this blog post, I will discuss how I have applied an open-source tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ ) to analyze the source code of my open-source data mining software named SPMF. I have applied … Continue reading

Today, I will do a quick post on how to automatically adjust the minimum support threshold of frequent pattern mining algorithms such as Apriori, FPGrowth and PrefixSpan according to the size of the data. The problem is simple. Let’s consider … Continue reading

Hi, today, I will discuss how to compare data mining algorithms. This is an important question for data mining researchers who want to evaluate which algorithm is “better” in general or for a given situation. This question is also important … Continue reading

Hi, I have made a map to visualize the relationship between the 52 different data mining algorithms offered in the SPMF data mining software. You can view it in PNG format by clicking on the picture below: Or you can … Continue reading

I have seen many people asking for help in data mining forums and on other websites about how to choose a good thesis topic in data mining. Therefore, in this this post, I will address this question. The first thing … Continue reading

In this post, I will discuss what it takes to be a good data mining programmer and how to become one. Data mining is a broad field that can be approached from several angles. Some people with a mathematical background will … Continue reading