Tag Archives: data mining

Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz

Today, I will show how to use the open-source SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found using GraphViz. Step 1 :  getting the … Continue reading

Posted in Data Mining, Programming | Tagged , , , , , | 8 Comments

Brief report about the ADMA 2013 conference

In this blog post, I will discuss my recent trip to the ADMA 2013 conference (9th Intern. Conf. on Advanced Data Mining and Applications in China (December 14-16 2013 in Hangzhou, China at Zhejiang University). Note that the view expressed … Continue reading

Posted in Data Mining, General | Tagged , , , | 2 Comments

How to encourage data mining researchers to share their source code and datasets?

A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“.  I explained several advantages that researchers can get by sharing the source code of … Continue reading

Posted in Data Mining, Research | Tagged , , , | Leave a comment

The importance of constraints in data mining

Today, I will discuss an important concept in data mining which is the use of constraints. Data mining is a broad field incorporating many different kind of techniques for discovering unexpected and new knowledge from data. Some main data mining … Continue reading

Posted in Data Mining | Tagged , , | Leave a comment

How to measure the memory usage of data mining algorithms in Java?

Today, I will discuss the topic of accurately evaluating the memory usage of data mining algorithms in Java. I will share several problems that I have discovered with memory measurements in Java for data miners and strategies to avoid these … Continue reading

Posted in Data Mining, Programming, Research | Tagged , , , , , | 1 Comment

What are the steps to implement a data mining algorithm?

In this post, I will discuss what are the steps that I follow to implement a data mining algorithm.  The subject of this post comes from a question that I have received by e-mail recently, and I think that it … Continue reading

Posted in Data Mining, Programming | Tagged , , , , | 42 Comments

Analyzing the source code of the SPMF data mining software

Hi everyone, In this blog post, I will discuss how I have applied an open-source tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ )  to analyze the source code of my open-source data mining software named SPMF. I have applied … Continue reading

Posted in Data Mining, Programming | Tagged , , , , | 1 Comment

How to characterize and compare data mining algorithms?

Hi, today, I will discuss how to compare data mining algorithms.  This is an important question for data mining researchers who want to evaluate which algorithm is “better” in general or for a given situation.  This question is also important … Continue reading

Posted in Data Mining, Programming, Research | Tagged , , , , , | 7 Comments

A Map of Data Mining Algorithms (offered in SPMF v092c)

Hi, I have made a map to visualize the relationship between the 52 different data mining algorithms offered in the SPMF data mining software.  You can view it in PNG format by clicking on the picture below: Or you can … Continue reading

Posted in Data Mining, Programming | Tagged , , , , , | 2 Comments

How to choose a good thesis topic in Data Mining?

I have seen many people asking for help in data mining forums and on other websites about how to choose a good thesis topic in data mining.  Therefore, in this this post, I will address this question. The first thing … Continue reading

Posted in Data Mining, Research | Tagged , , | 475 Comments