Category Archives: Data Mining

Report of the PAKDD 2014 conference (part 3)

This post continue my report of the PAKDD 2014 in Tainan (Taiwan). The panel about big data Friday, there was a great panel at PAKDD about big data with 7 top researchers from the field of data mining.  I will try … Continue reading

Posted in Conference, Data Mining | 1 Comment

Report of the PAKDD 2014 conference (part 2)

This post will continue my report of the PAKDD 2014 in Tainan (Taiwan). About big data Another interesting talk at this conference was given by Prof. Jian Pei. The topic was Big Data. Some key ideas in this talk was that to … Continue reading

Posted in Conference, Data Mining | 2 Comments

Report of the PAKDD 2014 conference (part 1)

I am currently at the PAKDD 2014 conference in Tainan, In this post, I will report interesting information about the conference and talks that I have attended. Importance of Succint Data Structures for Data Mining I have attended a very nice … Continue reading

Posted in Conference, Data Mining | 7 Comments

New version of SPMF Java open-source data mining library (0.95)

Today, I write a post to announce a new version of the SPMF Java open-source data mining library.  It is SPMF version 0.95 and it is a major revision. It offers 11 new  data mining algorithms for various data mining … Continue reading

Posted in Data Mining, Data science, Java, open-source, spmf | Tagged , , , | 1 Comment

Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz

Today, I will show how to use the open-source SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found using GraphViz. Step 1 :  getting the … Continue reading

Posted in Data Mining, Java, open-source, Programming, spmf | Tagged , , , , , | 8 Comments

Why data mining researchers should evaluate their algorithms against state-of-the-art algorithms?

sA common problem in research on data mining is that researchers proposing new data mining algorithms often do not compare the performance of their new algorithm with the current state-of-the art data mining algorithms. For example, let me illustrate this … Continue reading

Posted in Data Mining, Programming, Research | 4 Comments

Brief report about the ADMA 2013 conference

In this blog post, I will discuss my recent trip to the ADMA 2013 conference (9th Intern. Conf. on Advanced Data Mining and Applications in China (December 14-16 2013 in Hangzhou, China at Zhejiang University). Note that the view expressed … Continue reading

Posted in Data Mining, General | Tagged , , , | 5 Comments

An introduction to frequent pattern mining

In this blog post, I will give a brief overview of an important subfield of data mining that is  called pattern mining.  Pattern mining consists of using/developing data mining algorithms to discover interesting,  unexpected and useful patterns in databases. Pattern … Continue reading

Posted in Data Mining, Research | Tagged , , , , | 28 Comments

How to encourage data mining researchers to share their source code and datasets?

A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“.  I explained several advantages that researchers can get by sharing the source code of … Continue reading

Posted in Data Mining, Research | Tagged , , , | Leave a comment

The importance of constraints in data mining

Today, I will discuss an important concept in data mining which is the use of constraints. Data mining is a broad field incorporating many different kind of techniques for discovering unexpected and new knowledge from data. Some main data mining … Continue reading

Posted in Data Mining | Tagged , , | Leave a comment