Category Archives: Big data

Introduction to clustering: the K-Means algorithm (with Java code)

In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis).  I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading

Posted in Big data, Data Mining, Data science, Open-source | Tagged , , , , , , | 8 Comments

Introduction to time series mining with SPMF

This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06).  It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading

Posted in Big data, Data Mining, Open-source, Time series | Tagged , , , , , , , , | 2 Comments

The KDDCup 2015 dataset

The KDD cup 2015 dataset is about MOOC dropout prediction. I have  had recently found that the dataset had been offline on the official website. Thus, I have uploaded a copy of the KDD cup 2015 dataset on my website. You can download … Continue reading

Posted in Big data, Data Mining, Data science, Research | Leave a comment

Discovering hidden patterns in texts using SPMF

This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words.  We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading

Posted in Big data, Data Mining, Data science, Open-source | 10 Comments

Brief report about the Dexa 2016 and Dawak 2016 conferences

This week, I have been attending the DEXA 2016 and DA‎WAK 2016 conferences, in Porto, Portugal, from the 4th to 8th September 2016, to present three papers. In this blog post, I will give a brief report about these conferences. About these … Continue reading

Posted in Big data, Conference, Data Mining, Research | 2 Comments

Brief report about the MLDM 2016 Conference (12th International Conference on Machine Learning and Data Mining conference)

In this blog post, I will provide a brief report about the 12th Intern. Conference on Machine Learning and Data Mining (MLDM 2016), that I have attended from the 18th to 20th July 2016 in Newark, USA. About the conference This is the 12th edition … Continue reading

Posted in Big data, Conference, Data Mining, Data science | 1 Comment

Brief report about the 16th Industrial Conference on Data mining 2016 (ICDM 2016)

In this blog post, I will provide a brief report about the 16th Industrial Conference on Data mining 2016, that I have attended from the 13 to 14 July 2016 in Newark, USA. About the conference The Industrial Conference on Data Mining is an … Continue reading

Posted in Big data, Conference, Data Mining | 3 Comments

The top journals and conferences in data mining / data science

A key question for data mining and data science researchers is to know what are the top journals and conferences in the field, since it is always best to publish in the most popular journals or conferences. In this blog post, … Continue reading

Posted in Big data, Data Mining, Data science, Research | 3 Comments

An introduction to periodic pattern mining

In this blog post I will give an introduction to the discovery of periodic patterns in data. Mining periodic patterns is an important data mining task as patterns may periodically appear in all kinds of data, and it may be desirable to find them … Continue reading

Posted in Big data, Data Mining, Data science, Open-source, Research, Utility Mining | 3 Comments

Full-time faculty positions at Harbin Institute of Technology (data mining, statistics, psychology, design…)

Full-time faculty positions at Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China The Center of Innovative Industrial Design is currently looking to hire full-time faculty members at the rank of assistant professor, associate professor or professor, with expertise in … Continue reading

Posted in artificial intelligence, Big data, Data Mining, Data science, Research | Leave a comment