
Recent Posts
 Plagiarism by Bhawna Mallick and Kriti Raj at Galgotias College of Engineering & Technology
 How to publish in top conferences/journals? The Blue Ocean Strategy
 This is why you should visualize your data!
 An Introduction to Sequential Pattern Mining
 An Introduction to Data Mining
 Write more papers or write better papers? (quantity vs quality)
 Using LaTeX for writing research papers
 An introduction to frequent subgraph mining
 We are launching a new data mining journal
 What is the job of a university professor?
Categories
 Academia (8)
 artificial intelligence (4)
 Big data (24)
 Conference (12)
 Data Mining (55)
 Data science (20)
 General (27)
 Mathematics (2)
 Opensource (6)
 Plagiarism (2)
 Programming (15)
 Research (48)
 Time series (1)
 Utility Mining (2)
Tag cloud
academia academic journal algorithm algorithms articles big data classification comparison conference data mining data science datasets frequent pattern mining frequent patterns graph internet itemset mining java journal library M.Sc. map opensource paper papers pattern mining peerreview Ph.D. phd plagiarism programmer programming publications Research research advisor research papers sequence sequential patterns software source code spmf thesis topic visualization website writingArchives
Recent Comments
 Philippe FournierViger on How to test if a data mining mining algorithm implementation is correct?
 Hitesh Pujari on How to test if a data mining mining algorithm implementation is correct?
 Philippe FournierViger on How to autoadjust the minimum support threshold according to the data size
 Ko Moe on How to autoadjust the minimum support threshold according to the data size
 Philippe FournierViger on An introduction to frequent subgraph mining
Number of visitors:
525611
Author Archives: Philippe FournierViger
Introduction to clustering: the KMeans algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular KMeans algorithm with an example. Moreover, I will briefly explain how an opensource Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
Tagged clustering, data mining, data science, java, kmeans, opensource, spmf
2 Comments
Happy New Year!
To all those reading this blog and/or using the SPMF library, I wish you a Merry Christmas and Happy new year! Related posts:How to measure the memory usage of data mining algorithms in Java? New version of SPMF Java opensource data … Continue reading
Posted in General
Leave a comment
Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java opensource data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading
Posted in Big data, Data Mining, Opensource, Time series
Tagged big data, data mining, data science, java, opensource, pattern mining, SAX algorithm, spmf, time series
2 Comments
Plagiarism at Ilahia College of Engineering and Technology by Nasreen Ali A and Arunkumar M
I have found that two professors from the Ilahia College of Engineering and Technology named Nasreen Ali A ( arunpvmn@gmail.com ) and Arunkumar M have plagiarized one of my paper. The plagiarized paper is the following: asreen Ali A.1 , Arunkumar M. Mining … Continue reading
Postdoctoral positions in data mining in Shenzhen, China (apply now)
The CIID research center of the Harbin Institute of Technology (Shenzhen campus, China) is looking to hire two postdoctoral researchers to carry research on data mining / big data. The applicant must have: a Ph.D. in computer Science, a strong research background in data mining/big … Continue reading
Posted in artificial intelligence, Big data, Data Mining, Research
1 Comment
The KDDCup 2015 dataset
The KDD cup 2015 dataset is about MOOC dropout prediction. I have had recently found that the dataset had been offline on the official website. Thus, I have uploaded a copy of the KDD cup 2015 dataset on my website. You can download … Continue reading
Posted in Big data, Data Mining, Data science, Research
Leave a comment
What not to do when applying for a M.Sc. or Ph.D position?
This brief blog discusses what not to do when applying for a M.Sc. or Ph.D. position in a research lab. The aim of this post is to give advices to those applying for such positions. I had previously discussed about this … Continue reading
Posted in General, Research
Leave a comment
Discovering hidden patterns in texts using SPMF
This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words. We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
6 Comments
Why I left Canada to work as a University Professor in China
One year and a half ago, I was working as a professor at a university in Canada. But I took the decision to not renew my contract and move to China. At that time, some people may have thought that I was crazy to leave my job … Continue reading
Posted in Data Mining, General, Research
3 Comments
Brief report about the Dexa 2016 and Dawak 2016 conferences
This week, I have been attending the DEXA 2016 and DAWAK 2016 conferences, in Porto, Portugal, from the 4th to 8th September 2016, to present three papers. In this blog post, I will give a brief report about these conferences. About these … Continue reading
Posted in Big data, Conference, Data Mining, Research
2 Comments