Author Archives: Philippe Fournier-Viger

An Introduction to Data Mining

In this blog post, I will introduce the topic of data mining. The goal is to give a general overview of what is data mining. What is data mining? Data mining is a field of research that has emerged in … Continue reading

Posted in Data Mining, Data science, General | 3 Comments

Write more papers or write better papers? (quantity vs quality)

In this blog post, I will discuss an important question for young researchers, which is: Is it better to try to write more papers  or to try to write fewer but better papers?  In other words, what is more important: quantity … Continue reading

Posted in Academia, General, Research | Leave a comment

Using LaTeX for writing research papers

Many researchers are using Microsoft Word  for writing research papers. However, Microsoft Word has several problems or limitations.  In this blog post, I will discuss the use of LaTeX as an alternative to Microsoft Word for writing research papers. What is LaTeX? … Continue reading

Posted in Academia, General, Research | Tagged , , , | 2 Comments

An introduction to frequent subgraph mining

In this blog post, I will give an introduction to an interesting data mining task called frequent subgraph mining, which consists of discovering interesting patterns in graphs. This task is important since data is naturally represented as graph in many domains (e.g. … Continue reading

Posted in Big data, Data Mining, Data science | Tagged , , , , , , | 9 Comments

We are launching a new data mining journal

In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague Chun-Wei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new open-access journal, … Continue reading

Posted in Big data, Data Mining, Data science, Research | Tagged , , , | 2 Comments

What is the job of a university professor?

In this blog post, I will discuss the job of university professor. And, I will discuss why I have chosen to become one. This post is especially aimed at students who are considering working in academia after their Ph.D. What is … Continue reading

Posted in Academia, General, Research | Tagged , , , | Leave a comment

Introduction to clustering: the K-Means algorithm (with Java code)

In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis).  I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading

Posted in Big data, Data Mining, Data science, Open-source | Tagged , , , , , , | 2 Comments

Happy New Year!

To all those reading this blog and/or using the SPMF library, I wish you a Merry Christmas and Happy new year! Related posts:Conference reviewers procrastinate?Drawing the Powerset of a Set using Java and GraphViz (Hasse Diagram)Full-time faculty positions at Harbin Institute … Continue reading

Posted in General | Leave a comment

Introduction to time series mining with SPMF

This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06).  It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading

Posted in Big data, Data Mining, Open-source, Time series | Tagged , , , , , , , , | 2 Comments

Plagiarism at Ilahia College of Engineering and Technology by Nasreen Ali A and Arunkumar M

I have found that two professors from the Ilahia College of Engineering and Technology  named Nasreen Ali A ( arunpvmn@gmail.com )  and Arunkumar M have plagiarized one of my paper. The plagiarized paper is the following: asreen Ali A.1 , Arunkumar M. Mining … Continue reading

Posted in Academia, Plagiarism | Tagged , , | 12 Comments