Archives
Categories
 Academia (21)
 artificial intelligence (4)
 Big data (33)
 Conference (13)
 Data Mining (63)
 Data science (29)
 General (33)
 Graph mining (3)
 Mathematics (2)
 Opensource (7)
 Other (2)
 Plagiarism (5)
 Programming (15)
 Research (57)
 Time series (1)
 Utility Mining (3)
Tag cloud
 academia
 algorithm
 algorithms
 articles
 association rules
 big data
 comparison
 conference
 data mining
 data science
 datasets
 frequent pattern mining
 frequent patterns
 graph
 highutility mining
 ilahia
 itemset mining
 java
 journal
 library
 M.Sc.
 opensource
 pakdd
 paper
 papers
 pattern mining
 Ph.D.
 phd
 plagiarism
 programmer
 programming
 publications
 Research
 research advisor
 research papers
 review
 reviewer
 sequence
 sequential patterns
 software
 source code
 spmf
 thesis topic
 visualization
 writing
Recent Comments
 Philippe FournierViger on Why I left Canada to work as a University Professor in China
 infi on Why I left Canada to work as a University Professor in China
 Philippe FournierViger on How to choose a good thesis topic in Data Mining?
 Kanika on How to choose a good thesis topic in Data Mining?
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)

Recent Posts
 The conference that tolerates up to 20 % plagiarism
 Subgraph mining datasets
 Plagiarism by Divvela Srinivasa Rao at Lakireddy Balireddy College of Engineering (LBRCE)
 Why doing a Ph.D.?
 How to review a research paper?
 On the Completeness of the CloSpan and IncSpan algorithms
 10 ways of becoming more efficient at doing research
 IEEE and its language polishing service
 On the correctness of the FSMS algorithm for frequent subgraph mining
 The ontology book by Kerry Taylor that was never published
Number of visitors:
730715
Category Archives: Big data
Subgraph mining datasets
In this post, I will provide two standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that I have created for debugging subgraph mining algorithms. The format of … Continue reading
Posted in Big data, Data Mining, Data science, Graph mining
Tagged big data, data mining, data science, graph
Leave a comment
On the correctness of the FSMS algorithm for frequent subgraph mining
In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm. I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to … Continue reading
Posted in Big data, Data Mining, Graph mining
Tagged data mining, frequent subgraph, fsms, graph, graph mining, pattern mining
3 Comments
Postdoctoral positions in data mining in Shenzhen, China (apply now)
The CIID research center of the Harbin Institute of Technology (Shenzhen campus, China) is looking to hire two postdoctoral researchers to carry research on data mining / big data. The applicant must have: a Ph.D. in computer Science, a strong research background in data mining/big … Continue reading
Posted in artificial intelligence, Big data, Data Mining, Research
4 Comments
How to discover interesting patterns in data?
Discovering interesting patterns in data is often referred as data mining, data science or big data. In the last few years, I have written several blog posts providing introduction to data mining and key topics in data mining: An Introduction to … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged big data, data mining, data science, pattern mining, survey
Leave a comment
Call for chapters: High Utility Pattern Mining, the book
CALL FOR CHAPTERS HighUtility Pattern Mining: Theory, Algorithms and Applications Editors: Philippe FournierViger, ChunWei Lin, Roger Nkambou, Bay Vo An edited book to be published by Springer in 2018 Introduction This book will provide an introduction to the high utility mining, reviews stateoftheart … Continue reading
Introduction to the Apriori algorithm (with Java code)
This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
Tagged apriori algorithm, frequent itemset, frequent pattern, itemset mining, java, source code
8 Comments
The PAKDD 2017 conference (a brief report)
This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May. PAKDD is the top data mining conference for the asiapacific region. It is held every year in a … Continue reading
Posted in Academia, Big data, Conference, Data Mining, Data science
Tagged big data, conference, data mining, pakdd
2 Comments
This is why you should visualize your data!
In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data. This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data, data mining, data science, visualization
3 Comments
An Introduction to Sequential Pattern Mining
In this blog post, I will give an introduction to sequential pattern mining, an important data mining task with a wide range of applications from text analysis to market basket analysis. This blog post is aimed to be a short … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data mining, data science, frequent pattern mining, frequent patterns, pattern, sequence, sequential pattern
22 Comments
An introduction to frequent subgraph mining
In this blog post, I will give an introduction to an interesting data mining task called frequent subgraph mining, which consists of discovering interesting patterns in graphs. This task is important since data is naturally represented as graph in many domains (e.g. … Continue reading
Posted in Big data, Data Mining, Data science, Graph mining
Tagged algorithm, big data, data mining, data science, frequent subgraphs, graph, pattern mining
15 Comments