Archives
Categories
 Academia (37)
 artificial intelligence (6)
 Big data (48)
 Conference (22)
 Data Mining (78)
 Data science (40)
 General (40)
 Graph mining (3)
 Industry (2)
 Latex (2)
 Mathematics (2)
 Mobile technology (1)
 Opensource (8)
 Other (2)
 Pattern Mining (2)
 Plagiarism (7)
 Programming (17)
 Research (69)
 Time series (2)
 Uncategorized (2)
 Utility Mining (7)
 video (1)
 Web (2)
Tag cloud
 academia
 academic wriing
 algorithm
 algorithms
 articles
 artificial intelligence
 association rules
 big data
 comparison
 conference
 data mining
 data science
 datasets
 frequent pattern mining
 frequent patterns
 graph
 highutility mining
 itemset
 itemset mining
 java
 journal
 latex
 library
 opensource
 pakdd
 paper
 pattern mining
 Ph.D.
 phd
 plagiarism
 programming
 Research
 research paper
 research papers
 review
 reviewer
 sequence
 sequential patterns
 software
 source code
 spmf
 time series
 utility mining
 visualization
 writing
Recent Comments
 Philippe FournierViger on Expensive Academic Conferences – the case of ICDM
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)
 Maciej Muras on Introduction to clustering: the KMeans algorithm (with Java code)
 Frank K. Cooper, Jr. on Expensive Academic Conferences – the case of ICDM

Recent Posts
 (video) Minimal High Utility Itemset Mining with MinFHM
 Expensive Academic Conferences – the case of ICDM
 Periodic patterns in Web log time series
 Upcoming book: High Utility Itemset Mining: Theory, Algorithms and Applications
 What I don’t like about academia
 News about the data mining blog
 Report about the DEXA 2018 and DAWAK 2018 conferences
 China lead in mobile payment and services
 Report about the KDD 2018 conference
 A Model for Football Pass Prediction (source code + dataset)
Top 5 most popular posts
Number of visitors:
842468
Tag Archives: data mining
On the Completeness of the CloSpan and IncSpan algorithms
In this blog post, I will briefly discuss the fact that the popular CloSpan algorithm for frequent sequential pattern mining is an incomplete algorithm. This means that in some special situations, CloSpan does not produce the expected results that it has been designed for, and … Continue reading
On the correctness of the FSMS algorithm for frequent subgraph mining
In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm. I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to … Continue reading
Posted in Big data, Data Mining, Graph mining
Tagged data mining, frequent subgraph, fsms, graph, graph mining, pattern mining
5 Comments
How to discover interesting patterns in data?
Discovering interesting patterns in data is often referred as data mining, data science or big data. In the last few years, I have written several blog posts providing introduction to data mining and key topics in data mining: An Introduction to … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged big data, data mining, data science, pattern mining, survey
Leave a comment
The PAKDD 2017 conference (a brief report)
This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May. PAKDD is the top data mining conference for the asiapacific region. It is held every year in a … Continue reading
Posted in Academia, Big data, Conference, Data Mining, Data science
Tagged big data, conference, data mining, pakdd
2 Comments
This is why you should visualize your data!
In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data. This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data, data mining, data science, visualization
3 Comments
An Introduction to Sequential Pattern Mining
In this blog post, I will give an introduction to sequential pattern mining, an important data mining task with a wide range of applications from text analysis to market basket analysis. This blog post is aimed to be a short … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data mining, data science, frequent pattern mining, frequent patterns, pattern, sequence, sequential pattern
51 Comments
An Introduction to Data Mining
In this blog post, I will introduce the topic of data mining. The goal is to give a general overview of what is data mining. What is data mining? Data mining is a field of research that has emerged in … Continue reading
Posted in Data Mining, Data science, General
Tagged big data, data mining, data science, introduction
8 Comments
An introduction to frequent subgraph mining
In this blog post, I will give an introduction to an interesting data mining task called frequent subgraph mining, which consists of discovering interesting patterns in graphs. This task is important since data is naturally represented as graph in many domains (e.g. … Continue reading
Posted in Big data, Data Mining, Data science, Graph mining
Tagged algorithm, big data, data mining, data science, frequent subgraphs, graph, pattern mining
17 Comments
We are launching a new data mining journal
In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague ChunWei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new openaccess journal, … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged big data, data mining, data science, journal
4 Comments
Introduction to clustering: the KMeans algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular KMeans algorithm with an example. Moreover, I will briefly explain how an opensource Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
Tagged clustering, data mining, data science, java, kmeans, opensource, spmf
26 Comments