Archives
Categories
- Academia (91)
- artificial intelligence (35)
- Big data (83)
- Bioinformatics (7)
- cfp (12)
- China (1)
- Chinese posts (1)
- Conference (77)
- Data Mining (190)
- Data science (107)
- Database (2)
- General (43)
- Industry (2)
- Java (12)
- Latex (12)
- Machine Learning (22)
- Mathematics (2)
- open-source (42)
- Other (3)
- Pattern Mining (93)
- Plagiarism (1)
- Programming (17)
- Research (108)
- spmf (62)
- Time series (3)
- Uncategorized (27)
- Utility Mining (23)
- Video (19)
- Website (6)
-
Recent Posts
- The 1st HP4MoDa workshop was held at BIBM 2025
- Another release of SPMF: v2.64b
- A prototype of an improved GUI for the SPMF pattern mining software
- Upcoming in SPMF 2.64b : The “Pattern Diff Analyzer”
- A new version of SPMF (v2.64, november 2025)!
- GMP: A new algorithm for compressing protein sequences
- A new tool for visualizing algorithms from SPMF
- Fixing the reviewresponse.cls LaTeX Class to Allow Multi-Page Comments
- How to fix reviewresponse.cls for custom reviewer numbering
- The Conference Hotel Booking Scam
Recent Comments
- Philippe Fournier-Viger on What is a Closed Itemset and Why is it Useful?
- Philippe Fournier-Viger on An Improved Pattern Viewer
- Bairavi Balaguru on What is a Closed Itemset and Why is it Useful?
- Kushal P. Birla on An Improved Pattern Viewer
- Kushal P. Birla on An Improved Pattern Viewer


-

-


Tag cloud
- academia
- ai
- algorithm
- apriori
- article
- artificial intelligence
- association rule
- big data
- cfp
- china
- conference
- data
- data mining
- data science
- graph
- high utility itemset mining
- icdm
- itemset
- itemset mining
- java
- journal
- latex
- machine learning
- open-source
- open source
- pakdd
- paper
- papers
- pattern
- pattern mining
- periodic pattern
- phd
- Research
- researcher
- reviewer
- sequence
- sequential pattern
- software
- spmf
- udml
- utility mining
- video
- website
- workshop
- writing
Number of visitors:
2,435,037
Tag Archives: data mining
Subgraph mining datasets
In this post, I will provide links to standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that can be used for debugging subgraph mining algorithms. The format of graph datasets A graph dataset is a text … Continue reading
Posted in Big data, Data Mining
Tagged data mining, dataset, frequent subgraph, graph, subgraph
Leave a comment
On the correctness of the FSMS algorithm for frequent subgraph mining
In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm. I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to implement the algorithm in 2017 and wish to save time to other researchers … Continue reading
Posted in Big data, Data Mining, Pattern Mining
Tagged algorithm, correctness, data mining, pattern mining, subgraph mining
2 Comments
The PAKDD 2017 conference (a brief report)
This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May. PAKDD is the top data mining conference for the asia-pacific region. It is held every year in a different pacific-asian country. In this blog post, I will write a brief report about … Continue reading
Posted in Academia, Conference, Data Mining, Data science
Tagged asia, big data, conference, data mining, data science, korea, pakdd
4 Comments
This is why you should visualize your data!
In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data. This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious when visualizing the data. In this blog post, I will give a few … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data, data mining, data science, visualization
Leave a comment
We are launching a new data mining journal
In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague Chun-Wei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new open-access journal, … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged big data, data mining, data science, journal
Leave a comment
Introduction to the K-Means clustering algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, open-source
Tagged clustering, data mining, data science, java, k-means, open-source, spmf
2 Comments
Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading
Posted in Big data, Data Mining, open-source, spmf, Time series
Tagged big data, data mining, data science, java, open-source, pattern mining, SAX algorithm, spmf, time series
1 Comment
Discovering hidden patterns in texts using SPMF
This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words. We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading
Posted in Big data, Data Mining, Data science, open-source, spmf
Tagged data mining, data science, pattern mining, spmf
9 Comments
Brief report about the IEA AIE 2016 conference
This week, I have attended the IEA AIE 2016 conference, held at Morioka, Japan from the 2nd to the 4th August 2016. In this blog post, I will briefly discuss the conference. About the conference IEA AIE 2016 (29th International Conference … Continue reading
Posted in artificial intelligence, Conference, Data Mining, Data science, Research
Tagged artificial intelligence, conferenced, data mining
7 Comments
News about SPMF
Some quick news about the SPMF project. First, this month I have made a few updates to SPMF. Two new algorithms have been added: USpan (for high-utility sequential pattern mining) and FCHM (for correlated high utility itemset mining). Moreover, I have fixed … Continue reading