Archives
Categories
- Academia (91)
- artificial intelligence (35)
- Big data (83)
- Bioinformatics (6)
- cfp (12)
- China (1)
- Chinese posts (1)
- Conference (76)
- Data Mining (189)
- Data science (107)
- Database (2)
- General (43)
- Industry (2)
- Java (12)
- Latex (12)
- Machine Learning (22)
- Mathematics (2)
- open-source (42)
- Other (3)
- Pattern Mining (91)
- Plagiarism (1)
- Programming (17)
- Research (108)
- spmf (59)
- Time series (3)
- Uncategorized (23)
- Utility Mining (23)
- Video (19)
- Website (6)
-
Recent Posts
- Fixing the reviewresponse.cls LaTeX Class to Allow Multi-Page Comments
- How to fix reviewresponse.cls for custom reviewer numbering
- The Conference Hotel Booking Scam
- Huge traffic from a botnet looking for datasets
- New version of SPMF: 2.63!
- Upcoming feature of SPMF 2.63: Taxonomy Viewer
- CFP: The OCSA 2025 conference
- Update on SPMF next version: the visual pattern viewer
- CFP: HP4MoDa 2025 @ IEEE BIBM 2025
- An Improved Pattern Viewer
Recent Comments
- Philippe Fournier-Viger on What is a Closed Itemset and Why is it Useful?
- Philippe Fournier-Viger on An Improved Pattern Viewer
- Bairavi Balaguru on What is a Closed Itemset and Why is it Useful?
- Kushal P. Birla on An Improved Pattern Viewer
- Kushal P. Birla on An Improved Pattern Viewer


-

-


Tag cloud
- academia
- ai
- algorithm
- apriori
- article
- artificial intelligence
- association rule
- big data
- cfp
- china
- conference
- data
- data mining
- data science
- episode
- graph
- high utility itemset mining
- icdm
- itemset
- itemset mining
- java
- journal
- latex
- machine learning
- open-source
- open source
- pakdd
- paper
- papers
- pattern mining
- periodic pattern
- phd
- Research
- researcher
- reviewer
- sequence
- sequential pattern
- software
- spmf
- udml
- utility mining
- video
- website
- workshop
- writing
Number of visitors:
2,413,134
Category Archives: Data Mining
An interview with P. Fournier-Viger about AI and data mining
I recently was interviewed by Djavan de Clercq, a graduate student from Tsinghua University, working on Machine Learning and Optimization. The interview can be read here (on LinkedIn). I answer ten questions related to data mining and AI research. —-Philippe Fournier-Viger is a professor of Computer Science and also the founder of the open-source data mining software SPMF, offering … Continue reading
Posted in Big data, Data Mining, Data science
Tagged artificial intelligence, big data, data mining, interview
Leave a comment
A Model for Football Pass Prediction (source code + dataset)
In this blog post, I will discuss the data challenge of the Machine Learning for Sport Analytics workshop (MLSA 2018) at PKDD 2018. The challenge consisted of predicting the receivers of football passes (pass prediction). I will first briefly describe the data and then … Continue reading
Posted in Data Mining, Data science, Video
Tagged data, data mining, football, mlsa, prediction
Leave a comment
PAKDD 2018 Conference (a brief report)
In this blog post, I will discuss the PAKDD 2018 conference (Pacific Asia Conference on Knowledge Discovery and Data Mining), in Melbourne Australia, from the 3rd June to the 6th June 2018. About the PAKDD conference PAKDD is an important conference in the data science / data mining research community, mainly attended by researchers from … Continue reading
Posted in Big data, Conference, Data Mining, Data science
Tagged asia, australia, big data, conference, data mining, pakdd
2 Comments
China International BigData Industry Expo 2018 (a brief report)
This week I have attended the China International Big Data Industry Expo 2018 in Guiyang, China. I will describe this event and some of the key things that I have observed so far. What is the China International Big Data Industry Expo? It is an international event targeted toward the industry and focused on big data, which is aimed … Continue reading
Posted in Big data, Conference, Data Mining, Data science
Tagged big data, china, expo, industry
2 Comments
How to run SPMF without installing Java?
The SPMF data mining software is a popular open-source software for discovering patterns in data and for performing other data mining tasks. Typically, to run SPMF, Java must have been installed on a computer. However, it is possible to run SPMF on a computer that does not have Java installed. For example, … Continue reading
Posted in Data Mining, Data science, open-source, Pattern Mining, Research, spmf
Tagged data mining, pattern mining, software, spmf
Leave a comment
Subgraph mining datasets
In this post, I will provide links to standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that can be used for debugging subgraph mining algorithms. The format of graph datasets A graph dataset is a text … Continue reading
Posted in Big data, Data Mining
Tagged data mining, dataset, frequent subgraph, graph, subgraph
Leave a comment
On the Completeness of the CloSpan and IncSpan algorithms
In this blog post, I will briefly discuss the fact that the popular CloSpan algorithm for frequent sequential pattern mining is an incomplete algorithm. This means that in some special situations, CloSpan does not produce the expected results that it has been designed for, and in particular some patterns are … Continue reading
Posted in Data Mining, Pattern Mining
Tagged clospan, frequent pattern, incspan, pattern mining, sequential pattern
Leave a comment
On the correctness of the FSMS algorithm for frequent subgraph mining
In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm. I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to implement the algorithm in 2017 and wish to save time to other researchers … Continue reading
Posted in Big data, Data Mining, Pattern Mining
Tagged algorithm, correctness, data mining, pattern mining, subgraph mining
2 Comments
Introduction to the Apriori algorithm (with Java code)
This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not because it is the fastest, but because it has … Continue reading
Posted in Big data, Data Mining, Pattern Mining, Programming
Tagged apriori, code, frequent itemset, frequent pattern, itemset, java, pattern mining
12 Comments
The PAKDD 2017 conference (a brief report)
This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May. PAKDD is the top data mining conference for the asia-pacific region. It is held every year in a different pacific-asian country. In this blog post, I will write a brief report about … Continue reading
Posted in Academia, Conference, Data Mining, Data science
Tagged asia, big data, conference, data mining, data science, korea, pakdd
4 Comments