Archives
Categories
- Academia (83)
- artificial intelligence (34)
- Big data (80)
- Bioinformatics (3)
- cfp (10)
- Chinese posts (1)
- Conference (72)
- Data Mining (181)
- Data science (101)
- Database (1)
- General (42)
- Industry (2)
- Java (11)
- Latex (10)
- Machine Learning (20)
- Mathematics (2)
- open-source (37)
- Other (3)
- Pattern Mining (83)
- Plagiarism (1)
- Programming (17)
- Research (109)
- spmf (53)
- Time series (3)
- Uncategorized (23)
- Utility Mining (22)
- Video (19)
- Website (3)
-
Recent Posts
- Upcoming SPMF features for v.2.62 – More Dataset Stats Tools
- UDML 2024 Workshop program @ PAKDD 2024
- SPMF: bug fix about screen resolution
- SPMF 2.60 is released!
- How to download an offline copy of the SPMF documentation?
- Is EasyChair still good?
- Some interesting statistics about SPMF
- Sneak peak at the new user interface of SPMF (part 3)
- ChatGPT, LLMs and homework
- When ChatGPT is used to write papers…
Recent Comments
- K. P. Birla on About the author
- An Overview of Pattern Mining Techniques | The Data Blog on An Introduction to Data Mining
- Key Papers about Episode Mining | The Data Blog on An introduction to periodic pattern mining
- Dr J Gangadhar Naik on How to improve the quality of your research papers?
- Philippe Fournier-Viger on About the author
Tag cloud
- academia
- ai
- algorithm
- apriori
- article
- articles
- artificial intelligence
- association rule
- big data
- cfp
- china
- conference
- data
- data mining
- data science
- episode
- graph
- high utility itemset mining
- icdm
- itemset
- itemset mining
- java
- journal
- latex
- machine learning
- open-source
- open source
- pakdd
- paper
- papers
- pattern mining
- periodic pattern
- phd
- Research
- researcher
- reviewer
- sequence
- sequential pattern
- software
- spmf
- udml
- utility mining
- video
- workshop
- writing
Number of visitors:
2,282,054
Category Archives: Data science
An interview with P. Fournier-Viger about AI and data mining
I recently was interviewed by Djavan de Clercq, a graduate student from Tsinghua University, working on Machine Learning and Optimization. The interview can be read here (on LinkedIn). I answer ten questions related to data mining and AI research. —-Philippe Fournier-Viger is a professor of Computer Science and also the founder of the open-source data mining software SPMF, offering … Continue reading
Posted in Big data, Data Mining, Data science
Tagged artificial intelligence, big data, data mining, interview
Leave a comment
A Model for Football Pass Prediction (source code + dataset)
In this blog post, I will discuss the data challenge of the Machine Learning for Sport Analytics workshop (MLSA 2018) at PKDD 2018. The challenge consisted of predicting the receivers of football passes (pass prediction). I will first briefly describe the data and then … Continue reading
Posted in Data Mining, Data science, Video
Tagged data, data mining, football, mlsa, prediction
Leave a comment
PAKDD 2018 Conference (a brief report)
In this blog post, I will discuss the PAKDD 2018 conference (Pacific Asia Conference on Knowledge Discovery and Data Mining), in Melbourne Australia, from the 3rd June to the 6th June 2018. About the PAKDD conference PAKDD is an important conference in the data science / data mining research community, mainly attended by researchers from … Continue reading
Posted in Big data, Conference, Data Mining, Data science
Tagged asia, australia, big data, conference, data mining, pakdd
2 Comments
China International BigData Industry Expo 2018 (a brief report)
This week I have attended the China International Big Data Industry Expo 2018 in Guiyang, China. I will describe this event and some of the key things that I have observed so far. What is the China International Big Data Industry Expo? It is an international event targeted toward the industry and focused on big data, which is aimed … Continue reading
Posted in Big data, Conference, Data Mining, Data science
Tagged big data, china, expo, industry
2 Comments
How to run SPMF without installing Java?
The SPMF data mining software is a popular open-source software for discovering patterns in data and for performing other data mining tasks. Typically, to run SPMF, Java must have been installed on a computer. However, it is possible to run SPMF on a computer that does not have Java installed. For example, … Continue reading
Posted in Data Mining, Data science, open-source, Pattern Mining, Research, spmf
Tagged data mining, pattern mining, software, spmf
Leave a comment
The PAKDD 2017 conference (a brief report)
This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May. PAKDD is the top data mining conference for the asia-pacific region. It is held every year in a different pacific-asian country. In this blog post, I will write a brief report about … Continue reading
Posted in Academia, Conference, Data Mining, Data science
Tagged asia, big data, conference, data mining, data science, korea, pakdd
Leave a comment
This is why you should visualize your data!
In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data. This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious when visualizing the data. In this blog post, I will give a few … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data, data mining, data science, visualization
Leave a comment
We are launching a new data mining journal
In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague Chun-Wei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new open-access journal, … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged big data, data mining, data science, journal
Leave a comment
Introduction to clustering: the K-Means algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, open-source
Tagged clustering, data mining, data science, java, k-means, open-source, spmf
1 Comment
The KDDCup 2015 dataset
The KDD cup 2015 dataset is about MOOC dropout prediction. I have had recently found that the dataset had been offline on the official website. Thus, I have uploaded a copy of the KDD cup 2015 dataset on my website. You can … Continue reading
Posted in Big data, Data Mining, Data science, Research
6 Comments