Archives
Categories
- Academia (89)
- artificial intelligence (35)
- Big data (84)
- Bioinformatics (4)
- cfp (10)
- China (1)
- Chinese posts (1)
- Conference (75)
- Data Mining (187)
- Data science (106)
- Database (2)
- General (43)
- Industry (2)
- Java (11)
- Latex (10)
- Machine Learning (21)
- Mathematics (2)
- open-source (39)
- Other (3)
- Pattern Mining (87)
- Plagiarism (1)
- Programming (17)
- Research (109)
- spmf (56)
- Time series (3)
- Uncategorized (23)
- Utility Mining (23)
- Video (19)
- Website (4)
-
Recent Posts
- Categorical data clustering: 25 years beyond K-modes (a survey)
- Vertical and horizontal databases in itemset mining
- CFP: PM4B 2005: A new workshop on pattern mining and machine learning in bioinformatics @ PAKDD 2025
- Merry X-mas and Happy New Year!
- About Academic conferences in China
- An ethical issue in the Elsevier “International Journal of Hydrogen Energy” ?
- End of my term as associate editor for Array
- Computer Science Journals and Conferences with the most withdrawals in 2023
- Reducing the cost of web hosting…
- SPMF 2.62 is released!
Recent Comments
- Introduction to the K-Means clustering algorithm (with Java code) | The Data Blog on Categorical data clustering: 25 years beyond K-modes (a survey)
- Categorical data clustering: 25 years beyond K-modes (a survey) | The Data Blog on Introduction to the K-Means clustering algorithm (with Java code)
- Joop on A simple BAT script to unzip ZIP files in all sub-directories
- Vertical and horizontal databases in itemset mining | The Data Blog on Introduction to the Apriori algorithm (with Java code)
- Philippe Fournier-Viger on What is a maximal itemset?
-
Tag cloud
- academia
- ai
- algorithm
- apriori
- article
- articles
- artificial intelligence
- association rule
- big data
- cfp
- china
- conference
- data
- data mining
- data science
- episode
- graph
- high utility itemset mining
- icdm
- itemset
- itemset mining
- java
- journal
- latex
- machine learning
- open-source
- open source
- pakdd
- paper
- papers
- pattern mining
- periodic pattern
- phd
- Research
- researcher
- reviewer
- sequence
- sequential pattern
- software
- spmf
- udml
- utility mining
- video
- workshop
- writing
Number of visitors:
2,341,153
Category Archives: Data science
Categorical data clustering: 25 years beyond K-modes (a survey)
If you are interested by categorical data clustering, I am glad to announce that a new and up-to-date survey paper named “Categorical data clustering: 25 years beyond K-modes” will appear on this topic in the Expert Systems with Applications journal. … Continue reading
Vertical and horizontal databases in itemset mining
Itemset mining is a data mining task for discovering patterns that appear frequently in transaction databases. In this context, a pattern, also called a frequent itemset, is a set of values that frequently occur together in transactions (records) of a … Continue reading
CFP: PM4B 2005: A new workshop on pattern mining and machine learning in bioinformatics @ PAKDD 2025
Today, I would like to announce that we are creating a new workshop for PAKDD 2025 called the 1st Workshop on Pattern mining and Machine learning for Bioinformatics (PM4B 2025). The goal of the workshop is to establish a collaborative platform … Continue reading
Posted in artificial intelligence, Big data, Bioinformatics, Conference, Data Mining, Data science
Tagged big data, bioinformatics, cfp, machine learning, pakdd, pattern mining, pm4b, workshop
Leave a comment
SPMF 2.62 is released!
This is a short blog post to announce that SPMF 2.62 is released, and can be downloaded from the SPMF website‘s download page. The previous version of SPMF (2.60) introduced a lot of new features, also with some code refactoring, … Continue reading
Posted in Data Mining, Data science, open-source, spmf
Tagged association rule, data mining, itemset mining, java, new version, open source, pattern mining, spmf
Leave a comment
The story of the most influential paper award of PAKDD 2024
Recently, I have attended the PAKDD 2024 conference, where I was happy to receive the most influential paper award with my co-authors. This award is a test of time type of award that is given to the paper from PAKDD … Continue reading
Posted in Big data, Data Mining, Data science, Pattern Mining
Tagged award, data mining, data science, pakdd, pakdd2014, pakdd2024, pattern mining, sequential pattern
Leave a comment
SPMF 2.60 is released!
This is a short message today to announce that the new version of SPMF 2.60 is finally released! This is a major version as it contains many new things. The full lists of changes can be found on the download … Continue reading
Posted in Data Mining, Data science, Java, Pattern Mining, spmf
Tagged algorithms, data mining, data science, fast, implementations, java, open source, pattern mining, software, spmf
Leave a comment
Some interesting statistics about SPMF
While I am preparing the next version of Java SPMF data mining software (2.60), here are some interesting statistics about the project, that I have generated directly from the metadata provided by SPMF. Here it is: The number of algorithms … Continue reading
Posted in Data Mining, Data science, spmf
Tagged big data, data mining, data science, itemset mining, pattern mining, spmf
Leave a comment
Sneak peak at the new user interface of SPMF (part 1)
I am currently working on the next version of SPMF, which will be called 2.60. There will be several improvements to the user interface of SPMF. Here is an overview of some of the improvements to give you a sneak … Continue reading
Posted in Big data, Data Mining, Data science, spmf
Leave a comment
UDML 2024 Accepted papers
Today, I want to talk to you about the upcoming UDML 2024 workshop at the PAKDD 2024 conference. This year is the 6th edition of the UDML workshop. I am happy to say that this year, we received a record … Continue reading
Posted in Conference, Data Mining, Data science, Pattern Mining, Utility Mining
Tagged data mining, itemset mining, pakdd, pattern mining, udml, workshop
Leave a comment
A new survey paper on episode mining!
I am pleased to announce today that my collaborators and I have published a new survey paper about episode mining to give an introduction to this nice and interesting subfield of pattern mining. To our knowledge this is the most … Continue reading
Posted in Data Mining, Data science, Pattern Mining
Tagged data mining, data science, episode, episode mining, event sequence, pattern mining, sequence, survey
Leave a comment