Archives
Categories
 Academia (60)
 artificial intelligence (24)
 Big data (64)
 cfp (7)
 Conference (47)
 Data Mining (113)
 Data science (55)
 General (37)
 Industry (1)
 Interview (1)
 Machine Learning (7)
 Mathematics (2)
 opensource (10)
 Pattern Mining (26)
 Plagiarism (6)
 Programming (16)
 Research (96)
 Time series (2)
 Uncategorized (15)
 Utility Mining (12)
 Video (11)
 Website (1)

Recent Posts
 Approximate Algorithms for High Utility Itemset Mining
 UDML 2021 @ ICDM 2021
 MLiSE 2021 @ PKDD 2021 – a new workshop!
 Mining Episode Rules (video)
 A Brief Report about ACIIDS 2021 (13th Asian Conference on Intelligent Information and Database Systems)
 Phrasebank, an interesting tool to improve your academic writing.
 Papers without code (and the problem of nonreproducible research)
 An Overview of Pattern Mining Techniques
 How to write a research grant proposal?
 Some funny or interesting websites related to research
Recent Comments
 Philippe FournierViger on Six important skills to become a succesful researcher
 Approximate Algorithms for High Utility Itemset Mining  The Data Mining Blog on An Introduction to HighUtility Itemset Mining
 Krishna Hasanthi Wellappili on Six important skills to become a succesful researcher
 Mining Episode Rules (video)  The Data Mining Blog on A Brief Report about ACIIDS 2021 (13th Asian Conference on Intelligent Information and Database Systems)
 A Brief Report about ACIIDS 2021 (13th Asian Conference on Intelligent Information and Database Systems)  The Data Mining Blog on Real Conferences VS Virtual Conferences
Tag cloud
 academia
 ai
 algorithm
 algorithms
 article
 articles
 artificial intelligence
 big data
 book
 china
 conference
 data
 data mining
 data science
 dataset
 frequent pattern mining
 highutility mining
 high utility itemset mining
 icdm
 ieaaie
 itemset mining
 java
 journal
 machine learning
 opensource
 pakdd
 paper
 papers
 pattern mining
 periodic pattern
 phd
 plagiarism
 programming
 Research
 researcher
 review
 reviewer
 sequence
 software
 spmf
 university
 utility mining
 video
 workshop
 writing
Number of visitors:
1,965,262
Category Archives: Programming
A Tribute to Hypercard
In this blog post, I will talk about the first programming language that I have learn, which is HyperTalk. Younger readers may have never heard about it, as it was mostly popular in the 1980s and 1990s. Though, it is not … Continue reading
Introduction to the Apriori algorithm (with Java code)
This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not because it is the fastest, but because it has … Continue reading
Posted in Big data, Data Mining, Pattern Mining, Programming
Tagged apriori, code, frequent itemset, frequent pattern, itemset, java, pattern mining
Leave a comment
The SPMF data mining library: a brief history and what’s next?
In this blog post, I will talk about the wellknown opensource library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading
Posted in Data Mining, Programming, Research
Tagged data mining, library, opensource, spmf
Leave a comment
How to test if a data mining mining algorithm implementation is correct?
In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading
Posted in Data Mining, Programming, Research, Uncategorized
Tagged algorithm, correctness, data mining, debugging
3 Comments
Drawing a setenumeration tree using Java and GraphViz
In this blog post, I will explain and provide source code to automatically draw the setenumeration tree of a set using Java and GraphViz. Drawing a setenumeration tree is useful in computer science, for example in frequent itemset mining, a subfield of data … Continue reading
Posted in Data Mining, Mathematics, Programming, Research
Leave a comment
Big Problems only found in Big Data?
Today, I will discuss the topic of Big Data, which is a very popular topic nowadays. The popularity of big data can be seen for example in universities. Many universities are currently searching for professors who do research on “big data”. Moreover, … Continue reading
Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz
Today, I will show how to use the opensource SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found using GraphViz. Step 1 : getting the … Continue reading
Posted in Data Mining, Programming
Tagged data mining, graph, patterns, sequential patterns, spmf, visualization
8 Comments
Why data mining researchers should evaluate their algorithms against stateoftheart algorithms?
sA common problem in research on data mining is that researchers proposing new data mining algorithms often do not compare the performance of their new algorithm with the current stateofthe art data mining algorithms. For example, let me illustrate this … Continue reading
Posted in Data Mining, Programming, Research
4 Comments
How to measure the memory usage of data mining algorithms in Java?
Today, I will discuss the topic of accurately evaluating the memory usage of data mining algorithms in Java. I will share several problems that I have discovered with memory measurements in Java for data miners and strategies to avoid these … Continue reading
Posted in Data Mining, Programming, Research
Tagged comparison, data mining, experiment, java, memory, performance
1 Comment
What are the steps to implement a data mining algorithm?
In this post, I will discuss what are the steps that I follow to implement a data mining algorithm. The subject of this post comes from a question that I have received by email recently, and I think that it … Continue reading
Posted in Data Mining, Programming
Tagged algorithm, data mining, design, implementation, programming
43 Comments