Archives
Categories
- Academia (87)
- artificial intelligence (34)
- Big data (82)
- Bioinformatics (3)
- cfp (10)
- Chinese posts (1)
- Conference (74)
- Data Mining (184)
- Data science (103)
- Database (2)
- General (43)
- Industry (2)
- Java (11)
- Latex (10)
- Machine Learning (20)
- Mathematics (2)
- open-source (39)
- Other (3)
- Pattern Mining (86)
- Plagiarism (1)
- Programming (17)
- Research (109)
- spmf (55)
- Time series (3)
- Uncategorized (23)
- Utility Mining (23)
- Video (19)
- Website (4)
-
Recent Posts
- End of my term as associate editor for Array
- Computer Science Journals and Conferences with the most withdrawals in 2023
- Reducing the cost of web hosting…
- SPMF 2.62 is released!
- My research is open-source
- Two new shopping datasets with taxonomy
- How to deal with unethical reviewers? The good example of the EAAI journal
- CSRankings: still a biased ranking
- The story of the most influential paper award of PAKDD 2024
- A brief report about PAKDD 2024
Recent Comments
- Philippe Fournier-Viger on A simple BAT script to unzip ZIP files in all sub-directories and then delete the ZIP files
- hamjii on A simple BAT script to unzip ZIP files in all sub-directories and then delete the ZIP files
- Philippe Fournier-Viger on How to call SPMF from R?
- Toni Garcia on How to call SPMF from R?
- K. P. Birla on About the author
Tag cloud
- academia
- ai
- algorithm
- apriori
- article
- articles
- artificial intelligence
- association rule
- big data
- cfp
- china
- conference
- data
- data mining
- data science
- episode
- graph
- high utility itemset mining
- icdm
- itemset
- itemset mining
- java
- journal
- latex
- machine learning
- open-source
- open source
- pakdd
- paper
- papers
- pattern mining
- periodic pattern
- phd
- Research
- researcher
- reviewer
- sequence
- sequential pattern
- software
- spmf
- udml
- utility mining
- video
- workshop
- writing
Number of visitors:
2,315,670
Category Archives: Programming
Efficiency problems with using java.util.Hashmap
Today, I will talk about some efficiency problems when using the java.util.Hashmap data structure provided by Java. This is an important topics for programmers that aim at implementing efficient software that rely on maps. Brief review about HashMap First, let’s … Continue reading
Posted in Java, Programming
Tagged complexity, efficiency, efficiency problems, hashmap, implementation, java, map, memory, performance, programming, time
Leave a comment
A Tribute to Hypercard
In this blog post, I will talk about the first programming language that I have learn, which is HyperTalk. Younger readers may have never heard about it, as it was mostly popular in the 1980s and 1990s. Though, it is not … Continue reading
Introduction to the Apriori algorithm (with Java code)
This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not because it is the fastest, but because it has … Continue reading
Posted in Big data, Data Mining, Pattern Mining, Programming
Tagged apriori, code, frequent itemset, frequent pattern, itemset, java, pattern mining
11 Comments
The SPMF data mining library: a brief history and what’s next?
In this blog post, I will talk about the well-known open-source library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading
Posted in Data Mining, open-source, Programming, Research, spmf
Tagged data mining, library, open-source, spmf
Leave a comment
How to test if a data mining mining algorithm implementation is correct?
In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading
Posted in Data Mining, Programming, Research, Uncategorized
Tagged algorithm, correctness, data mining, debugging
3 Comments
Drawing a set-enumeration tree using Java and GraphViz
In this blog post, I will explain and provide source code to automatically draw the set-enumeration tree of a set using Java and GraphViz. Drawing a set-enumeration tree is useful in computer science, for example in frequent itemset mining, a subfield of data … Continue reading
Posted in Data Mining, Mathematics, Programming, Research
Leave a comment
Big Problems only found in Big Data?
Today, I will discuss the topic of Big Data, which is a very popular topic nowadays. The popularity of big data can be seen for example in universities. Many universities are currently searching for professors who do research on “big data”. Moreover, … Continue reading
Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz
Today, I will show how to use the open-source SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found using GraphViz. Step 1 : getting the … Continue reading
Posted in Data Mining, Java, open-source, Programming, spmf
Tagged data mining, graph, patterns, sequential patterns, spmf, visualization
8 Comments
Why data mining researchers should evaluate their algorithms against state-of-the-art algorithms?
sA common problem in research on data mining is that researchers proposing new data mining algorithms often do not compare the performance of their new algorithm with the current state-of-the art data mining algorithms. For example, let me illustrate this … Continue reading
Posted in Data Mining, Programming, Research
4 Comments
How to measure the memory usage of data mining algorithms in Java?
Today, I will discuss the topic of accurately evaluating the memory usage of data mining algorithms in Java. I will share several problems that I have discovered with memory measurements in Java for data miners and strategies to avoid these … Continue reading
Posted in Data Mining, Programming, Research
Tagged comparison, data mining, experiment, java, memory, performance
1 Comment