Archives
Categories
- Academia (51)
- artificial intelligence (20)
- Big data (54)
- cfp (4)
- Conference (44)
- Data Mining (104)
- Data science (52)
- General (35)
- Industry (1)
- Interview (1)
- Machine Learning (3)
- Mathematics (2)
- open-source (9)
- Pattern Mining (20)
- Plagiarism (6)
- Programming (16)
- Research (88)
- Time series (2)
- Uncategorized (14)
- Utility Mining (7)
- Video (10)
- Website (1)
-
Recent Posts
- If I would do a PhD again, what would I do differently?
- Analyzing COVID-19 tweets to understand the public opinion
- Discovering Alarm Correlation Rules for Network Fault Management (video)
- Merry Christmas and Happy New Year!
- Conference Badges: the Best and the Worst
- Real Conferences VS Virtual Conferences
- The 10 most cited papers in our “Data Science and Pattern Recognition” journal!
- How to prepare your thesis defense?
- A Brief Report about the IEEE ICDM 2020 Conference
- Is it a good to change research area?
Recent Comments
- Philippe Fournier-Viger on Big problem on my website on IONOS webhosting!
- Brendan Vail on Big problem on my website on IONOS webhosting!
- Report about the ICGEC 2018 conference | The Data Mining Blog on An introduction to periodic pattern mining
- Philippe Fournier-Viger on How to answer reviewers for a journal paper revision?
- Philippe Fournier-Viger on Six important skills to become a succesful researcher
Tag cloud
- academia
- ai
- algorithm
- algorithms
- article
- articles
- artificial intelligence
- asia
- big data
- china
- conference
- data
- data mining
- data science
- frequent pattern mining
- frequent patterns
- high-utility mining
- high utility itemset mining
- icdm
- ieaaie
- itemset mining
- java
- journal
- machine learning
- open-source
- pakdd
- paper
- papers
- pattern mining
- periodic pattern
- phd
- plagiarism
- programming
- Research
- researcher
- research paper
- review
- reviewer
- sequence
- software
- spmf
- utility mining
- video
- workshop
- writing
Number of visitors:
1,938,055
Tag Archives: java
Introduction to the Apriori algorithm (with Java code)
This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not because it is the fastest, but because it has … Continue reading
Posted in Big data, Data Mining, Pattern Mining, Programming
Tagged apriori, code, frequent itemset, frequent pattern, itemset, java, pattern mining
Leave a comment
Introduction to clustering: the K-Means algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, open-source
Tagged clustering, data mining, data science, java, k-means, open-source, spmf
Leave a comment
Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading
Posted in Big data, Data Mining, open-source, Time series
Tagged big data, data mining, data science, java, open-source, pattern mining, SAX algorithm, spmf, time series
Leave a comment
An Introduction to High-Utility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called “high-utility itemset mining” or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, high-utility mining, itemset mining, java, open-source, source code, spmf, utility mining
59 Comments
Drawing the Powerset of a Set using Java and GraphViz (Hasse Diagram)
In this blog post, I will explain and provide source code to automatically draw the powerset of a set using Java and GraphViz. Drawing a powerset is useful in mathematics and also in computer science, for example in frequent itemset mining, it can … Continue reading
Posted in Data Mining, General, Mathematics
Tagged graphviz, hasse diagram, itemset mining, java, powerset
2 Comments
How to measure the memory usage of data mining algorithms in Java?
Today, I will discuss the topic of accurately evaluating the memory usage of data mining algorithms in Java. I will share several problems that I have discovered with memory measurements in Java for data miners and strategies to avoid these … Continue reading
Posted in Data Mining, Programming, Research
Tagged comparison, data mining, experiment, java, memory, performance
1 Comment
Analyzing the source code of the SPMF data mining software
Hi everyone, In this blog post, I will discuss how I have applied an open-source tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ ) to analyze the source code of my open-source data mining software named SPMF. I have applied … Continue reading
A Map of Data Mining Algorithms (offered in SPMF v092c)
Hi, I have made a map to visualize the relationship between the 52 different data mining algorithms offered in the SPMF data mining software. You can view it in PNG format by clicking on the picture below: Or you can … Continue reading
Posted in Data Mining, Programming
Tagged algorithms, data mining, java, map, open-source, spmf
2 Comments