
Recent Posts
 How to discover interesting patterns in data?
 Call for chapters: High Utility Pattern Mining, the book
 Introduction to the Apriori algorithm (with Java code)
 Do not link to impact factors, they will censor you!
 How to publish in top conferences/journals? (Part 2) – The opportunity cost of research
 Plagiarism by K. Raghu Naga Dhareswararao, T. Kishore
 The PAKDD 2017 conference (a brief report)
 Plagiarism by Bhawna Mallick and Kriti Raj at Galgotias College of Engineering & Technology
 How to publish in top conferences/journals? (Part 1) – The Blue Ocean Strategy
 This is why you should visualize your data!
Categories
 Academia (14)
 artificial intelligence (4)
 Big data (31)
 Conference (13)
 Data Mining (60)
 Data science (27)
 General (28)
 Mathematics (2)
 Opensource (7)
 Plagiarism (3)
 Programming (15)
 Research (52)
 Time series (1)
 Utility Mining (3)
Tag cloud
 academia
 algorithm
 algorithms
 articles
 association rules
 big data
 comparison
 conference
 data mining
 data science
 datasets
 frequent pattern mining
 frequent patterns
 graph
 highutility mining
 ilahia
 internet
 itemset mining
 java
 journal
 library
 M.Sc.
 opensource
 pakdd
 paper
 papers
 pattern mining
 Ph.D.
 phd
 plagiarism
 programmer
 programming
 publications
 Research
 research advisor
 research papers
 sequence
 sequential patterns
 software
 source code
 spmf
 thesis topic
 visualization
 website
 writing
Archives
Recent Comments
 Mr.X ( Victim of Internal) on Plagiarism at Ilahia College of Engineering and Technology by Nasreen Ali A and Arunkumar M
 Philippe FournierViger on An Introduction to HighUtility Itemset Mining
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)
 Philippe FournierViger on About the author
Number of visitors:
684881
Tag Archives: java
Introduction to the Apriori algorithm (with Java code)
This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
Tagged apriori algorithm, frequent itemset, frequent pattern, itemset mining, java, source code
6 Comments
Introduction to clustering: the KMeans algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular KMeans algorithm with an example. Moreover, I will briefly explain how an opensource Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
Tagged clustering, data mining, data science, java, kmeans, opensource, spmf
6 Comments
Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java opensource data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading
Posted in Big data, Data Mining, Opensource, Time series
Tagged big data, data mining, data science, java, opensource, pattern mining, SAX algorithm, spmf, time series
2 Comments
An Introduction to HighUtility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called “highutility itemset mining” or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, highutility mining, itemset mining, java, opensource, source code, spmf, utility mining
99 Comments
Drawing the Powerset of a Set using Java and GraphViz (Hasse Diagram)
In this blog post, I will explain and provide source code to automatically draw the powerset of a set using Java and GraphViz. Drawing a powerset is useful in mathematics and also in computer science, for example in frequent itemset mining, it can … Continue reading
Posted in Data Mining, General, Mathematics
Tagged graphviz, hasse diagram, itemset mining, java, powerset
2 Comments
How to measure the memory usage of data mining algorithms in Java?
Today, I will discuss the topic of accurately evaluating the memory usage of data mining algorithms in Java. I will share several problems that I have discovered with memory measurements in Java for data miners and strategies to avoid these … Continue reading
Posted in Data Mining, Programming, Research
Tagged comparison, data mining, experiment, java, memory, performance
1 Comment
Analyzing the source code of the SPMF data mining software
Hi everyone, In this blog post, I will discuss how I have applied an opensource tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyzegpl/ ) to analyze the source code of my opensource data mining software named SPMF. I have applied … Continue reading
A Map of Data Mining Algorithms (offered in SPMF v092c)
Hi, I have made a map to visualize the relationship between the 52 different data mining algorithms offered in the SPMF data mining software. You can view it in PNG format by clicking on the picture below: Or you can … Continue reading
Posted in Data Mining, Programming
Tagged algorithms, data mining, java, map, opensource, spmf
2 Comments