Archives
Categories
 Academia (37)
 artificial intelligence (6)
 Big data (47)
 Conference (22)
 Data Mining (77)
 Data science (40)
 General (40)
 Graph mining (3)
 Industry (2)
 Latex (2)
 Mathematics (2)
 Mobile technology (1)
 Opensource (8)
 Other (2)
 Pattern Mining (1)
 Plagiarism (7)
 Programming (17)
 Research (69)
 Time series (2)
 Uncategorized (2)
 Utility Mining (6)
 Web (2)
Tag cloud
 academia
 academic wriing
 algorithm
 algorithms
 articles
 artificial intelligence
 association rules
 big data
 comparison
 conference
 data mining
 data science
 datasets
 frequent pattern mining
 frequent patterns
 graph
 highutility mining
 ilahia
 itemset mining
 java
 journal
 latex
 library
 opensource
 pakdd
 paper
 pattern mining
 Ph.D.
 phd
 plagiarism
 programming
 Research
 research paper
 research papers
 review
 reviewer
 sequence
 sequential patterns
 software
 source code
 spmf
 time series
 utility mining
 visualization
 writing
Recent Comments
 Philippe FournierViger on Expensive Academic Conferences – the case of ICDM
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)
 Maciej Muras on Introduction to clustering: the KMeans algorithm (with Java code)
 Frank K. Cooper, Jr. on Expensive Academic Conferences – the case of ICDM

Recent Posts
 Expensive Academic Conferences – the case of ICDM
 Periodic patterns in Web log time series
 Upcoming book: High Utility Itemset Mining: Theory, Algorithms and Applications
 What I don’t like about academia
 News about the data mining blog
 Report about the DEXA 2018 and DAWAK 2018 conferences
 China lead in mobile payment and services
 Report about the KDD 2018 conference
 A Model for Football Pass Prediction (source code + dataset)
 The future of pattern mining
Top 5 most popular posts
Number of visitors:
841840
Category Archives: Programming
What are the steps to implement a data mining algorithm?
In this post, I will discuss what are the steps that I follow to implement a data mining algorithm. The subject of this post comes from a question that I have received by email recently, and I think that it … Continue reading
Posted in Data Mining, Programming
Tagged algorithm, data mining, design, implementation, programming
57 Comments
Choosing data structures according to what you want to do
Today, I write a post about programming. I want to share a simple but important idea for writing optimized code. The idea is to choose data structures according to what you want to do instead of what you want to … Continue reading
Posted in Data Mining, Programming
Tagged association rules, bitset, optimization, programming, redblack tree, topkrules
Leave a comment
Analyzing the source code of the SPMF data mining software
Hi everyone, In this blog post, I will discuss how I have applied an opensource tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyzegpl/ ) to analyze the source code of my opensource data mining software named SPMF. I have applied … Continue reading
How to autoadjust the minimum support threshold according to the data size
Today, I will do a quick post on how to automatically adjust the minimum support threshold of frequent pattern mining algorithms such as Apriori, FPGrowth and PrefixSpan according to the size of the data. The problem is simple. Let’s consider … Continue reading
Posted in Data Mining, Programming
Tagged apriori, fpgrowth, frequent pattern mining, itemset mining, minsup, prefixspan
61 Comments
How to characterize and compare data mining algorithms?
Hi, today, I will discuss how to compare data mining algorithms. This is an important question for data mining researchers who want to evaluate which algorithm is “better” in general or for a given situation. This question is also important … Continue reading
Posted in Data Mining, Programming, Research
Tagged algorithms, characteristics, classification, comparison, data mining, evaluation
7 Comments
A Map of Data Mining Algorithms (offered in SPMF v092c)
Hi, I have made a map to visualize the relationship between the 52 different data mining algorithms offered in the SPMF data mining software. You can view it in PNG format by clicking on the picture below: Or you can … Continue reading
Posted in Data Mining, Programming
Tagged algorithms, data mining, java, map, opensource, spmf
2 Comments
How to become a good data mining programmer?
In this post, I will discuss what it takes to be a good data mining programmer and how to become one. Data mining is a broad field that can be approached from several angles. Some people with a mathematical background will … Continue reading
Posted in Data Mining, Programming
Tagged algorithms, data mining, programmer, programming
17 Comments