Archives
Categories
 Academia (21)
 artificial intelligence (4)
 Big data (33)
 Conference (13)
 Data Mining (63)
 Data science (29)
 General (33)
 Graph mining (3)
 Mathematics (2)
 Opensource (7)
 Other (2)
 Plagiarism (5)
 Programming (15)
 Research (57)
 Time series (1)
 Utility Mining (3)
Tag cloud
 academia
 algorithm
 algorithms
 articles
 association rules
 big data
 comparison
 conference
 data mining
 data science
 datasets
 frequent pattern mining
 frequent patterns
 graph
 highutility mining
 ilahia
 itemset mining
 java
 journal
 library
 M.Sc.
 opensource
 pakdd
 paper
 papers
 pattern mining
 Ph.D.
 phd
 plagiarism
 programmer
 programming
 publications
 Research
 research advisor
 research papers
 review
 reviewer
 sequence
 sequential patterns
 software
 source code
 spmf
 thesis topic
 visualization
 writing
Recent Comments
 Philippe FournierViger on Why I left Canada to work as a University Professor in China
 infi on Why I left Canada to work as a University Professor in China
 Philippe FournierViger on How to choose a good thesis topic in Data Mining?
 Kanika on How to choose a good thesis topic in Data Mining?
 Philippe FournierViger on Introduction to clustering: the KMeans algorithm (with Java code)

Recent Posts
 The conference that tolerates up to 20 % plagiarism
 Subgraph mining datasets
 Plagiarism by Divvela Srinivasa Rao at Lakireddy Balireddy College of Engineering (LBRCE)
 Why doing a Ph.D.?
 How to review a research paper?
 On the Completeness of the CloSpan and IncSpan algorithms
 10 ways of becoming more efficient at doing research
 IEEE and its language polishing service
 On the correctness of the FSMS algorithm for frequent subgraph mining
 The ontology book by Kerry Taylor that was never published
Number of visitors:
730715
Category Archives: Programming
How to design memoryefficient data mining algorithms in Java?
A while ago, I had written a blog post about How to measure the memory usage of algorithms in Java. Today, I will discuss the topic of optimizing the memory usage of algorithms written in Java to design memoryefficient data mining algorithms. … Continue reading
Posted in Academia, Big data, Data Mining, Data science, Programming
Leave a comment
The SPMF data mining library: a brief history and what’s next?
In this blog post, I will talk about the wellknown opensource library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading
Posted in Data Mining, Programming, Research
Tagged data mining, library, opensource, spmf
Leave a comment
How to test if a data mining mining algorithm implementation is correct?
In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading
Posted in Data Mining, Programming, Research
Tagged algorithm, correctness, data mining, debugging
5 Comments
Drawing a setenumeration tree using Java and GraphViz
In this blog post, I will explain and provide source code to automatically draw the setenumeration tree of a set using Java and GraphViz. Drawing a setenumeration tree is useful in computer science, for example in frequent itemset mining, a subfield of data … Continue reading
Posted in Data Mining, Mathematics, Programming, Research
3 Comments
Big Problems only found in Big Data?
Today, I will discuss the topic of Big Data, which is a very popular topic nowadays. The popularity of big data can be seen for example in universities. Many universities are currently searching for professors who do research on “big data”. Moreover, … Continue reading
Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz
Today, I will show how to use the opensource SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found, using GraphViz. Step 1 : getting the … Continue reading
Posted in Big data, Data Mining, Data science, Programming
Tagged data mining, graph, patterns, sequential patterns, spmf, visualization
8 Comments
Why data mining researchers should evaluate their algorithms against stateoftheart algorithms?
A common problem in research on data mining is that researchers proposing new data mining algorithms often do not compare the performance of their new algorithm with the current stateofthe art data mining algorithms. For example, let me illustrate this … Continue reading
Posted in Data Mining, Programming, Research
4 Comments
How to measure the memory usage of data mining algorithms in Java?
Today, I will discuss the topic of accurately evaluating the memory usage of data mining algorithms in Java. I will share several problems that I have discovered with memory measurements in Java for data miners and strategies to avoid these … Continue reading
Posted in Data Mining, Programming, Research
Tagged comparison, data mining, experiment, java, memory, performance
1 Comment
What are the steps to implement a data mining algorithm?
In this post, I will discuss what are the steps that I follow to implement a data mining algorithm. The subject of this post comes from a question that I have received by email recently, and I think that it … Continue reading
Posted in Data Mining, Programming
Tagged algorithm, data mining, design, implementation, programming
55 Comments
Choosing data structures according to what you want to do
Today, I write a post about programming. I want to share a simple but important idea for writing optimized code. The idea is to choose data structures according to what you want to do instead of what you want to … Continue reading
Posted in Data Mining, Programming
Tagged association rules, bitset, optimization, programming, redblack tree, topkrules
Leave a comment