Tag Archives: itemset mining
An Introduction to HighUtility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called “highutility itemset mining” or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, highutility mining, itemset mining, java, opensource, source code, spmf, utility mining
58 Comments
Drawing the Powerset of a Set using Java and GraphViz (Hasse Diagram)
In this blog post, I will explain and provide source code to automatically draw the powerset of a set using Java and GraphViz. Drawing a powerset is useful in mathematics and also in computer science, for example in frequent itemset mining, it can … Continue reading
Posted in Data Mining, General, Mathematics
Tagged graphviz, hasse diagram, itemset mining, java, powerset
2 Comments
Big Problems only found in Big Data?
Today, I will discuss the topic of Big Data, which is a very popular topic nowadays. The popularity of big data can be seen for example in universities. Many universities are currently searching for professors who do research on “big data”. Moreover, … Continue reading
An introduction to frequent pattern mining
In this blog post, I will give a brief overview of an important subfield of data mining that is called pattern mining. Pattern mining consists of using/developing data mining algorithms to discover interesting, unexpected and useful patterns in databases. Pattern … Continue reading
Posted in Data Mining, Research
Tagged association rule, itemset, itemset mining, pattern mining, sequential patterns
14 Comments
How to autoadjust the minimum support threshold according to the data size
Today, I will do a quick post on how to automatically adjust the minimum support threshold of frequent pattern mining algorithms such as Apriori, FPGrowth and PrefixSpan according to the size of the data. The problem is simple. Let’s … Continue reading
Posted in Data Mining, Programming
Tagged apriori, fpgrowth, frequent pattern mining, itemset mining, minsup, prefixspan
42 Comments