
Recent Posts
 How to discover interesting patterns in data?
 Call for chapters: High Utility Pattern Mining, the book
 Introduction to the Apriori algorithm (with Java code)
 Do not link to impact factors, they will censor you!
 How to publish in top conferences/journals? (Part 2) – The opportunity cost of research
 Plagiarism by K. Raghu Naga Dhareswararao, T. Kishore
 The PAKDD 2017 conference (a brief report)
 Plagiarism by Bhawna Mallick and Kriti Raj at Galgotias College of Engineering & Technology
 How to publish in top conferences/journals? (Part 1) – The Blue Ocean Strategy
 This is why you should visualize your data!
Categories
 Academia (14)
 artificial intelligence (4)
 Big data (31)
 Conference (13)
 Data Mining (60)
 Data science (27)
 General (28)
 Mathematics (2)
 Opensource (7)
 Plagiarism (3)
 Programming (15)
 Research (52)
 Time series (1)
 Utility Mining (3)
Tag cloud
academia algorithm algorithms articles association rules big data comparison conference data mining data science datasets frequent pattern mining frequent patterns graph highutility mining ilahia internet itemset mining java journal library M.Sc. opensource pakdd paper papers pattern mining Ph.D. phd plagiarism programmer programming publications Research research advisor research papers sequence sequential patterns software source code spmf thesis topic visualization website writingArchives
Recent Comments
 Philippe FournierViger on An Introduction to HighUtility Itemset Mining
 Philippe FournierViger on An Introduction to HighUtility Itemset Mining
 Philippe FournierViger on An Introduction to HighUtility Itemset Mining
 Philippe FournierViger on An Introduction to HighUtility Itemset Mining
 Vandna dahiya on An Introduction to HighUtility Itemset Mining
Number of visitors:
670655
Tag Archives: data mining
The SPMF data mining library: a brief history and what’s next?
In this blog post, I will talk about the wellknown opensource library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading
Posted in Data Mining, Programming, Research
Tagged data mining, library, opensource, spmf
Leave a comment
An Introduction to Sequential Rule Mining
In this blog post, I will discuss an interesting topic in data mining, which is the topic of sequential rule mining. It consists of discovering rules in sequences. This data mining task has many applications for example for analyzing the behavior of … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged data mining, frequent patterns, high utility, sequential rules
18 Comments
How to test if a data mining mining algorithm implementation is correct?
In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading
Posted in Data Mining, Programming, Research
Tagged algorithm, correctness, data mining, debugging
5 Comments
An Introduction to HighUtility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called “highutility itemset mining” or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, highutility mining, itemset mining, java, opensource, source code, spmf, utility mining
97 Comments
Big Problems only found in Big Data?
Today, I will discuss the topic of Big Data, which is a very popular topic nowadays. The popularity of big data can be seen for example in universities. Many universities are currently searching for professors who do research on “big data”. Moreover, … Continue reading
Report of the PAKDD 2014 conference (part 3)
This post continue my report of the PAKDD 2014 in Tainan (Taiwan). The panel about big data Friday, there was a great panel about big data with 7 top researchers from the field of data mining. I will try to faithfully report some … Continue reading
Posted in Academia, Big data, Conference, Data Mining, Data science
Tagged big data, conference, data mining, data science, pakdd
1 Comment
Report of the PAKDD 2014 conference (part 2)
This post will continue my report of the PAKDD 2014 in Tainan (Taiwan). About big data Another interesting talk at this conference was given by Jian Pei. The topic was Big Data. Some key ideas in this talk was that to make … Continue reading
Posted in Academia, Big data, Conference, Data Mining, Data science
Tagged big data, conference, data mining, data science, pakdd
2 Comments
Report of the PAKDD 2014 conference (part 1)
I am currently at the PAKDD 2014 conference in Tainan, In this post, I will report interesting information about the conference and talks that I have attended. Importance of Succint Data Structures for Data Mining I have attended a very nice … Continue reading
Posted in Big data, Conference, Data Mining, Data science
Tagged big data, data mining, data science, pakdd
2 Comments
Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz
Today, I will show how to use the opensource SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found, using GraphViz. Step 1 : getting the … Continue reading
Posted in Big data, Data Mining, Data science, Programming
Tagged data mining, graph, patterns, sequential patterns, spmf, visualization
8 Comments
Brief report about the ADMA 2013 conference
In this blog post, I will discuss my recent trip to the ADMA 2013 conference (9th Intern. Conf. on Advanced Data Mining and Applications in China (December 1416 2013 in Hangzhou, China at Zhejiang University). Note that the view expressed … Continue reading