
Recent Posts
 The PAKDD 2017 conference (a brief report)
 Plagiarism by Bhawna Mallick and Kriti Raj at Galgotias College of Engineering & Technology
 How to publish in top conferences/journals? The Blue Ocean Strategy
 This is why you should visualize your data!
 An Introduction to Sequential Pattern Mining
 An Introduction to Data Mining
 Write more papers or write better papers? (quantity vs quality)
 Using LaTeX for writing research papers
 An introduction to frequent subgraph mining
 We are launching a new data mining journal
Categories
 Academia (11)
 artificial intelligence (4)
 Big data (28)
 Conference (13)
 Data Mining (56)
 Data science (24)
 General (27)
 Mathematics (2)
 Opensource (6)
 Plagiarism (2)
 Programming (15)
 Research (48)
 Time series (1)
 Utility Mining (2)
Tag cloud
academia academic journal algorithm algorithms articles big data comparison conference data mining data science datasets frequent pattern mining frequent patterns graph internet itemset mining java journal library M.Sc. map opensource pakdd paper papers pattern mining peerreview Ph.D. phd plagiarism programmer programming publications Research research advisor research papers sequence sequential patterns software source code spmf thesis topic visualization website writingArchives
Recent Comments
 The PAKDD 2017 conference (a brief report)  The Data Mining Blog on Report of the PAKDD 2014 conference (part 1)
 Philippe FournierViger on How to test if a data mining mining algorithm implementation is correct?
 Hitesh Pujari on How to test if a data mining mining algorithm implementation is correct?
 Philippe FournierViger on How to autoadjust the minimum support threshold according to the data size
 Ko Moe on How to autoadjust the minimum support threshold according to the data size
Number of visitors:
532495
Category Archives: Opensource
Introduction to clustering: the KMeans algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular KMeans algorithm with an example. Moreover, I will briefly explain how an opensource Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
Tagged clustering, data mining, data science, java, kmeans, opensource, spmf
2 Comments
Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java opensource data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading
Posted in Big data, Data Mining, Opensource, Time series
Tagged big data, data mining, data science, java, opensource, pattern mining, SAX algorithm, spmf, time series
2 Comments
Discovering hidden patterns in texts using SPMF
This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words. We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
6 Comments
An introduction to periodic pattern mining
In this blog post I will give an introduction to the discovery of periodic patterns in data. Mining periodic patterns is an important data mining task as patterns may periodically appear in all kinds of data, and it may be desirable to find them … Continue reading
Posted in Big data, Data Mining, Data science, Opensource, Research, Utility Mining
3 Comments
SPMF data mining library 0.98: new pattern visualization window
This blog post is to let you know that I have just published a new version of the SPMF opensource Java data mining library (0.98) that offers a new window for visualizing the patterns found by data mining algorithms. This … Continue reading
Posted in Data Mining, General, Opensource, Research
Tagged big data, data mining, GPL, library, opensource, spmf
Leave a comment
200,000 visitors on the SPMF website!
Today, I will just write a short blog post to mention that the SPMF opensource data mining library has recently passed the milestone of 200,000 visitors. This is possible thanks to the support of all users of SPMF, and the contributors … Continue reading
Posted in Data Mining, Data science, Opensource, Research
Leave a comment