Archives
Categories
 Academia (46)
 artificial intelligence (19)
 Big data (53)
 cfp (3)
 Conference (42)
 Data Mining (102)
 Data science (49)
 General (31)
 Industry (1)
 Interview (1)
 Machine Learning (2)
 Mathematics (2)
 opensource (9)
 Pattern Mining (20)
 Plagiarism (6)
 Programming (16)
 Research (85)
 Time series (2)
 Uncategorized (14)
 Utility Mining (7)
 Video (9)
 Website (1)

Recent Posts
 A Brief Report about the IEEE ICDM 2020 Conference
 Is it a good to change research area?
 How to write the cover letter for a journal paper?
 How to find a good thesis topic in Machine Learning?
 Brief Report about the PKDD 2020 conference
 Why it takes so long for a journal paper to be reviewed?
 CFP: IKEDS 2021 @ IEA AIE 2021 – Special Session on Intelligent Knowledge Engineering in Decision Making
 (video) TopK CrossLevel High Utility Itemset Mining
 CFP about “AI in healthcare” (AIH2021 @ IEA AIE 2021)
 More problems on IONOS web hosting… 4 days of downtime!
Recent Comments
 Philippe FournierViger on On the correctness of the FSMS algorithm for frequent subgraph mining
 A Brief Report about the IEEE ICDM 2020 Conference  The Data Mining Blog on Expensive Academic Conferences – the case of ICDM
 Philippe FournierViger on Brief Report about the PKDD 2020 conference
 Philippe FournierViger on Why it takes so long for a journal paper to be reviewed?
 Philippe FournierViger on Six important skills to become a succesful researcher
Tag cloud
 academia
 ai
 algorithm
 algorithms
 article
 articles
 artificial intelligence
 asia
 big data
 china
 conference
 data
 data mining
 data science
 frequent pattern mining
 frequent patterns
 highutility mining
 high utility itemset mining
 icdm
 ieaaie
 itemset mining
 java
 journal
 machine learning
 opensource
 pakdd
 paper
 papers
 pattern mining
 periodic pattern
 phd
 plagiarism
 programming
 Research
 researcher
 research paper
 review
 reviewer
 sequence
 software
 spmf
 utility mining
 video
 workshop
 writing
Number of visitors:
1,925,814
Tag Archives: opensource
Introduction to clustering: the KMeans algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular KMeans algorithm with an example. Moreover, I will briefly explain how an opensource Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, opensource
Tagged clustering, data mining, data science, java, kmeans, opensource, spmf
Leave a comment
Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java opensource data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading
Posted in Big data, Data Mining, opensource, Time series
Tagged big data, data mining, data science, java, opensource, pattern mining, SAX algorithm, spmf, time series
Leave a comment
An Introduction to Sequence Prediction
In this blog post, I will give an introduction to the task of sequence prediction, a popular data mining/machine learning task, which consist of predicting the next symbol of a sequence of symbols. This task is important as it have many … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged opensource, prediction model, sequence, sequence prediction
Leave a comment
SPMF data mining library 0.98: new pattern visualization window
This blog post is to let you know that I have just published a new version of the SPMF opensource Java data mining library (0.98) that offers a new window for visualizing the patterns found by data mining algorithms. This … Continue reading
Posted in Data Mining, General, opensource, Research
Tagged big data, data mining, GPL, library, opensource, spmf
Leave a comment
The SPMF data mining library: a brief history and what’s next?
In this blog post, I will talk about the wellknown opensource library of data mining algorithms implemented in Java, which I am the founder of. I will give a brief overview of its history, discuss some lessons learned from the development of … Continue reading
Posted in Data Mining, Programming, Research
Tagged data mining, library, opensource, spmf
Leave a comment
An Introduction to HighUtility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called “highutility itemset mining” or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, highutility mining, itemset mining, java, opensource, source code, spmf, utility mining
58 Comments
How to encourage data mining researchers to share their source code and datasets?
A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“. I explained several advantages that researchers can get by sharing the source code of … Continue reading
Posted in Data Mining, Research
Tagged data mining, dataset, opensource, source code
Leave a comment
Analyzing the source code of the SPMF data mining software
Hi everyone, In this blog post, I will discuss how I have applied an opensource tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyzegpl/ ) to analyze the source code of my opensource data mining software named SPMF. I have applied … Continue reading
A Map of Data Mining Algorithms (offered in SPMF v092c)
Hi, I have made a map to visualize the relationship between the 52 different data mining algorithms offered in the SPMF data mining software. You can view it in PNG format by clicking on the picture below: Or you can … Continue reading
Posted in Data Mining, Programming
Tagged algorithms, data mining, java, map, opensource, spmf
2 Comments
Why it is important to publish source code and datasets for researchers?
Today, I will discuss about why it is important that researchers share their source code and data. As some of you know, I’m working on the design of data mining algorithms. More specifically, I’m working on algorithms for discovering patterns … Continue reading
Posted in Research
Tagged algorithms, data mining, datasets, opensource, Research, software, source code
2 Comments