Archives
Categories
 Academia (24)
 artificial intelligence (4)
 Big data (35)
 Conference (14)
 Data Mining (66)
 Data science (30)
 General (35)
 Graph mining (3)
 Latex (2)
 Mathematics (2)
 Opensource (7)
 Other (2)
 Plagiarism (7)
 Programming (16)
 Research (60)
 Time series (1)
 Utility Mining (4)
 Web (1)
Tag cloud
 academia
 algorithm
 algorithms
 articles
 association rules
 big data
 comparison
 conference
 data mining
 data science
 datasets
 frequent pattern mining
 frequent patterns
 graph
 highutility mining
 ilahia
 india
 itemset mining
 java
 journal
 latex
 library
 M.Sc.
 opensource
 pakdd
 paper
 pattern mining
 Ph.D.
 phd
 plagiarism
 programming
 publications
 Research
 research advisor
 research papers
 review
 reviewer
 sequence
 sequential patterns
 software
 source code
 spmf
 utility mining
 visualization
 writing
Recent Comments
 Philippe FournierViger on An Introduction to Sequential Pattern Mining
 inf3rno on An Introduction to Sequential Pattern Mining
 inf3rno on An Introduction to Sequential Pattern Mining
 Philippe FournierViger on An Introduction to Sequential Pattern Mining
 inf3rno on An Introduction to Sequential Pattern Mining

Recent Posts
 The Semantic Web and why it failed.
 招收数据挖掘领域博士后，地点：中国，深圳
 How to run SPMF without installing Java?
 KDD 2018 workshop on utilitydriven mining
 Plagiarism by Sudhir Mohod and Sharda Khode from Bapurao Deshmukh College （BDCE）
 Plagiarism by Kalli S N Prasad and S Venkata Suryanaryana at GVIT College Bhimavaram (affiliated to JNTUK) and CVR college
 Comparing Two LaTeX documents with Latexdiff
 The conference that tolerates up to 20 % plagiarism
 Subgraph mining datasets
 Plagiarism by Divvela Srinivasa Rao at Lakireddy Balireddy College of Engineering (LBRCE)
Number of visitors:
777484
Category Archives: Big data
An Introduction to Sequential Pattern Mining
In this blog post, I will give an introduction to sequential pattern mining, an important data mining task with a wide range of applications from text analysis to market basket analysis. This blog post is aimed to be a short … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data mining, data science, frequent pattern mining, frequent patterns, pattern, sequence, sequential pattern
43 Comments
An introduction to frequent subgraph mining
In this blog post, I will give an introduction to an interesting data mining task called frequent subgraph mining, which consists of discovering interesting patterns in graphs. This task is important since data is naturally represented as graph in many domains (e.g. … Continue reading
Posted in Big data, Data Mining, Data science, Graph mining
Tagged algorithm, big data, data mining, data science, frequent subgraphs, graph, pattern mining
17 Comments
We are launching a new data mining journal
In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague ChunWei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new openaccess journal, … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged big data, data mining, data science, journal
2 Comments
Introduction to clustering: the KMeans algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular KMeans algorithm with an example. Moreover, I will briefly explain how an opensource Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
Tagged clustering, data mining, data science, java, kmeans, opensource, spmf
18 Comments
Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java opensource data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading
Posted in Big data, Data Mining, Opensource, Time series
Tagged big data, data mining, data science, java, opensource, pattern mining, SAX algorithm, spmf, time series
2 Comments
The KDDCup 2015 dataset
The KDD cup 2015 dataset is about MOOC dropout prediction. I have had recently found that the dataset had been offline on the official website. Thus, I have uploaded a copy of the KDD cup 2015 dataset on my website. You can download … Continue reading
Posted in Big data, Data Mining, Data science, Research
Leave a comment
Discovering hidden patterns in texts using SPMF
This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words. We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
10 Comments
Brief report about the Dexa 2016 and Dawak 2016 conferences
This week, I have been attending the DEXA 2016 and DAWAK 2016 conferences, in Porto, Portugal, from the 4th to 8th September 2016, to present three papers. In this blog post, I will give a brief report about these conferences. About these … Continue reading
Posted in Big data, Conference, Data Mining, Research
2 Comments
Brief report about the MLDM 2016 Conference (12th International Conference on Machine Learning and Data Mining conference)
In this blog post, I will provide a brief report about the 12th Intern. Conference on Machine Learning and Data Mining (MLDM 2016), that I have attended from the 18th to 20th July 2016 in Newark, USA. About the conference This is the 12th edition … Continue reading
Posted in Big data, Conference, Data Mining, Data science
1 Comment
Brief report about the 16th Industrial Conference on Data mining 2016 (ICDM 2016)
In this blog post, I will provide a brief report about the 16th Industrial Conference on Data mining 2016, that I have attended from the 13 to 14 July 2016 in Newark, USA. About the conference The Industrial Conference on Data Mining is an … Continue reading
Posted in Big data, Conference, Data Mining
3 Comments