
Recent Posts
 Do not link to impact factors, they will censor you!
 How to publish in top conferences/journals? (Part 2) – The opportunity cost of research
 Plagiarism by K. Raghu Naga Dhareswararao, T. Kishore
 The PAKDD 2017 conference (a brief report)
 Plagiarism by Bhawna Mallick and Kriti Raj at Galgotias College of Engineering & Technology
 How to publish in top conferences/journals? (Part 1) – The Blue Ocean Strategy
 This is why you should visualize your data!
 An Introduction to Sequential Pattern Mining
 An Introduction to Data Mining
 Write more papers or write better papers? (quantity vs quality)
Categories
 Academia (14)
 artificial intelligence (4)
 Big data (28)
 Conference (13)
 Data Mining (57)
 Data science (24)
 General (28)
 Mathematics (2)
 Opensource (6)
 Plagiarism (3)
 Programming (15)
 Research (50)
 Time series (1)
 Utility Mining (2)
Tag cloud
academia academic journal algorithm algorithms articles big data comparison conference data mining data science datasets frequent pattern mining frequent patterns graph ilahia internet itemset mining java journal library M.Sc. opensource pakdd paper papers pattern mining peerreview Ph.D. phd plagiarism programmer programming publications Research research advisor research papers sequence sequential patterns software source code spmf thesis topic visualization website writingArchives
Recent Comments
 Philippe FournierViger on How to answer reviewers for a journal paper revision?
 Jamie on How to answer reviewers for a journal paper revision?
 Jumoke on How to choose a good thesis topic in Data Mining?
 Muhammad on Discovering hidden patterns in texts using SPMF
 Philippe FournierViger on How to choose a good thesis topic in Data Mining?
Number of visitors:
628945
Category Archives: Data science
The PAKDD 2017 conference (a brief report)
This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May. PAKDD is the top data mining conference for the asiapacific region. It is held every year in a … Continue reading
Posted in Academia, Big data, Conference, Data Mining, Data science
Tagged big data, conference, data mining, pakdd
2 Comments
This is why you should visualize your data!
In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data. This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data, data mining, data science, visualization
2 Comments
An Introduction to Sequential Pattern Mining
In this blog post, I will give an introduction to sequential pattern mining, an important data mining task with a wide range of applications from text analysis to market basket analysis. This blog post is aimed to be a short … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data mining, data science, frequent pattern mining, frequent patterns, pattern, sequence, sequential pattern
7 Comments
An Introduction to Data Mining
In this blog post, I will introduce the topic of data mining. The goal is to give a general overview of what is data mining. What is data mining? Data mining is a field of research that has emerged in … Continue reading
Posted in Data Mining, Data science, General
3 Comments
An introduction to frequent subgraph mining
In this blog post, I will give an introduction to an interesting data mining task called frequent subgraph mining, which consists of discovering interesting patterns in graphs. This task is important since data is naturally represented as graph in many domains (e.g. … Continue reading
Posted in Big data, Data Mining, Data science
Tagged algorithm, big data, data mining, data science, frequent subgraphs, graph, pattern mining
9 Comments
We are launching a new data mining journal
In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague ChunWei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new openaccess journal, … Continue reading
Posted in Big data, Data Mining, Data science, Research
Tagged big data, data mining, data science, journal
2 Comments
Introduction to clustering: the KMeans algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular KMeans algorithm with an example. Moreover, I will briefly explain how an opensource Java implementation of … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
Tagged clustering, data mining, data science, java, kmeans, opensource, spmf
2 Comments
The KDDCup 2015 dataset
The KDD cup 2015 dataset is about MOOC dropout prediction. I have had recently found that the dataset had been offline on the official website. Thus, I have uploaded a copy of the KDD cup 2015 dataset on my website. You can download … Continue reading
Posted in Big data, Data Mining, Data science, Research
Leave a comment
Discovering hidden patterns in texts using SPMF
This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words. We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading
Posted in Big data, Data Mining, Data science, Opensource
10 Comments
Brief report about the IEA AIE 2016 conference
This week, I have attended the IEA AIE 2016 conference, held at Morioka, Japan from the 2nd to the 4th August 2016. In this blog post, I will briefly discuss the conference. About the conference IEA AIE 2016 (29th International Conference on … Continue reading
Posted in artificial intelligence, Conference, Data Mining, Data science, Research
Tagged artificial intelligence, conferenced, data mining
2 Comments