Category Archives: Data science

China International BigData Industry Expo 2018 (a brief report)

This week I have attended the China International Big Data Industry Expo 2018 in Guiyang, China. I will describe this event and some of the key things that I have observed so far. What is the China International Big Data Industry Expo? It is an international event targeted toward the industry and focused on big data, which is aimed … Continue reading

Posted in Big data, Conference, Data Mining, Data science | Tagged , , , | 2 Comments

How to run SPMF without installing Java?

The SPMF data mining software is a popular open-source software for discovering patterns in data and for performing other data mining tasks. Typically, to run SPMF, Java must have been installed on a computer. However, it is possible to run SPMF on a computer that does not have Java installed. For example, … Continue reading

Posted in Data Mining, Data science, open-source, Pattern Mining, Research, spmf | Tagged , , , | Leave a comment

The PAKDD 2017 conference (a brief report)

This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May.  PAKDD is the top data mining conference for the asia-pacific region. It is held every year in a different pacific-asian country. In this blog post, I will write a brief report about … Continue reading

Posted in Academia, Conference, Data Mining, Data science | Tagged , , , , , , | 4 Comments

This is why you should visualize your data!

In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data.  This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious when visualizing the data. In this blog post, I will give a few … Continue reading

Posted in Big data, Data Mining, Data science | Tagged , , , , | Leave a comment

We are launching a new data mining journal

In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague Chun-Wei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new open-access journal, … Continue reading

Posted in Big data, Data Mining, Data science, Research | Tagged , , , | Leave a comment

Introduction to the K-Means clustering algorithm (with Java code)

In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis).  I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading

Posted in Big data, Data Mining, Data science, open-source | Tagged , , , , , , | 2 Comments

Discovering hidden patterns in texts using SPMF

This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words.  We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading

Posted in Big data, Data Mining, Data science, open-source, spmf | Tagged , , , | 9 Comments

Brief report about the IEA AIE 2016 conference

This week, I have attended the IEA AIE 2016 conference, held at Morioka, Japan from the 2nd to the 4th August 2016. In this blog post, I will briefly discuss the conference. About the conference IEA AIE 2016 (29th International Conference … Continue reading

Posted in artificial intelligence, Conference, Data Mining, Data science, Research | Tagged , , | 7 Comments

Brief report about the 12th International Conference on Machine Learning and Data Mining conference (MLDM 2016)

In this blog post, I will provide a brief report about the 12th Intern. Conference on Machine Learning and Data Mining (MLDM 2016), that I have attended from the 18th to 20th July 2016 in Newark, USA. First I have to say, that I … Continue reading

Posted in Big data, Conference, Data Mining, Data science | Tagged , | 2 Comments

The top journals and conferences in data mining / data science

A key question for data mining and data science researchers is to know what are the top journals and conferences in the field, since it is always best to publish in the most popular journals or conferences. In this blog post, … Continue reading

Posted in Big data, Data Mining, Data science, Research | 4 Comments