Tag Archives: data science

Introduction to frequent subranking mining

Rankings are made in many fields, as we naturally tend to rank objects, persons or things, in different contexts. For example, in a singing or a sport competition, some judges will rank participants from worst to best and give prizes to … Continue reading

Posted in Big data, Data Mining, Data science, Pattern Mining | Tagged , , , , | Leave a comment

The DSPR journal is growing… and we are looking for your papers!

This week, I want to talk to you about a journal that I am one of the editors-in-chief of, called Data Science and Pattern Recognition. The journal has been established in 2017 and currently 18 papers have already been published, are in press, … Continue reading

Posted in artificial intelligence, Big data, Data Mining, Research | Tagged , , , | Leave a comment

Skills needed for a data scientists? (comments on the HBR article)

Recently, I have read an article of the Harvard Business Review (HBR) website about data sciences skills for businesses. This article proposes to categorize skills related to data on a 2×2 matrix where skills are labelled as useful VS not useful, and … Continue reading

Posted in Big data, Data science | Tagged , , , , | Leave a comment

Report about the DEXA 2018 and DAWAK 2018 conferences

This week, I am attending the DEXA 2018 (29th International Conference on Database and Expert Systems Applications) and the DAWAK 2018 (20th Intern. Conf. on Data Warehousing and Knowledge Discovery) conferences from the 3rd to 6th September in Regensburg, Germany. Those two conferences are well established European conferences dedicated mainly to research on database and data mining. These conferences are always collocated. … Continue reading

Posted in Big data, Conference | Tagged , , , , , , | 1 Comment

Report about the KDD 2018 conference

This week, I am participating to the KDD 2018 ( 24th ACM SIGKDD Intern. Conference on Knowledge Discovery and Data Mining), in London, UK from the 19th to 23rd August 2018. The KDD conference is an international conference, established 24 years ago. It is the top conference in the field of data mining / … Continue reading

Posted in Big data, Conference, Data Mining, Data science | Tagged , , , , | Leave a comment

The future of pattern mining

In this blog post, I will talk about the future of research on pattern mining. I will also discuss some lessons learnt from the decades of research in this field and talk about research opportunities. What is the state of research on pattern mining? Over the last … Continue reading

Posted in Data Mining, Pattern Mining | Tagged , , , | Leave a comment

The PAKDD 2017 conference (a brief report)

This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May.  PAKDD is the top data mining conference for the asia-pacific region. It is held every year in a different pacific-asian country. In this blog post, I will write a brief report about … Continue reading

Posted in Academia, Conference, Data Mining, Data science | Tagged , , , , , , | Leave a comment

This is why you should visualize your data!

In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data.  This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious when visualizing the data. In this blog post, I will give a few … Continue reading

Posted in Big data, Data Mining, Data science | Tagged , , , , | Leave a comment

We are launching a new data mining journal

In this blog post, I will discuss one of my recent and current project. I have been recently working with my colleague Chun-Wei Lin on launching a new journal, titled “Data Science and Pattern Recognition“. This is a new open-access journal, … Continue reading

Posted in Big data, Data Mining, Data science, Research | Tagged , , , | Leave a comment

Introduction to clustering: the K-Means algorithm (with Java code)

In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis).  I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading

Posted in Big data, Data Mining, Data science, open-source | Tagged , , , , , , | Leave a comment