Datasets of 30 English novels for pattern mining and text mining

Today, I want to announce that I have just made public datasets of 30 novels from English Novels from 10 authors of the XIX century. These datasets can be used for testing algorithms for sequential pattern mining, sequential rule mining,

(video) Discovering interpretable high utility patterns in databases

Today, I will share a short keynote talk (28 min) about discovering interpretable high utility patterns in data that I have presented at the CCNS 2020 conference. This talk gives an overview of techniques for finding interesting and useful patterns

Brief report about the ADMA 2019 conference

This week, I am also attending the 14th International Conference on Advanced Data Mining and Applications (ADMA 2019) conference in Dalian, China, from the 21st to 23rd November at Dalian Neusoft University of Information. About ADMA The ADMA conference is focused on

The SPMF data mining library v.2.40 is released!

Hi all, I am please to announce that a new version of SPMF has just been published (v 2.40). It contains 9 novel algorithms: the HUIM-ABC algorithm for mining high utility itemsets using Artificial Bee Colony Optimization (thanks to Wei Song and Chaoming Huang for

Funny pictures about data mining / machine learning

Today, I will share a few funny pictures related to data mining and machine learning that I have found online. These pictures comes from various sources (I don't remember who created them). I will also perhaps add more later on

This is why you should visualize your data!

In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data.  This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious