MLiSE 2021 @ PKDD 2021 – a new workshop!
I am glad to announce that I am coorganizing a new workshop called MLiSE 2021 (1st international workshop on Machine Learning in Software Engineering), held in conjunction with the ECML PKDD 2021 conference. Briefly, the aim of this workshop is … Continue reading
Analyzing the COVID19 genome with AI and data mining techniques (paper + data + code)
Recently, my team has been working on analyzing COVID19 genome sequences using pattern mining and other AI techniques. We have recently published a paper in the Applied Intelligence journal about this. In this blog post, I will give some brief … Continue reading
A Brief Report about the IEEE ICDM 2020 Conference
In this blog post, I will talk about the IEEE ICDM 2020 conference that I have attended virtually. The conference was supposed to be held in Italy but due to the coronavirus pandemic, it was held online. About the ICDM … Continue reading
Five recent books on pattern mining
In this blog post, I will list a few interesting and recent books on the topic of pattern mining (discovering interesting patterns in data). This mainly lists books from the last 5 years. High utility pattern mining: Theory, Applications and algorithms (2019). This … Continue reading
Correlation does not imply causation
There is a well known principle in statistics that correlation does not imply causation. It means that even if we observe that two variables behave in the same way, we should not conclude that the behavior of one of those variables … Continue reading
The best data mining mailing lists (for researchers)
Today, I will list a few useful mailing lists related to data mining and big data. Subscribing to these mailing list is useful for PhD students and researchers, as many jobs, conferences, special issues and other opportunities are advertised on these mailing lists. It is … Continue reading
(video) Minimal Correlated High Utility Itemsets with FCHM
This is a video presentation of the paper “Mining Correlated HighUtility Itemsets Using the bond Measure” about correlated high utility pattern mining using FCHM. VIDEO LINK: http://www.philippefournierviger.com/spmf/videos/FCHM_correlated_itemsets.mp4 More information about the FCHM algorithm are provided in this research paper: FournierViger, P., Zhang, Y., Lin, J. C.W., … Continue reading
Introduction to frequent subranking mining
Rankings are made in many fields, as we naturally tend to rank objects, persons or things, in different contexts. For example, in a singing or a sport competition, some judges will rank participants from worst to best and give prizes to … Continue reading
Skills needed for a data scientists? (comments on the HBR article)
Recently, I have read an article of the Harvard Business Review (HBR) website about data sciences skills for businesses. This article proposes to categorize skills related to data on a 2×2 matrix where skills are labelled as useful VS not useful, and … Continue reading
Periodic patterns in Web log time series
Recently, I have analysed trends about visitors on this blog. I have made two observations. First, there is about 500 to 1000 visitors per day. For this, I want to thank you all for reading and commenting on the blog. Second, if we … Continue reading
