Category Archives: Data science

Brief report about the ADMA 2019 conference

This week, I am also attending the 14th International Conference on Advanced Data Mining and Applications (ADMA 2019) conference in Dalian, China, from the 21st to 23rd November at Dalian Neusoft University of Information. About ADMA The ADMA conference is focused on data mining and its applications, and is … Continue reading

Posted in Conference, Data Mining, Data science | Tagged , , , , | 3 Comments

25 years of pattern mining

This year, we are in 2019, and it is already 25 years since Agrawal wrote his seminal papers on frequent itemset mining and association rule mining in 1994. Since then, there has been thousands of papers published on this topic, some about algorithm design, new pattern mining … Continue reading

Posted in Big data, Data Mining, Data science, Pattern Mining | Tagged , , , | Leave a comment

Brief report about DAWAK 2019 / DEXA 2019

This week, I am attending the DAWAK 2019 and DEXA 2019 conferences in Linz, Austria from the 26th to the 29th August 2019. In this blog post, I will provide a report about these conferences. About the DAWAK and DEXA conferences DAWAK ( Intern. Conf. on Data Warehousing … Continue reading

Posted in Big data, Conference, Data Mining, Data science | Tagged , , , , | 1 Comment

Correlation does not imply causation

There is a well known principle in statistics that correlation does not imply causation. It means that even if we observe that two variables behave in the same way, we should not conclude that the behavior of one of those variables … Continue reading

Posted in Big data, Data Mining, Data science | Tagged , , , , , | Leave a comment

(video) Mining Frequent Itemsets with the Apriori algorithm

This is a video presentation of the Apriori algorithm for discovering frequent itemsets in data. Frequent itemset mining is one of the most popular data mining task. VIDEO LINK: https://www.philippe-fournier-viger.com/spmf/videos/apriori.mp4 The Java source code of the Apriori algorithm and datasets for evaluating its performance are available in the SPMF software. If you want … Continue reading

Posted in Data Mining, Data science, Pattern Mining, Video | Tagged , , , , , , , | Leave a comment

The PAKDD 2019 conference (a brief report)

This year, I am attending the PAKDD 2019 conference (23rd Pacific Asia Conference on Knowledge Discovery and Data Mining), in Macau, China, from the 14th to the 17th April 2019. In this blog post, I will provide information about the conference. About the PAKDD conference PAKDD is one of the most important … Continue reading

Posted in Big data, Conference, Data Mining, Data science | Tagged , , , , | 4 Comments

The best data mining mailing lists (for researchers)

Today, I will list a few useful mailing lists related to data mining and big data. Subscribing to these mailing list is useful for PhD students and researchers, as many jobs, conferences, special issues and other opportunities are advertised on these mailing lists. It is … Continue reading

Posted in Big data, Data Mining, Data science | Tagged , , , , , , | Leave a comment

Analyzing the source code of SPMF (5 years later)

Five years ago, I had analyzed the source code of the SPMF data mining software using an open-source tool called CodeAnalyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ ). This had provided some interesting insights about the structure of the project, especially in terms of lines of codes and code to … Continue reading

Posted in Data Mining, Data science, open-source, spmf | Tagged , , , , , | Leave a comment

Report about the 2018 International Workshop on Mining of Massive Data and IoT

This week, I have attended the 2018 International Workshop on Mining of Massive Data and IoT  (2018 年大数据与物联网挖掘国际研讨会) organized by the Fujian Normal University in the city of  Fuzhou, China from the 18th to 20thDecember 2018. I have attended the workshop to give a talk … Continue reading

Posted in Conference, Data Mining, Data science | Tagged , , , | Leave a comment

(video) Minimal Correlated High Utility Itemsets with FCHM

This is a video presentation of the paper “Mining Correlated High-Utility Itemsets Using the bond Measure” about correlated high utility pattern mining using FCHM.  VIDEO LINK: https://www.philippe-fournier-viger.com/spmf/videos/FCHM_correlated_itemsets.mp4 More information about the FCHM algorithm are provided in this research paper: Fournier-Viger, P., Zhang, Y., Lin, J. C.-W., … Continue reading

Posted in Big data, Data Mining, Data science, Video | Tagged , , , , , | Leave a comment