This week I have attended the China International Big Data Industry Expo 2018 in Guiyang, China. I will describe this event and some of the key things that I have observed so far. What is the China International Big Data Industry … Continue reading

# Category Archives: Big data

In this blog post, I will talk about the vision of the Semantic Web that was proposed in the years 2000s, and why it failed. Then, I will talk about how it has been replaced today by the use of … Continue reading

哈尔滨工业大学(深圳)工业设计研究中心正在招聘两名博士后研究人员进行数据挖掘/大数据方向的研究。 招聘条件: 计算机科学博士学位, 在数据挖掘或人工智能领域有着深厚的研究背景, 在数据挖掘或人工智能领域的优秀会议或期刊上发表过论文， 对数据挖掘算法的开发和应用有浓厚兴趣， 211/ 985大学或国外优秀学校博士学位优先考虑。 成功申请人将： 工作在与时间序列和空间序列相关方面或者其它与数据挖掘领域相关的理论或者工业应用。（确切的主题会根据申请人的优势讨论后确定）。 加入由Philippe Fournier-Viger教授领导的优秀研究团队，Philippe Fournier-Viger教授是流行数据挖掘库SPMF的创始人，并且与其他领域的优秀研究人员有密切合作。 工作在具有先进设备的实验室（实验室配备高端的工作站，用于大数据研究的服务器集群，GPU服务器，虚拟现实设备，身体传感器等）。 以年薪17.6万元人民币聘用两年（其中51600来自学校，120,000来自深圳市政府）。请注意，博士后研究员不需要对工资支付任何税费，学校会提供低价格的租赁公寓（大约1500/月，很大地节省了住宿费用）。 工作在全球计算机科学领域排名前50的大学之一，以及中国排名前10的大学之一。 工作在中国东南部增长最快的城市之一深圳，这里污染低，全年气候温暖，接近香港。 如果您对此职位感兴趣，请尽快发送您的详细简历（包括出版物和参考文献清单）至Philippe Fournier-Viger教授(philfv8@yahoo.com )，可以申请2018年或2019年的博士后名额。 Related posts:How to choose a research advisor for M.Sc. / Ph.D ?An Introduction to Sequential Rule MiningThe conference that tolerates up to … Continue reading

In this post, I will provide two standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that I have created for debugging subgraph mining algorithms. The format of … Continue reading

In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm. I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to … Continue reading

Discovering interesting patterns in data is often referred as data mining, data science or big data. In the last few years, I have written several blog posts providing introduction to data mining and key topics in data mining: An Introduction to … Continue reading

CALL FOR CHAPTERS High-Utility Pattern Mining: Theory, Algorithms and Applications Editors: Philippe Fournier-Viger, Chun-Wei Lin, Roger Nkambou, Bay Vo An edited book to be published by Springer in 2018 Introduction This book will provide an introduction to the high utility mining, reviews state-of-the-art … Continue reading

This blog post provides an introduction to the Apriori algorithm, a classic data mining algorithm for the problem of frequent itemset mining. Although Apriori was introduced in 1993, more than 20 years ago, Apriori remains one of the most important data mining algorithms, not … Continue reading

This week, I have attended the PAKDD 2017 conference in Jeju Island, South Korea, this week, from the 23 to 26th May. PAKDD is the top data mining conference for the asia-pacific region. It is held every year in a … Continue reading

In the data science and data mining communities, several practitioners are applying various algorithms on data, without attempting to visualize the data. This is a big mistake because sometimes, visualizing the data greatly helps to understand the data. Some phenomena are obvious … Continue reading