Archives
Categories
- Academia (54)
- artificial intelligence (21)
- Big data (57)
- cfp (5)
- Conference (44)
- Data Mining (108)
- Data science (54)
- General (35)
- Industry (1)
- Interview (1)
- Machine Learning (4)
- Mathematics (2)
- open-source (10)
- Pattern Mining (23)
- Plagiarism (6)
- Programming (16)
- Research (91)
- Time series (2)
- Uncategorized (14)
- Utility Mining (8)
- Video (10)
- Website (1)
-
Recent Posts
- Analyzing the COVID-19 genome with AI and data mining techniques (paper + data + code)
- New version of SPMF (2.44): 4 new algorithms, datasets and features
- An introduction to frequent subgraph mining (repost)
- Sequential pattern mining vs Sequence prediction ?
- Email invitation to be a “special” speaker, a scam?
- Two journal special issues with deadlines in 2021
- Atomic Habits to Become a Better Researcher
- The Hard Road to Success in Academia
- If I would do a PhD again, what would I do differently?
- Analyzing COVID-19 tweets to understand the public opinion
Recent Comments
- Philippe Fournier-Viger on Six important skills to become a succesful researcher
- Analyzing the COVID-19 genome with AI and data mining techniques (paper + data + code) | The Data Mining Blog on An Introduction to Sequential Rule Mining
- Analyzing the COVID-19 genome with AI and data mining techniques (PAPER + CODE + DATA) | The Data Mining Blog on An Introduction to Sequential Pattern Mining
- Irfan Nurhidayat on Six important skills to become a succesful researcher
- Lankarie Chamoda on Six important skills to become a succesful researcher
Tag cloud
- academia
- ai
- algorithm
- algorithms
- article
- articles
- artificial intelligence
- big data
- china
- conference
- data
- data mining
- data science
- dataset
- frequent pattern mining
- frequent patterns
- high-utility mining
- high utility itemset mining
- icdm
- ieaaie
- itemset mining
- java
- journal
- machine learning
- open-source
- pakdd
- paper
- papers
- pattern mining
- phd
- plagiarism
- programming
- Research
- researcher
- research paper
- review
- reviewer
- sequence
- sequence prediction
- software
- spmf
- utility mining
- video
- workshop
- writing
Number of visitors:
1,946,574
Tag Archives: data science
New version of SPMF (2.44): 4 new algorithms, datasets and features
Today, I am happy to announce that a new version of the SPMF open-source data mining software is released (v. 2.44). This is the download page. This new version was made possible due to several contributors. What is new? New … Continue reading
Posted in Data Mining, Data science, open-source, Pattern Mining, Utility Mining
Tagged algorithm, data mining, data science, dataset, open source, open-source, pattern mining, spmf
Leave a comment
Sequential pattern mining vs Sequence prediction ?
In this blog post, I will answer a question that I have received in my e-mail about what is the difference between sequential pattern mining and sequence prediction. I think that this is a good question and sharing the answer … Continue reading
Posted in Big data, Data Mining
Tagged data science, pattern mining, sequence, sequence prediction, sequential pattern
Leave a comment
The 10 most cited papers in our “Data Science and Pattern Recognition” journal!
As some of you know, I am editor-in-chief of the Data Science and Pattern Recognition (DSPR) journal. This journal has started in 2017 and four volumes have been published already with 28 papers, which I consider as a success. One … Continue reading
Posted in Academia, Big data, cfp, Data science, General, Machine Learning
Tagged big data, data mining, data science, dspr, journal, kdd, pattern recognition
Leave a comment
The PAKDD 2020 conference (a brief report)
In this report, I will talk about the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2020), from the 11th to 14th May 2020. The PAKDD conference PAKDD is a top international conference on data mining / big data in the Pacific-Asia part of the world. … Continue reading
Posted in Conference, Data Mining, Data science
Tagged conference, data mining, data science, pakdd
1 Comment
(Video) Sequence prediction with the CPT and CPT+ Models
Today, I presents the CPT and CPT+ sequence prediction models in a video. Sequence prediction is an important task in data mining which consists of predicting the next symbols of a sequence. It can be used for example to predict the next word that someone will … Continue reading
Correlation does not imply causation
There is a well known principle in statistics that correlation does not imply causation. It means that even if we observe that two variables behave in the same way, we should not conclude that the behavior of one of those variables … Continue reading
Posted in Big data, Data Mining, Data science
Tagged causation, correlation, data, data analysis, data mining, data science
Leave a comment
(video) Mining Frequent Itemsets with the Apriori algorithm
This is a video presentation of the Apriori algorithm for discovering frequent itemsets in data. Frequent itemset mining is one of the most popular data mining task. VIDEO LINK: http://www.philippe-fournier-viger.com/spmf/videos/apriori.mp4 The Java source code of the Apriori algorithm and datasets for evaluating its performance are available in the SPMF software. If you want … Continue reading
Posted in Data Mining, Data science, Pattern Mining, Video
Tagged algorithm, apriori, big data, data mining, data science, frequent pattern mining, frequent patterns, video
Leave a comment
Interview with Prof. Rage Uday Kiran about Data Mining
Today, I have the pleasure to interview Rage Uday Kiran researcher at the National Institute of Informatics in Tokyo, Japan. R. Uday Kiran is an Indian researcher who has been working in Japan for several years. He has been active mainly in the field of data mining, and … Continue reading
Posted in Data Mining, Data science, Interview, Pattern Mining
Tagged data mining, data science, pattern mining, periodic pattern
Leave a comment
The best data mining mailing lists (for researchers)
Today, I will list a few useful mailing lists related to data mining and big data. Subscribing to these mailing list is useful for PhD students and researchers, as many jobs, conferences, special issues and other opportunities are advertised on these mailing lists. It is … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data, data mining, data science, machine learning, mailing list, Research
Leave a comment
Analyzing the source code of SPMF (5 years later)
Five years ago, I had analyzed the source code of the SPMF data mining software using an open-source tool called CodeAnalyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ ). This had provided some interesting insights about the structure of the project, especially in terms of lines of codes and code to … Continue reading
Posted in Data Mining, Data science, open-source
Tagged data analysis, data mining, data science, open source, software, spmf
Leave a comment