Archives
Categories
 Academia (54)
 artificial intelligence (21)
 Big data (57)
 cfp (5)
 Conference (44)
 Data Mining (108)
 Data science (54)
 General (35)
 Industry (1)
 Interview (1)
 Machine Learning (4)
 Mathematics (2)
 opensource (10)
 Pattern Mining (23)
 Plagiarism (6)
 Programming (16)
 Research (91)
 Time series (2)
 Uncategorized (14)
 Utility Mining (8)
 Video (10)
 Website (1)

Recent Posts
 Analyzing the COVID19 genome with AI and data mining techniques (paper + data + code)
 New version of SPMF (2.44): 4 new algorithms, datasets and features
 An introduction to frequent subgraph mining (repost)
 Sequential pattern mining vs Sequence prediction ?
 Email invitation to be a “special” speaker, a scam?
 Two journal special issues with deadlines in 2021
 Atomic Habits to Become a Better Researcher
 The Hard Road to Success in Academia
 If I would do a PhD again, what would I do differently?
 Analyzing COVID19 tweets to understand the public opinion
Recent Comments
 Philippe FournierViger on Six important skills to become a succesful researcher
 Analyzing the COVID19 genome with AI and data mining techniques (paper + data + code)  The Data Mining Blog on An Introduction to Sequential Rule Mining
 Analyzing the COVID19 genome with AI and data mining techniques (PAPER + CODE + DATA)  The Data Mining Blog on An Introduction to Sequential Pattern Mining
 Irfan Nurhidayat on Six important skills to become a succesful researcher
 Lankarie Chamoda on Six important skills to become a succesful researcher
Tag cloud
 academia
 ai
 algorithm
 algorithms
 article
 articles
 artificial intelligence
 big data
 china
 conference
 data
 data mining
 data science
 dataset
 frequent pattern mining
 frequent patterns
 highutility mining
 high utility itemset mining
 icdm
 ieaaie
 itemset mining
 java
 journal
 machine learning
 opensource
 pakdd
 paper
 papers
 pattern mining
 phd
 plagiarism
 programming
 Research
 researcher
 research paper
 review
 reviewer
 sequence
 sequence prediction
 software
 spmf
 utility mining
 video
 workshop
 writing
Number of visitors:
1,946,574
Tag Archives: data science
New version of SPMF (2.44): 4 new algorithms, datasets and features
Today, I am happy to announce that a new version of the SPMF opensource data mining software is released (v. 2.44). This is the download page. This new version was made possible due to several contributors. What is new? New … Continue reading
Posted in Data Mining, Data science, opensource, Pattern Mining, Utility Mining
Tagged algorithm, data mining, data science, dataset, open source, opensource, pattern mining, spmf
Leave a comment
Sequential pattern mining vs Sequence prediction ?
In this blog post, I will answer a question that I have received in my email about what is the difference between sequential pattern mining and sequence prediction. I think that this is a good question and sharing the answer … Continue reading
Posted in Big data, Data Mining
Tagged data science, pattern mining, sequence, sequence prediction, sequential pattern
Leave a comment
The 10 most cited papers in our “Data Science and Pattern Recognition” journal!
As some of you know, I am editorinchief of the Data Science and Pattern Recognition (DSPR) journal. This journal has started in 2017 and four volumes have been published already with 28 papers, which I consider as a success. One … Continue reading
Posted in Academia, Big data, cfp, Data science, General, Machine Learning
Tagged big data, data mining, data science, dspr, journal, kdd, pattern recognition
Leave a comment
The PAKDD 2020 conference (a brief report)
In this report, I will talk about the 24th PacificAsia Conference on Knowledge Discovery and Data Mining (PAKDD 2020), from the 11th to 14th May 2020. The PAKDD conference PAKDD is a top international conference on data mining / big data in the PacificAsia part of the world. … Continue reading
Posted in Conference, Data Mining, Data science
Tagged conference, data mining, data science, pakdd
1 Comment
(Video) Sequence prediction with the CPT and CPT+ Models
Today, I presents the CPT and CPT+ sequence prediction models in a video. Sequence prediction is an important task in data mining which consists of predicting the next symbols of a sequence. It can be used for example to predict the next word that someone will … Continue reading
Correlation does not imply causation
There is a well known principle in statistics that correlation does not imply causation. It means that even if we observe that two variables behave in the same way, we should not conclude that the behavior of one of those variables … Continue reading
Posted in Big data, Data Mining, Data science
Tagged causation, correlation, data, data analysis, data mining, data science
Leave a comment
(video) Mining Frequent Itemsets with the Apriori algorithm
This is a video presentation of the Apriori algorithm for discovering frequent itemsets in data. Frequent itemset mining is one of the most popular data mining task. VIDEO LINK: http://www.philippefournierviger.com/spmf/videos/apriori.mp4 The Java source code of the Apriori algorithm and datasets for evaluating its performance are available in the SPMF software. If you want … Continue reading
Posted in Data Mining, Data science, Pattern Mining, Video
Tagged algorithm, apriori, big data, data mining, data science, frequent pattern mining, frequent patterns, video
Leave a comment
Interview with Prof. Rage Uday Kiran about Data Mining
Today, I have the pleasure to interview Rage Uday Kiran researcher at the National Institute of Informatics in Tokyo, Japan. R. Uday Kiran is an Indian researcher who has been working in Japan for several years. He has been active mainly in the field of data mining, and … Continue reading
Posted in Data Mining, Data science, Interview, Pattern Mining
Tagged data mining, data science, pattern mining, periodic pattern
Leave a comment
The best data mining mailing lists (for researchers)
Today, I will list a few useful mailing lists related to data mining and big data. Subscribing to these mailing list is useful for PhD students and researchers, as many jobs, conferences, special issues and other opportunities are advertised on these mailing lists. It is … Continue reading
Posted in Big data, Data Mining, Data science
Tagged big data, data, data mining, data science, machine learning, mailing list, Research
Leave a comment
Analyzing the source code of SPMF (5 years later)
Five years ago, I had analyzed the source code of the SPMF data mining software using an opensource tool called CodeAnalyzer ( http://sourceforge.net/projects/codeanalyzegpl/ ). This had provided some interesting insights about the structure of the project, especially in terms of lines of codes and code to … Continue reading
Posted in Data Mining, Data science, opensource
Tagged data analysis, data mining, data science, open source, software, spmf
Leave a comment