Archives
Categories
- Academia (51)
- artificial intelligence (20)
- Big data (54)
- cfp (4)
- Conference (44)
- Data Mining (104)
- Data science (52)
- General (35)
- Industry (1)
- Interview (1)
- Machine Learning (3)
- Mathematics (2)
- open-source (9)
- Pattern Mining (20)
- Plagiarism (6)
- Programming (16)
- Research (88)
- Time series (2)
- Uncategorized (14)
- Utility Mining (7)
- Video (10)
- Website (1)
-
Recent Posts
- If I would do a PhD again, what would I do differently?
- Analyzing COVID-19 tweets to understand the public opinion
- Discovering Alarm Correlation Rules for Network Fault Management (video)
- Merry Christmas and Happy New Year!
- Conference Badges: the Best and the Worst
- Real Conferences VS Virtual Conferences
- The 10 most cited papers in our “Data Science and Pattern Recognition” journal!
- How to prepare your thesis defense?
- A Brief Report about the IEEE ICDM 2020 Conference
- Is it a good to change research area?
Recent Comments
- Philippe Fournier-Viger on Big problem on my website on IONOS webhosting!
- Brendan Vail on Big problem on my website on IONOS webhosting!
- Report about the ICGEC 2018 conference | The Data Mining Blog on An introduction to periodic pattern mining
- Philippe Fournier-Viger on How to answer reviewers for a journal paper revision?
- Philippe Fournier-Viger on Six important skills to become a succesful researcher
Tag cloud
- academia
- ai
- algorithm
- algorithms
- article
- articles
- artificial intelligence
- asia
- big data
- china
- conference
- data
- data mining
- data science
- frequent pattern mining
- frequent patterns
- high-utility mining
- high utility itemset mining
- icdm
- ieaaie
- itemset mining
- java
- journal
- machine learning
- open-source
- pakdd
- paper
- papers
- pattern mining
- periodic pattern
- phd
- plagiarism
- programming
- Research
- researcher
- research paper
- review
- reviewer
- sequence
- software
- spmf
- utility mining
- video
- workshop
- writing
Number of visitors:
1,936,786
Author Archives: Philippe Fournier-Viger
What are the steps to implement a data mining algorithm?
In this post, I will discuss what are the steps that I follow to implement a data mining algorithm. The subject of this post comes from a question that I have received by e-mail recently, and I think that it … Continue reading
Posted in Data Mining, Programming
Tagged algorithm, data mining, design, implementation, programming
43 Comments
Choosing data structures according to what you want to do
Today, I write a post about programming. I want to share a simple but important idea for writing optimized code. The idea is to choose data structures according to what you want to do instead of what you want to … Continue reading
Posted in Data Mining, Programming
Tagged association rules, bitset, optimization, programming, red-black tree, topkrules
Leave a comment
Analyzing the source code of the SPMF data mining software
Hi everyone, In this blog post, I will discuss how I have applied an open-source tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyze-gpl/ ) to analyze the source code of my open-source data mining software named SPMF. I have applied … Continue reading
How to auto-adjust the minimum support threshold according to the data size
Today, I will do a quick post on how to automatically adjust the minimum support threshold of frequent pattern mining algorithms such as Apriori, FPGrowth and PrefixSpan according to the size of the data. The problem is simple. Let’s … Continue reading
Posted in Data Mining, Programming
Tagged apriori, fpgrowth, frequent pattern mining, itemset mining, minsup, prefixspan
42 Comments
How to characterize and compare data mining algorithms?
Hi, today, I will discuss how to compare data mining algorithms. This is an important question for data mining researchers who want to evaluate which algorithm is “better” in general or for a given situation. This question is also important … Continue reading
Posted in Data Mining, Programming, Research
Tagged algorithms, characteristics, classification, comparison, data mining, evaluation
7 Comments
A Map of Data Mining Algorithms (offered in SPMF v092c)
Hi, I have made a map to visualize the relationship between the 52 different data mining algorithms offered in the SPMF data mining software. You can view it in PNG format by clicking on the picture below: Or you can … Continue reading
Posted in Data Mining, Programming
Tagged algorithms, data mining, java, map, open-source, spmf
2 Comments
On the quality of peer-review for academic journals (updated)
Nowadays, there are a lot of low-quality academic journal popping everywhere on the Web. Actually, it does not take much to start a low-quality journal (just a website). What is the goal of low-quality journals? It is generally to earn … Continue reading
Why researchers should make their research papers available on internet?
In this blog post, I will discuss the importance of making research papers available on the internet. As you probably knows, many researchers nowadays prefer to search for papers/articles on the internet instead of searching at their university’s library. Searching … Continue reading
What does it takes to do a good Ph.D?
Today, I will discuss this question: “What does it takes to do a good Ph.D ?“. We can answer this question from several points of view. First, from a personal level, some people think that “talent” is the most important … Continue reading
How to choose a research advisor for M.Sc. / Ph.D ?
In this post I will discuss how to choose a research advisor for doing a M.Sc. or Ph.D. This is a very important decision for any graduate students that can have an important impact on their success, and on their … Continue reading