
Recent Posts
 Plagiarism by Bhawna Mallick and Kriti Raj at Galgotias College of Engineering & Technology
 How to publish in top conferences/journals? The Blue Ocean Strategy
 This is why you should visualize your data!
 An Introduction to Sequential Pattern Mining
 An Introduction to Data Mining
 Write more papers or write better papers? (quantity vs quality)
 Using LaTeX for writing research papers
 An introduction to frequent subgraph mining
 We are launching a new data mining journal
 What is the job of a university professor?
Categories
 Academia (8)
 artificial intelligence (4)
 Big data (24)
 Conference (12)
 Data Mining (55)
 Data science (20)
 General (27)
 Mathematics (2)
 Opensource (6)
 Plagiarism (2)
 Programming (15)
 Research (48)
 Time series (1)
 Utility Mining (2)
Tag cloud
academia academic journal algorithm algorithms articles big data classification comparison conference data mining data science datasets frequent pattern mining frequent patterns graph internet itemset mining java journal library M.Sc. map opensource paper papers pattern mining peerreview Ph.D. phd plagiarism programmer programming publications Research research advisor research papers sequence sequential patterns software source code spmf thesis topic visualization website writingArchives
Recent Comments
 Philippe FournierViger on How to test if a data mining mining algorithm implementation is correct?
 Hitesh Pujari on How to test if a data mining mining algorithm implementation is correct?
 Philippe FournierViger on How to autoadjust the minimum support threshold according to the data size
 Ko Moe on How to autoadjust the minimum support threshold according to the data size
 Philippe FournierViger on An introduction to frequent subgraph mining
Number of visitors:
525605
Tag Archives: data mining
How to test if a data mining mining algorithm implementation is correct?
In this blog post, I will discuss how to check if a data mining algorithm implementation is correct and complete. This is a very important topic for researchers who are implementing data mining algorithms since an incorrect implementation may generate unexpected results. … Continue reading
Posted in Data Mining, Programming, Research
Tagged algorithm, correctness, data mining, debugging
5 Comments
An Introduction to HighUtility Itemset Mining
In this blog post, I will give an introduction about a popular problem in data mining, which is called “highutility itemset mining” or more generally utility mining. I will give an overview of this problem, explains why it is interesting, and provide source code of … Continue reading
Posted in Data Mining, Research, Utility Mining
Tagged data mining, datasets, frequent pattern mining, highutility mining, itemset mining, java, opensource, source code, spmf, utility mining
77 Comments
Big Problems only found in Big Data?
Today, I will discuss the topic of Big Data, which is a very popular topic nowadays. The popularity of big data can be seen for example in universities. Many universities are currently searching for professors who do research on “big data”. Moreover, … Continue reading
Discovering and visualizing sequential patterns in web log data using SPMF and GraphViz
Today, I will show how to use the opensource SPMF data mining software to discover sequential patterns in web log data. Then, I will show to how visualize the frequent sequential patterns found, using GraphViz. Step 1 : getting the … Continue reading
Posted in Big data, Data Mining, Data science, Programming
Tagged data mining, graph, patterns, sequential patterns, spmf, visualization
8 Comments
Brief report about the ADMA 2013 conference
In this blog post, I will discuss my recent trip to the ADMA 2013 conference (9th Intern. Conf. on Advanced Data Mining and Applications in China (December 1416 2013 in Hangzhou, China at Zhejiang University). Note that the view expressed … Continue reading
How to encourage data mining researchers to share their source code and datasets?
A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“. I explained several advantages that researchers can get by sharing the source code of … Continue reading
Posted in Data Mining, Research
Tagged data mining, dataset, opensource, source code
Leave a comment
The importance of constraints in data mining
Today, I will discuss an important concept in data mining which is the use of constraints. Data mining is a broad field incorporating many different kind of techniques for discovering unexpected and new knowledge from data. Some main data mining … Continue reading
How to measure the memory usage of data mining algorithms in Java?
Today, I will discuss the topic of accurately evaluating the memory usage of data mining algorithms in Java. I will share several problems that I have discovered with memory measurements in Java for data miners and strategies to avoid these … Continue reading
Posted in Data Mining, Programming, Research
Tagged comparison, data mining, experiment, java, memory, performance
1 Comment
What are the steps to implement a data mining algorithm?
In this post, I will discuss what are the steps that I follow to implement a data mining algorithm. The subject of this post comes from a question that I have received by email recently, and I think that it … Continue reading
Posted in Data Mining, Programming
Tagged algorithm, data mining, design, implementation, programming
51 Comments
Analyzing the source code of the SPMF data mining software
Hi everyone, In this blog post, I will discuss how I have applied an opensource tool that is named Code Analyzer ( http://sourceforge.net/projects/codeanalyzegpl/ ) to analyze the source code of my opensource data mining software named SPMF. I have applied … Continue reading