Archives
Categories
- Academia (91)
 - artificial intelligence (35)
 - Big data (83)
 - Bioinformatics (6)
 - cfp (12)
 - China (1)
 - Chinese posts (1)
 - Conference (76)
 - Data Mining (189)
 - Data science (107)
 - Database (2)
 - General (43)
 - Industry (2)
 - Java (12)
 - Latex (12)
 - Machine Learning (22)
 - Mathematics (2)
 - open-source (42)
 - Other (3)
 - Pattern Mining (91)
 - Plagiarism (1)
 - Programming (17)
 - Research (108)
 - spmf (59)
 - Time series (3)
 - Uncategorized (23)
 - Utility Mining (23)
 - Video (19)
 - Website (6)
 
- 
		
Recent Posts
- Fixing the reviewresponse.cls LaTeX Class to Allow Multi-Page Comments
 - How to fix reviewresponse.cls for custom reviewer numbering
 - The Conference Hotel Booking Scam
 - Huge traffic from a botnet looking for datasets
 - New version of SPMF: 2.63!
 - Upcoming feature of SPMF 2.63: Taxonomy Viewer
 - CFP: The OCSA 2025 conference
 - Update on SPMF next version: the visual pattern viewer
 - CFP: HP4MoDa 2025 @ IEEE BIBM 2025
 - An Improved Pattern Viewer
 
 Recent Comments
- Philippe Fournier-Viger on What is a Closed Itemset and Why is it Useful?
 - Philippe Fournier-Viger on An Improved Pattern Viewer
 - Bairavi Balaguru on What is a Closed Itemset and Why is it Useful?
 - Kushal P. Birla on An Improved Pattern Viewer
 - Kushal P. Birla on An Improved Pattern Viewer
 


- 

 - 

 
Tag cloud
- academia
 - ai
 - algorithm
 - apriori
 - article
 - artificial intelligence
 - association rule
 - big data
 - cfp
 - china
 - conference
 - data
 - data mining
 - data science
 - episode
 - graph
 - high utility itemset mining
 - icdm
 - itemset
 - itemset mining
 - java
 - journal
 - latex
 - machine learning
 - open-source
 - open source
 - pakdd
 - paper
 - papers
 - pattern mining
 - periodic pattern
 - phd
 - Research
 - researcher
 - reviewer
 - sequence
 - sequential pattern
 - software
 - spmf
 - udml
 - utility mining
 - video
 - website
 - workshop
 - writing
 
Number of visitors:
2,412,748
Category Archives: Big data
Introduction to the K-Means clustering algorithm (with Java code)
In this blog post, I will introduce the popular data mining task of clustering (also called cluster analysis). I will explain what is the goal of clustering, and then introduce the popular K-Means algorithm with an example. Moreover, I will briefly explain how an open-source Java implementation of … Continue reading
									
						Posted in Big data, Data Mining, Data science, open-source					
					
				
								
					Tagged clustering, data mining, data science, java, k-means, open-source, spmf				
				
				
				2 Comments
							
		Introduction to time series mining with SPMF
This blog post briefly explain how time series data mining can be performed with the Java open-source data mining library SPMF (v.2.06). It first explain what is a time series and then discuss how data mining can be performed on time series. What is … Continue reading
									
						Posted in Big data, Data Mining, open-source, spmf, Time series					
					
				
								
					Tagged big data, data mining, data science, java, open-source, pattern mining, SAX algorithm, spmf, time series				
				
				
				1 Comment
							
		Discovering hidden patterns in texts using SPMF
This tutorial will explain how to analyze text documents to discover complex and hidden relationships between words. We will illustrate this with a Sherlock Holmes novel. Moreover we will explain how hidden patterns in text can be used to recognize the author of a … Continue reading
									
						Posted in Big data, Data Mining, Data science, open-source, spmf					
					
				
								
					Tagged data mining, data science, pattern mining, spmf				
				
				
				9 Comments
							
		Brief report about the Dexa 2016 and Dawak 2016 conferences
This week, I have been attending the DEXA 2016 and DAWAK 2016 conferences, in Porto, Portugal, from the 4th to 8th September 2016, to present three papers. In this blog post, I will give a brief report about these conferences. About these … Continue reading
Brief report about the 12th International Conference on Machine Learning and Data Mining conference (MLDM 2016)
In this blog post, I will provide a brief report about the 12th Intern. Conference on Machine Learning and Data Mining (MLDM 2016), that I have attended from the 18th to 20th July 2016 in Newark, USA. First I have to say, that I … Continue reading
Brief report about the 16th Industrial Conference on Data mining 2016 (ICDM 2016)
In this blog post, I will provide a brief report about the 16th Industrial Conference on Data mining 2016, that I have attended from the 13 to 14 July 2016 in New York, USA. About the conference This conference is an established conference in … Continue reading
									
						Posted in Big data, Conference, Data Mining					
					
				
				
				4 Comments
							
		The top journals and conferences in data mining / data science
A key question for data mining and data science researchers is to know what are the top journals and conferences in the field, since it is always best to publish in the most popular journals or conferences. In this blog post, … Continue reading
									
						Posted in Big data, Data Mining, Data science, Research					
					
				
				
				4 Comments
							
		An introduction to periodic pattern mining
In this blog post I will give an introduction to the discovery of periodic patterns in data. Mining periodic patterns is an important data mining task as patterns may periodically appear in all kinds of data, and it may be desirable to find them … Continue reading
									
						Posted in Big data, Data Mining, Data science, open-source, Research, Uncategorized, Utility Mining					
					
				
				
				29 Comments
							
		An Introduction to Sequence Prediction
In this blog post, I will give an introduction to the task of sequence prediction, a popular data mining/machine learning task, which consist of predicting the next symbol of a sequence of symbols. This task is important as it have many … Continue reading
									
						Posted in Big data, Data Mining, Data science, Research					
					
				
								
					Tagged open-source, prediction model, sequence, sequence prediction				
				
				
				9 Comments
							
		Finding a Data Scientist Unicorn or building a Data Science Team?
In recent months/years, many blog posts have been trending on the social Web about what is a “data scientist“, as this term has become very popular. As there is much hype about this term, some people have even jokingly said that … Continue reading
									
						Posted in Big data, Data Mining, Data science					
					
				
				
				Leave a comment