Archives
Categories
- Academia (74)
- artificial intelligence (36)
- Big data (80)
- cfp (8)
- Chinese posts (1)
- Conference (67)
- Data Mining (164)
- Data science (89)
- Database (1)
- General (42)
- Industry (2)
- Interview (1)
- Java (10)
- Latex (8)
- Machine Learning (19)
- Mathematics (2)
- open-source (35)
- Other (2)
- Pattern Mining (60)
- Programming (17)
- Research (108)
- spmf (41)
- Time series (3)
- Uncategorized (16)
- Utility Mining (17)
- Video (18)
- Website (3)
-
Recent Posts
- UDML 2023 workshop!
- How to Detect and Classify Metamorphic Malware with Sequential Pattern Mining (MalSPM)
- How to propose a special issue for a journal?
- ASCII Art for SPMF
- Efficiency problems with using java.util.Hashmap
- SPMF’s architecture (5) The Graphical User Interface
- SPMF 3.0: Towards even more efficiency
- SPMF’s architecture (4) The MemoryLogger
- Unethical services in academia
- SPMF’s architecture (3) The Preference Manager
Recent Comments
- Philippe Fournier-Viger on How to auto-adjust the minimum support threshold according to the data size
- Nabil R Adam on How to auto-adjust the minimum support threshold according to the data size
- SPMF’s architecture (5) The Graphical User Interface | The Data Mining Blog on SPMF’s architecture (2) The Main class and the Command Processor
- SPMF’s architecture (3) The Preference Manager | The Data Mining Blog on SPMF’s architecture (1) The Algorithm Manager
- Philippe Fournier-Viger on The KDDCup 2015 dataset
Tag cloud
- academia
- ai
- algorithm
- algorithms
- apriori
- article
- articles
- artificial intelligence
- big data
- china
- conference
- data
- data mining
- data science
- frequent pattern mining
- graph
- high utility itemset mining
- icdm
- itemset
- itemset mining
- java
- journal
- latex
- machine learning
- open-source
- open source
- paper
- papers
- pattern mining
- periodic pattern
- phd
- programming
- Research
- researcher
- review
- reviewer
- sequence
- sequential pattern
- software
- spmf
- udml
- utility mining
- video
- workshop
- writing
Number of visitors:
2,157,903
Tag Archives: dataset
New version of SPMF (2.44): 4 new algorithms, datasets and features
Today, I am happy to announce that a new version of the SPMF open-source data mining software is released (v. 2.44). This is the download page. This new version was made possible due to several contributors. What is new? New … Continue reading
Posted in Data Mining, Data science, open-source, Pattern Mining, spmf, Utility Mining
Tagged algorithm, data mining, data science, dataset, open source, open-source, pattern mining, spmf
Leave a comment
Datasets of 30 English novels for pattern mining and text mining
Today, I want to announce that I have just made public datasets of 30 novels from English Novels from 10 authors of the XIX century. These datasets can be used for testing algorithms for sequential pattern mining, sequential rule mining, as well as for some text … Continue reading
Subgraph mining datasets
In this post, I will provide links to standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that can be used for debugging subgraph mining algorithms. The format of graph datasets A graph dataset is a text … Continue reading
Posted in Big data, Data Mining
Tagged data mining, dataset, frequent subgraph, graph, subgraph
Leave a comment
How to encourage data mining researchers to share their source code and datasets?
A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“. I explained several advantages that researchers can get by sharing the source code of … Continue reading
Posted in Data Mining, Research
Tagged data mining, dataset, open-source, source code
Leave a comment