Archives
Categories
- Academia (78)
- artificial intelligence (34)
- Big data (79)
- Bioinformatics (3)
- cfp (9)
- Chinese posts (1)
- Conference (69)
- Data Mining (174)
- Data science (96)
- Database (1)
- General (42)
- Industry (2)
- Interview (1)
- Java (10)
- Latex (8)
- Machine Learning (19)
- Mathematics (2)
- open-source (36)
- Other (3)
- Pattern Mining (79)
- Programming (17)
- Research (108)
- spmf (45)
- Time series (3)
- Uncategorized (20)
- Utility Mining (21)
- Video (19)
- Website (3)
-
Recent Posts
- Busy times
- The MDLM 2023 conference: a scam?
- Fake reviews…
- SPMF: upcoming feature: The Memory Viewer
- (video) The EFIM algorithm
- (video) Introduction to episode mining
- As a speaker, we should always be ready for the unexpected…
- Two common English errors in pattern mining papers
- How to become a well-known researcher?
- Call for tutorials at BESC 2023
Recent Comments
- Darsh on The top journals and conferences in data mining / data science
- michael Tabi on Six important skills to become a succesful researcher
- Philippe Fournier-Viger on MLDM 2019… still not in New York! …and not published by Springer!
- Pr. Jeremie Sublime on MLDM 2019… still not in New York! …and not published by Springer!
- Philippe Fournier-Viger on How to call SPMF from R?
DSSBA2023
Tag cloud
- academia
- ai
- algorithm
- apriori
- article
- articles
- artificial intelligence
- association rule
- big data
- china
- conference
- data
- data mining
- data science
- frequent pattern mining
- graph
- high utility itemset mining
- icdm
- itemset
- itemset mining
- java
- journal
- latex
- machine learning
- open-source
- open source
- paper
- papers
- pattern mining
- periodic pattern
- phd
- programming
- Research
- researcher
- research paper
- reviewer
- sequence
- sequential pattern
- software
- spmf
- udml
- utility mining
- video
- workshop
- writing
Number of visitors:
2,216,098
Tag Archives: dataset
Visualizing the item frequency distribution of pattern mining datasets
In this blog post, I will explain a quick and easy way of visualizing the frequency distribution of items in a dataset in SPMF format for pattern mining. To do this, we will use a new online tool that I … Continue reading
Posted in Pattern Mining, spmf
Tagged association rule, association rule mining, data, dataset, density, frequency, frequency distribution, itemset, pattern mining, spmf format, support
Leave a comment
New version of SPMF (2.44): 4 new algorithms, datasets and features
Today, I am happy to announce that a new version of the SPMF open-source data mining software is released (v. 2.44). This is the download page. This new version was made possible due to several contributors. What is new? New … Continue reading
Posted in Data Mining, Data science, open-source, Pattern Mining, spmf, Utility Mining
Tagged algorithm, data mining, data science, dataset, open source, open-source, pattern mining, spmf
Leave a comment
Datasets of 30 English novels for pattern mining and text mining
Today, I want to announce that I have just made public datasets of 30 novels from English Novels from 10 authors of the XIX century. These datasets can be used for testing algorithms for sequential pattern mining, sequential rule mining, as well as for some text … Continue reading
Subgraph mining datasets
In this post, I will provide links to standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that can be used for debugging subgraph mining algorithms. The format of graph datasets A graph dataset is a text … Continue reading
Posted in Big data, Data Mining
Tagged data mining, dataset, frequent subgraph, graph, subgraph
Leave a comment
How to encourage data mining researchers to share their source code and datasets?
A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“. I explained several advantages that researchers can get by sharing the source code of … Continue reading
Posted in Data Mining, Research
Tagged data mining, dataset, open-source, source code
Leave a comment