Archives
Categories
- Academia (85)
- artificial intelligence (34)
- Big data (82)
- Bioinformatics (3)
- cfp (10)
- Chinese posts (1)
- Conference (74)
- Data Mining (184)
- Data science (103)
- Database (2)
- General (43)
- Industry (2)
- Java (11)
- Latex (10)
- Machine Learning (20)
- Mathematics (2)
- open-source (39)
- Other (3)
- Pattern Mining (86)
- Plagiarism (1)
- Programming (17)
- Research (109)
- spmf (55)
- Time series (3)
- Uncategorized (23)
- Utility Mining (23)
- Video (19)
- Website (4)
-
Recent Posts
- Reducing the cost of web hosting…
- SPMF 2.62 is released!
- My research is open-source
- Two new shopping datasets with taxonomy
- How to deal with unethical reviewers? The good example of the EAAI journal
- CSRankings: still a biased ranking
- The story of the most influential paper award of PAKDD 2024
- A brief report about PAKDD 2024
- Report on the UDML 2024 workshop @ PAKDD 2024
- Upcoming SPMF features for v.2.62 – More Dataset Stats Tools
Recent Comments
- Philippe Fournier-Viger on A simple BAT script to unzip ZIP files in all sub-directories and then delete the ZIP files
- hamjii on A simple BAT script to unzip ZIP files in all sub-directories and then delete the ZIP files
- Philippe Fournier-Viger on How to call SPMF from R?
- Toni Garcia on How to call SPMF from R?
- K. P. Birla on About the author
Tag cloud
- academia
- ai
- algorithm
- apriori
- article
- articles
- artificial intelligence
- association rule
- big data
- cfp
- china
- conference
- data
- data mining
- data science
- episode
- graph
- high utility itemset mining
- icdm
- itemset
- itemset mining
- java
- journal
- latex
- machine learning
- open-source
- open source
- pakdd
- paper
- papers
- pattern mining
- periodic pattern
- phd
- Research
- researcher
- reviewer
- sequence
- sequential pattern
- software
- spmf
- udml
- utility mining
- video
- workshop
- writing
Number of visitors:
2,300,803
Tag Archives: dataset
Visualizing the item frequency distribution of pattern mining datasets
In this blog post, I will explain a quick and easy way of visualizing the frequency distribution of items in a dataset in SPMF format for pattern mining. To do this, we will use a new online tool that I … Continue reading
Posted in Pattern Mining, spmf
Tagged association rule, association rule mining, data, dataset, density, frequency, frequency distribution, itemset, pattern mining, spmf format, support
Leave a comment
New version of SPMF (2.44): 4 new algorithms, datasets and features
Today, I am happy to announce that a new version of the SPMF open-source data mining software is released (v. 2.44). This is the download page. This new version was made possible due to several contributors. What is new? New … Continue reading
Posted in Data Mining, Data science, open-source, Pattern Mining, spmf, Utility Mining
Tagged algorithm, data mining, data science, dataset, open source, open-source, pattern mining, spmf
Leave a comment
Datasets of 30 English novels for pattern mining and text mining
Today, I want to announce that I have just made public datasets of 30 novels from English Novels from 10 authors of the XIX century. These datasets can be used for testing algorithms for sequential pattern mining, sequential rule mining, as well as for some text … Continue reading
Subgraph mining datasets
In this post, I will provide links to standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that can be used for debugging subgraph mining algorithms. The format of graph datasets A graph dataset is a text … Continue reading
Posted in Big data, Data Mining
Tagged data mining, dataset, frequent subgraph, graph, subgraph
Leave a comment
How to encourage data mining researchers to share their source code and datasets?
A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“. I explained several advantages that researchers can get by sharing the source code of … Continue reading
Posted in Data Mining, Research
Tagged data mining, dataset, open-source, source code
Leave a comment