Tag Archives: dataset

Datasets of 30 English novels for pattern mining and text mining

Today, I want to announce that I have just made public datasets of 30 novels from English Novels from 10 authors of the XIX century. These datasets can be used for testing algorithms for sequential pattern mining, sequential rule mining, as well as for some text … Continue reading

Posted in Data Mining, Data science | Tagged , , | Leave a comment

Subgraph mining datasets

In this post, I will provide two standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that I have created for debugging subgraph mining algorithms. The format of graph datasets A graph dataset is a text file … Continue reading

Posted in Big data, Data Mining | Tagged , , , , | Leave a comment

How to encourage data mining researchers to share their source code and datasets?

A few months ago, I wrote a popular blog post on this blog about why it is important to publish source code and datasets for researchers“.  I explained several advantages that researchers can get by sharing the source code of … Continue reading

Posted in Data Mining, Research | Tagged , , , | Leave a comment