This is to announce that a new version of SPMF has been released on the 27th November 2022. This version has 7 new pattern mining algorithms:
the HUCI-Miner algorithm to mine closed high utility itemsets and generators at the same time (thanks to Jayakrushna Sahoo et al. for the original code )
the FHIM algorithm to mine all high utility itemsets (thanks to Jayakrushna Sahoo et al. for the original code)
the HGB algorithm to mine non redundant high utility association rules (thanks to Jayakrushna Sahoo et al. for the original code)
the HGB-all algorithm to derive all high utility association rules from the non redundant high utility association rules (thanks to Jayakrushna Sahoo et al. for the original code)
algorithms for mining sequential patterns with flexible constraints in a time-extended sequence database (eg. MOOC data)
the SPM-FC-L algorithm fi (Thanks to Wei Song et al. for the original code)
the SPM-FC-P algorithm (Thanks to Wei Song et al. for the original code)
Besides, it has several new features such as:
(1) An integrated text editor to open output file (to give an alternative to the system’s default text editor)
(2) Some improvements to the graphical user interface, such as shown below, such as colors to highlight algorithm categories and a window icon:
And some bugs have been fixed.
Besides a new MOOC.txt dataset of sequences of courses with timestamps has been added to the dataset page of SPMF.
Thanks again to all users and contributors to SPMF!
In this blog post, I will talk about the 11th International Conference on Model & Data Engineering (MEDI 2022), which I have attended this week. It was held from the 21st to 23rd November 2022, in Cairo, Egypt, and also online.
MEDI is a good conference for modelling, database, and related topics. MEDI is ranked C in the CORE 2021 ranking. It has been held in various countries over the years such as: Egypt, Estonia, France, Morocco, Spain, Cyprus, Italy and Portugal.
The local organization of the 2022 edition of MEDI is by the Nile University.
This year, I am glad to play the role of Program Committee (PC) Chair at this conference. Below, I give an overview of the event.
Opening ceremony
The opening ceremony was at 10:00 AM. The conference was introduced, and an overview of the program was given. Below, I present some slides from the opening ceremony with some more details.
Countries were MEDI was held:
Organizations. Here is an overview of the committees behind MEDI 2022:
Paper selection. About the program, this year 65 papers have been submitted to the main conference, and from that 18 were accepted for long presentation (28%) and 11 for short presentation (17%). The program committee consisted of 60 researchers from 23 countries, which provided 200 reviews, for an average of 3.5 reviews per paper.
Proceedings. Full presentation papers are published in a Springer LNCS volume, while short presentation papers are published together with workshop papers in a Springer CCIS volume.
Totally, there was 190 authors and submissions were made from 18 countries on five main topics: (1) Modelling, (2) Image processing and diagnosis, (3) Machine learning and optimization, (4) Natural language processing and (5) Database systems.
Workshop. A workshop called DETECT was co-organized with MEDI 2022, and 4 papers were accepted in that workshop (44%).
Special issues. Two special issues are organized for best papers of MEDI 2022
Keynote talks
There was three keynote talks.
The first keynote talk was by Prof. Vincent S. Tseng from National Yang Ming Chiao Tung University about “Broad and Deep Learning of Big Heterogeneous Health Data for Medical AI: Opportunities and Challenges“.
The second talk was “A service-based approach to drone service delivery in Skyway networks” by Prof. Athman Bouguettaya from The University of Sydney, Australia
The third talk was “Safety and security are key considerations in the design of critical systems” by Dr. Colin Snook from University of Southampton, United Kingdom.
Best paper awards of MEDI 2022
The best paper awards were announced during the closing ceremony. These papers were selected based on the peer reviews and also based on the presentations given at the MEDI conference.
Conclusion
This is all for this blog post! Hope this has been interesting. Looking forward to MEDI 2023! In about 1 week, I will talk to you about ICDM 2022.
In this blog post, I will talk about the 15th International Conference on Multi-disciplinary Trends in Artificial Intelligence (MIWAI 2022), which was held as a virtual event on November 18th, 2022. I have attended this conference to present two papers related to pattern mining.
MIWAI is a conference that has been held every year in the pacific/Asia region, for 15 years. In the past, MIWAI has been held in countries such as Vietnam, India, Thailand, China, Malaysia and Brunei. I have also attended MIWAI in Malaysia in 2019 (see my blog post about MIWAI 2019 here).
Proceedingsof MIWAI 2022
This year, the MIWAI conference received 42 papers from which 19 were accepted, which includes 14 full papers (acceptance rate of 33% for full papers) and 5 short papers (acceptance rate of 45% for full + short papers).
The proceedings of the conference are published by Springer in a book from the LNAI series. Hence, the papers are well-indexed in various publication databases such as DBLP, which is good.
Schedule of the conference
The conference was organized during a single day (November 18) and online using the Zoom platform. It started with an opening session, followed by a keynote talk by Prof. Rapeepan Pitakaso from Thailand about “Artificial Multiple Intelligence System (AMIS) and its Applications”. Then, there was paper presentations organized as parallel sessions during the rest of the day.
Opening
During the opening, the organizers talked about the program, the review process, the organization, etc. It was nice to see several people that I knew already from MIWAI 2019. The organizers are very friendly and professional.
It was announced that next year, MIWAI 2023 will be in Hyderabad, India. The submission deadline is planned for 10th January 2023.
Then, MIWAI 2024 will be in Chongqing, China.
Awards were also announced. I am please that the best paper award was given to “LCIM: Mining Low Cost High Utility Itemsets“, which is my paper. There was also another paper who received an award.
Keynote talk
There was a keynote talk “AMIS – Artificial Multiple Intelligence System: Theory and Application” by Prof. Rapeepan Pitakaso from Ubon Ratchathani Unviersity, Thailand. The talk was about a system called Artificial Multiple Intelligent System (AMIS), that aims to combine multiple types of intelligence, just like humans do (verbal, linguistic intelligences, etc.). The system is based on heuristic algorithms and also combines CNN (Convolutional Neural Networks) and other techniques. Here are some pictures and slides from this presentation.
Papers on pattern mining
As readers of this blog knows, I am interested in the field of pattern mining. At the conference, there was two papers about pattern mining (my papers):
5121
Philippe Fournier-Viger, M. Saqib Nawaz, Yulin He, Youxi Wu, Farid Nouioua and Unil Yun
A good thing about this conference is that the registration fee is cheap. For one paper, it costs 250$ USD and for two papers, I spent only 350$ USD. This is much cheaper than many other conferences published by Springer and also IEEE. For example, an alternative that I considered would have been to publish in some European conferences published by Springer such as DAWAK or DEXA but registering two papers for those conferences would have cost me over 1240 euros instead of 350$ USD! This is a big difference. There are also many IEEE top conferences that are very expensive and have an increasing price in recent years. For example, this year, IEEE ICDM has a registration price of 1300 USD, while the price 10 years ago was about only 500$ USD. Thus, I appreciate that MIWAI offers registration at a very reasonable price. I think it can allow researchers from all around the world to more easily publish their papers, especially when research funding is limited.
Conclusion
MIWAI 2022 was a good conference. It is not a very large conference but there are some interesting papers of good quality, and the conference is well-organized. I will try to attend again for MIWAI 2023. Later this month, I will talk to you about the MEDI 2022 and ICDM 2022 conference.
This is a short post about how to change the language of Tableau if the option is not available in the Help Menu.
Close Tableau
On Windows, open Regedit
Search for “Language code” to find the keys containing the languages options of Tableau.
Change the three following keys: “Language code”, “Repository language” and “SamplesLanguage” to your preferred language. For example “en_US” is for English and “zh_CN” is for Chinese.
5. Start Tableau again and the language will have been changed.
Today, I will talk about some upcoming feature for the next release of SPMF (2.58), which is a simple integrated text editor. This new release should happen in about 1 or 2 weeks (as I am very busy recently) and will contain some new algorithms. But also, it will contain an integrated text editor. This may seem very strange? Why a text editor? I will explain briefly the idea in this blog post to give an overview of this feature as I am testing it now. If you have comments, you may leave them in the comment section below
Why a text editor in SPMF?
Well, in previous version of SPMF, there was a few options to open the output files produced by the algorithms: using the system’s default text editor (e.g. Notepad on Windows) or using the Pattern Viewer tool of SPMF. Although using the system’s text editor can be good, I was thinking that having a customized text editor could be interesting and it is actually not difficult to implement. So, I will explain how it works below.
The SPMF text editor
In the next release of SPMF, there will be a new option to open the output file using the SPMF text editor instead of the default system text editor.
The SPMF text editor looks like this:
It has some useful features such as showing the line count and the line and column numbers in the status bar at the bottom. This bar can also be hidden.
Also a useful feature is to always highlight the current line (which is not done by NotePad, for example):
Besides, it is possible to activate a “night mode” from the menu:
And there is also a search bar that is very convenient for highlighting some keywords in a file, and works like the search bar in some Web browsers:
Also, there is of course the possibility of changing the font and the font size.
Besides, all the user-defined preferences are saved. So next time that you open SPMF, the same font, font size, night mode preference, window size and location, and other preferences of the SPMF text editor are kept.
The SPMF text editor can also display the size of the last file that is opened:
And here is an overview of the menus:
It has the basic important functions such as “Line wrap” and “Word wrap”.
Conclusion
Interesting? You will be able to try it in the next version of SPMF to be released soon.
For now, the features are quite simple because the aim is to provide an alternative to the system text editor specialized for opening the output files of SPMF. It is not designed to compete with a more complex text editor and to be used as a general-purpose text editor (although it could).
A limitation of the SPMF text editor is that text files are loaded in memory so it is restricted to opening files that are not too big.
Many other features could be added like highlighting keywords in the output file. So there is something to think about. Which features would be useful? You may let me know what you think. If some features are not too hard to implement and useful, I may add them. I will also do some more debugging before it can be released.
If you have any suggestions or ideas, you can let me know in the comment section or by e-mail at philfv AT qq.com
In this blog post, I will talk about the IEEE DSAA 2022 conference, which was held from the 13th to the 16th October 2022.
What is IEEE DSAA?
DSAA 2022 is the 9th edition of the IEEE International Conference on Data Science and Advanced Analytics (DSAA). DSAA is an international conference that has been held in many countries, and focuses on data mining, data science, big data, machine learning and relatedtopics.
DSAA is a relatively young conference compared to some top data mining and machine learning conferences, but DSAA has become more and more successful over the years.
Location
This year, the DSAA conference was planned to be held in the city of Shenzhen, China. But due to the COVID-19 pandemic, the conference was held in online mode (local organization by Shenzhen University), and using Zoom as online videoconferencing platform.
Proceedings of DSAA 2022
The proceedings of DSAA are published by IEEE.
Authors could submit a paper to the main track of DSAA. But besides that, the DSAA conference hosts several special sessions on emerging topics. It is interesting for authors that papers accepted in the special sessions are published as regular papers in the proceedings of DSAA. This year, I have been co-organizer of a special session called DSSBA (1st Special Session on Data Science for Social and Behavioral Analytics), that has been quite successful with 5 papers accepted (more on that later).
Another interesting aspect about DSAA is that there are two special issues, respectively organized in the International Journal on Data Science and Analytics (JDSA) journal and the Machine Learning Journal (MLJ). Authors could submit papers to these special issues and then present the articles at the conference.
Keynote speaker from the main conference
This year, there was a good line up of keynote speakers:
Conference opening
The conference opening was on the 13th October. Several interesting information were given about the DSAA conference. Here are some slides from the opening:
Country distribution for application track:
Country distribution for the research track:
The paper acceptance rate statistics:
The main topics of papers published in DSAA:
Some awards were given with some of them receiving a cash prize of 1000$ USD.
A keynote talk was given in this special session by Prof. Yun Sing Koh from the University of Auckland, New Zealand. She presented some of her latest research work related to machine learning to tackle environmental science challenges. In particular, she presented two recent research projects published in the Machine Learning journal and in AAAI 2022, which are about air quality index inference and about algal bloom monitoring, respectively. Below, I share a few slides from her talk:
For more details about these two research projects, you can see these two papers:
Olivier Graffeuille, Yun Sing Koh, Jörg Wicker, Moritz K. Lehmann: Semi-supervised Conditional Density Estimation with Wasserstein Laplacian Regularisation. AAAI 2022: 6746-6754
Ben Halstead, Yun Sing Koh, Patricia Riddle, Russel Pears, Mykola Pechenizkiy, Albert Bifet, Gustavo Olivares, Guy Coulson: Analyzing and repairing concept drift adaptation in data stream classification. Mach. Learn. 111(10): 3489-3523 (2022)
In the DSSBA special session, we also had four paper presentations, where the last two papers are about pattern mining.
SA-FGDEM: A Self-adaptive E-Learning Performance Prediction Model Wang, Liping; Ye, Mingtao; Zhang, Guodao; Sheng, Xin; Zhang, Jingran
Heterogeneous Drift Learning: Classification of Mix-Attribute Data with Concept Drifts Zhao, Lang; Zhang, Yiqun; Ji, Yuzhu; Zeng, An; Gu, Fangqing; Luo, Xiaopeng
Fast Mining RFM Patterns for Behavioral Analytics Wan, Shicheng; Deng, Jieyin; Chen, Jiahui; Gan, Wensheng; Yu, Philip S
There was also several interesting paper presentations at the conference and talks but due to my busy schedule, I was not able to attend many of them. An interesting paper that I saw related to periodic pattern mining is:
Discovering Periodicity in Locally Repeating Patterns Alfred Krzywicki (University of Adelaide); Ashesh Mahidadia (Rich Data Corporation); Michael Bain (University of New South Wales)
Conclusion
Overall, the conference has been very interesting and well-organized. I will try to participate again to IEEE DSAA, next year.
This is a short blog post to let you know that a new version of the SPMF data mining library has been released (version 2.55) with 10 new algorithms for pattern mining. SPMF is by far the most complete library for pattern mining and can be used from various languages. Thanks again to all contributors and users who made this project a success and supported it through the years.
If you are a researcher and would like your algorithms to be included in SPMF to provide more visibility to your work, feel free to send me an e-mail.
In this blog post, I provide instructions about how to update the Cloudera VM, even though the CentOS version used by Cloudera has reached the end-of-life, and the Cloudera VM is no longer updated. I have spent quite a lot of time on this and finally found how to do based on various sources from the Internet.
4. Next, go back to the terminal to type the following commands:
sudo yum-config-manager –disable epel
sudo yum-config-manager –disable cloudera-cdh5
sudo yum-config-manager –disable cloudera-manager
5. Type this command in the terminal to open the configuration file of Yum:
sudo gedit /etc/yum.conf
6.Add this entry at the end of the [main] section:
sslverify=false
Then, save the file and close the application.
7. After that, we will type some additional commands to update and install additional software. During this step, you will be asked several times if you want to continue. When this happens, press “y” and “enter”. The commands are:
sudo yum update ca-certificates
sudo yum update
sudo yum install epel-release
sudo yum –enablerepo=extras install epel-release
8. Let’s say we want to install some Python libraries “mathplotlib” and “imaging”:
sudo yum install python-matplotlib
sudo yum install python-imaging
We can also install PIP:
sudo yum install -y –enablerepo=”epel” python-pip
And we can also install other software like LibreOffice to use some software like Oocalc:
I have not written on the blog in the past few weeks. This is because I was quite busy and also it was the summer vacation. I had to temporarilly focus on other things. Today, I want to give you some quick news:
A new version of SPMF (v 2.55) is under preparation. There will be about 10 new algorithms, including 5 high utility itemset mining algorithms and 3 episode mining algorithms. I am very excited to release this new version soon. If you have some algorithm implementation that you would like to include in SPMF, feel free to contact with me at philfv8 AT yahoo DOT COM.
You may also want to consider the BDA 2022 conference on Big Data Analytics for submitting your papers. It is an international conference held in India and published by Springer. The deadline is September 5th.
That is all for today. I will be back soon with more content for the blog and my Youtube channel. Thanks for reading! 🙂