https://github.com/critical-infrastructure-systems-lab/multi-objective-feature-selection
MatLab implementation of W-QEISS, F-QEISS and W-MOSS: three algorithms for the selection of (quasi) equally informative subsets
https://github.com/critical-infrastructure-systems-lab/multi-objective-feature-selection
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 4 DOI reference(s) in README -
✓Academic publication links
Links to: ieee.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.7%) to scientific vocabulary
Keywords
feature-selection
machine-learning
multi-objective-optimization
Last synced: 5 months ago
·
JSON representation
Repository
MatLab implementation of W-QEISS, F-QEISS and W-MOSS: three algorithms for the selection of (quasi) equally informative subsets
Basic Info
Statistics
- Stars: 29
- Watchers: 3
- Forks: 14
- Open Issues: 2
- Releases: 0
Topics
feature-selection
machine-learning
multi-objective-optimization
Created over 10 years ago
· Last pushed almost 3 years ago
Metadata Files
Readme
readme.txt
This toolbox implements the WQEISS/WMOSS/FQEISS feature selection techniques for classification problems presented in Karakaya et al. (2015) and Taormina et al. (2016).
This toolbox depends on the following contributions, which have to be donwloaded and appropriately referenced in the code.
1) Hanchuan Peng's Mutual Information computation toolbox
http://www.mathworks.com/matlabcentral/fileexchange/14888-mutual-information-computation
2)Yi Cao's Pareto-front toolbox
http://www.mathworks.com/matlabcentral/fileexchange/17251-pareto-front
3) Song Lin's NGPM toolbox for NSGA-II (Deb et al., 2002)
http://in.mathworks.com/matlabcentral/fileexchange/31166-ngpm-a-nsga-ii-program-in-matlab-v1-4
The scripts also support the Borg multi-objective algorithm, which was originally used for the experiments reported in Karakaya et al. (2015).
Interested users who want to employ Borg instead of NSGA-II are referred to http://borgmoea.org for the MATLAB files required by this package.
The NSGA-II version of the algorithms is illustrated in "script_example_NSGAII.m" for the "Heart" dataset of the UCI Repository (Lichman, 2013).
Users may refer to "script_example_BORG.m" for the equivalent version in Borg.
NOTE: Contrary to the experiments reported in Karakaya et al. (2015), this illustrative implementation features only one run for each algorithm on the chosen dataset. We suggest the user to run each algorithm several times, possibly using different randomizations of the employed dataset, in order to maximize the number of solutions returned by the methods and better assess the accuracy of the trained models. An overall Pareto-front should then be constructed from all the solutions returned by the multiple runs, making sure that the same value of accuracy is assigned to equal solutions (equal subsets) returned on different runs. This could be done by averaging the accuracies across the runs. For a fair comparison of the results of the three algorithms, it is also important that the same (average) accuracy is assigned for the same solutions returned by the different techniques.
*** UPDATE 08/2016: the W-QEISS algorithm for regression problems is described in
Taormina, R., Galelli, S., Karakaya, G., Ahipasaoglu, S.D., 2016. An information theoretic approach to select alternate subsets of predictors for data-driven hydrological models. J. Hydro. doi:10.1016/j.jhydrol.2016.07.045.
See script_example_BORG__REGRESSION and script_example_NSGA__REGRESSION ***
References:
Karakaya, G., Galelli, S., Ahipasaoglu, S.D., Taormina, R., 2015.
Identifying (Quasi) Equally Informative Subsets in Feature Selection Problems for Classification:
A Max-Relevance Min-Redundancy Approach.
IEEE Trans. Cybern. doi:10.1109/TCYB.2015.2444435
(available at http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7150365&tag=1)
Deb, K., Pratap, A., Agarwal, S., Meyarivan, T., 2002.
A Fast and Elitist Multiobjective Genetic Algorithm.
IEEE Trans. Evol. Comput. 6, 182–197. doi:10.1109/4235.996017
Hadka, D., Reed, P., 2012.
Borg: An Auto-Adaptive Many-Objective Evolutionary Computing Framework.
Evol. Comput. 21, 1–30. doi:10.1162/EVCO_a_00075
Lichman, M. (2013). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml].
Irvine, CA: University of California, School of Information and Computer Science.
Copyright 2015 Riccardo Taormina (riccardo_taormina@sutd.edu.sg), Gulsah Karakaya (gulsahkilickarakaya@gmail.com;), Stefano Galelli (stefano_galelli@sutd.edu.sg), and Selin Damla Ahipasaoglu (ahipasaoglu@sutd.edu.sg;.
This file is part of Matlab-Multi-objective-Feature-Selection.
Matlab-Multi-objective-Feature-Selection is free software: you can redistribute
it and/or modify it under the terms of the GNU General Public License
as published by the Free Software Foundation, either version 3 of the
License, or (at your option) any later version.
This code is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with Matlab-Multi-objective-Feature-Selection.
If not, see .
Owner
- Name: CRITICAL Infrastructure Systems Lab
- Login: Critical-Infrastructure-Systems-Lab
- Kind: organization
- Email: stefano_galelli@sutd.edu.sg
- Location: Singapore
- Website: https://people.sutd.edu.sg/~stefano_galelli/
- Twitter: GalelliStefano
- Repositories: 1
- Profile: https://github.com/Critical-Infrastructure-Systems-Lab
Connecting climate | water | energy