https://github.com/amlalejini/gecco-2024-phylogeny-informed-subsampling
Repository associated with 2024 GECCO paper submission.
https://github.com/amlalejini/gecco-2024-phylogeny-informed-subsampling
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 3 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.3%) to scientific vocabulary
Keywords
Repository
Repository associated with 2024 GECCO paper submission.
Basic Info
- Host: GitHub
- Owner: amlalejini
- License: mit
- Language: HTML
- Default Branch: main
- Homepage: https://lalejini.com/GECCO-2024-phylogeny-informed-subsampling/bookdown/book/
- Size: 14.7 MB
Statistics
- Stars: 3
- Watchers: 3
- Forks: 0
- Open Issues: 0
- Releases: 1
Topics
Metadata Files
README.md
Phylogeny-informed subsampling
Overview
Abstract
Phylogenies (ancestry trees) tell the evolutionary history of an evolving population. In evolutionary computing, phylogenies reveal how evolutionary algorithms steer populations through a search space by illuminating the step-by-step evolution of solutions. To date, phylogenetic analyses have almost exclusively been applied in post-hoc analyses of evolutionary algorithms for performance tuning and research. Here, we apply phylogenetic information at runtime to augment parent selection procedures that use training sets to assess candidate solution quality. We propose phylogeny-informed fitness estimation, thinning a fraction of costly training case evaluations by substituting the fitness profiles of near relatives as a heuristic estimate. We evaluate phylogeny-informed fitness estimation in the context of the down-sampled lexicase and cohort lexicase selection algorithms on two diagnostic analyses and four genetic programming (GP) problems. Our results indicate that phylogeny-informed fitness estimation can mitigate the drawbacks of down-sampled lexicase, improving diversity maintenance and search space exploration. However, the extent to which phylogeny-informed fitness estimation improves problem-solving success for GP varies by problem, subsampling method, and subsampling level. This work serves as an initial step toward improving evolutionary algorithms by exploiting runtime phylogenetic analysis.
Repository guide
docs/contains supplemental documentation for our methods.experiments/contains HPC job submission scripts, configuration files, and data analyses for all experiments.include/contains C++ implementations of experiment software (header only).scripts/contains generically useful scripts used in this work.source/contains .cpp files that can be compiled to run our experiments.
Owner
- Name: Alex Lalejini
- Login: amlalejini
- Kind: user
- Location: Grand Rapids, MI
- Company: Grand Valley State University
- Website: https://lalejini.com
- Twitter: amlalejini
- Repositories: 98
- Profile: https://github.com/amlalejini
Assistant Professor @ Grand Valley State University
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 12 months ago
All Time
- Total issues: 0
- Total pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: about 20 hours
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 1.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- mmore500 (2)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- numpy ==1.24.3
- pandas ==2.0.1
- python-dateutil ==2.8.2
- pytz ==2023.3
- pyvarco ==1.0.0
- six ==1.16.0
- style ==1.1.0
- tzdata ==2023.3
- update ==0.0.1