aamas-2023-fcql

A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.

https://github.com/johnhostetter/aamas-2023-fcql

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: researchgate.net, zenodo.org
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.7%) to scientific vocabulary

Last synced: 7 months ago · JSON representation ·

Repository

A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.

Basic Info

Host: GitHub
Owner: johnHostetter
License: mit
Language: Jupyter Notebook
Default Branch: main
Homepage:
Size: 72.3 KB

Statistics

Stars: 9
Watchers: 2
Forks: 2
Open Issues: 0
Releases: 1

Created about 3 years ago · Last pushed almost 3 years ago

Metadata Files

Readme License Citation

A Self-Organizing Neuro-Fuzzy Q-Network: Systematic Design with Offline Hybrid Learning

TLDR: A systematic design process for a self-organizing neuro-fuzzy Q-network by using two learning paradigms in tandem: unsupervised learning and an offline, model-free fuzzy RL algorithm called Fuzzy Conservative Q-learning (FCQL).

Abstract: We propose a systematic design process for automatically generating self-organizing neuro-fuzzy Q-networks by leveraging unsupervised learning and an offline, model-free fuzzy reinforcement learning algorithm called Fuzzy Conservative Q-learning (FCQL). Our FCQL offers more effective and interpretable policies than deep neural networks, facilitating human-in-the-loop design and explainability. The effectiveness of FCQL is empirically demonstrated in Cart Pole and in an Intelligent Tutoring System that teaches probability principles to real humans.

overview

What is the contribution? The primary novelty and contribution of this work is the proposal of a systematic design process for a neuro-fuzzy Q-network that works with a model-free offline fuzzy RL algorithm. To our knowledge, existing methods within fuzzy RL are either developed for online interaction due to their dependency on exploration, are unable to accommodate for distributional shift, or rely upon the existence of a simulation; furthermore, the only existing work capable of automatic design for offline fuzzy RL is fuzzy particle swarm RL, but this requires a simulation of real system dynamics that may be impractical for many real-world complex applications such as e-learning and healthcare.

A Demonstration using Cart Pole

The systematic design process can run CLIP and ECM in parallel, but we will first look at CLIP.

clip_highlighted

What is "CLIP"? CLIP stands for "Categorical Learning Induced Partitioning" and is one way of generating Gaussian fuzzy sets dynamically and iteratively. It is inspired by human's behavior category learning and automatically discovers the appropriate number of fuzzy sets within each input dimension. Fuzzy sets must be defined in order to create a neuro-fuzzy Q-networks' knowledge base.

https://github.com/johnHostetter/AAMAS-2023-FCQL/assets/35469358/9388a511-e850-42eb-af4c-297dc2170ef8

Now, we will look at ECM.

ecm_highlighted

What is "ECM"? ECM is the "Evolving Clustering Method". It was originally proposed with the intention of immediately using the resulting clusters as the fuzzy premise for fuzzy logic rules, resulting in approximate fuzzy logic controllers. Aapproximate fuzzy logic controllers are less interpretable as their semantics are local and they share no overlapping linguistic terms between the rules. Here, we have modified how its output is used. Specifically, we use the identified centers of the clusters produced by ECM as the exemplars. Exemplary data is the most "noteworthy" or "interesting" moments. As such, they are candidates for fuzzy logic rules, as it is in our interest to map exemplars to actions' Q-values.

https://github.com/johnHostetter/AAMAS-2023-FCQL/assets/35469358/6ef5fbd6-37ec-4662-8e87-17ceaa54fbee

The Gaussian fuzzy sets produced by CLIP and the candidate exemplary data from ECM are then given to the widely-used and famous Wang-Mendel Method for fuzzy logic rule generation.

wm_highlighted

What is the "Wang-Mendel Method"? The Wang-Mendel Method is a common and famous technique for producing fuzzy logic rules in a very simple and straightforward way. That said, it is widely adopted not necessarily due to its effectiveness or efficiency, but because of its simplicity in implementation. Simply put, each data point is examined to see how strongly it belongs to each fuzzy set, and fuzzy rules are generated from the fuzzy sets with a maximum degree of membership. We used the Wang-Mendel Method to generate textual descriptions of the state environments.

https://github.com/johnHostetter/AAMAS-2023-FCQL/assets/35469358/e425b298-9cd2-4140-9bb4-d7613787903d

After completing the Wang-Mendel Method, each fuzzy logic rule only contains fuzzy premises in the form of textual descriptions, but have no associated Q-values for the available actions. These must be learned with FCQL.

fcql_highlighted

What is FCQL? FCQL is primarily based on Fuzzy Q-Learning, which treats a fuzzy logic rule as a “state” in the environment —similar to how Tabular Q-Learning behaves —and trains Q-values for each of the rule’s possible actions. To prevent overestimating Q-values in the offline setting, we use the updated formula proposed in Conservative Q-Learning (CQL).

formula

Did this paper or code help you? Please consider citing the paper or code. Your support helps ensure the authors can continue researching efficient and robust systematic design of neuro-fuzzy Q-networks!

Would you like to contact us? If you wish to reach out to the authors, we would love to hear from you and discuss neuro-fuzzy networks with you! Please reach us at: jwhostet@ncsu.edu

Stay tuned for code updates! :smile:

If you have any issues with the code, please report them via the Issues tab.

Owner

Name: John Wesley Hostetter
Login: johnHostetter
Kind: user
Location: Raleigh, North Carolina

Repositories: 1
Profile: https://github.com/johnHostetter

Computer Science PhD student with a research interest in fuzzy logic and its applications.

Citation (CITATION.cff)

cff-version: 1.1.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Hostetter
    given-names: John
    orcid: https://orcid.org/0000-0002-7008-7493
title: johnHostetter/AAMAS-2023-FCQL: First release
version: v1.0.0
date-released: 2023-02-22

GitHub Events

Total

Watch event: 2

Last Year

Watch event: 2

Committers

Last synced: 8 months ago

All Time

Total Commits: 7
Total Committers: 1
Avg Commits per committer: 7.0
Development Distribution Score (DDS): 0.0

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
John Wesley Hostetter	3****r	7

Issues and Pull Requests

Last synced: 8 months ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

aamas-2023-fcql

Science Score: 54.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

A Self-Organizing Neuro-Fuzzy Q-Network: Systematic Design with Offline Hybrid Learning

A Demonstration using Cart Pole

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels