Recent Releases of https://github.com/cn-upb/deepcomp
https://github.com/cn-upb/deepcomp - deepcomp 1.4.2
DeepCoMP is now accepted for publication in the 2023 IEEE Transaction on Network and Service Management (TNSM) as "Multi-Agent Deep Reinforcement Learning for Coordinated Multipoint in Mobile Networks" 🎉
- Updated Readme
- Fix dependencies for correct installation: Pin
protobufandpydantic
- Python
Published by stefanbschneider almost 3 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.4.1
- Enable
'avg'reward aggregation for DeepCoMP by default (wassum) - Add
--debugCLI option for running in a debugger
- Python
Published by stefanbschneider over 4 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.4.0
Improvements regarding utility functions: * Use constants to define the max and min utility, which are then applied for normalization, reward clipping, and rendering * Support two additional utility functions (in addition to log): Linear (ie, just data rate) and step function. * Configurable via CLI. But: Requires manual adjustment of MINUTILITY and MAXUTILITY
Full Changelog: https://github.com/CN-UPB/DeepCoMP/compare/v1.3.0...v1.4.0
- Python
Published by stefanbschneider over 4 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.3.0
- Two, configurable heuristics: Dynamic and static
- Configurable dynamic UE arrival and departure over time
- Changed reward function for multi-agent: Weighted avg. QoE over all cells in range (based on their connected UEs)
- Added observation for multi-agent: Avg. QoE of connected UEs at each cell
- Multiple smaller changes, fixes, eg, upgrade to Ray 1.4
- Python
Published by stefanbschneider almost 5 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.2.5
Updated Docker support: https://hub.docker.com/r/stefanbschneider/deepcomp
- Python
Published by stefanbschneider about 5 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.2.4
- Critical bug fix in CLI
- Improved formatting
- Docker support
- Python
Published by stefanbschneider about 5 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.2.3
Another small fix in the Readme
- Python
Published by stefanbschneider about 5 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.2.2
Minor fix in readme for PyPi
- Python
Published by stefanbschneider about 5 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.2.1
Minor fixes in Readme and rendered video.
- Python
Published by stefanbschneider about 5 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.2.0
- Much improved and extended render function for nicer visualization. New
--dashboardmode, new icons, metrics, etcs. - New, restructured CLI args
- Clean up issues (with DeepSource)
- Minor other changes
- Python
Published by stefanbschneider about 5 years ago
https://github.com/cn-upb/deepcomp - deepcomp 1.1.0
- Update to
ray 1.2 - New CLI features, eg, for multi-node cluster, simplified videos, etc.
- Update readme, setup, license
- Python
Published by stefanbschneider about 5 years ago
https://github.com/cn-upb/deepcomp - PyPi Release
Release of deepcomp package on PyPi. Install via
pip install deepcomp
Functionally equivalent to v1.0. Now using semantic versioning for new releases.
- Python
Published by stefanbschneider over 5 years ago
https://github.com/cn-upb/deepcomp - Major release v1.0
Major release of DeepCoMP, DD-CoMP, and D3-CoMP
- Python
Published by stefanbschneider over 5 years ago
https://github.com/cn-upb/deepcomp - Cooperative Multi-Agent
- New observation space with better normalization improving performance of both central and multi agent PPO
- Extra observations and new reward function for multi agent PPO to learn non-greedy, cooperative & fair behavior, taking other UEs into account
- Support for continuous instead of episodic training
- Refactoring, fixes, improvements
Details: v0.10 details
- Python
Published by stefanbschneider over 5 years ago
https://github.com/cn-upb/deepcomp - Preparation for Evaluation
- New variants for observation (components, normalization, ...) and reward (utility function and penalties)
- New larger scenario and adjusted rendering
- New utility scripts for evaluation: Running experiments and visualzing results
- Bug fixes and refactoring
- Default radio model is resource-fair again (more stable than proportional-fair)
Details: v0.9 details
- Python
Published by stefanbschneider almost 6 years ago
https://github.com/cn-upb/deepcomp - Proportional-fair sharing, Heuristic baselines, Improved Env
- Support for proportional-fair sharing (new default)
- 2 new greedy heuristic algorithms as baselines
- New default UE movement: Random waypoint
- New default UE utility: Log function with increasing data rate
- Improved and refactored environment and model
Details: v0.8 details
- Python
Published by stefanbschneider almost 6 years ago
https://github.com/cn-upb/deepcomp - Larger Environment, CLI support
- Larger environment with 3 BS and 4 moving UEs.
- Extra observation (optional) showing number of connected UEs per BS. To help learn balancing connections. Seems not to be very useful.
- Improved visualization
- Improved install. Added CLI support.
Details: v0.7 details
- Python
Published by stefanbschneider almost 6 years ago
https://github.com/cn-upb/deepcomp - Multi-agent RL
- Support for multi-agent RL: Each UE is trained by its own RL agent
- Currently, all agents share the same RL algorithm and NN
- Already with 2 UEs, multi-agent leads to better results more quickly than a central agent
Details: v0.6 details
- Python
Published by stefanbschneider almost 6 years ago
https://github.com/cn-upb/deepcomp - Improved radio model and observations
- Improved radio model: Configurable sharing/fairness models for multiple UEs connected to a BS. New default: Rate-fair sharing.
- Improved observations: Extra observation indicating the current total data rate of each UE combined over all its connections (normalized)
- New penalty for losing connection rather than disconnecting actively
- Many smaller improvements and adjustments
Details: v0.5 details
- Python
Published by stefanbschneider almost 6 years ago
https://github.com/cn-upb/deepcomp - RLlib
- Replaced
stable_baselineswith ray's RLlib, which is more powerful and supports multi-agent RL - Major refactoring of most code
- No changes in radio model or MDP
Details: MDP description
- Python
Published by stefanbschneider almost 6 years ago
https://github.com/cn-upb/deepcomp - 2ue-2bs-central-agent-simple-radio
- Multiple moving UEs
- Controlled by single, centralized agent that sees combined observations and takes combined actions for all UEs
- Updated radio model: Split rate among connected UEs, allow connecting from farther away, data rate of connections adds up
Details: MDP description
- Python
Published by stefanbschneider almost 6 years ago
https://github.com/cn-upb/deepcomp - 1ue-2bs-simple-radio
- Simplest case works: Just 1 moving UE, 2+ fixed basestation.
- Simple radio model for calculating SNR based on distance to BS. Then calculate achievable data rate from SNR. Interference supported but currently disabled. No schedules yet.
- Advanced observation space using clipping and normalization (configurable)
- Baseline RL algorithm (PPO) learns to connect the UE to always at least one BS as it moves
Details: MDP description
- Python
Published by stefanbschneider about 6 years ago
https://github.com/cn-upb/deepcomp - 1ue-2bs-no-radio
- Simplest case works: Just 1 moving UE, 2+ fixed basestation.
- No radio/wireless model implemented yet. Instead, each BS just has a fixed radius of coverage.
- Baseline RL algorithm (PPO) learns to connect the UE to always at least one BS as it moves
- Python
Published by stefanbschneider about 6 years ago