reinforcement-learning-based-prefetching-using-access-frequency-weighting

Pythia-AF is an enhanced reinforcement learning-based hardware prefetcher built on the Pythia framework. It introduces Access Frequency Weighting (AFW) to prioritize frequently accessed memory patterns—improving learning speed, reducing cache pollution, and saving bandwidth.

https://github.com/keerthanapalani-1/reinforcement-learning-based-prefetching-using-access-frequency-weighting

Last synced: 6 months ago · JSON representation ·

Repository

Pythia-AF is an enhanced reinforcement learning-based hardware prefetcher built on the Pythia framework. It introduces Access Frequency Weighting (AFW) to prioritize frequently accessed memory patterns—improving learning speed, reducing cache pollution, and saving bandwidth.

Basic Info

Host: GitHub
Owner: KeerthanaPalani-1
License: mit
Language: C++
Default Branch: main
Homepage:
Size: 1.31 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created 11 months ago · Last pushed 11 months ago

Metadata Files

Readme License Citation

README.md

Reinforcement-Learning-Based-Prefetching-Using-Access-Frequency-Weighting

What is Reinforcement Learning Based Prefetching Using Access Frequency Weighting?

This project introduces Pythia-AF, an enhanced version of the RL-based Pythia prefetcher. It applies Access Frequency Weighting (AFW) to guide prefetch decisions using the recurrence of memory access patterns. By modifying the reward function to prioritize frequent accesses, the prefetcher becomes more accurate, timely, and efficient—reducing cache pollution and memory bandwidth waste.

About the Framework

The project builds on the Pythia hardware prefetcher, a reinforcement learning-based system that dynamically learns how to issue prefetches based on observed memory behavior. Our contributions include: 1) Adding a lightweight counter vector to track offset frequency 2) Modifying the SARSA reward function to factor in frequency weighting 3) Achieving improved IPC, reduced LLC misses, and smarter prefetch depth decisions 4) Total added overhead? Just +0.04 KB per core.

Prerequisites

Make sure the following packages and tools are installed: 1) Ubuntu 24.04 (or compatible Linux environment) 2) Oracle VirtualBox (for VM testing) 3) G++ 6.3.0 4) CMake 3.20.2 5) Perl 5.24.1 (make sure PERL5LIB is properly configured) 6) Python 3.x with: 1. pandas 2. matplotlib

Installation

Install necessary prerequisites bash sudo apt install perl
Clone the GitHub repo bash git clone https://github.com/CMU-SAFARI/Pythia.git
Clone the bloomfilter library inside Pythia home directory bash cd Pythia git clone https://github.com/mavam/libbf.git libbf
Build bloomfilter library. This should create the static libbf.a library inside build directory bash cd libbf mkdir build && cd build cmake ../ make clean && make
Build Pythia for single/multi core using build script. This should create the executable inside bin directory. ```bash cd $PYTHIA_HOME

./buildchampsim.sh <l1pref>

./build_champsim.sh multi multi no 1 ```

Please use buildchampsimhighcore.sh to build ChampSim for more than four cores.

5.Set appropriate environment variables as follows: bash source setvars.sh

Preparing Traces

Use the downloadtraces.pl perl script to download all traces, except Ligra and PARSEC. ```bash mkdir $PYTHIAHOME/traces/ cd $PYTHIAHOME/scripts/ perl downloadtraces.pl --csv artifact_traces.csv --dir ../traces/ ``` Note: The script should download 138 traces. Please check the final log for any incomplete downloads.
Once the trace download completes, please verify the checksum as follows. Please make sure all traces pass the checksum test. bash cd $PYTHIA_HOME/traces md5sum -c ../scripts/artifact_traces.md5
Download the Ligra and PARSEC traces from these repositories: Ligra PARSEC 2.1
If the traces are downloaded in some other path, please change the full path in experiments/MICRO21_1C.tlist and bash experiments/MICRO21_4C.tlist accordingly.

Experimental Workflow

Our experimental process is divided into two main stages: (1) running the experiments and (2) aggregating statistics from the results

Launching Experiments

To create necessary experiment commands in bulk, we will use scripts/create_jobfile.pl
create_jobfile.pl requires three necessary arguments:
- exe : the full path of the executable to run
- tlist : contains trace definitions
- exp : contains knobs of the experiments to run
Create experiments as follows. Please make sure the paths used in tlist and exp files are appropriate. bash cd $PYTHIA_HOME/experiments/ perl ../scripts/create_jobfile.pl --exe $PYTHIA_HOME/bin/perceptron-multi-multi-no-ship-1core --tlist MICRO21_1C.tlist --exp MICRO21_1C.exp --local 1 > jobfile.sh
Go to a run directory (or create one) inside experiements to launch runs in the following way: bash cd experiments_1C source ../jobfile.sh
If your system uses Slurm to manage multiple jobs on a compute cluster, be sure to pass --local 0 to create_jobfile.pl.

Rolling-up Statistics

To aggregate statistics in bulk, use the scripts/rollup.pl script.
This script requires the following three arguments:
- tlist: defines the traces
- exp: specifies experiment configurations
- mfile: lists the statistic names and the method used for reduction
Rollup statistics as follows. Make sure the file paths provided in the tlist and exp files are correct. bash cd experiements_1C/ perl ../../scripts/rollup.pl --tlist ../MICRO21_1C.tlist --exp ../MICRO21_1C.exp --mfile ../rollup_1C_base_config.mfile > rollup.csv
Open the rollup.csv file in your preferred data analysis tool (such as Python Pandas, Excel, or Numbers) to explore and interpret the results.

HDL Implementation

Although our work primarily focused on simulation and performance evaluation, we kept hardware implementation in mind throughout. The enhancement we proposed—adding Access Frequency Weighting (AFW) to the Pythia prefetcher—requires only a tiny amount of additional hardware (about 0.04 KB per core). This was achieved using a simple counter vector built into Pythia's existing structures. The overall storage overhead remains very low at 25.5 KB per core, and synthesis results show the design would add just 1.03% area overhead on a desktop-class processor. This makes it a practical option for real-world hardware deployment with minimal changes to the original architecture.

Code Walkthrough

The enhanced version of Pythia integrates Access Frequency Weighting (AFW) into its reinforcement learning-based prefetcher. Here's a quick overview of the key component:

inc/featureknowledge.h & src/featureknowledge.cc

These files are responsible for defining and processing program features that the reinforcement learning engine uses to make prefetching decisions. As part of our Access Frequency Weighting (AFW) enhancement, we modified these files to help track how often specific memory offsets are accessed.
We introduced a mechanism that monitors access patterns and integrates frequency data as part of the reward calculation during training. This allows the prefetcher to prioritize memory addresses that are accessed more frequently, improving learning efficiency and reducing unnecessary prefetches.
If you're looking to extend AFW or add new features, this is the place to do it. Start by updating the FeatureType enum in featureknowledge.h, then implement the logic to extract or compute your new feature in featureknowledge.cc.

Contact

Feel free to reach out if you have questions or ideas!

Keerthana Palani – k.palani@student.fdu.edu
Sai Ajith Reddy Bureddy – s.bureddy@student.fdu.edu
Mengfei Liu – m.liu1@student.fdu.edu
Xinyi Wang – x.wang7@student.fdu.edu
Shuihan Jiang – s.jiang1@student.fdu.edu

Acknowledgements

This project was developed as part of the Capstone for the Master of Science in Applied Computer Science program at Fairleigh Dickinson University, Vancouver Campus. We’d like to sincerely thank: - Our professors and mentors for their valuable guidance - The creators of the Pythia Prefetching Framework - Benchmark providers: SPEC 2017 and CloudSuite - The open-source community for tools like Python, Pandas, and Matplotlib, which helped power our analysis and visualization

Owner

Name: Keerthana Palani
Login: KeerthanaPalani-1
Kind: user

Repositories: 1
Profile: https://github.com/KeerthanaPalani-1

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use our framework to evaluate Pythia, please cite the work as follows."
authors:
- family-names: "Bera"
  given-names: "Rahul"
- family-names: "Kanellopoulos"
  given-names: "Konstantinos"
title: "Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning"
version: 1.3
doi: 10.5281/zenodo.5520125
date-released: 2021-09-22
url: "https://github.com/CMU-SAFARI/Pythia"
preferred-citation:
  title: "Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning"
  type: proceedings
  year: 2021
  conference: 
    name: "54th International Symposium on Microarchitecture(MICRO)"
  authors:
  - family-names: "Bera"
    given-names: "Rahul"
  - family-names: "Kanellopoulos"
    given-names: "Konstantinos"
  - family-names: "Nori"
    given-names: "Anant V."
  - family-names: "Shahroodi"
    given-names: "Taha"
  - family-names: "Subramoney"
    given-names: "Sreenivas"
  - family-names: "Mutlu"
    given-names: "Onur"
  doi: "10.1145/3466752.3480114"

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science