rat

🐀 Regular Expression Denial of Service analyzer.

https://github.com/phreppo/rat

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: springer.com
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.4%) to scientific vocabulary

Keywords

denial-of-service redos redos-checker redos-detector regular-expressions

Last synced: 9 months ago · JSON representation ·

Repository

🐀 Regular Expression Denial of Service analyzer.

Basic Info

Host: GitHub
Owner: phreppo
License: mit
Language: OCaml
Default Branch: master
Homepage:
Size: 136 KB

Statistics

Stars: 25
Watchers: 4
Forks: 2
Open Issues: 0
Releases: 0

Topics

denial-of-service redos redos-checker redos-detector regular-expressions

Created about 4 years ago · Last pushed over 2 years ago

Metadata Files

Readme License Citation

rat - ReDoS Abstract Tester 🐁

rat is a sound tool to detect exponential Regular Expression Denial of Service (ReDoS) attacks. Since the algorithm is proved to be sound, the tool cannot raise false negatives. This means that if rat determines that a regular expression is safe, it is impossible for it to be exploited by an attacker. Furthermore, rat takes advantage of efficient data structures to be particularly fast.

The paper is available here. The preprint PDF version is here.

Build, Install and Run

To compile the project you need opam installed, with a version of the OCaml compiler >= 4.08.0.

bash make deps # Install the dependencies. make # Build the project. make test # Run the tests. make install # Install the executable.

To run rat without installing it:

bash dune exec rat -- <args>

Run with --help to print the help message. If there are no arguments, the command runs the interactive interpreter.

To run rat inside Docker:

``` bash

Build (you just need to do this once)

docker build . -t rat:latest

Run rat in interactive mode

docker run -it rat:latest

Example of running rat with arguments

docker run rat:latest rat --regex '(a|a)*b' ```

Example Analysis

Assume we have rat installed on the machine. The following command analyzes the regular expression (a|a)*b:

bash rat --regex '(a|a)*b'

The output should be similar to:

bash ~ ⟦🐁⟧ ~ Exponential ReDoS: true Exploit string: { prefix = '' pump = 'a' suffix = '' } Example: aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa Runtime(ms): 0.399

The analyzer reports that the regular expression is vulnerable to a ReDoS attack, and prints an exploit string. The runtime includes only the time to run the analysis, and does not take into account the time to create a printable exploit string. Since in order to do this we must run an exponential algorithm (to build the minimal DFA recognizing the attack language), this can take a lot of time. The implementation of this algorithm can be improved considerably, and building the exploit string is not a necessary part of the analysis.

Analysis Options

It is possible to set the semantics of the matching with the option --semantics. If you are not sure, use the default option, which is match. In this case, the analyzer assumes that the input regular expressions match an input string even if just a prefix of the input string matches the regular expression. It is the default behaviour of matching engines, and corresponds to the behaviour of the function match in the re module in Python. If the fullmatch semantics is used, then the analyzer assumes that matching engines compute the language membership. The fullmatch semantics is the one used in the fullmatch function in Python. If you read the paper about rat (available soon!) and you are interested in trying the examples, you should use --semantics fullmatch, in all other cases the match semantics is more appropriate.

It is possible to print the full attack language for a regular expression with the option --show-lang. Since this language is the specification of the attack words, it becomes quickly difficult to read it.

Limitations

rat cannot analyze a wide variety of non-regular constructs in regular expressions, most notably backreferences and lookarounds. Some non-regular constructs, such as $, are supported in a limited form. Future improvements might fix this.

Benchmarking

If you want to benchmark rat against other detectors, please use an OCaml switch with flambda enabled. For instance, this can be done with the following: ```bash

Create a with with OCaml 4.14 and flambda.

opam switch create 4.14.0+flambda --package=ocaml-variants.4.14.0+options,ocaml-option-flambda

Install dependencies, build, and install.

make deps make make install ```

Authors

Owner

Name: Francesco Parolini
Login: phreppo
Kind: user
Location: London, UK
Company: Lacework

Website: phreppo.github.io
Repositories: 1
Profile: https://github.com/phreppo

Making programs more secure (WIP)

Citation (CITATION.cff)

cff-version: 1.1.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Parolini
    given-names: Francesco
    orcid: https://orcid.org/0000-0002-1077-7812
  - family-names: Miné
    given-names: Antoine
    orcid: https://orcid.org/0000-0002-6375-3179
title: "rat - ReDoS Abstract Tester"
version: 0.1
date-released: 2022-05-04
url: "https://github.com/parof/rat"

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

rat