Lerche

Lerche: Generating data file processors in Julia from EBNF grammars - Published in JOSS (2021)

https://github.com/jamesrhester/lerche.jl

Science Score: 93.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 6 DOI reference(s) in README and JOSS metadata
✓
Academic publication links
Links to: joss.theoj.org, zenodo.org
○
Committers with academic emails
○
Institutional organization owner
✓
JOSS paper metadata
Published in Journal of Open Source Software

Scientific Fields

Biology Life Sciences - 40% confidence

Last synced: 9 months ago · JSON representation

Repository

A Julia port of the Lark parser

Basic Info

Host: GitHub
Owner: jamesrhester
License: mit
Language: Julia
Default Branch: master
Size: 393 KB

Statistics

Stars: 49
Watchers: 4
Forks: 4
Open Issues: 9
Releases: 15

Created over 6 years ago · Last pushed 10 months ago

Metadata Files

Readme License

Introduction

Lerche (German for Lark) is a partial port of the Lark grammar processor from Python to Julia. Lark grammars should work unchanged in Lerche.

Installation: at the Julia REPL, using Pkg; Pkg.add("Lerche")

Documentation:

Quick start

Citation

If you are publishing work where Lerche has been useful, please consider citing the Lerche paper.

Issues

Please raise any issues or problems with using Lerche in the Github issue tracker.

Contributions

Contributions of all types are welcome. Examples include: * Improvements to processing speed * Improved documentation * Links to projects using Lerche * Commenting and triaging issues

The most straightforward way to make a contribution is to fork the repository, make your changes, and create a pull request.

Notes for Lark users

Please read the Lark documentation. When converting from Lark programs written in Python to Lerche programs written in Julia, the changes outlined below are necessary.

All Transformer and Visitor classes become subtypes of Transformer/Visitor
All class method calls become Julia method calls with an instance of the type as the first argument (i.e. replacing self)
Transformation or visitor rules should be preceded by the @rule macro. Inline rules use the @inline_rule macro and token processing methods use @terminal.
The first argument of transformer and visitor rules is a variable of the desired transformer/visitor type.
Any grammars containing backslash-double quote sequences need to be fixed (see below).
Any grammars containing backslash-x to denote a byte value need to be fixed (see below).

Inconsistencies with Lark

Earley and CYK grammars are not implemented.
Dynamic lexer is not implemented.
All errors with messages attached must be at the bottom of the exception type hierarchy, as these are the only types that can have contents. Thus an UnexpectedInput exception must become e.g an UnexpectedCharacter exception if a message is included.
The PuppetParser invoked when there is a parse error is not yet functional
There may be issues with correctly interpreting import paths to find imported grammars: please raise an issue if this happens.
No choice of regex engine, Tree structure or byte/string choices are available as they make no sense for Julia.

Implementation notes and hints

Lerche is currently based off Lark 0.11.1. The priority has been on maintaining fidelity with Lark. For example, global regex flags which are integers in Lark are still integers in Lerche, which means you will need to look their values up. This may be changed to a more Julian approach in future.

The @rule and @inline_rule macros define methods of Lerche function transformer_func. Julia multiple dispatch is used to select the appropriate method at runtime. @terminal similarly defines methods of token_func.

Parsing a large (500K) file suggest Lerche is about 3 times faster than Lark with CPython for parsing. Parser generation is much slower as no optimisation techniques have been applied (yet). Calculating and storing your grammar in a Julia const variable at the top level of your package will allow it to be precompiled and thus avoid grammar re-analysis each time your package is loaded.

Owner

Name: James Hester
Login: jamesrhester
Kind: user

Repositories: 17
Profile: https://github.com/jamesrhester

JOSS Publication

Lerche: Generating data file processors in Julia from EBNF grammars

Published

August 24, 2021

DOI

10.21105/joss.03497

Volume 6, Issue 64, Page 3497

Authors

James R. Hester

Australian Nuclear Science and Technology Organisation, Sydney, Australia

Erez Shinan
Independent researcher

Editor

Sebastian Benthall

GitHub Events

Total

Release event: 2
Watch event: 2
Issue comment event: 6
Push event: 3
Create event: 2

Last Year

Release event: 2
Watch event: 2
Issue comment event: 6
Push event: 3
Create event: 2

Committers

Last synced: 10 months ago

All Time

Total Commits: 119
Total Committers: 4
Avg Commits per committer: 29.75
Development Distribution Score (DDS): 0.286

Past Year

Commits: 3
Committers: 2
Avg Commits per committer: 1.5
Development Distribution Score (DDS): 0.333

Top Committers

Name	Email	Commits
James.Hester	j**h@a**u	85
jamesrhester	j**r@g**m	30
Venkatesh Dayananda	v**h@j**m	3
GiggleLiu	c**9@g**m	1

Committer Domains (Top 20 + Academic)

juliacomputing.com: 1 ansto.gov.au: 1

Issues and Pull Requests

Last synced: 9 months ago

All Time

Total issues: 28
Total pull requests: 5
Average time to close issues: 2 months
Average time to close pull requests: 1 day
Total issue authors: 13
Total pull request authors: 4
Average comments per issue: 4.36
Average comments per pull request: 1.4
Merged pull requests: 3
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 1
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 1
Pull request authors: 0
Average comments per issue: 0.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

jamesrhester (10)
guyvdbroeck (5)
ziotom78 (2)
kskyten (2)
ArtHarg (1)
Amval (1)
willow-ahrens (1)
bilderbuchi (1)
stensmo (1)
GiggleLiu (1)
robertfeldt (1)
JuliaTagBot (1)
ddyok (1)

Pull Request Authors

GiggleLiu (2)
vdayanand (1)
erezsh (1)
danielskatz (1)

Top Labels

Issue Labels

enhancement (3) bug (2)

Pull Request Labels

Packages

Total packages: 3
Total downloads:
- julia 157 total

Total dependent packages: 8
(may contain duplicates)
Total dependent repositories: 0
(may contain duplicates)
Total versions: 47

proxy.golang.org: github.com/jamesrhester/lerche.jl

Documentation: https://pkg.go.dev/github.com/jamesrhester/lerche.jl#section-documentation
License: mit
Latest release: v0.5.7
published 10 months ago

Versions: 15
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Dependent packages count: 6.4%

Average: 6.7%

Dependent repos count: 6.9%

Last synced: 9 months ago

proxy.golang.org: github.com/jamesrhester/Lerche.jl

Documentation: https://pkg.go.dev/github.com/jamesrhester/Lerche.jl#section-documentation
License: mit
Latest release: v0.5.7
published 10 months ago

Versions: 15
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Dependent packages count: 6.4%

Average: 6.7%

Dependent repos count: 6.9%

Last synced: 9 months ago

juliahub.com: Lerche

A Julia port of the Lark parser

Documentation: https://docs.juliahub.com/General/Lerche/stable/
License: MIT
Latest release: 0.5.7
published 10 months ago

Versions: 17
Dependent Packages: 8
Dependent Repositories: 0
Downloads: 157 Total

Rankings

Dependent packages count: 7.0%

Dependent repos count: 9.9%

Average: 16.0%

Stargazers count: 18.9%

Forks count: 28.1%

Last synced: 9 months ago

Dependencies

.github/workflows/TagBot.yml actions

JuliaRegistries/TagBot v1 composite

.github/workflows/ci.yml actions

actions/checkout v2 composite
coverallsapp/github-action master composite
julia-actions/julia-buildpkg latest composite
julia-actions/julia-runtest latest composite
julia-actions/setup-julia latest composite
julia-actions/setup-julia v1 composite

Lerche

Science Score: 93.0%

Scientific Fields

Repository

Basic Info

Statistics

Metadata Files

README.md

Introduction

Quick start

Citation

Issues

Contributions

Notes for Lark users

Inconsistencies with Lark

Implementation notes and hints

Owner

JOSS Publication

Lerche: Generating data file processors in Julia from EBNF grammars

Authors

Editor

Tags

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

proxy.golang.org: github.com/jamesrhester/lerche.jl

Rankings

proxy.golang.org: github.com/jamesrhester/Lerche.jl

Rankings

juliahub.com: Lerche

Rankings

Dependencies