synfintabgen
SynFinTabGen: A package for generating synthetic financial tables.
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.2%) to scientific vocabulary
Repository
SynFinTabGen: A package for generating synthetic financial tables.
Basic Info
- Host: GitHub
- Owner: ethanbradley
- License: mit
- Language: Python
- Default Branch: main
- Homepage: https://ethanbradley.co.uk/research/synfintabs
- Size: 18.6 KB
Statistics
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
SynFinTabGen: Synthetic Financial Table Generator
A package for generating synthetic financial tables.
Quick Start
To generate a dataset of synthetic financial tables, create a generator and pass how many tables you would like.
```python3 from synfintabgen import DatasetGenerator
generator = DatasetGenerator()
generator(10) ```
The output directory defaults to dataset in the current working directory.
Configuration
You can configure the generator using the DatasetGeneratorConfig class.
```python3 from synfintabgen import DatasetGeneratorConfig
config = DatasetGeneratorConfig( datasetpath="my-datasets-dir", datasetname="my-dataset-name", documentwidth=745, documentheight=1503 )
generator = DatasetGenerator(config) ```
Note
Before the first use of this package, you'll need to make sure you have nltk words corpus downloaded. You can do this like so:
```python3 import nltk
nltk.download('words') ```
Citation
If you use this software, please cite both the article using the citation below and the software itself.
bib
@misc{bradley2024synfintabs,
title = {Syn{F}in{T}abs: A Dataset of Synthetic Financial Tables for Information and Table Extraction},
author = {Bradley, Ethan and Roman, Muhammad and Rafferty, Karen and Devereux, Barry},
year = {2024},
eprint = {2412.04262},
archivePrefix = {arXiv},
primaryClass = {cs.LG},
url = {https://arxiv.org/abs/2412.04262}
}
Owner
- Name: Ethan Bradley
- Login: ethanbradley
- Kind: user
- Location: Belfast, NI
- Company: Queen's University Belfast, @ewbb
- Website: ethanbradley.co.uk
- Twitter: ewbb_
- Repositories: 8
- Profile: https://github.com/ethanbradley
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite both the article from preferred-citation and the software itself."
type: software
authors:
- given-names: "Ethan"
family-names: "Bradley"
affiliation: Queen's University Belfast
orcid: "https://orcid.org/0000-0002-3696-4253"
- given-names: "Muhammad"
family-names: "Roman"
affiliation: Queen's University Belfast
orcid: "https://orcid.org/0000-0002-9035-2426"
- given-names: "Karen"
family-names: "Rafferty"
affiliation: Queen's University Belfast
orcid: "https://orcid.org/0000-0002-7443-7876"
- given-names: "Barry"
family-names: "Devereux"
affiliation: Queen's University Belfast
orcid: "https://orcid.org/0000-0003-2128-8632"
title: "SynFinTabGen"
version: 1.0.0
date-released: 2024-12-05
license: MIT
repository-code: "https://github.com/ethanbradley/synfintabgen"
preferred-citation:
type: generic
authors:
- given-names: "Ethan"
family-names: "Bradley"
affiliation: Queen's University Belfast
orcid: "https://orcid.org/0000-0002-3696-4253"
- given-names: "Muhammad"
family-names: "Roman"
affiliation: Queen's University Belfast
orcid: "https://orcid.org/0000-0002-9035-2426"
- given-names: "Karen"
family-names: "Rafferty"
affiliation: Queen's University Belfast
orcid: "https://orcid.org/0000-0002-7443-7876"
- given-names: "Barry"
family-names: "Devereux"
affiliation: Queen's University Belfast
orcid: "https://orcid.org/0000-0003-2128-8632"
title: "SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction"
month: 12
year: 2024
url: "https://arxiv.org/abs/2412.04262"
GitHub Events
Total
- Release event: 1
- Watch event: 6
- Push event: 6
- Public event: 1
- Create event: 6
Last Year
- Release event: 1
- Watch event: 6
- Push event: 6
- Public event: 1
- Create event: 6
Packages
- Total packages: 1
-
Total downloads:
- pypi 27 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 6
- Total maintainers: 1
pypi.org: synfintabgen
A package for generating synthetic financial tables.
- Homepage: https://ethanbradley.co.uk/research/synfintabs
- Documentation: https://synfintabgen.readthedocs.io/
- License: MIT
-
Latest release: 1.0.1
published about 1 year ago