synfintabgen

SynFinTabGen: A package for generating synthetic financial tables.

https://github.com/ethanbradley/synfintabgen

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.2%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

SynFinTabGen: A package for generating synthetic financial tables.

Basic Info
Statistics
  • Stars: 3
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme License Citation

README.md

SynFinTabGen: Synthetic Financial Table Generator

A package for generating synthetic financial tables.

Quick Start

To generate a dataset of synthetic financial tables, create a generator and pass how many tables you would like.

```python3 from synfintabgen import DatasetGenerator

generator = DatasetGenerator()

generator(10) ```

The output directory defaults to dataset in the current working directory.

Configuration

You can configure the generator using the DatasetGeneratorConfig class.

```python3 from synfintabgen import DatasetGeneratorConfig

config = DatasetGeneratorConfig( datasetpath="my-datasets-dir", datasetname="my-dataset-name", documentwidth=745, documentheight=1503 )

generator = DatasetGenerator(config) ```

Note

Before the first use of this package, you'll need to make sure you have nltk words corpus downloaded. You can do this like so:

```python3 import nltk

nltk.download('words') ```

Citation

If you use this software, please cite both the article using the citation below and the software itself.

bib @misc{bradley2024synfintabs, title = {Syn{F}in{T}abs: A Dataset of Synthetic Financial Tables for Information and Table Extraction}, author = {Bradley, Ethan and Roman, Muhammad and Rafferty, Karen and Devereux, Barry}, year = {2024}, eprint = {2412.04262}, archivePrefix = {arXiv}, primaryClass = {cs.LG}, url = {https://arxiv.org/abs/2412.04262} }

Owner

  • Name: Ethan Bradley
  • Login: ethanbradley
  • Kind: user
  • Location: Belfast, NI
  • Company: Queen's University Belfast, @ewbb

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite both the article from preferred-citation and the software itself."
type: software
authors:
- given-names: "Ethan"
  family-names: "Bradley"
  affiliation: Queen's University Belfast
  orcid: "https://orcid.org/0000-0002-3696-4253"
- given-names: "Muhammad"
  family-names: "Roman"
  affiliation: Queen's University Belfast
  orcid: "https://orcid.org/0000-0002-9035-2426"
- given-names: "Karen"
  family-names: "Rafferty"
  affiliation: Queen's University Belfast
  orcid: "https://orcid.org/0000-0002-7443-7876"
- given-names: "Barry"
  family-names: "Devereux"
  affiliation: Queen's University Belfast
  orcid: "https://orcid.org/0000-0003-2128-8632"
title: "SynFinTabGen"
version: 1.0.0
date-released: 2024-12-05
license: MIT
repository-code: "https://github.com/ethanbradley/synfintabgen"
preferred-citation:
  type: generic
  authors:
  - given-names: "Ethan"
    family-names: "Bradley"
    affiliation: Queen's University Belfast
    orcid: "https://orcid.org/0000-0002-3696-4253"
  - given-names: "Muhammad"
    family-names: "Roman"
    affiliation: Queen's University Belfast
    orcid: "https://orcid.org/0000-0002-9035-2426"
  - given-names: "Karen"
    family-names: "Rafferty"
    affiliation: Queen's University Belfast
    orcid: "https://orcid.org/0000-0002-7443-7876"
  - given-names: "Barry"
    family-names: "Devereux"
    affiliation: Queen's University Belfast
    orcid: "https://orcid.org/0000-0003-2128-8632"
  title: "SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction"
  month: 12
  year: 2024
  url: "https://arxiv.org/abs/2412.04262"

GitHub Events

Total
  • Release event: 1
  • Watch event: 6
  • Push event: 6
  • Public event: 1
  • Create event: 6
Last Year
  • Release event: 1
  • Watch event: 6
  • Push event: 6
  • Public event: 1
  • Create event: 6

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 27 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 6
  • Total maintainers: 1
pypi.org: synfintabgen

A package for generating synthetic financial tables.

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 27 Last month
Rankings
Dependent packages count: 9.6%
Average: 32.0%
Dependent repos count: 54.3%
Maintainers (1)
Last synced: 10 months ago