rdocdump

rdocdump: Dump ‘R’ Package Source, Documentation, and Vignettes into One File

https://github.com/e-kotov/rdocdump

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 10 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.3%) to scientific vocabulary

Keywords

llm r-package rag text
Last synced: 6 months ago · JSON representation ·

Repository

rdocdump: Dump ‘R’ Package Source, Documentation, and Vignettes into One File

Basic Info
Statistics
  • Stars: 14
  • Watchers: 1
  • Forks: 0
  • Open Issues: 2
  • Releases: 3
Topics
llm r-package rag text
Created 11 months ago · Last pushed 6 months ago
Metadata Files
Readme Changelog Contributing License Code of conduct Citation Codemeta

README.md

rdocdump: Dump ‘R’ Package Source, Documentation, and Vignettes into One File rdocdump website

Project Status:
Active Lifecycle:
experimental CRAN
status CRAN/METACRAN Total downloads CRAN/METACRAN Downloads per month R-CMD-check pkgcheck

DOI DOI <!-- badges: end -->

Dump source code, documentation and vignettes of an R package into a single file. Supports installed packages, tar.gz archives, and package source directories. If the package is not installed, only its source is automatically downloaded from CRAN for processing. The output is a single plain text file or a character vector, which is useful to ingest complete package documentation and source into a large language model (LLM) or pass it further to other tools, such as {ragnar} to create a Retrieval-Augmented Generation (RAG) workflow.

Installation

Install the latest stable release of rdocdump from CRAN with:

r install.packages("rdocdump")

You can install the development version of rdocdump from R Universe with:

r install.packages('rdocdump', repos = c('https://e-kotov.r-universe.dev', 'https://cloud.r-project.org') )

or from GitHub with:

``` r

install.packages("pak")

pak::pak("e-kotov/rdocdump") ```

Example

Extract documenation and source code of {rJavaEnv} package by downloading source from CRAN and save it to file rJavaEnv_docs.txt

r rdd_to_txt( pkg = "rJavaEnv", file = tempfile("rJavaEnv_docs_", fileext = ".txt"), force_fetch = TRUE, # force download even if package is installed keep_files = "none" # delete temp files )

Citation

To cite package ‘rdocdump’ in publications use:

Kotov E (2025). rdocdump: Dump R Package Source, Documentation, and Vignettes into One File. doi:10.32614/CRAN.package.rdocdump https://doi.org/10.32614/CRAN.package.rdocdump, https://github.com/e-kotov/rdocdump.

BibTeX:

@Manual{rdocdump,
  title = {rdocdump: Dump R Package Source, Documentation, and Vignettes into One File},
  author = {Egor Kotov},
  year = {2025},
  url = {https://github.com/e-kotov/rdocdump},
  doi = {10.32614/CRAN.package.rdocdump},
}

Owner

  • Name: Egor Kotov
  • Login: e-kotov
  • Kind: user
  • Location: Rostock, Germany
  • Company: Max Planck Institute for Demographic Research

Spatial Data Scientist, Doctoral Reseacher at @MPIDR and UPF

Citation (CITATION.cff)

# --------------------------------------------
# CITATION file created with {cffr} R package
# See also: https://docs.ropensci.org/cffr/
# --------------------------------------------
 
cff-version: 1.2.0
message: 'To cite package "rdocdump" in publications use:'
type: software
license: MIT
title: 'rdocdump: Dump ''R'' Package Source, Documentation, and Vignettes into One
  File'
version: 0.1.1
doi: 10.32614/CRAN.package.rdocdump
identifiers:
- type: doi
  value: 10.32614/CRAN.package.rdocdump
abstract: Dump source code, documentation and vignettes of an 'R' package into a single
  file. Supports installed packages, tar.gz archives, and package source directories.
  If the package is not installed, only its source is automatically downloaded from
  CRAN for processing. The output is a single plain text file or a character vector,
  which is useful to ingest complete package documentation and source into a large
  language model (LLM) or pass it further to other tools, such as 'ragnar' <https://github.com/tidyverse/ragnar>
  to create a Retrieval-Augmented Generation (RAG) workflow.
authors:
- family-names: Kotov
  given-names: Egor
  email: kotov.egor@gmail.com
  orcid: https://orcid.org/0000-0001-6690-5345
preferred-citation:
  type: manual
  title: 'rdocdump: Dump R Package Source, Documentation, and Vignettes into One File'
  authors:
  - family-names: Kotov
    given-names: Egor
    email: kotov.egor@gmail.com
    orcid: https://orcid.org/0000-0001-6690-5345
  year: '2025'
  url: https://github.com/e-kotov/rdocdump
  doi: 10.32614/CRAN.package.rdocdump
repository: https://CRAN.R-project.org/package=rdocdump
repository-code: https://github.com/e-kotov/rdocdump
url: https://www.ekotov.pro/rdocdump/
contact:
- family-names: Kotov
  given-names: Egor
  email: kotov.egor@gmail.com
  orcid: https://orcid.org/0000-0001-6690-5345
keywords:
- llm
- r-package
- rag
- text
references:
- type: software
  title: curl
  abstract: 'curl: A Modern and Flexible Web Client for R'
  notes: Suggests
  url: https://jeroen.r-universe.dev/curl
  repository: https://CRAN.R-project.org/package=curl
  authors:
  - family-names: Ooms
    given-names: Jeroen
    email: jeroenooms@gmail.com
    orcid: https://orcid.org/0000-0002-4035-0289
  year: '2025'
  doi: 10.32614/CRAN.package.curl
- type: software
  title: quarto
  abstract: 'quarto: R Interface to ''Quarto'' Markdown Publishing System'
  notes: Suggests
  url: https://quarto-dev.github.io/quarto-r/
  repository: https://CRAN.R-project.org/package=quarto
  authors:
  - family-names: Allaire
    given-names: JJ
    email: jj@posit.co
    orcid: https://orcid.org/0000-0003-0174-9868
  - family-names: Dervieux
    given-names: Christophe
    email: cderv@posit.co
    orcid: https://orcid.org/0000-0003-4474-2498
  year: '2025'
  doi: 10.32614/CRAN.package.quarto
- type: software
  title: testthat
  abstract: 'testthat: Unit Testing for R'
  notes: Suggests
  url: https://testthat.r-lib.org
  repository: https://CRAN.R-project.org/package=testthat
  authors:
  - family-names: Wickham
    given-names: Hadley
    email: hadley@posit.co
  year: '2025'
  doi: 10.32614/CRAN.package.testthat
  version: '>= 3.0.0'
- type: software
  title: withr
  abstract: 'withr: Run Code ''With'' Temporarily Modified Global State'
  notes: Suggests
  url: https://withr.r-lib.org
  repository: https://CRAN.R-project.org/package=withr
  authors:
  - family-names: Hester
    given-names: Jim
  - family-names: Henry
    given-names: Lionel
    email: lionel@posit.co
  - family-names: Müller
    given-names: Kirill
    email: krlmlr+r@mailbox.org
  - family-names: Ushey
    given-names: Kevin
    email: kevinushey@gmail.com
  - family-names: Wickham
    given-names: Hadley
    email: hadley@posit.co
  - family-names: Chang
    given-names: Winston
  year: '2025'
  doi: 10.32614/CRAN.package.withr

CodeMeta (codemeta.json)

{
  "@context": "https://doi.org/10.5063/schema/codemeta-2.0",
  "@type": "SoftwareSourceCode",
  "identifier": "rdocdump",
  "description": "Dump source code, documentation and vignettes of an 'R' package into a single file. Supports installed packages, tar.gz archives, and package source directories. If the package is not installed, only its source is automatically downloaded from CRAN for processing. The output is a single plain text file or a character vector, which is useful to ingest complete package documentation and source into a large language model (LLM) or pass it further to other tools, such as 'ragnar' <https://github.com/tidyverse/ragnar> to create a Retrieval-Augmented Generation (RAG) workflow.",
  "name": "rdocdump: Dump 'R' Package Source, Documentation, and Vignettes into One File",
  "relatedLink": [
    "https://www.ekotov.pro/rdocdump/",
    "https://CRAN.R-project.org/package=rdocdump"
  ],
  "codeRepository": "https://github.com/e-kotov/rdocdump",
  "issueTracker": "https://github.com/e-kotov/rdocdump/issues",
  "license": "https://spdx.org/licenses/MIT",
  "version": "0.1.1",
  "programmingLanguage": {
    "@type": "ComputerLanguage",
    "name": "R",
    "url": "https://r-project.org"
  },
  "runtimePlatform": "R version 4.5.1 (2025-06-13)",
  "provider": {
    "@id": "https://cran.r-project.org",
    "@type": "Organization",
    "name": "Comprehensive R Archive Network (CRAN)",
    "url": "https://cran.r-project.org"
  },
  "author": [
    {
      "@type": "Person",
      "givenName": "Egor",
      "familyName": "Kotov",
      "email": "kotov.egor@gmail.com",
      "@id": "https://orcid.org/0000-0001-6690-5345"
    }
  ],
  "copyrightHolder": [
    {
      "@type": "Person",
      "givenName": "Egor",
      "familyName": "Kotov",
      "email": "kotov.egor@gmail.com",
      "@id": "https://orcid.org/0000-0001-6690-5345"
    }
  ],
  "maintainer": [
    {
      "@type": "Person",
      "givenName": "Egor",
      "familyName": "Kotov",
      "email": "kotov.egor@gmail.com",
      "@id": "https://orcid.org/0000-0001-6690-5345"
    }
  ],
  "softwareSuggestions": [
    {
      "@type": "SoftwareApplication",
      "identifier": "curl",
      "name": "curl",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=curl"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "quarto",
      "name": "quarto",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=quarto"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "testthat",
      "name": "testthat",
      "version": ">= 3.0.0",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=testthat"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "withr",
      "name": "withr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=withr"
    }
  ],
  "softwareRequirements": {
    "SystemRequirements": null
  },
  "fileSize": "80.824KB",
  "citation": [
    {
      "@type": "SoftwareSourceCode",
      "datePublished": "2025",
      "author": [
        {
          "@type": "Person",
          "givenName": "Egor",
          "familyName": "Kotov",
          "email": "kotov.egor@gmail.com",
          "@id": "https://orcid.org/0000-0001-6690-5345"
        }
      ],
      "name": "rdocdump: Dump R Package Source, Documentation, and Vignettes into One File",
      "identifier": "10.32614/CRAN.package.rdocdump",
      "url": "https://github.com/e-kotov/rdocdump",
      "@id": "https://doi.org/10.32614/CRAN.package.rdocdump",
      "sameAs": "https://doi.org/10.32614/CRAN.package.rdocdump"
    }
  ],
  "releaseNotes": "https://github.com/e-kotov/rdocdump/blob/master/NEWS.md",
  "readme": "https://github.com/e-kotov/rdocdump/blob/main/README.md",
  "contIntegration": [
    "https://github.com/e-kotov/rdocdump/actions/workflows/R-CMD-check.yaml",
    "https://github.com/e-kotov/rdocdump/actions?query=workflow%3Apkgcheck"
  ],
  "developmentStatus": [
    "https://www.repostatus.org/#active",
    "https://lifecycle.r-lib.org/articles/stages.html#experimental"
  ],
  "keywords": [
    "llm",
    "r-package",
    "rag",
    "text"
  ]
}

GitHub Events

Total
  • Create event: 13
  • Release event: 1
  • Issues event: 16
  • Watch event: 11
  • Delete event: 12
  • Issue comment event: 29
  • Public event: 1
  • Push event: 71
  • Pull request review comment event: 1
  • Pull request review event: 2
  • Pull request event: 18
Last Year
  • Create event: 13
  • Release event: 1
  • Issues event: 16
  • Watch event: 11
  • Delete event: 12
  • Issue comment event: 29
  • Public event: 1
  • Push event: 71
  • Pull request review comment event: 1
  • Pull request review event: 2
  • Pull request event: 18

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 79
  • Total Committers: 1
  • Avg Commits per committer: 79.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 79
  • Committers: 1
  • Avg Commits per committer: 79.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Egor Kotov k****r@g****m 79

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 9
  • Total pull requests: 11
  • Average time to close issues: about 21 hours
  • Average time to close pull requests: about 2 hours
  • Total issue authors: 2
  • Total pull request authors: 1
  • Average comments per issue: 2.33
  • Average comments per pull request: 0.09
  • Merged pull requests: 11
  • Bot issues: 2
  • Bot pull requests: 0
Past Year
  • Issues: 9
  • Pull requests: 11
  • Average time to close issues: about 21 hours
  • Average time to close pull requests: about 2 hours
  • Issue authors: 2
  • Pull request authors: 1
  • Average comments per issue: 2.33
  • Average comments per pull request: 0.09
  • Merged pull requests: 11
  • Bot issues: 2
  • Bot pull requests: 0
Top Authors
Issue Authors
  • e-kotov (8)
  • github-actions[bot] (2)
  • atsyplenkov (1)
Pull Request Authors
  • e-kotov (18)
Top Labels
Issue Labels
bug (3) enhancement (2)
Pull Request Labels
enhancement (2)

Packages

  • Total packages: 1
  • Total downloads:
    • cran 196 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 2
  • Total maintainers: 1
cran.r-project.org: rdocdump

Dump 'R' Package Source, Documentation, and Vignettes into One File

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 196 Last month
Rankings
Dependent packages count: 26.2%
Dependent repos count: 32.3%
Average: 48.3%
Downloads: 86.4%
Maintainers (1)
Last synced: 6 months ago