arkhe

Tools for cleaning rectangular data - :exclamation: This is a read-only mirror from https://codeberg.org/tesselle/arkhe

https://github.com/tesselle/arkhe

Science Score: 77.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 25 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
    1 of 2 committers (50.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (19.1%) to scientific vocabulary

Keywords

data-cleaning r-package statistics
Last synced: 6 months ago · JSON representation ·

Repository

Tools for cleaning rectangular data - :exclamation: This is a read-only mirror from https://codeberg.org/tesselle/arkhe

Basic Info
Statistics
  • Stars: 10
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 18
Topics
data-cleaning r-package statistics
Created over 6 years ago · Last pushed 6 months ago
Metadata Files
Readme Changelog License Citation Codemeta

README.Rmd

---
output: github_document
---



```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = NULL
)
Sys.setenv(LANGUAGE = "en") # Force locale
```

# arkhe 


[![status-badge](https://ci.codeberg.org/api/badges/14688/status.svg)](https://ci.codeberg.org/repos/14688){.pkgdown-devel}
[![Code coverage](https://packages.tesselle.org/arkhe/coverage/badge.svg)](https://packages.tesselle.org/arkhe/coverage/){.pkgdown-devel}
[![Dependencies](https://tinyverse.netlify.app/badge/arkhe)](https://cran.r-project.org/package=arkhe){.pkgdown-devel}

[![r-universe](https://tesselle.r-universe.dev/badges/arkhe)](https://tesselle.r-universe.dev/arkhe){.pkgdown-devel}
[![CRAN Version](https://www.r-pkg.org/badges/version/arkhe)](https://cran.r-project.org/package=arkhe){.pkgdown-release}
[![CRAN checks](https://badges.cranchecks.info/worst/arkhe.svg)](https://cran.r-project.org/web/checks/check_results_arkhe.html){.pkgdown-release}
[![CRAN Downloads](https://cranlogs.r-pkg.org/badges/arkhe)](https://cran.r-project.org/package=arkhe){.pkgdown-release}

[![Project Status: Active – The project has reached a stable, usable state and is being actively developed.](https://www.repostatus.org/badges/latest/active.svg)](https://www.repostatus.org/#active)

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.3526659.svg)](https://doi.org/10.5281/zenodo.3526659)


## Overview

A dependency-free collection of simple functions for cleaning rectangular data. This package allows to detect, count and replace values or discard rows/columns using a predicate function. In addition, it provides tools to check conditions and return informative error messages.

---

```{r citation, echo=FALSE, comment='', results='asis'}
cite <- utils::citation("arkhe")
print(cite, bibtex = FALSE)
```

## Installation

You can install the released version of **arkhe** from [CRAN](https://CRAN.R-project.org) with:

```{r cran-installation, eval=FALSE}
install.packages("arkhe")
```

And the development version from [Codeberg](https://codeberg.org/) with:

```{r gh-installation, eval=FALSE}
# install.packages("remotes")
remotes::install_git("https://codeberg.org/tesselle/arkhe")
```

## Usage

```{r}
## Load the package
library(arkhe)

## Set seed for reproductibility
set.seed(12345)

## Create a matrix
X <- matrix(sample(1:10, 25, TRUE), nrow = 5, ncol = 5)

## Add NA
k <- sample(1:25, 3, FALSE)
X[k] <- NA
X

## Count missing values in rows
count(X, f = is.na, margin = 1)

## Count non-missing values in columns
count(X, f = is.na, margin = 2, negate = TRUE)

## Find row with NA
detect(X, f = is.na, margin = 1)

## Find column without any NA
detect(X, f = is.na, margin = 2, negate = TRUE, all = TRUE)

## Remove row with any NA
discard(X, f = is.na, margin = 1, all = FALSE)

## Remove column with any NA
discard(X, f = is.na, margin = 2, all = FALSE)

## Replace NA with zeros
replace_NA(X, value = 0)
```

## Translation

This package provides translations of user-facing communications, like messages, warnings and errors. The preferred language is by default taken from the locale. This can be overridden by setting of the environment variable `LANGUAGE` (you only need to do this once per session):

``` r
Sys.setenv(LANGUAGE = "")
```

Languages currently available are English (`en`), French (`fr`) and Spanish (`es`).

## Contributing

Please note that the **arkhe** project is released with a [Contributor Code of Conduct](https://www.tesselle.org/conduct.html). By contributing to this project, you agree to abide by its terms.

```{r metadata, include=FALSE}
## Update codemeta.json
codemetar::write_codemeta(verbose = FALSE)

## Update CITATION.cff
cff_keys <- list(
  identifiers = list(
    list(description = "The concept DOI.",
         type = "doi",
         value = "10.5281/zenodo.3526659"),
    list(description = "The versioned DOI for version 0.1.0.",
         type = "doi",
         value = "10.5281/zenodo.3526660"),
    list(description = "The versioned DOI for version 0.2.0.",
         type = "doi",
         value = "10.5281/zenodo.3583089"),
    list(description = "The versioned DOI for version 0.2.1.",
         type = "doi",
         value = "10.5281/zenodo.3668868"),
    list(description = "The versioned DOI for version 0.2.2.",
         type = "doi",
         value = "10.5281/zenodo.3724463"),
    list(description = "The versioned DOI for version 0.3.0.",
         type = "doi",
         value = "10.5281/zenodo.4675724"),
    list(description = "The versioned DOI for version 0.3.1.",
         type = "doi",
         value = "10.5281/zenodo.4761928"),
    list(description = "The versioned DOI for version 0.4.0.",
         type = "doi",
         value = "10.5281/zenodo.5515470"),
    list(description = "The versioned DOI for version 0.5.0.",
         type = "doi",
         value = "10.5281/zenodo.6647041"),
    list(description = "The versioned DOI for version 1.0.0.",
         type = "doi",
         value = "10.5281/zenodo.7261616"),
    list(description = "The versioned DOI for version 1.1.0.",
         type = "doi",
         value = "10.5281/zenodo.7546749"),
    list(description = "The versioned DOI for version 1.2.0.",
         type = "doi",
         value = "10.5281/zenodo.7925529"),
    list(description = "The versioned DOI for version 1.3.0.",
         type = "doi",
         value = "10.5281/zenodo.8308598"),
    list(description = "The versioned DOI for version 1.4.0.",
         type = "doi",
         value = "10.5281/zenodo.10148702"),
    list(description = "The versioned DOI for version 1.5.0.",
         type = "doi",
         value = "10.5281/zenodo.10534310"),
    list(description = "The versioned DOI for version 1.6.0.",
         type = "doi",
         value = "10.5281/zenodo.10694357"),
    list(description = "The versioned DOI for version 1.7.0.",
         type = "doi",
         value = "10.5281/zenodo.13121710"),
    list(description = "The versioned DOI for version 1.8.0.",
         type = "doi",
         value = "10.5281/zenodo.14054350"),
    list(description = "The versioned DOI for version 1.9.0.",
         type = "doi",
         value = "10.5281/zenodo.14355274"),
    list(description = "The versioned DOI for version 1.10.0.",
         type = "doi",
         value = "10.5281/zenodo.14926270"),
    list(description = "The versioned DOI for version 1.11.0.",
         type = "doi",
         value = "10.5281/zenodo.15390837"),
    list(description = "The CRAN DOI",
         type = "doi",
         value = "10.32614/cran.package.arkhe")
  )
)
cff <- cffr::cff_create("DESCRIPTION", keys = cff_keys)
if (cffr::cff_validate(cff)) cffr::cff_write(cff, outfile = "CITATION.cff")
```

Owner

  • Name: tesselle
  • Login: tesselle
  • Kind: organization
  • Location: France

A collection of R packages for archaeological research and teaching

Citation (CITATION.cff)

# --------------------------------------------
# CITATION file created with {cffr} R package
# See also: https://docs.ropensci.org/cffr/
# --------------------------------------------
 
cff-version: 1.2.0
message: 'To cite package "arkhe" in publications use:'
type: software
license: GPL-3.0-or-later
title: 'arkhe: Tools for Cleaning Rectangular Data'
version: 1.11.0
doi: 10.5281/zenodo.3526659
identifiers:
- description: The concept DOI.
  type: doi
  value: 10.5281/zenodo.3526659
- description: The versioned DOI for version 0.1.0.
  type: doi
  value: 10.5281/zenodo.3526660
- description: The versioned DOI for version 0.2.0.
  type: doi
  value: 10.5281/zenodo.3583089
- description: The versioned DOI for version 0.2.1.
  type: doi
  value: 10.5281/zenodo.3668868
- description: The versioned DOI for version 0.2.2.
  type: doi
  value: 10.5281/zenodo.3724463
- description: The versioned DOI for version 0.3.0.
  type: doi
  value: 10.5281/zenodo.4675724
- description: The versioned DOI for version 0.3.1.
  type: doi
  value: 10.5281/zenodo.4761928
- description: The versioned DOI for version 0.4.0.
  type: doi
  value: 10.5281/zenodo.5515470
- description: The versioned DOI for version 0.5.0.
  type: doi
  value: 10.5281/zenodo.6647041
- description: The versioned DOI for version 1.0.0.
  type: doi
  value: 10.5281/zenodo.7261616
- description: The versioned DOI for version 1.1.0.
  type: doi
  value: 10.5281/zenodo.7546749
- description: The versioned DOI for version 1.2.0.
  type: doi
  value: 10.5281/zenodo.7925529
- description: The versioned DOI for version 1.3.0.
  type: doi
  value: 10.5281/zenodo.8308598
- description: The versioned DOI for version 1.4.0.
  type: doi
  value: 10.5281/zenodo.10148702
- description: The versioned DOI for version 1.5.0.
  type: doi
  value: 10.5281/zenodo.10534310
- description: The versioned DOI for version 1.6.0.
  type: doi
  value: 10.5281/zenodo.10694357
- description: The versioned DOI for version 1.7.0.
  type: doi
  value: 10.5281/zenodo.13121710
- description: The versioned DOI for version 1.8.0.
  type: doi
  value: 10.5281/zenodo.14054350
- description: The versioned DOI for version 1.9.0.
  type: doi
  value: 10.5281/zenodo.14355274
- description: The versioned DOI for version 1.10.0.
  type: doi
  value: 10.5281/zenodo.14926270
- description: The versioned DOI for version 1.11.0.
  type: doi
  value: 10.5281/zenodo.15390837
- description: The CRAN DOI
  type: doi
  value: 10.32614/cran.package.arkhe
abstract: A dependency-free collection of simple functions for cleaning rectangular
  data. This package allows to detect, count and replace values or discard rows/columns
  using a predicate function. In addition, it provides tools to check conditions and
  return informative error messages.
authors:
- family-names: Frerebeau
  given-names: Nicolas
  email: nicolas.frerebeau@u-bordeaux-montaigne.fr
  orcid: https://orcid.org/0000-0001-5759-4944
preferred-citation:
  type: manual
  title: 'arkhe: Tools for Cleaning Rectangular Data'
  authors:
  - family-names: Frerebeau
    given-names: Nicolas
    email: nicolas.frerebeau@u-bordeaux-montaigne.fr
    orcid: https://orcid.org/0000-0001-5759-4944
  year: '2025'
  institution:
    name: Université Bordeaux Montaigne
    address: Pessac, France
  notes: R package version 1.11.0
  url: https://packages.tesselle.org/arkhe/
  doi: 10.5281/zenodo.3526659
repository: https://CRAN.R-project.org/package=arkhe
repository-code: https://codeberg.org/tesselle/arkhe
url: https://packages.tesselle.org/arkhe/
contact:
- family-names: Frerebeau
  given-names: Nicolas
  email: nicolas.frerebeau@u-bordeaux-montaigne.fr
  orcid: https://orcid.org/0000-0001-5759-4944
keywords:
- data-cleaning
- statistics
- r-package
references:
- type: software
  title: 'R: A Language and Environment for Statistical Computing'
  notes: Depends
  url: https://www.R-project.org/
  authors:
  - name: R Core Team
  institution:
    name: R Foundation for Statistical Computing
    address: Vienna, Austria
  year: '2025'
  version: '>= 3.5'
- type: software
  title: methods
  abstract: 'R: A Language and Environment for Statistical Computing'
  notes: Imports
  authors:
  - name: R Core Team
  institution:
    name: R Foundation for Statistical Computing
    address: Vienna, Austria
  year: '2025'
- type: software
  title: stats
  abstract: 'R: A Language and Environment for Statistical Computing'
  notes: Imports
  authors:
  - name: R Core Team
  institution:
    name: R Foundation for Statistical Computing
    address: Vienna, Austria
  year: '2025'
- type: software
  title: utils
  abstract: 'R: A Language and Environment for Statistical Computing'
  notes: Imports
  authors:
  - name: R Core Team
  institution:
    name: R Foundation for Statistical Computing
    address: Vienna, Austria
  year: '2025'
- type: software
  title: tinytest
  abstract: 'tinytest: Lightweight and Feature Complete Unit Testing Framework'
  notes: Suggests
  url: https://github.com/markvanderloo/tinytest
  repository: https://CRAN.R-project.org/package=tinytest
  authors:
  - family-names: Loo
    given-names: Mark
    name-particle: van der
    email: mark.vanderloo@gmail.com
    orcid: https://orcid.org/0000-0002-9807-4686
  year: '2025'
  doi: 10.32614/CRAN.package.tinytest

CodeMeta (codemeta.json)

{
  "@context": "https://doi.org/10.5063/schema/codemeta-2.0",
  "@type": "SoftwareSourceCode",
  "identifier": "arkhe",
  "description": "A dependency-free collection of simple functions for cleaning rectangular data. This package allows to detect, count and replace values or discard rows/columns using a predicate function. In addition, it provides tools to check conditions and return informative error messages.",
  "name": "arkhe: Tools for Cleaning Rectangular Data",
  "relatedLink": [
    "https://packages.tesselle.org/arkhe/",
    "https://CRAN.R-project.org/package=arkhe"
  ],
  "codeRepository": "https://codeberg.org/tesselle/arkhe",
  "issueTracker": "https://codeberg.org/tesselle/arkhe/issues",
  "license": "https://spdx.org/licenses/GPL-3.0",
  "version": "1.11.0",
  "programmingLanguage": {
    "@type": "ComputerLanguage",
    "name": "R",
    "url": "https://r-project.org"
  },
  "runtimePlatform": "R version 4.5.0 (2025-04-11)",
  "provider": {
    "@id": "https://cran.r-project.org",
    "@type": "Organization",
    "name": "Comprehensive R Archive Network (CRAN)",
    "url": "https://cran.r-project.org"
  },
  "author": [
    {
      "@type": "Person",
      "givenName": "Nicolas",
      "familyName": "Frerebeau",
      "email": "nicolas.frerebeau@u-bordeaux-montaigne.fr",
      "@id": "https://orcid.org/0000-0001-5759-4944"
    }
  ],
  "funder": [
    {
      "@type": "Organization",
      "name": "Universit Bordeaux Montaigne"
    },
    {
      "@type": "Organization",
      "name": "CNRS"
    }
  ],
  "maintainer": [
    {
      "@type": "Person",
      "givenName": "Nicolas",
      "familyName": "Frerebeau",
      "email": "nicolas.frerebeau@u-bordeaux-montaigne.fr",
      "@id": "https://orcid.org/0000-0001-5759-4944"
    }
  ],
  "softwareSuggestions": [
    {
      "@type": "SoftwareApplication",
      "identifier": "tinytest",
      "name": "tinytest",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=tinytest"
    }
  ],
  "softwareRequirements": {
    "1": {
      "@type": "SoftwareApplication",
      "identifier": "R",
      "name": "R",
      "version": ">= 3.5"
    },
    "2": {
      "@type": "SoftwareApplication",
      "identifier": "methods",
      "name": "methods"
    },
    "3": {
      "@type": "SoftwareApplication",
      "identifier": "stats",
      "name": "stats"
    },
    "4": {
      "@type": "SoftwareApplication",
      "identifier": "utils",
      "name": "utils"
    },
    "SystemRequirements": null
  },
  "isPartOf": "https://www.tesselle.org",
  "keywords": [
    "data-cleaning",
    "statistics",
    "r-package"
  ],
  "fileSize": "361.461KB",
  "citation": [
    {
      "@type": "SoftwareSourceCode",
      "datePublished": "2025",
      "author": [
        {
          "@type": "Person",
          "givenName": "Nicolas",
          "familyName": "Frerebeau"
        }
      ],
      "name": "{arkhe: Tools for Cleaning Rectangular Data}",
      "identifier": "10.5281/zenodo.3526659",
      "url": "https://packages.tesselle.org/arkhe/",
      "description": "R package version 1.11.0",
      "@id": "https://doi.org/10.5281/zenodo.3526659",
      "sameAs": "https://doi.org/10.5281/zenodo.3526659"
    }
  ],
  "developmentStatus": "https://www.repostatus.org/#active"
}

GitHub Events

Total
  • Release event: 2
  • Watch event: 2
  • Push event: 20
  • Create event: 4
Last Year
  • Release event: 2
  • Watch event: 2
  • Push event: 20
  • Create event: 4

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 283
  • Total Committers: 2
  • Avg Commits per committer: 141.5
  • Development Distribution Score (DDS): 0.244
Past Year
  • Commits: 28
  • Committers: 1
  • Avg Commits per committer: 28.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
nfrerebeau n****u@u****r 214
nfrerebeau 3****u 69
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 2
  • Total pull requests: 0
  • Average time to close issues: about 4 hours
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • nfrerebeau (2)
Pull Request Authors
Top Labels
Issue Labels
bug (2)
Pull Request Labels

Dependencies

DESCRIPTION cran
  • R >= 3.3 depends
  • methods * imports
  • stats * imports
  • utils * imports
  • testthat >= 3.0.0 suggests