arkhe
Tools for cleaning rectangular data - :exclamation: This is a read-only mirror from https://codeberg.org/tesselle/arkhe
Science Score: 77.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 25 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
✓Committers with academic emails
1 of 2 committers (50.0%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (19.1%) to scientific vocabulary
Keywords
data-cleaning
r-package
statistics
Last synced: 6 months ago
·
JSON representation
·
Repository
Tools for cleaning rectangular data - :exclamation: This is a read-only mirror from https://codeberg.org/tesselle/arkhe
Basic Info
- Host: GitHub
- Owner: tesselle
- License: gpl-3.0
- Language: R
- Default Branch: main
- Homepage: https://packages.tesselle.org/arkhe/
- Size: 9.89 MB
Statistics
- Stars: 10
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 18
Topics
data-cleaning
r-package
statistics
Created over 6 years ago
· Last pushed 6 months ago
Metadata Files
Readme
Changelog
License
Citation
Codemeta
README.Rmd
---
output: github_document
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = NULL
)
Sys.setenv(LANGUAGE = "en") # Force locale
```
# arkhe
[](https://ci.codeberg.org/repos/14688){.pkgdown-devel}
[](https://packages.tesselle.org/arkhe/coverage/){.pkgdown-devel}
[](https://cran.r-project.org/package=arkhe){.pkgdown-devel}
[](https://tesselle.r-universe.dev/arkhe){.pkgdown-devel}
[](https://cran.r-project.org/package=arkhe){.pkgdown-release}
[](https://cran.r-project.org/web/checks/check_results_arkhe.html){.pkgdown-release}
[](https://cran.r-project.org/package=arkhe){.pkgdown-release}
[](https://www.repostatus.org/#active)
[](https://doi.org/10.5281/zenodo.3526659)
## Overview
A dependency-free collection of simple functions for cleaning rectangular data. This package allows to detect, count and replace values or discard rows/columns using a predicate function. In addition, it provides tools to check conditions and return informative error messages.
---
```{r citation, echo=FALSE, comment='', results='asis'}
cite <- utils::citation("arkhe")
print(cite, bibtex = FALSE)
```
## Installation
You can install the released version of **arkhe** from [CRAN](https://CRAN.R-project.org) with:
```{r cran-installation, eval=FALSE}
install.packages("arkhe")
```
And the development version from [Codeberg](https://codeberg.org/) with:
```{r gh-installation, eval=FALSE}
# install.packages("remotes")
remotes::install_git("https://codeberg.org/tesselle/arkhe")
```
## Usage
```{r}
## Load the package
library(arkhe)
## Set seed for reproductibility
set.seed(12345)
## Create a matrix
X <- matrix(sample(1:10, 25, TRUE), nrow = 5, ncol = 5)
## Add NA
k <- sample(1:25, 3, FALSE)
X[k] <- NA
X
## Count missing values in rows
count(X, f = is.na, margin = 1)
## Count non-missing values in columns
count(X, f = is.na, margin = 2, negate = TRUE)
## Find row with NA
detect(X, f = is.na, margin = 1)
## Find column without any NA
detect(X, f = is.na, margin = 2, negate = TRUE, all = TRUE)
## Remove row with any NA
discard(X, f = is.na, margin = 1, all = FALSE)
## Remove column with any NA
discard(X, f = is.na, margin = 2, all = FALSE)
## Replace NA with zeros
replace_NA(X, value = 0)
```
## Translation
This package provides translations of user-facing communications, like messages, warnings and errors. The preferred language is by default taken from the locale. This can be overridden by setting of the environment variable `LANGUAGE` (you only need to do this once per session):
``` r
Sys.setenv(LANGUAGE = "")
```
Languages currently available are English (`en`), French (`fr`) and Spanish (`es`).
## Contributing
Please note that the **arkhe** project is released with a [Contributor Code of Conduct](https://www.tesselle.org/conduct.html). By contributing to this project, you agree to abide by its terms.
```{r metadata, include=FALSE}
## Update codemeta.json
codemetar::write_codemeta(verbose = FALSE)
## Update CITATION.cff
cff_keys <- list(
identifiers = list(
list(description = "The concept DOI.",
type = "doi",
value = "10.5281/zenodo.3526659"),
list(description = "The versioned DOI for version 0.1.0.",
type = "doi",
value = "10.5281/zenodo.3526660"),
list(description = "The versioned DOI for version 0.2.0.",
type = "doi",
value = "10.5281/zenodo.3583089"),
list(description = "The versioned DOI for version 0.2.1.",
type = "doi",
value = "10.5281/zenodo.3668868"),
list(description = "The versioned DOI for version 0.2.2.",
type = "doi",
value = "10.5281/zenodo.3724463"),
list(description = "The versioned DOI for version 0.3.0.",
type = "doi",
value = "10.5281/zenodo.4675724"),
list(description = "The versioned DOI for version 0.3.1.",
type = "doi",
value = "10.5281/zenodo.4761928"),
list(description = "The versioned DOI for version 0.4.0.",
type = "doi",
value = "10.5281/zenodo.5515470"),
list(description = "The versioned DOI for version 0.5.0.",
type = "doi",
value = "10.5281/zenodo.6647041"),
list(description = "The versioned DOI for version 1.0.0.",
type = "doi",
value = "10.5281/zenodo.7261616"),
list(description = "The versioned DOI for version 1.1.0.",
type = "doi",
value = "10.5281/zenodo.7546749"),
list(description = "The versioned DOI for version 1.2.0.",
type = "doi",
value = "10.5281/zenodo.7925529"),
list(description = "The versioned DOI for version 1.3.0.",
type = "doi",
value = "10.5281/zenodo.8308598"),
list(description = "The versioned DOI for version 1.4.0.",
type = "doi",
value = "10.5281/zenodo.10148702"),
list(description = "The versioned DOI for version 1.5.0.",
type = "doi",
value = "10.5281/zenodo.10534310"),
list(description = "The versioned DOI for version 1.6.0.",
type = "doi",
value = "10.5281/zenodo.10694357"),
list(description = "The versioned DOI for version 1.7.0.",
type = "doi",
value = "10.5281/zenodo.13121710"),
list(description = "The versioned DOI for version 1.8.0.",
type = "doi",
value = "10.5281/zenodo.14054350"),
list(description = "The versioned DOI for version 1.9.0.",
type = "doi",
value = "10.5281/zenodo.14355274"),
list(description = "The versioned DOI for version 1.10.0.",
type = "doi",
value = "10.5281/zenodo.14926270"),
list(description = "The versioned DOI for version 1.11.0.",
type = "doi",
value = "10.5281/zenodo.15390837"),
list(description = "The CRAN DOI",
type = "doi",
value = "10.32614/cran.package.arkhe")
)
)
cff <- cffr::cff_create("DESCRIPTION", keys = cff_keys)
if (cffr::cff_validate(cff)) cffr::cff_write(cff, outfile = "CITATION.cff")
```
Owner
- Name: tesselle
- Login: tesselle
- Kind: organization
- Location: France
- Website: www.tesselle.org
- Repositories: 5
- Profile: https://github.com/tesselle
A collection of R packages for archaeological research and teaching
Citation (CITATION.cff)
# --------------------------------------------
# CITATION file created with {cffr} R package
# See also: https://docs.ropensci.org/cffr/
# --------------------------------------------
cff-version: 1.2.0
message: 'To cite package "arkhe" in publications use:'
type: software
license: GPL-3.0-or-later
title: 'arkhe: Tools for Cleaning Rectangular Data'
version: 1.11.0
doi: 10.5281/zenodo.3526659
identifiers:
- description: The concept DOI.
type: doi
value: 10.5281/zenodo.3526659
- description: The versioned DOI for version 0.1.0.
type: doi
value: 10.5281/zenodo.3526660
- description: The versioned DOI for version 0.2.0.
type: doi
value: 10.5281/zenodo.3583089
- description: The versioned DOI for version 0.2.1.
type: doi
value: 10.5281/zenodo.3668868
- description: The versioned DOI for version 0.2.2.
type: doi
value: 10.5281/zenodo.3724463
- description: The versioned DOI for version 0.3.0.
type: doi
value: 10.5281/zenodo.4675724
- description: The versioned DOI for version 0.3.1.
type: doi
value: 10.5281/zenodo.4761928
- description: The versioned DOI for version 0.4.0.
type: doi
value: 10.5281/zenodo.5515470
- description: The versioned DOI for version 0.5.0.
type: doi
value: 10.5281/zenodo.6647041
- description: The versioned DOI for version 1.0.0.
type: doi
value: 10.5281/zenodo.7261616
- description: The versioned DOI for version 1.1.0.
type: doi
value: 10.5281/zenodo.7546749
- description: The versioned DOI for version 1.2.0.
type: doi
value: 10.5281/zenodo.7925529
- description: The versioned DOI for version 1.3.0.
type: doi
value: 10.5281/zenodo.8308598
- description: The versioned DOI for version 1.4.0.
type: doi
value: 10.5281/zenodo.10148702
- description: The versioned DOI for version 1.5.0.
type: doi
value: 10.5281/zenodo.10534310
- description: The versioned DOI for version 1.6.0.
type: doi
value: 10.5281/zenodo.10694357
- description: The versioned DOI for version 1.7.0.
type: doi
value: 10.5281/zenodo.13121710
- description: The versioned DOI for version 1.8.0.
type: doi
value: 10.5281/zenodo.14054350
- description: The versioned DOI for version 1.9.0.
type: doi
value: 10.5281/zenodo.14355274
- description: The versioned DOI for version 1.10.0.
type: doi
value: 10.5281/zenodo.14926270
- description: The versioned DOI for version 1.11.0.
type: doi
value: 10.5281/zenodo.15390837
- description: The CRAN DOI
type: doi
value: 10.32614/cran.package.arkhe
abstract: A dependency-free collection of simple functions for cleaning rectangular
data. This package allows to detect, count and replace values or discard rows/columns
using a predicate function. In addition, it provides tools to check conditions and
return informative error messages.
authors:
- family-names: Frerebeau
given-names: Nicolas
email: nicolas.frerebeau@u-bordeaux-montaigne.fr
orcid: https://orcid.org/0000-0001-5759-4944
preferred-citation:
type: manual
title: 'arkhe: Tools for Cleaning Rectangular Data'
authors:
- family-names: Frerebeau
given-names: Nicolas
email: nicolas.frerebeau@u-bordeaux-montaigne.fr
orcid: https://orcid.org/0000-0001-5759-4944
year: '2025'
institution:
name: Université Bordeaux Montaigne
address: Pessac, France
notes: R package version 1.11.0
url: https://packages.tesselle.org/arkhe/
doi: 10.5281/zenodo.3526659
repository: https://CRAN.R-project.org/package=arkhe
repository-code: https://codeberg.org/tesselle/arkhe
url: https://packages.tesselle.org/arkhe/
contact:
- family-names: Frerebeau
given-names: Nicolas
email: nicolas.frerebeau@u-bordeaux-montaigne.fr
orcid: https://orcid.org/0000-0001-5759-4944
keywords:
- data-cleaning
- statistics
- r-package
references:
- type: software
title: 'R: A Language and Environment for Statistical Computing'
notes: Depends
url: https://www.R-project.org/
authors:
- name: R Core Team
institution:
name: R Foundation for Statistical Computing
address: Vienna, Austria
year: '2025'
version: '>= 3.5'
- type: software
title: methods
abstract: 'R: A Language and Environment for Statistical Computing'
notes: Imports
authors:
- name: R Core Team
institution:
name: R Foundation for Statistical Computing
address: Vienna, Austria
year: '2025'
- type: software
title: stats
abstract: 'R: A Language and Environment for Statistical Computing'
notes: Imports
authors:
- name: R Core Team
institution:
name: R Foundation for Statistical Computing
address: Vienna, Austria
year: '2025'
- type: software
title: utils
abstract: 'R: A Language and Environment for Statistical Computing'
notes: Imports
authors:
- name: R Core Team
institution:
name: R Foundation for Statistical Computing
address: Vienna, Austria
year: '2025'
- type: software
title: tinytest
abstract: 'tinytest: Lightweight and Feature Complete Unit Testing Framework'
notes: Suggests
url: https://github.com/markvanderloo/tinytest
repository: https://CRAN.R-project.org/package=tinytest
authors:
- family-names: Loo
given-names: Mark
name-particle: van der
email: mark.vanderloo@gmail.com
orcid: https://orcid.org/0000-0002-9807-4686
year: '2025'
doi: 10.32614/CRAN.package.tinytest
CodeMeta (codemeta.json)
{
"@context": "https://doi.org/10.5063/schema/codemeta-2.0",
"@type": "SoftwareSourceCode",
"identifier": "arkhe",
"description": "A dependency-free collection of simple functions for cleaning rectangular data. This package allows to detect, count and replace values or discard rows/columns using a predicate function. In addition, it provides tools to check conditions and return informative error messages.",
"name": "arkhe: Tools for Cleaning Rectangular Data",
"relatedLink": [
"https://packages.tesselle.org/arkhe/",
"https://CRAN.R-project.org/package=arkhe"
],
"codeRepository": "https://codeberg.org/tesselle/arkhe",
"issueTracker": "https://codeberg.org/tesselle/arkhe/issues",
"license": "https://spdx.org/licenses/GPL-3.0",
"version": "1.11.0",
"programmingLanguage": {
"@type": "ComputerLanguage",
"name": "R",
"url": "https://r-project.org"
},
"runtimePlatform": "R version 4.5.0 (2025-04-11)",
"provider": {
"@id": "https://cran.r-project.org",
"@type": "Organization",
"name": "Comprehensive R Archive Network (CRAN)",
"url": "https://cran.r-project.org"
},
"author": [
{
"@type": "Person",
"givenName": "Nicolas",
"familyName": "Frerebeau",
"email": "nicolas.frerebeau@u-bordeaux-montaigne.fr",
"@id": "https://orcid.org/0000-0001-5759-4944"
}
],
"funder": [
{
"@type": "Organization",
"name": "Universit Bordeaux Montaigne"
},
{
"@type": "Organization",
"name": "CNRS"
}
],
"maintainer": [
{
"@type": "Person",
"givenName": "Nicolas",
"familyName": "Frerebeau",
"email": "nicolas.frerebeau@u-bordeaux-montaigne.fr",
"@id": "https://orcid.org/0000-0001-5759-4944"
}
],
"softwareSuggestions": [
{
"@type": "SoftwareApplication",
"identifier": "tinytest",
"name": "tinytest",
"provider": {
"@id": "https://cran.r-project.org",
"@type": "Organization",
"name": "Comprehensive R Archive Network (CRAN)",
"url": "https://cran.r-project.org"
},
"sameAs": "https://CRAN.R-project.org/package=tinytest"
}
],
"softwareRequirements": {
"1": {
"@type": "SoftwareApplication",
"identifier": "R",
"name": "R",
"version": ">= 3.5"
},
"2": {
"@type": "SoftwareApplication",
"identifier": "methods",
"name": "methods"
},
"3": {
"@type": "SoftwareApplication",
"identifier": "stats",
"name": "stats"
},
"4": {
"@type": "SoftwareApplication",
"identifier": "utils",
"name": "utils"
},
"SystemRequirements": null
},
"isPartOf": "https://www.tesselle.org",
"keywords": [
"data-cleaning",
"statistics",
"r-package"
],
"fileSize": "361.461KB",
"citation": [
{
"@type": "SoftwareSourceCode",
"datePublished": "2025",
"author": [
{
"@type": "Person",
"givenName": "Nicolas",
"familyName": "Frerebeau"
}
],
"name": "{arkhe: Tools for Cleaning Rectangular Data}",
"identifier": "10.5281/zenodo.3526659",
"url": "https://packages.tesselle.org/arkhe/",
"description": "R package version 1.11.0",
"@id": "https://doi.org/10.5281/zenodo.3526659",
"sameAs": "https://doi.org/10.5281/zenodo.3526659"
}
],
"developmentStatus": "https://www.repostatus.org/#active"
}
GitHub Events
Total
- Release event: 2
- Watch event: 2
- Push event: 20
- Create event: 4
Last Year
- Release event: 2
- Watch event: 2
- Push event: 20
- Create event: 4
Committers
Last synced: over 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| nfrerebeau | n****u@u****r | 214 |
| nfrerebeau | 3****u | 69 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 2
- Total pull requests: 0
- Average time to close issues: about 4 hours
- Average time to close pull requests: N/A
- Total issue authors: 1
- Total pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- nfrerebeau (2)
Pull Request Authors
Top Labels
Issue Labels
bug (2)
Pull Request Labels
Dependencies
DESCRIPTION
cran
- R >= 3.3 depends
- methods * imports
- stats * imports
- utils * imports
- testthat >= 3.0.0 suggests