https://github.com/appeler/ethnicolor

Race and Ethnicity based on name using data from census, voter reg. files, etc.

https://github.com/appeler/ethnicolor

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.5%) to scientific vocabulary

Keywords

census ethnicity machine-learning race
Last synced: 6 months ago · JSON representation

Repository

Race and Ethnicity based on name using data from census, voter reg. files, etc.

Basic Info
  • Host: GitHub
  • Owner: appeler
  • License: other
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 41.9 MB
Statistics
  • Stars: 11
  • Watchers: 4
  • Forks: 7
  • Open Issues: 2
  • Releases: 0
Topics
census ethnicity machine-learning race
Created about 9 years ago · Last pushed about 8 years ago

https://github.com/appeler/ethnicolor/blob/master/

## ethnicolor: Impute Race and Ethnicity Based on Name

[![Build Status](https://travis-ci.org/appeler/ethnicolor.svg?branch=master)](https://travis-ci.org/appeler/ethnicolor)
[![CRAN_Status_Badge](http://www.r-pkg.org/badges/version/ethnicolor)](https://cran.r-project.org/package=ethnicolor)
[![codecov](https://codecov.io/gh/soodoku/ethnicolor/branch/master/graph/badge.svg)](https://codecov.io/gh/soodoku/ethnicolor)
![](http://cranlogs.r-pkg.org/badges/grand-total/ethnicolor)

### Data

The data are from:

* [The 2000 Census](https://github.com/soodoku/ethnicolor/tree/master/data-raw/census)
* [The 2010 Census](https://github.com/soodoku/ethnicolor/tree/master/data-raw/census)
* The Florida Voter Registration File from February 2017. Request access to the data [here](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/UBIG3F) or request your own copy from the Florida Secretary of State. 
* [Wikipedia Data](https://github.com/soodoku/ethnicolor/tree/master/data-raw/wiki)

### Included Models for Imputing Race

* `census_ln` will simply append data from the 2000 and 2010 census. For some mild processing that underlies it, see [here](https://github.com/soodoku/ethnicolor/tree/master/data-raw/census).
* `pred_census_ln` exposes a model based off Florida Voter Registration Last Name Data. For details, see [here](https://github.com/soodoku/ethnicolor/tree/master/data-raw/fl_voter_reg)
* `pred_fl_reg_ln` exposes a model built using Florida Voter Registration data. Utilizes only the surname. For details, see [here](https://github.com/soodoku/ethnicolor/tree/master/data-raw/fl_voter_reg)
* `pred_fl_reg_ln` exposes a model built using Florida Voter Registration data. Utilizes the first name, middle name (if there), last name, and suffix (if there). For details, see [here](https://github.com/soodoku/ethnicolor/tree/master/data-raw/fl_voter_reg)

### Installation

To get the current development version from GitHub:

```r
# install.packages("devtools")
devtools::install_github("soodoku/ethnicolor")
```

### Related Information

For Python package that provides roughly the same functionality, see [ethnicolr](https://github.com/soodoku/ethnicolr).

### License

Scripts are released under the [MIT License](https://opensource.org/licenses/MIT).

### Contributor Code of Conduct

The project welcomes contributions from everyone! In fact, it depends on it. To maintain this welcoming atmosphere, and to collaborate in a fun and productive way, we expect contributors to the project to abide by the [Contributor Code of Conduct](http://contributor-covenant.org/version/1/0/0/).

Owner

  • Name: appeler
  • Login: appeler
  • Kind: organization

Making sense of names.

GitHub Events

Total
Last Year

Dependencies

DESCRIPTION cran
  • R >= 3.3.2 depends
  • knitr >= 1.11 suggests
  • lintr * suggests
  • rmarkdown * suggests
  • testthat * suggests