intelligent-crowdworker-selection
Code associated to the article "Who knows best? Intelligent Crowdworker Selection via Deep Learning"
https://github.com/ies-research/intelligent-crowdworker-selection
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.2%) to scientific vocabulary
Keywords
Repository
Code associated to the article "Who knows best? Intelligent Crowdworker Selection via Deep Learning"
Basic Info
- Host: GitHub
- Owner: ies-research
- Language: Python
- Default Branch: main
- Homepage: https://ceur-ws.org/Vol-3470/paper3.pdf
- Size: 49.8 KB
Statistics
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Who knows best? Intelligent Crowdworker Selection via Deep Learning
Authors: Marek Herde, Denis Huseljic, Bernhard Sick, Ulrich Bretschneider, and Sarah Oeste-Reiß
Project Structure
evaluation: collection of Python and Bash scripts required to perform experimental evaluationlfma: Python package consisting of several sub-packagesclassifiers: implementation of multi-annotator supervised learning techniques according to scikit-learn interfacesmodules: implementation of multi-annotator supervised learning techniques aspytorch_lightningmodules,pytorchdata sets, and special layersutils: helper functions
notebooks:annotator_simulation.ipynb: simulation of annotator sets for the data sets LETTER and CIFAR10data_set_creation_download.ipynb: download of LETTER and CIFAR10evaluation.ipynb: loading and presentation of experimental results
requirements.txt: list of Python packages required to reproduce experiments
How to execute experiments?
In the following, we describe step-by-step how to execute all experiments presented in the accompanied article.
As a prerequisite, we assume to have a Linux distribution as operating system and
conda installed on your machine.
- Setup Python environment:
bash projectpath$ conda create --name crowd python=3.9 projectpath$ conda activate crowdFirst, we need to installtorchwith the build (1.13.1). For this purpose, we refer topytorch. An exemplary command for a Linux operating system would be:bash projectpath$ pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116Subsequently, we install the remaining requirements:bash projectpath$ pip install -r requirements.txt - Create and download data sets: Start jupyter-notebook and follow the instructions in the jupyter-notebook file
notebooks/data_set_creation_download.ipynb.bash projectpath$ conda activate crowd projectpath$ jupyter-notebook - Simulate annotators: Start jupyter-notebook and follow the instructions in the jupyter-notebook file
notebooks/annotator_simulation.ipynb.bash projectpath$ conda activate crowd projectpath$ jupyter-notebook - Execute experiment scripts: The files
evaluation/letter.shandevaluation/cifar10.shcorresponds to evaluating MaDL on CIFAR10 and LETTER. Such a file consists of multiple commands executing the fileevaluation/run_experiment.pywith different configurations. For a better understanding of these possible configurations, we refer to the explanations in the fileevaluation/run_experiment.py. Further you need to specify certain paths, e.g., for logging before execution. You can now execute such abashscript via:bash projectpath$ conda activate crowd projectpath$ ./evaluation/crowd_letter.sh projectpath$ ./evaluation/crowd_cifar10.shAlternatively, you can use thesbatchcommand:bash projectpath$ conda activate crowd projectpath$ sbatch ./evaluation/letter.sh projectpath$ sbatch ./evaluation/crowd_cifar10.sh
How to investigate the experimental results?
Once, an experiment is completed, its associated results are saved as a .csv file at the directory specified by
evaluation.run_experiment.RESULT_PATH. For getting a summarized presentation of these results, you need
to start jupyter-notebook and follow the instructions in the jupyter-notebook file
notebooks/evaluation.ipynb.
bash
projectpath$ conda activate crowd
projectpath$ jupyter-notebook
References
The code is majorly based on and adopted from Multi-annotator Deep Learning (MaDL).
Citing
If you use this software in one of your research projects or would like to reference the accompanied article, please use the following:
@inproceedings{
herde2023who,
title={Who knows best? Intelligent Crowdworker Selection via Deep Learning},
author={Marek Herde and Denis Huseljic and Bernhard Sick and Ulrich Bretschneider and Sarah Oeste-Rei{\ss}},
booktitle={Interactive Adaptive Learning Workshop @ ECML/PKDD},
pages={14--18},
year={2023},
url={https://ceur-ws.org/Vol-3470/paper3.pdf},
}
Owner
- Name: Intelligent Embedded Systems
- Login: ies-research
- Kind: organization
- Repositories: 3
- Profile: https://github.com/ies-research
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Herde"
given-names: "Marek"
orcid: "https://orcid.org/0000-0003-4908-122X"
- family-names: "Huseljic"
given-names: "Denis"
orcid: "https://orcid.org/0000-0001-6207-1494"
- family-names: "Sick"
given-names: "Bernhard"
orcid: "https://orcid.org/0000-0001-9467-656X"
- family-names: "Ulrich"
given-names: "Bretschneider"
orcid: "https://orcid.org/0000-0002-2494-0457"
- family-names: "Sarah"
given-names: "Oeste-Rei{\ss}"
orcid: "https://orcid.org/0000-0002-6576-8841"
title: "intelligent-crowdworker-selection"
date-released: 2023-09-21
url: "https://github.com/ies-research/intelligent-crowdworker-selection"
preferred-citation:
type: inproceedings
authors:
- family-names: "Herde"
given-names: "Marek"
orcid: "https://orcid.org/0000-0003-4908-122X"
- family-names: "Huseljic"
given-names: "Denis"
orcid: "https://orcid.org/0000-0001-6207-1494"
- family-names: "Sick"
given-names: "Bernhard"
orcid: "https://orcid.org/0000-0001-9467-656X"
- family-names: "Ulrich"
given-names: "Bretschneider"
orcid: "https://orcid.org/0000-0002-2494-0457"
- family-names: "Sarah"
given-names: "Oeste-Rei{\ss}"
orcid: "https://orcid.org/0000-0002-6576-8841"
url: "https://ceur-ws.org/Vol-3470/paper3.pdf"
booktitle: "International Workshop on Interactive Adaptive Learning @ ECML/PKDD"
pages: "14--18"
title: "Who knows best? Intelligent Crowdworker Selection via Deep Learning"
year: 2023
GitHub Events
Total
Last Year
Dependencies
- Pillow ==10.0.1
- annotlib ==1.0.0
- ipywidgets ==7.6.3
- iteration_utilities ==0.11.0
- jupyterlab ==3.5.0
- matplotlib ==3.6.2
- numpy ==1.23.5
- pandas ==1.5.2
- pytorch-lightning ==1.8.3.post1
- requests ==2.31.0
- sacred ==0.8.2
- scikit-activeml ==0.3.1
- scikit-learn ==1.1.3
- scipy ==1.10.0