imc2024-cookielessbrowsing
Browsing without Third-Party Cookies: What Do You See?
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (5.4%) to scientific vocabulary
Repository
Browsing without Third-Party Cookies: What Do You See?
Basic Info
Statistics
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
IMC2024-CookielessBrowsing
This repository contains web crawling and analysis code for the IMC 2024 paper "Browsing without Third-Party Cookies: What Do You See?".
Setup
Only Ubuntu 20.04 LTS is officially supported.
To create the cookie conda environment, execute:
bash
conda env create -f environment.yml
Activate the environment with:
bash
conda activate cookie
Usage
To start a crawl, execute:
bash
python3 sbatch_main.py --jobs <number of slurm jobs>
If you do not have Slurm, you can start a single job using main.py.
After crawling, use extract_differences.py to compute differences in extracted features.
To analyze the differences, use classification_algo.ipynb.
Owner
- Name: Maxwell Lin
- Login: maxwellmlin
- Kind: user
- Location: Durham, North Carolina
- Website: maxwellmlin.com
- Repositories: 13
- Profile: https://github.com/maxwellmlin
Math & CS @ Duke
Citation (CITATION.cff)
cff-version: 1.2.0
title: Cookie-Classify
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- given-names: Maxwell Lin
affiliation: Duke University
- given-names: Karen Wang
affiliation: Duke University
repository-code: 'https://github.com/maxwellmlin/cookie-classify'
GitHub Events
Total
- Watch event: 2
Last Year
- Watch event: 2
Dependencies
- Pillow *
- adjustText *
- beautifulsoup4 *
- filelock *
- ipykernel *
- lxml *
- matplotlib *
- mypy *
- pandas *
- pyyaml *
- scipy *
- selenium ==4.2.0
- selenium-wire *
- tldextract *
- types-requests *
- validators *