Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.4%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: yqqqq-q
  • Language: Jupyter Notebook
  • Default Branch: yang_branch_1
  • Size: 19.7 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created 12 months ago · Last pushed 12 months ago
Metadata Files
Readme Citation

README.md

IMC2024-CookielessBrowsing

This repository contains web crawling and analysis code for the IMC 2024 paper "Browsing without Third-Party Cookies: What Do You See?".

Setup

Only Ubuntu 20.04 LTS is officially supported.

To create the cookie conda environment, execute:

bash conda env create -f environment.yml

Activate the environment with: bash conda activate cookie

Usage

To start a crawl, execute: bash python3 sbatch_main.py --jobs <number of slurm jobs> If you do not have Slurm, you can start a single job using main.py.

After crawling, use extract_differences.py to compute differences in extracted features.

To analyze the differences, use classification_algo.ipynb.

Owner

  • Login: yqqqq-q
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
title: Cookie-Classify
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - given-names: Maxwell Lin
    affiliation: Duke University
  - given-names: Karen Wang
    affiliation: Duke University
repository-code: 'https://github.com/maxwellmlin/cookie-classify'

GitHub Events

Total
  • Public event: 1
Last Year
  • Public event: 1

Dependencies

environment.yml pypi
  • Pillow *
  • adjustText *
  • beautifulsoup4 *
  • filelock *
  • ipykernel *
  • lxml *
  • matplotlib *
  • mypy *
  • pandas *
  • pyyaml *
  • scipy *
  • selenium ==4.2.0
  • selenium-wire *
  • tldextract *
  • types-requests *
  • validators *