routeformer

Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction

https://github.com/meakbiyik/routeformer

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (15.7%) to scientific vocabulary

Keywords

autonomous-driving computer-vision

Last synced: 6 months ago · JSON representation ·

Repository

Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction

Basic Info

Host: GitHub
Owner: meakbiyik
License: mit
Language: Python
Default Branch: main
Homepage: https://meakbiyik.com/routeformer
Size: 66.5 MB

Statistics

Stars: 5
Watchers: 1
Forks: 1
Open Issues: 0
Releases: 0

Topics

autonomous-driving computer-vision

Created 11 months ago · Last pushed 8 months ago

Metadata Files

Readme License Citation

Routeformer: Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction

[![Website](docs/imgs/badges/badge_project_page.svg)](https://meakbiyik.com/routeformer/) [![Paper](docs/imgs/badges/badge_pdf.svg)](https://arxiv.org/abs/2312.08558) [![Dataset](docs/imgs/badges/badge_dataset.svg)](https://huggingface.co/datasets/meakbiyik/GEM_gaze-assisted-ego-motion-in-driving)

This repository will host the code and supplementary materials for our paper "Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction" accepted at ICLR 2025. It includes the implementation of our novel multimodal ego-trajectory prediction network, Routeformer, and the GEM dataset.

Overview

Understanding drivers' decision-making is crucial for road safety. While predicting an ego-vehicle’s path is important for driver-assistance systems, most existing methods focus primarily on external factors like other vehicles' motions. Our work addresses this limitation by integrating the driver's attention with the surrounding scene, combining GPS data, environmental context, and driver field-of-view information (first-person video and gaze fixations).

In this repository, you will eventually find:

Code: The implementation of Routeformer and associated tools.
GEM Dataset: A comprehensive dataset of urban driving scenarios enriched with synchronized driver field-of-view and gaze data. The link to the GEM dataset will be provided once available.

Getting Started

Installation

Clone the repository:

bash git clone https://github.com/meakbiyik/routeformer.git cd routeformer
Install the dependencies using Poetry:

bash poetry install

Note on the av dependency: This project uses the av library for video processing, which has ffmpeg as a dependency. If you have ffmpeg already installed on your system, you might encounter issues with the default installation. In that case, it is recommended to install av with the following command to avoid building it from source:

bash pip install av --no-binary av

Repository Structure

Here's a brief overview of the most important files and directories:

routeformer/models/routeformer.py: This file contains the core implementation of the Routeformer model.
experiments/full_comparison.py: This is the main script to run the experiments and reproduce the results from the paper.
routeformer/io/dataset.py: Contains the dataset loading and processing logic.
docs/: Contains additional documentation, including details on the dataset and data extraction.

Abstract

Understanding drivers' decision-making is crucial for road safety. Although predicting the ego-vehicle's path is valuable for driver-assistance systems, existing methods mainly focus on external factors like other vehicles' motions, often neglecting the driver's attention and intent. To address this gap, we infer the ego-trajectory by integrating the driver's attention and the surrounding scene. We introduce Routeformer, a novel multimodal ego-trajectory prediction network combining GPS data, environmental context, and driver field-of-view—comprising first-person video and gaze fixations. We also present the Path Complexity Index (PCI), a new metric for trajectory complexity that enables a more nuanced evaluation of challenging scenarios. To tackle data scarcity and enhance diversity, we introduce GEM, a comprehensive dataset of urban driving scenarios enriched with synchronized driver field-of-view and gaze data. Extensive evaluations on GEM and DR(eye)VE demonstrate that Routeformer significantly outperforms state-of-the-art methods, achieving notable improvements in prediction accuracy across diverse conditions. Ablation studies reveal that incorporating driver field-of-view data yields significantly better average displacement error, especially in challenging scenarios with high PCI scores, underscoring the importance of modeling driver attention. All data, code, and models will be made publicly available.

Citation

If you use our work, please consider citing our paper:

```bibtex @inproceedings{akbiyik2023routeformer, title={Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction}, author={M. Eren Akbiyik, Nedko Savov, Danda Pani Paudel, Nikola Popovic, Christian Vater, Otmar Hilliges, Luc Van Gool, Xi Wang}, booktitle={International Conference on Learning Representations}, year={2025} }

Owner

Name: M. Eren Akbiyik
Login: meakbiyik
Kind: user
Company: ETH Zurich

Repositories: 10
Profile: https://github.com/meakbiyik

Data Science MSc at ETH Zurich

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
title: "Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction"
authors:
  - family-names: "Akbiyik"
    given-names: "M. Eren"
  - family-names: "Savov"
    given-names: "Nedko"
  - family-names: "Paudel"
    given-names: "Danda Pani"
  - family-names: "Popovic"
    given-names: "Nikola"
  - family-names: "Vater"
    given-names: "Christian"
  - family-names: "Hilliges"
    given-names: "Otmar"
  - family-names: "Van Gool"
    given-names: "Luc"
  - family-names: "Wang"
    given-names: "Xi"
date-released: 2025
url: "https://arxiv.org/abs/2312.08558"
preferred-citation:
  type: conference-paper
  title: "Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction"
  authors:
    - family-names: "Akbiyik"
      given-names: "M. Eren"
    - family-names: "Savov"
      given-names: "Nedko"
    - family-names: "Paudel"
      given-names: "Danda Pani"
    - family-names: "Popovic"
      given-names: "Nikola"
    - family-names: "Vater"
      given-names: "Christian"
    - family-names: "Hilliges"
      given-names: "Otmar"
    - family-names: "Van Gool"
      given-names: "Luc"
    - family-names: "Wang"
      given-names: "Xi"
  collection-title: "International Conference on Learning Representations"
  year: 2025

GitHub Events

Total

Issues event: 3
Watch event: 4
Issue comment event: 1
Push event: 10
Fork event: 1
Create event: 2

Last Year

Issues event: 3
Watch event: 4
Issue comment event: 1
Push event: 10
Fork event: 1
Create event: 2

Issues and Pull Requests

Last synced: 11 months ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

NielsRogge (1)
Mollylulu (1)

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies

poetry.lock pypi

243 dependencies

pyproject.toml pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science