routeformer
Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.7%) to scientific vocabulary
Keywords
Repository
Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction
Basic Info
- Host: GitHub
- Owner: meakbiyik
- License: mit
- Language: Python
- Default Branch: main
- Homepage: https://meakbiyik.com/routeformer
- Size: 66.5 MB
Statistics
- Stars: 5
- Watchers: 1
- Forks: 1
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Routeformer: Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction
This repository will host the code and supplementary materials for our paper "Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction" accepted at ICLR 2025. It includes the implementation of our novel multimodal ego-trajectory prediction network, Routeformer, and the GEM dataset.
Overview
Understanding drivers' decision-making is crucial for road safety. While predicting an ego-vehicle’s path is important for driver-assistance systems, most existing methods focus primarily on external factors like other vehicles' motions. Our work addresses this limitation by integrating the driver's attention with the surrounding scene, combining GPS data, environmental context, and driver field-of-view information (first-person video and gaze fixations).
In this repository, you will eventually find:
- Code: The implementation of Routeformer and associated tools.
- GEM Dataset: A comprehensive dataset of urban driving scenarios enriched with synchronized driver field-of-view and gaze data. The link to the GEM dataset will be provided once available.
Getting Started
Installation
Clone the repository:
bash git clone https://github.com/meakbiyik/routeformer.git cd routeformerInstall the dependencies using Poetry:
bash poetry installNote on the
avdependency: This project uses theavlibrary for video processing, which hasffmpegas a dependency. If you haveffmpegalready installed on your system, you might encounter issues with the default installation. In that case, it is recommended to installavwith the following command to avoid building it from source:bash pip install av --no-binary av
Repository Structure
Here's a brief overview of the most important files and directories:
routeformer/models/routeformer.py: This file contains the core implementation of the Routeformer model.experiments/full_comparison.py: This is the main script to run the experiments and reproduce the results from the paper.routeformer/io/dataset.py: Contains the dataset loading and processing logic.docs/: Contains additional documentation, including details on the dataset and data extraction.
Abstract
Understanding drivers' decision-making is crucial for road safety. Although predicting the ego-vehicle's path is valuable for driver-assistance systems, existing methods mainly focus on external factors like other vehicles' motions, often neglecting the driver's attention and intent. To address this gap, we infer the ego-trajectory by integrating the driver's attention and the surrounding scene. We introduce Routeformer, a novel multimodal ego-trajectory prediction network combining GPS data, environmental context, and driver field-of-view—comprising first-person video and gaze fixations. We also present the Path Complexity Index (PCI), a new metric for trajectory complexity that enables a more nuanced evaluation of challenging scenarios. To tackle data scarcity and enhance diversity, we introduce GEM, a comprehensive dataset of urban driving scenarios enriched with synchronized driver field-of-view and gaze data. Extensive evaluations on GEM and DR(eye)VE demonstrate that Routeformer significantly outperforms state-of-the-art methods, achieving notable improvements in prediction accuracy across diverse conditions. Ablation studies reveal that incorporating driver field-of-view data yields significantly better average displacement error, especially in challenging scenarios with high PCI scores, underscoring the importance of modeling driver attention. All data, code, and models will be made publicly available.
Citation
If you use our work, please consider citing our paper:
```bibtex @inproceedings{akbiyik2023routeformer, title={Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction}, author={M. Eren Akbiyik, Nedko Savov, Danda Pani Paudel, Nikola Popovic, Christian Vater, Otmar Hilliges, Luc Van Gool, Xi Wang}, booktitle={International Conference on Learning Representations}, year={2025} }
Owner
- Name: M. Eren Akbiyik
- Login: meakbiyik
- Kind: user
- Company: ETH Zurich
- Repositories: 10
- Profile: https://github.com/meakbiyik
Data Science MSc at ETH Zurich
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
title: "Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction"
authors:
- family-names: "Akbiyik"
given-names: "M. Eren"
- family-names: "Savov"
given-names: "Nedko"
- family-names: "Paudel"
given-names: "Danda Pani"
- family-names: "Popovic"
given-names: "Nikola"
- family-names: "Vater"
given-names: "Christian"
- family-names: "Hilliges"
given-names: "Otmar"
- family-names: "Van Gool"
given-names: "Luc"
- family-names: "Wang"
given-names: "Xi"
date-released: 2025
url: "https://arxiv.org/abs/2312.08558"
preferred-citation:
type: conference-paper
title: "Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction"
authors:
- family-names: "Akbiyik"
given-names: "M. Eren"
- family-names: "Savov"
given-names: "Nedko"
- family-names: "Paudel"
given-names: "Danda Pani"
- family-names: "Popovic"
given-names: "Nikola"
- family-names: "Vater"
given-names: "Christian"
- family-names: "Hilliges"
given-names: "Otmar"
- family-names: "Van Gool"
given-names: "Luc"
- family-names: "Wang"
given-names: "Xi"
collection-title: "International Conference on Learning Representations"
year: 2025
GitHub Events
Total
- Issues event: 3
- Watch event: 4
- Issue comment event: 1
- Push event: 10
- Fork event: 1
- Create event: 2
Last Year
- Issues event: 3
- Watch event: 4
- Issue comment event: 1
- Push event: 10
- Fork event: 1
- Create event: 2
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- NielsRogge (1)
- Mollylulu (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- 243 dependencies