Recent Releases of perspective-occupancy-map

perspective-occupancy-map - Initial release v1.0

📦 Release Notes — POMv2 v1.0.0

🚀 Initial Release Highlights

This is the first public release of POMv2, a modular deep learning pipeline for semantic top-view segmentation from monocular perspective images.


✨ Features

  • Two-Branch Architecture:

    • Semantic POM Head using DeepLabV3 to predict object footprints in perspective view.
    • PON Encoder to extract spatial features projected into BEV space.
  • Learned Perspective-to-BEV Projection:

    • Uses camera geometry to map perspective logits into a top-down grid.
    • Fused with learned features and decoded using a UNet.
  • Multi-Level Supervision:

    • Joint loss on both Semantic POM and BEV segmentation for stronger training signals.
  • Temporal Stability via POM:

    • Perspective occupancy maps offer stable spatial cues across frames.
    • Helps improve generalization and model robustness.
  • Cross-Dataset Generalization:

    • Supports pretraining the perspective segmentation branch on large datasets (e.g., Cityscapes).
  • Experiment Management:

    • YAML-based configuration system.
    • Integrated with Weights & Biases (wandb) for logging, checkpointing, and visualization.

🛠 Project Structure

  • Modular codebase with models/, datasets/, utils/, and config-driven training (train.py) and evaluation (eval.py) scripts.

📄 License

Released under the MIT License.

- Jupyter Notebook
Published by shantanusingh16 7 months ago