Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.8%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: insdet
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 22.7 MB
Statistics
  • Stars: 14
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 3 years ago · Last pushed about 1 year ago
Metadata Files
Readme Citation

README.md

A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture

[**Qianqian Shen**](https://shenqq377.github.io/)1 · [**Yunhan Zhao**](https://yunhan-zhao.github.io/)2 · [**Nahyun Kwon**](https://nahyunkwon.github.io/)3 · [**Jeeeun Kim**](https://github.com/qubick)3 · [**Yanan Li**](https://yananlix1.github.io/)1 · [**Shu Kong**](https://aimerykong.github.io/)3,4,5 1Zhejiang Lab 2UC Irvine 3Texas A&M University 4University of Macau 5Institute of Collaborative Paper PDF Project Page Benchmark

The paper has been accepted by NeurIPS (Datasets and Benchmarks) 2023.

InsDet

Dataset

The InsDet datase is a high-resolution real-world dataset for Instance Detection with Multi-view Instance Capture.
We provide an InsDet-mini for demo and visualization, and the full dataset InsDet-FULL.

The full dataset contains 100 objects with multi-view profile images in 24 rotation positions (per 15°), 160 testing scene images with high-resolution, and 200 pure background images. The mini version contains 5 objects, 10 testing scene images, and 10 pure background images.

Details

The Objects contains: - 000avedashampoo - images: raw RGB images (e.g., "images/001.jpg") - masks: segmentation masks generated by GrabCut Annotation Toolbox (e.g., "masks/001.png") -

$\vdots$

- 099mugblue

vis-objects

Tip: The first three digits specify the instance id.

The Scenes contains: - easy - leisure_zone - raw RGB images with 6144×8192 pixels (e.g. “office001/rgb_000.jpg”) - bounding box annotation for objects in test scenes generated by labelImg toolbox and using PascalVOC format (e.g. “office_001/rgb_000.xml”) - meeting_room - office_002 - pantry_room_002 - sink - hard - office_001 - pantry_room_001

vis-scenes

Tip: Each bounding box is specified by [xmin, ymin, xmax, ymax].

The Background contains 200 pure background images that do not include any instances from Objects folder.

vis-background

Code

The project is built on detectron2, segment-anything, and DINOv2.
<!-- Detectron2 provides end-to-end detectors implementation and metric evaluation. Segment-anything is an off-the-shelf class-agnostic segmentation model that we used to produce instance proposals. DINOv2 is a self-supervised vision foundation model that we used to extract feature representation. -->

Demo

The Jupyter notebooks files demonstrate our non-learned method using SAM and DINOv2. We choose light pretrained models of SAM (vitl) and DINOv2 (dinov2vits14) for efficiency. <!-- | Pretrained Model | # of params | AP | AP50 | AP75 | | :--- | :---: | :---:| :---:| :---:| | ViT-S/14 distilled | 21M |41.61 |49.10 |45.95 | |ViT-B/14 distilled | 86M |41.89 |49.39 |46.30 | |ViT-L/14 distilled | 300M |43.33 |50.80 |47.84 | |ViT-g/14 | 1,100M |44.65 |53.47 |49.11 | -->

Citation

If you find our project useful, please consider citing: bibtex @inproceedings{shen2023high, title={A high-resolution dataset for instance detection with multi-view object capture}, author={Shen, Qianqian and Zhao, Yunhan and Kwon, Nahyun and Kim, Jeeeun and Li, Yanan and Kong, Shu}, booktitle={NeurIPS Datasets & Benchmark Track}, year={2023}

Owner

  • Name: Instance Detection
  • Login: insdet
  • Kind: user

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: InsDet
message: >-
  If you use this dataset, please cite it using the metadata
  from this file.
type: dataset
authors:
  - given-names: Qianqian
    family-names: Shen
    affiliation: Zhejiang Lab
  - given-names: Yunhan
    family-names: Zhao
    affiliation: UC-Irvine
  - given-names: Nahyun
    family-names: Kwon
    affiliation: Texas A&M University
  - given-names: Jeeeun
    family-names: Kim
    affiliation: Texas A&M University
  - given-names: Yanan
    family-names: Li
    affiliation: Zhejiang Lab
  - given-names: Shu
    family-names: Kong
    email: instance.detection@gmail.com
    affiliation: Texas A&M University
identifiers:
  - type: url
    value: 'https://github.com/insdet/instance-detection'
repository-code: 'https://github.com/insdet/instance-detection'
abstract: >-
  A high-resolution real-world dataset and an non-learned
  method for instance detection
keywords:
  - Instance Detection
license: MIT

GitHub Events

Total
  • Issues event: 3
  • Watch event: 12
  • Issue comment event: 3
  • Push event: 13
Last Year
  • Issues event: 3
  • Watch event: 12
  • Issue comment event: 3
  • Push event: 13