3D Rigid Tracking from RGB Images Dataset ‒ CVLAB ‐ EPFL

The dataset challenge consists in tracking 3 rigid, poorly textured, highly occluded objects across sequences of monocular RGB images.

For each object dataset, we provide:

a simple CAD model of the object (.obj)
several learning videos
one or more testing videos
the groundtruth pose of the camera wrt the object reference system for all the learning videos
a simple Matlab script Test.m showing how to employ the pose groundtruth.

All videos were shot using a CANON EOS 5D camera, f = 50 mm.

THIS IS NOT A RGB-D dataset. No depth image is provided for any of the datasets.

The goal consists in retrieving the pose of the camera wrt to the object in all the frames of the testing videos, For training your algorithms, you are allowed to use anything but the testing video frames.

BOX

This dataset contains 5 learning videos and 2 testing videos showing an electric box filled and emptied with several objects.

CAN

Ths dataset contains 4 learning videos and 2 testing videos showing a non-textured food can over 2 different backgrounds; the can is occasionally occluded by a user’s hand grasping it and several distractor objects that create clutter.

DOOR

This dataset contains 8 learning videos and 1 testing video showing a non-textured office door moving over a cluttered background. On the testing sequence, a user opens the door and passes through it.

2D ANNOTATIONS

Additionally, we provide accurate manual annotations of some parts of the target objects on the testing sequences. These can be used to test 2D detectors for localizing 3D objects undergoing perspective and light changes.

Part of the manual annotations were kindly provided by Mahdi Rad of the Graz Univertisy of Technology.

DOWNLOAD

All the video sequences (except the BOX negative video) are provided as 1920 x 1080 png images. The file XXX-TestAndInfo contains the testing video sequences of each dataset, the 3D model (.obj) a READ_ME, a python script for importing the pose data and a simple matlab example.

Clic on the corresponding link to download the data.

BOX Dataset

Negative video BOX-Learn-video1 BOX-Learn-video2

BOX-Learn-video3 BOX-Learn-video4 BOX-Learn-video5

BOX-TestAndInfo

CAN Dataset

CAN-Learn-Part1 CAN-Learn-Part2 CAN-TestAndInfo

DOOR Dataset

DOOR-Learn-Part1 DOOR-Learn-Part2 DOOR-TestAndInfo

2D Annotations

2D annotations

EVALUATION

The 3D pose estimation can be evaluated computing the L2 norm of the rotation and translation components of the absolute pose error [2] for all the frame of each video sequence, and evaluating their Cumulative Distribution Function (CDF) .

Quantitative results are provided in [1] in the form of the normalized Area Under Curve (AUC) score for each error. The AUC score is computed dividing the area of the CDF curve by the max error of the graph. The max error was set to 0.5 for both rotation and translation for all frames. See [1] for further details.

DISCLAIMER – CONTACT

You are free to download and use these data for research purposes only.

We appreciate an email message indicating who has copied the data.

By downloading and using the dataset you agree to acknowledge its source (CVLab EPFL) and to cite the paper [1] in case results obtained with these data are published.

For any further details, questions and remarks, you can write to:

alberto [dot] crivellaro [at] epfl.ch

BIBLIOGRAPHY

Field Guide to Northern Tree-related Microhabitats: Descriptions and size limits for their inventory in boreal and hemiboreal forests of Europe and North America

R. Bütler Sauvain; L. Larrieu; L. F. Lunde; M. Maxence; B. Nordén et al.

Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Switzerland, 2024.

Detailed record

Full text

Data Champions Lunch Talks – Green Bytes: Data-Driven Approaches to EPFL Sustainability

M. S. P. Cubero-Castan; M. Peon Quiros; C. Gabella; F. Varrato; Loïc Lannelongue

Data Champions Lunch Talks – Green Bytes: Data-Driven Approaches to EPFL Sustainability, EPFL, CM 1 221, April 18, 2024.

Detailed record

Full text

Comparison of Three Viral Nucleic Acid Preamplification Pipelines for Sewage Viral Metagenomics

X. Fernandez Cassi; T. Kohn

Food and Environmental Virology. 2024. DOI : 10.1007/s12560-024-09594-3.

Detailed record

Full text – View at publisher

How to Support Students to Develop Skills that Promote Sustainability

S. R. Isaac; J. de Lima

Teaching Transversal Skills for Engineering Studens: A Practical Handbook of Activities with Tangibles; EPFL, 2024.

Detailed record

View at publisher

How to Support Students Giving Each Other Constructive Feedback, Especially When It Is Difficult to Hear

S. R. Isaac; J. de Lima

Teaching Transversal Skills for Engineering Studens: A Practical Handbook of Activities with Tangibles; EPFL, 2024.

Detailed record

View at publisher

How teachers can use the 3T PLAY trident framework to design an activity that develops transversal skills

S. R. Isaac; J. de Lima

Teaching Transversal Skills for Engineering Studens: A Practical Handbook of Activities with Tangibles; EPFL, 2024.

Detailed record

View at publisher

The conceptual foundations of innate immunity: Taking stock 30 years later

Pradeu Thomas; Thomma Bart T.P.H.; Girarding Stephen; B. Lemaitre

Immunity. 2024-04-09. Vol. 57, num. 4, p. 613-631. DOI : 10.1016/j.immuni.2024.03.007.

Detailed record

Full text – View at publisher

Radio-Activities: Architecture and Broadcasting in Cold War Berlin

A. Thiermann

Cambridge, MA; London: MIT Press, 2024.

Detailed record

No Last One

A. Thiermann

Revue Matières. 2024. num. 18.

Detailed record

All That is Solid

A. Thiermann

Transcalar Prospects in Climate Crisis; Zurich: Lars Müller, 2024.

Detailed record

[2] Sturm, J., Engelhard, N., Endres, F., Burgard, W., & Cremers, D. A benchmark for the evaluation of RGB-D SLAM systems. IROS 2012