3D Model-Assisted Learning for Object Detection and Pose Estimation

Georgios Georgakis

3D Model-Assisted Learning for Object Detection and Pose Estimation

Files

Georgakis_gmu_0883E_12229.pdf (39.26 MB)

Date

2020

Authors

Georgios Georgakis

Abstract

Supervised learning paradigm for training Deep Convolutional Neural Networks (DCNN) rests on the availability of large amounts of manually annotated images, which are necessary for training deep models with millions of parameters. In this thesis, we present novel techniques for mitigating the required manual annotation, by generating large object instance datasets through compositing textured 3D models onto commonly encountered background scenes to synthesize training images. The generated training data augmented with real world annotations outperforms models trained only on real data. Non-textured 3D models are subsequently used for keypoint learning and matching, and 3D object pose estimation from RGB images. The proposed methods showcase promising results with regards to generalization on new and standard benchmark datasets. In the final part of the thesis, we investigate how these perception capabilities can be leveraged and encoded in a spatial map, in order to enable an agent to successfully navigate towards a target object.

URI

https://hdl.handle.net/1920/12381

Collections

College of Engineering and Computing

Full item page

3D Model-Assisted Learning for Object Detection and Pose Estimation

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections