Maciej Halber will present his FPO,"RGBD pipeline for Indoor Scene Reconstruction and Understanding" on Monday, July 15th, 2019 at 1pm in CS 402.

The members of his committee are Thomas Funkhouser (adviser); Readers: Szymon Rusinkiewicz and Matthias Niessner (The Technical University of Munich); Examiners: Thomas Funkhouser, Olga Russakovsky, and Adam Finkelstein.

A copy of his thesis is available upon request.

Everyone is invited to attend his talk. The talk abstract follows below:

In this work, we consider the problem of reconstructing a 3D model from a sequence of

color and depth frames. Generating such a model has many important applications,

ranging from the entertainment industry to real estate. However, transforming the

RGBD frames into high-quality 3D models is a challenging problem, especially if

additional semantic information is required. In this document, we introduce three

projects, which implement various stages of a robust RGBD processing pipeline.

First, we consider the challenges arising during the RGBD data capture process.

While the depth cameras are providing dense, per-pixel depth measurements, there

is a non-trivial error associated with the resulting data. We discuss the depth generation problem and propose an error reduction technique based on estimating an

image-space undistortion field. We describe the capture process of the data required

for the generation of such an undistortion field. We showcase how correcting the

depth measurements improves the reconstruction quality.

Second, we address the problem of registering RGBD frames over a long video

sequence into a globally consistent 3D model. We propose a “fine-to-coarse” global

registration algorithm that leverages robust registrations at finer scales to seed detection and enforcement of geometrical constraints, modeled as planar structures,

at coarser scales. To test global registration algorithms, we provide a benchmark

with 10,401 manually-clicked point correspondences in 25 scenes from the SUN3D

dataset. We find that our fine-to-coarse algorithm registers long RGBD sequences

better than previous methods.

Last, we show how repeated scans of the same space can be used to establish

associations between the different observations. Specifically, we consider a situation

where 3D scans are acquired repeatedly at sparse time intervals. We develop an

algorithm that analyzes these “rescans” and builds a temporal model of a scene with

semantic instance information. The proposed algorithm operates inductively by using

a temporal model resulting from past observations to infer instance segmentation of

a new scan. The temporal model is continuously updated to reflect the changes

that occur in the scene over time, providing object associations across time. The

algorithm outperforms alternate approaches based on state-of-the-art networks for

semantic instance segmentation.