Hei Law will present his FPO "Learning to Detect Objects by Grouping" on Tuesday, August 16, 2022 at at 10:00 AM in Friend 202 and Zoom.

Location: Zoom link: https://princeton.zoom.us/my/heilaw?pwd=a3U4ZmZBZXdTbWhqOFBHbmN1c2pEUT09

The members of Hei’s committee are as follows:

Examiners: Jia Deng (Adviser), Adam Finkelstein, Felix Heide

Readers: Szymon Rusinkiewicz, Olga Russakovsky

A copy of his thesis is available upon request. Please email gradinfo@cs.princeton.edu if you would like a copy of the thesis.

Everyone is invited to attend his talk.

Abstract follows below:

Extracting high level semantic information from visual input is an ability that human rely on to perform daily tasks. This often requires identifying and locating relevant objects before any higher level information is inferred. This step is known as object detection which is a fundamental task in computer vision. It has numerous real world applications and serves as an upstream task for many computer vision tasks. This dissertation makes three contributions to object detection.

First we propose CornerNet, a new approach to object detection. CornerNet reformulates object detection as detecting and grouping pairs of keypoints. More specifically, CornerNet detects corners of the bounding boxes and predicts similar embedding vectors for corners from the same objects. CornerNet also introduces corner pooling, a new pooling layer that helps localize corners. Experiments on COCO show that CornerNet achieves an AP of 42.2%, outperforming all one-stage detectors.

Second we propose CornerNet-Lite, a collection of two efficient variants, CornerNet- Saccade and CornerNet-Squeeze, of CornerNet. CornerNet-Lite explores two orthog- onal directions: processing fewer pixels and reducing the processing cost of each pixel. Inspired by saccade in human vision system, CornerNet-Saccade processes fewer pix- els by estimating object locations on a downsampled image and processes a subet of regions in high resolution. It is 6x faster than CornerNet and achieves a better AP. CornerNet-Squeeze reduces the cost by introducing a new compact hourglass backbone network. It is faster and more accurate than YOLOv3.

Third we propose “Synthetic Opitmized Layout with Instance Detection (SOLID)”, a new pretraining approach for object detection. SOLID consists of two main compo- nents. The first component generates synthetic images from a collection of unlabelled 3D models with optimized scene arrangement. The second component is an instance detection task where given a query image depicting a 3D model, a detector is trained to locate the instances of the same object in a target image. Experiments show that synthetic data can be effective for pretraining an object detector.

From grouping corners in CornerNet and CornerNet-Lite to grouping instances in SOLID, this dissertation presents a new approach to object detection – learning to detect objects by grouping.

Louis Riehl
Graduate Administrator
Computer Science Department, CS213
Princeton University
(609) 258-8014