Min Sun will present his research seminar/general exam on Wednesday
April 22 at 3:30PM in Room 402.
The members of his committee are; Fei-Fei Li (advisor), David
Blei, Adam Finkelstein. Everyone is
invited to attend his talk and those faculty wishing to remain for
the oral exam following are welcome
to do so. His abstract and reading list follow below.
-------------------------------------------------
We propose a novel
probabilistic framework for learning visual models of 3D object categories
by combining appearance information and geometric constraints. Objects
are represented as a coherent ensemble of parts that are consistent under
3D viewpoint transformations. Each part is a collection of salient image
features. A generative framework is used for learning a model that captures the
relative position of parts within each of the discretized
viewpoints. Contrary to most of the existing mixture of viewpoints models,
our model establishes explicit correspondences of parts across different
viewpoints of the object class. Given a new image, detection and
classification are achieved by determining the position and viewpoint of the
model that maximize recognition scores of the candidate objects. Our approach
is among the first to propose a generative probabilistic framework for 3D object
categorization. We test our algorithm on the detection task and the
viewpoint classification task by using “car” category from both the Savarese et
al. 2007 and PASCAL VOC 2006 datasets. We show promising results in both
the detection and viewpoint classification tasks on these two challenging
datasets.
=======
Reading list:
• Textbooks
[1] C. M. Bishop. Pattern Recognition and Machine
Learning (Information Science and Statistics).
Springer, August 2006. Chapter 1, 2, 8, and 10.
[2] D. A. Forsyth and J. Ponce. Computer Vision: A
Modern
Approach. Prentice Hall, August 2002.
• Papers
[1] D. M. Blei. Variational methods for the
dirichlet process.
In In Proceedings of the 21st International
Conference
on Machine Learning, 2004.
[2] P. F. Felzenszwalb and D. P. Huttenlocher.
Pictorial
structures for object recognition. IJCV, 61:2005,
2005.
[3] D. Hoeim, C. Rother, and J. Winn. 3d layoutcrf
for
multi-view object class recognition and segmentation.
2007. In Proc. In IEEE Conference on Computer
Vision
and Pattern Recognition.
[4] A. Kushal, C. Schmid, , and J. Ponce. Flexible
object
models for category-level 3d object recognition.
2007. Flexible object models for category-level 3d
object
recognition. In Proc. In IEEE Conf. on Comp. Vis.
and Patt. Recogn.
[5] B. Leibe, A. Leonardis, and B. Schiele.
Combined object
categorization and segmentation with an implicit
shape model. In In ECCV workshop on statistical
learning in computer vision, pages 17–32, 2004.
[6] J. Liebelt, C. Schmid, and K. Schertler.
Viewpointindependent
object class detection using 3d feature
maps. Computer Vision and Pattern Recognition,
2008.
CVPR 2008. IEEE Conference on, pages 1–8, June
2008.
[7] S. Savarese and L. Fei-Fei. 3d generic object
categorization,
localization and pose estimation. 2007. IEEE
Intern. Conf. in Computer Vision (ICCV).
[8] E. B. Sudderth, A. Torralba, W. T. Freeman,
and A. S.
Willsky. Describing visual scenes using
transformed
dirichlet processes. In In NIPS, pages 1297–1304.
MIT
Press, 2005.
[9] A. Thomas, V. Ferrari, B. Leibe, T.
Tuytelaars,
B. Schiele, and L. V. Goo. Towards multi-view
object
class detection. In Proc. In IEEE Conference on
Computer
Vision and Pattern Recognition, volume 2, pages
1589-1596.
[10] P. Yan, D. Khan, and M. Shah. 3d model based
object
class detection in an arbitrary view. ICCV, 2007.