Sachin Ravi will present his MSE thesis talk on Thursday May 1 at 2:30 PM in Room 301 (note room!). The members of his committee are: Kai Li (advisor) and Moses Charikar. Everyone is invited to attend his talk. His abstract follows below. Nearest Neighbor search is a fundamental problem in computer vision, machine learning, data mining, and other fields. In today's Internet age, as the size of images and other non-text data grows exponentially, it is crucial to scale solutions to larger datasets. Current research involves the approximate nearest neighbor problem where we are willing to sacrice some percentage of accuracy so that the approximate algorithm is one or two orders of magnitude faster than a naive linear scan of the data. Most recently, advances in performance for the K-Nearest Neighbor (K-NN) search problem have been made using a graph-based data structure. To this end, we consider adapting a graph-based index for approximate Epsilon-Nearest Neighbor (Epsilon- NN) search. Through analysis of various data sets, we show that this problem is very relevant to the requirements of current applications. We then introduce a new data structure Epsilon-NN graph for online nearest neighbor search and an algorithm for constructing this index on existing data. Through experimental evaluation on extracted data sets, we show the high performance of this data structure compared to existing methods.
participants (1)
-
Melissa M. Lawson