Image Parsing with a Wide Range of Classes and Scene-Level Context

Marian George
Department of Computer Science, ETH Zurich, Switzerland

Abstract: This paper presents a nonparametric scene parsing approach that improves the overall accuracy, as well as the coverage of foreground classes in scene images. We first improve the label likelihood estimates at superpixels by merging likelihood scores from different probabilistic classifiers. This boosts the classification performance and enriches the representation of less-represented classes. Our second contribution consists of incorporating semantic context in the parsing process through global label costs. Our method does not rely on image retrieval sets but rather assigns a global likelihood estimate to each label, which is plugged into the overall energy function. We evaluate our system on two large-scale datasets, SIFTflow and LMSun. We achieve state-of-the-art performance on the SIFTflow dataset and near-record results on LMSun
Paper:
Marian George "Image Parsing with a Wide Range of Classes and Scene-Level Context," IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. (PDF)
Complete results on SIFTflow Dataset
Classification models on SIFTflow Dataset
Complete results on LMSun Dataset
Classification models on LMSun Dataset