course participants course announcements about this wiki questionnaires and assignments slides of presentations course schedule related resources |
Image understanding is a high level vision task which deals with interpreting and associating meaning to images. This includes but is not limited to figuring out what real world objects occur in the image (object recognition), where these objects occur (localization) and the spatio-temporal relationships between these objects. Semantic image understanding has been an active area of research in the vision community resulting in a number of paradigms:
Though significant progress has been made in the field a unified framework is lacking. Numerous papers are published each year developing new algorithms for domain dependent image understanding. Algorithms which work well in one domain often perform poorly in other domains. For instance, medical image understanding algorithms seldom perform well on aerial image analysis tasks. Adapting algorithms across domains often involves non trivial amounts of work (often hacky). This lack of generalization is what i intend to address in my thesis. In this thesis I investigate whether generalization is possible at all, and if so under what circumstances. Initial experiments have evaluated generalization capabilities across similar domains by adapting concepts from Transfer Learning[2]. Another aspect of this research has been to understand what kind of image features are the most domain independent. The work in this thesis provides a first step towards a unified image understanding framework. 1) Z.W. Tu, X.R. Chen, A.L. Yuille, and S.C. Zhu, "Image parsing: unifying segmentation, detection and recognition", Int'l J. of Computer Vision, 63(2), 113-140, 2005. 2)Rosenstein, M. T., Marx, Z., Kaelbling, L. P., Dietterich, T. G. (2005). To transfer or not to transfer. NIPS 2005 Workshop on Transfer Learning, Whistler, BC. Last modified 11 December 2007 at 2:07 am by Soumya |