Computer Vision
Paper: IEEE CVPR (2005) "Tracking multiple objects through occlusions"
Huang, Y and Essa, I. (2005) “Tracking multiple objects through occlusions”, In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005 (CVPR 2005), Volume: 2 page(s): 1051 – 1058 vol. 2, ISSN: 1063-6919, ISBN: 0-7695-2372-2, INSPEC Accession Number:8633324 DOI: 10.1109/CVPR.2005.350, [IEEEXplore#] 20-25 June 2005 ABSTRACT We present an approach for tracking […]
Talk at USC's IRIS (2004): "Temporal Reasoning from Video to Temporal Synthesis of Video"
Irfan Essa (2004), “Temporal Reasoning from Video to Temporal Synthesis of Video” Talk at USC’s IRIS-Vision Seminars (Fall 2004). Temporal Reasoning from Video to Temporal Synthesis of Video Abstract In this talk, I will present some ongoing work on extracting spatio-temporal cues from video for both synthesis of novel video sequences, and recognition of complex […]
Paper: IEEE CVPR (2004) "Asymmetrically boosted HMM for speech reading"
Pei Yin Essa, I. Rehg, J.M. (2004) “Asymmetrically boosted HMM for speech reading,”, In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004 (CVPR 2004). Publication Date: 27 June-2 July 2004, Volume: 2, On page(s): II-755 – II-761 Vol.2 ISSN: 1063-6919, ISBN: 0-7695-2158-, INSPEC Accession Number:8161546, Digital Object Identifier: 10.1109/CVPR.2004.1315240 […]
Paper: IEEE CVPR (2004) "Propagation networks for recognition of partially ordered sequential action"
Yifan Shi, Yan Huang, Minnen, D., Bobick, A., Essa, I. (2004), “Propagation networks for recognition of partially ordered sequential action” In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004 (CVPR 2004). Volume: 2, page(s): II-862 – II-869 Vol.2, ISSN: 1063-6919, ISBN: 0-7695-2158-4, INSPEC Accession Number:8161557, Digital Object Identifier: […]
Thesis: Gabriel Brostow's PhD (2004): "Novel Skeletal Representation for Articulated Creatures"
We define a Spine as a branching axial structure representing the shape and topology of a 3D objects limbs, and capturing the limbs correspondence and motion over time. … In general, our approach combines the objectives of generalized cylinders, 3D scanning, and markerless motion capture to generate baseline models from real puppets, animals, and human subjects.
Paper: ICCV (2003) "Spectral partitioning for structure from motion"
Steedly, D., Essa, I., Dellaert, F. (2003), “Spectral partitioning for structure from motion”, In Proceedings. Ninth IEEE International Conference on Computer Vision, 2003, 13-16 Oct. 2003, page(s): 996 – 1003 vol.2, Nice, France, ISBN: 0-7695-1950-4, INSPEC Accession Number:7971018, Digital Object Identifier: 10.1109/ICCV.2003.1238457, [IEEEXplore#] Abstract We propose a spectral partitioning approach for large-scale optimization problems, specifically […]
Papers: ACM SIGGRAPH (2003) "Graphcut textures"
Vivek Kwatra, Arno Schödl, Irfan Essa, Greg Turk, Aaron Bobick (2003), “Graphcut textures: image and video synthesis using graph cuts” In ACM Transactions on Graphics (TOG), Volume 22 , Issue 3, Proceedings of ACM SIGGRAPH 2003, Pages: 277 – 286, July 2003, ISSN:0730-0301. (DOI|Paper| SIGGRAPH Video (160 MB, 50 MB) | Video Results 87 MB […]
Funding: NSF/ITR (2002) "Analysis of Complex Audio-Visual Events Using Spatially Distributed Sensors"
Award#0205507 – ITR: Analysis of Complex Audio-Visual Events Using Spatially Distributed Sensors ABSTRACT We propose to develop a comprehensive framework for the joint analysis of audio-visual signals obtained from spatially distributed microphones and cameras. We desire solutions to the audio-visual sensing problem that will scale to an arbitrary number of cameras and microphones and can […]
Paper AAAI (2002): "Recognizing Multitasked Activities from Video using Stochastic Context-Free Grammar"
D. Moore and I. Essa (2002). “Recognizing multitasked activities from video using stochastic context-free grammar”, in Proceedings of AAAI 2002. [PDF | Project Site] Abstract In this paper, we present techniques for recognizing com- plex, multitasked activities from video. Visual information like image features and motion appearances, combined with domain-specific information, like object context is […]