Abstract
Using three dimensional invariant representations, we address the problem of changes in appearance that result from a change in camera orientation (or change of viewpoint). This approach is based on a Euclidean invariant representation of three dimensional objects, where the metric information is kept using the Gramian of 4 basis points and the affine coordinates of the remaining points, or using the generalized inverse Gramian of all the object points. We describe functions which operate on two dimensional images of three dimensional objects, and which are invariant under changes of viewpoint. These functions can be used to improve and extend various existing recognition approaches, including alignment, linear combination, and indexing. The invariant representation can be computed with a linear algorithm from a sequence of images.
This paper describes research done at IBM T.J. Watson Res. Ctr., Hawthorne, NY.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
J.B. Burns, R. Weiss, and E. Riseman. View variation of point-set and line segment features. In Proceedings Image Understanding Workshop, pages 650–659, April 1990.
D. T. Clemens and D. W. Jacobs. Space and time bounds on indexing 3-D models from 2-D images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(10):1007–1017, 1991.
O. Faugeras. What can be seen in three dimensions with an uncalibrated stereo rig? In Proceedings of the 2nd European Conference on Computer Vision, pages 563–578, Santa Margherita Ligure, Italy, 1992. Springer-Verlag.
D. Forsyth, J. L. Mundy, A. Zisserman, C. Coelho, A. Heller, and C. Rothwell. Invariant descriptors for 3-D object recognition and pose. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13:971–991, 1991.
D. P. Huttenlocher and S. Ullman. Object recognition using alignment. In Proceedings of the 1st International Conference on Computer Vision, pages 102–111, London, England, June 1987. IEEE, Washington, DC.
J. J. Koenderink and A. J. van Doorn. Affine structure from motion. Journal of the Optical Society of America, 8(2):377–385, 1991.
Y. Lamdan, J. T. Schwartz, and H. Wolfson. Object recognition by affine invariant matching. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition, pages 335–344, Ann Arbor, MI, 1988.
Y. Lamdan and H. Wolfson. Geometric hashing: a general and efficient recognition scheme. In Proceedings of the 2nd International Conference on Computer Vision, pages 238–251, Tarpon Springs, FL, 1988. IEEE, Washington, DC.
R. Mohan, D. Weinshall, and R. R. Sarukkai. 3D object recognition by indexing structural invariants from multiple views. In Proceedings of the 4th International Conference on Computer Vision, pages 264–268, Berlin, Germany, 1993. IEEE, Washington, DC.
Y. Moses and S. Ullman. Limitations of non model-based schemes. A.I. Memo No. 1301, Artificial Intelligence Laboratory, Mass. Inst. of Tech., 1991.
H. S. Sawhney, J. Oliensis, and A. R. Hanson. Description and reconstruction from image trajectories of rotational motion. In Proceedings of the 3rd International Conference on Computer Vision, pages 494–498, Osaka, Japan, 1990. IEEE, Washington, DC.
A. Shashua. Projective depth: a geometric invariant for 3D reconstruction from two perspective/orthographic views and for visual recognition. In Proceedings of the 4th International Conference on Computer Vision, pages 583–590, Berlin, Germany, 1993. IEEE, Washington, DC.
C. Tomasi and T. Kanade. Shape and motion from image streams under orthography: a factorization method. International Journal of Computer Vision, 9(2):137–154, 1992.
S. Ullman. Computational studies in the interpretation of structure and motion: summary and extension. In J. Beck, B. Hope, and A. Rosenfeld, editors, Human and Machine Vision. Academic Press, New York, 1983.
S. Ullman. Maximizing rigidity: the incremental recovery of 3D structure from rigid and rubbery motion. Perception, 13:255–274, 1984.
S. Ullman and R. Basri. Recognition by linear combinations of models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13:992–1006, 1991.
D. Weinshall. Model-based invariants for 3D vision. International Journal of Computer Vision, 10(1):27–42, 1993.
D. Weinshall and R. Basri. Distance metric between 3d models and 2d images for recognition and classification. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition, New-York City, NY, 1993. IEEE, Washington, DC.
D. Weinshall and C. Tomasi. Linear and incremental acquisition of invariant shape models from image sequences. In Proceedings of the 4th International Conference on Computer Vision, pages 675–682, Berlin, Germany, 1993. IEEE, Washington, DC.
A. P. Witkin. Scale-space filtering. In Proceedings IJCAI, pages 1019–1022, 1983.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
� 1994 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Weinshall, D. (1994). Model-based invariant functions and their use for recognition. In: Mundy, J.L., Zisserman, A., Forsyth, D. (eds) Applications of Invariance in Computer Vision. AICV 1993. Lecture Notes in Computer Science, vol 825. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58240-1_19
Download citation
DOI: https://doi.org/10.1007/3-540-58240-1_19
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58240-3
Online ISBN: 978-3-540-48583-4
eBook Packages: Springer Book Archive