On the equations relating a three-dimensional object and its two-dimensional images

Article ID	Journal	Published Year	Pages	File Type
9506059	Advances in Applied Mathematics	2005	27 Pages	PDF

Abstract

Our goal is to interpret computer vision in terms of the vanishing or non-vanishing of certain sets of polynomials. To this end, we establish the algebraic geometric foundations of computer vision using transformation groups. Let Î± be a three-dimensional object in whose structure m points and n lines play an important role. Let Î² be a possible two-dimensional image in whose structure are m points and n lines which may be an image of those chosen in Î±. We show that there is a set of polynomials Fi(Î±,Î²) which must vanish when Î² is a true image of Î±. The Fi are the maximal subdeterminants of a (3m+2n)Ã(12+m) matrix and their vanishing says that the rank of this matrix is at most 11+m. Let Z={(Î±,Î²):Fi(Î±,Î²)=0Â for allÂ i} and let Î={(Î±,Î²):Î²Â is a true image ofÂ Î±}. We show that the Zariski-closure of Î is an irreducible component of Z and give a precise description of the other components. We construct a set Uâ², defined by the non-vanishing of certain polynomials, so that Uâ²â©Î=Uâ²â©Z. We conclude with two applications to object recovery: when (m,n)=(3,3), we show that different objects have the same image sets; for any (m, n), we show that objects cannot be distinguished by any polynomial function on their image sets.

Keywords

Irreducible component Recognition Image Rank Polynomial equations Object