Doug Kerr
Well-known member
Most programs for the assembly of multi-image panoramic photographs offer a choice which projection is represented by the final result. I thought I would offer a little technical background in this matter.
****************
In this context, projection is the process by which object points in three-dimensional space are mapped onto locations on a (flat or flattenable) two dimensional surface. A projection is a particular scheme for doing so.
Projection is involved in any photography, but we most often hear the term in connection with panoramic photography. This is likely because the implications of different projections become most significant to us in the case of a large field of view.
The matter of projections was first highly developed in the field of cartography, that matter of making maps of the earth's surface. The issue arises here because the points on the earth surface (even if we ignore local changes in altitude) must be treated as a three dimensional field, while of course maps are expected to be flat and two dimensional.
Many of the projections we use were first developed in this contexts, and often their names reflect that history.
Four projections we often hear of used in this work are:
• Rectilinear
• Cylindrical ("classical")
• Mercator
• Equirectangular
The rectilinear projection is ordinarily the one we aspire to have followed in ordinary photography at "moderate" fields of view. Its two most important properties are:
• If we have an object surface that lies in a plane, perpendicular to the axis of the lens, then the image is just a "scaled" version of the object surface, with all distances between points uniformly proportional to the distances between the corresponding object points. All angles between lines we might construct between object points are preserved in the image. (We say that the image array of points is similar, in the formal geometric sense, to the object array of points.)
• All straight lines in the three-dimensional object space are represented as straight lines in the image.
The use of the rectilinear projection is useful in multi-image panoramic photography in such cases as the photography of a building with a "mostly planar" facade, spanning a substantial field of view from the camera location, when we want the image to look like an architect's "elevation" drawing of the facade.
A rectilinear projection is absolutely limited to a field of view of 180°, and various considerations make it unsuitable for fields of view even approaching that.
We can understand the concept of the rectilinear projection with a pinhole camera metaphor. We have a pinhole in an opaque plate. The film is flat and its plane is parallel to the plane of our "test object surface".
From each object point, we consider that a single ray passes through the pinhole onto the film. The triangles on the object and image sides of the pinhole, formed of the ray of interest, the axis of the camera extending to the object plane or film, and the lines connecting their ends, are similar (in the formal geometric sense, all angles being the same and all side lengths being uniformly proportional) and this leads to the image array being formally similar to the object array.
Now that we have practiced our pinhole metaphor, we will use it to explain the definition of another projection, the cylindrical projection. (I mean here what I call the "classical" cylindrical projection, since other related projections are often categorized as "cylindrical" (including the other two we will visit here shortly).
Again imagine our pinhole. Her, we need to imagine that it has a supernatural property: light will pass through it in the same way whether "head on" of from a substantial angle to the side.
We bend out sheet of film into a partial cylinder, whose axis is vertical and passes through the pinhole. Again, we consider rays from various points in the scene.
Imagine that we have a very narrow "billboard", whose face is perpendicular to the line from the pinhole to its center. We can see that with respect to this narrow object field, the result will be just the same as with the flat film, and thus with respect to that narrow object the behavior of the system is just like with the rectilinear projection.
We find that, if we consider a set of equally spaced points in a vertical line on our billboard, they will be equally spaced on the film. Thus the cylindrical projection preserves the vertical "scale" of object points at any particular distance (where by distance I mean distance measured parallel to the "camera" axis, not radially from the pinhole to the point - this is a critical distinction here as we consider significant vertical fields of view).
We find that, if we had a building facade that was semicircular, with its center at the camera, the cylindrical projection would make the image (as we flattened out the film) similar (in the geometric sense to the actual facade of the building (if we flattened it out).
But if we had a "wide" building, with its facade perpendicular to a line to its center from the camera, its portrayal on the image will not be similar to the facade itself. The reason is that the ends of the building are farther from the camera than the center. As a result, the vertical scale for them in the image is less than the vertical scale for points in the "center" of the facade.
Thus the image of the building facade on the film will "droop" as we go away from the center.
Note that just as the rectilinear projection is not good for large fields of view (horizontal or vertical, in fact), the cylindrical projection is not good when really large vertical fields of view are involved. If we wanted to embrace a vertical half field of view of 90°, we would need an infinitely-high piece of film.
[continued in part 2]
****************
In this context, projection is the process by which object points in three-dimensional space are mapped onto locations on a (flat or flattenable) two dimensional surface. A projection is a particular scheme for doing so.
Projection is involved in any photography, but we most often hear the term in connection with panoramic photography. This is likely because the implications of different projections become most significant to us in the case of a large field of view.
The matter of projections was first highly developed in the field of cartography, that matter of making maps of the earth's surface. The issue arises here because the points on the earth surface (even if we ignore local changes in altitude) must be treated as a three dimensional field, while of course maps are expected to be flat and two dimensional.
Many of the projections we use were first developed in this contexts, and often their names reflect that history.
Four projections we often hear of used in this work are:
• Rectilinear
• Cylindrical ("classical")
• Mercator
• Equirectangular
The rectilinear projection is ordinarily the one we aspire to have followed in ordinary photography at "moderate" fields of view. Its two most important properties are:
• If we have an object surface that lies in a plane, perpendicular to the axis of the lens, then the image is just a "scaled" version of the object surface, with all distances between points uniformly proportional to the distances between the corresponding object points. All angles between lines we might construct between object points are preserved in the image. (We say that the image array of points is similar, in the formal geometric sense, to the object array of points.)
• All straight lines in the three-dimensional object space are represented as straight lines in the image.
The use of the rectilinear projection is useful in multi-image panoramic photography in such cases as the photography of a building with a "mostly planar" facade, spanning a substantial field of view from the camera location, when we want the image to look like an architect's "elevation" drawing of the facade.
A rectilinear projection is absolutely limited to a field of view of 180°, and various considerations make it unsuitable for fields of view even approaching that.
We can understand the concept of the rectilinear projection with a pinhole camera metaphor. We have a pinhole in an opaque plate. The film is flat and its plane is parallel to the plane of our "test object surface".
From each object point, we consider that a single ray passes through the pinhole onto the film. The triangles on the object and image sides of the pinhole, formed of the ray of interest, the axis of the camera extending to the object plane or film, and the lines connecting their ends, are similar (in the formal geometric sense, all angles being the same and all side lengths being uniformly proportional) and this leads to the image array being formally similar to the object array.
Now that we have practiced our pinhole metaphor, we will use it to explain the definition of another projection, the cylindrical projection. (I mean here what I call the "classical" cylindrical projection, since other related projections are often categorized as "cylindrical" (including the other two we will visit here shortly).
Again imagine our pinhole. Her, we need to imagine that it has a supernatural property: light will pass through it in the same way whether "head on" of from a substantial angle to the side.
We bend out sheet of film into a partial cylinder, whose axis is vertical and passes through the pinhole. Again, we consider rays from various points in the scene.
Imagine that we have a very narrow "billboard", whose face is perpendicular to the line from the pinhole to its center. We can see that with respect to this narrow object field, the result will be just the same as with the flat film, and thus with respect to that narrow object the behavior of the system is just like with the rectilinear projection.
We find that, if we consider a set of equally spaced points in a vertical line on our billboard, they will be equally spaced on the film. Thus the cylindrical projection preserves the vertical "scale" of object points at any particular distance (where by distance I mean distance measured parallel to the "camera" axis, not radially from the pinhole to the point - this is a critical distinction here as we consider significant vertical fields of view).
We find that, if we had a building facade that was semicircular, with its center at the camera, the cylindrical projection would make the image (as we flattened out the film) similar (in the geometric sense to the actual facade of the building (if we flattened it out).
But if we had a "wide" building, with its facade perpendicular to a line to its center from the camera, its portrayal on the image will not be similar to the facade itself. The reason is that the ends of the building are farther from the camera than the center. As a result, the vertical scale for them in the image is less than the vertical scale for points in the "center" of the facade.
Thus the image of the building facade on the film will "droop" as we go away from the center.
Note that just as the rectilinear projection is not good for large fields of view (horizontal or vertical, in fact), the cylindrical projection is not good when really large vertical fields of view are involved. If we wanted to embrace a vertical half field of view of 90°, we would need an infinitely-high piece of film.
[continued in part 2]