Ellipse

Conic section

PD Image
Fig. 2. Upper shaded (green) section: ellipse; lower shaded (red) section: circle.

In the work of the Greek mathematician Apollonius (c. 262–190 BC) the ellipse arose as the intersection of a plane with a cone. Apollonius gave the ellipse its name, though the term ἔλλειψις (elleipsis, meaning "falling short") was used earlier by Euclid (c. 300 BC) in the construction of parallelograms with areas that "fell short". Apollonius applied the word to the conic section that at present we call ellipse. See Ref.^[4] for the—in modern eyes—complicated reasoning by which Apollonius tied the shape of certain conic sections to Euclid's concept of deficient areas.

In figure 2 a cone with a circular base is shown. It has a vertical symmetry axis, an axis of revolution. A cone can be generated by revolving around the axis a line that intersects the axis of rotation under an angle α (strictly between 0 and 90 degree). A horizontal plane (plane perpendicular to the axis of the cone) — that does not contain the vertex — intersects the cone in a circle (a special ellipse). A plane that intersects the axis in an angle greater than α intersects the cone in an ellipse. (Otherwise, the intersection is either a parabola or a hyperbola.) If the plane contains the vertex, the ellipse degenerates to a point; if the plane is perpendicular to the axis the ellipse is a circle.

Eccentricity

The eccentricity e of an ellipse (usually denoted by e or ε) is the ratio of the distance OF₂ (cf. figure 3) to the length a (half the major axis), that is, e := OF₂ / a. Let ${\vec {a}}$ be a vector of length a along the x-axis, then

e{\vec {a}}:={\overrightarrow {\mathrm {OF} }}_{2}.

The following two vectors have common endpoint at P, see figure 3,

{\vec {r}}:={\overrightarrow {\mathrm {OP} }}={\begin{pmatrix}x\\y\end{pmatrix}}\quad {\hbox{and}}\quad {\vec {g}}:={\overrightarrow {\mathrm {F} _{2}\mathrm {P} }}={\overrightarrow {\mathrm {F} _{2}\mathrm {O} }}+{\overrightarrow {\mathrm {O} \mathrm {P} }}=-e{\vec {a}}+{\vec {r}}.

Now choose P as the intersection P₁ of the positive y-axis with the ellipse; then its position vector is:

{\vec {r}}_{1}={\begin{pmatrix}0\\b\end{pmatrix}}.

By symmetry, the distance of this point P₁ to either focus is equal, thus the length of the corresponding vector ${\vec {g}}_{1}$ (with endpoint on the y-axis) is equal to the length a of the semi-major axis. For the following two inner products (indicated by a centered dot) we find,

{\vec {r}}_{1}\cdot (e{\vec {a}})=0\quad {\hbox{and}}\quad {\vec {r}}_{1}\cdot {\vec {r}}_{1}=b^{2}.

PD Image
Fig. 3. An ellipse situated such that the major and minor axes are along Cartesian axes. The center of the ellipse coincides with the origin O.

Hence, (in fact the Pythagoras theorem applied to P₁OF₂),

a^{2}=|{\vec {g}}_{1}\;|^{2}=({\vec {r}}_{1}-e{\vec {a}}\;)\cdot ({\vec {r}}_{1}-e{\vec {a}}\;)={\vec {r}}_{1}\cdot {\vec {r}}_{1}-2({\vec {r}}_{1}\cdot e{\vec {a}})+e{\vec {a}}\cdot e{\vec {a}}=b^{2}+e^{2}a^{2},

so that the eccentricity is given by

e={\sqrt {\frac {a^{2}-b^{2}}{a^{2}}}}\quad {\hbox{with}}\quad 0\leq e\leq 1.

Remark: The two extreme values for the eccentricity correspond to the extreme forms of an ellipse: The vaule 0 corresponds to the circle, the value 1 to the line segment.

Algebraic form

Consider an ellipse that is located with respect to a Cartesian frame as in figure 3 (a ≥ b > 0, major axis on x-axis, minor axis on y-axis). Then:

(Canonical equation of an ellipse) A point P=(x,y) is a point of the ellipse if and only if

{\frac {x^{2}}{a^{2}}}+{\frac {y^{2}}{b^{2}}}=1.

Note that for a = b this is the equation of a circle. An ellipse may be seen as a unit circle in which the x and the y coordinates are scaled independently, by 1/a and 1/b, respectively. (An ellipse degenerated to a line segment cannot be described with such an equation.)

Proof

Part 1: We first consider an arbitrary point P of the ellipse. Introduce the vectors

{\begin{aligned}{\overrightarrow {\mathrm {F} _{1}\mathrm {P} }}={\overrightarrow {\mathrm {F} _{1}\mathrm {O} }}+{\overrightarrow {\mathrm {O} \mathrm {P} }}=&\;e{\vec {a}}+{\vec {r}}\\{\overrightarrow {\mathrm {F} _{2}\mathrm {P} }}={\overrightarrow {\mathrm {F} _{2}\mathrm {O} }}+{\overrightarrow {\mathrm {O} \mathrm {P} }}=&-e{\vec {a}}+{\vec {r}}={\vec {g}}\\\end{aligned}}

By definition of ellipse, the sum of the lengths is 2a

2a=|{\vec {r}}+e{\vec {a}}|+|{\vec {r}}-e{\vec {a}}|\qquad \qquad \qquad \qquad (1)

Multiplying equation (1) by

|{\vec {r}}+e{\vec {a}}|-|{\vec {r}}-e{\vec {a}}|

gives

2a\left(|{\vec {r}}+e{\vec {a}}|-|{\vec {r}}-e{\vec {a}}|\right)

=\left(|{\vec {r}}+e{\vec {a}}|+|{\vec {r}}-e{\vec {a}}|\right)\cdot \left(|{\vec {r}}+e{\vec {a}}|-|{\vec {r}}-e{\vec {a}}|\right)=|{\vec {r}}+e{\vec {a}}|^{2}-|{\vec {r}}-e{\vec {a}}|^{2}

=4e{\vec {r}}\cdot {\vec {a}}

Hence

|{\vec {r}}+e{\vec {a}}|-|{\vec {r}}-e{\vec {a}}|={2e{\vec {r}}\cdot {\vec {a}} \over a}

and since

{\frac {{\vec {r}}\cdot {\vec {a}}}{a}}=x

(the first coordinate of the vector ${\vec {r}}$ ) we obtain

|{\vec {r}}+e{\vec {a}}|-|{\vec {r}}-e{\vec {a}}|=2ex\qquad \qquad \qquad \qquad (2)

By adding and subtracting equations (1) and (2) we find expressions for the distance of P to the foci,

{\begin{aligned}|{\overrightarrow {\mathrm {F} _{1}\mathrm {P} }}|&=|{\vec {r}}+e{\vec {a}}|&=a+ex&\\|{\overrightarrow {\mathrm {F} _{2}\mathrm {P} }}|&=|{\vec {r}}-e{\vec {a}}|&=a-ex&\qquad \qquad \qquad \quad (3)\\\end{aligned}}

Squaring both equations

{\begin{aligned}r^{2}+e^{2}a^{2}+2e{\vec {r}}\cdot {\vec {a}}&=a^{2}+e^{2}x^{2}+2eax\\r^{2}+e^{2}a^{2}-2e{\vec {r}}\cdot {\vec {a}}&=a^{2}+e^{2}x^{2}-2eax\\\end{aligned}}

adding them, substituting the earlier derived value for e², and reworking gives

r^{2}+e^{2}a^{2}=a^{2}+e^{2}x^{2}

\Rightarrow r^{2}+{\frac {a^{2}-b^{2}}{a^{2}}}a^{2}=a^{2}+{\frac {a^{2}-b^{2}}{a^{2}}}x^{2}

\Rightarrow x^{2}+y^{2}=r^{2}=a^{2}+{\frac {a^{2}-b^{2}}{a^{2}}}(x^{2}-a^{2})=b^{2}+x^{2}-x^{2}{\frac {b^{2}}{a^{2}}}

\Rightarrow y^{2}=b^{2}-x^{2}{\frac {b^{2}}{a^{2}}}

Division by b² finally gives

{\frac {x^{2}}{a^{2}}}+{\frac {y^{2}}{b^{2}}}=1.

Part 2: Conversely, for any point P whose coordinates x and y satisfy this equation, the sum of its distances from the foci

\mathrm {F} _{1}=(-f,0)\quad {\textrm {and}}\quad \mathrm {F} _{2}=(f,0)\quad {\textrm {with}}\quad f:={\sqrt {a^{2}-b^{2}}}

is

\mathrm {P} \mathrm {F} _{1}+\mathrm {P} \mathrm {F} _{2}=2a

To show this we calculate

\mathrm {P} \mathrm {F} _{1}^{2}=(x+f)^{2}+y^{2}=x^{2}+y^{2}+f^{2}+2fx

and substitute for f and

y^{2}=b^{2}-{b^{2} \over a^{2}}x^{2}

and obtain

\mathrm {P} \mathrm {F} _{1}^{2}=x^{2}+\left(b^{2}-{b^{2} \over a^{2}}x^{2}\right)+(a^{2}-b^{2})+2x{\sqrt {a^{2}-b^{2}}}=a^{2}+2x{\sqrt {a^{2}-b^{2}}}+{a^{2}-b^{2} \over a^{2}}x^{2}=\left(a+{\frac {f}{a}}x\right)^{2}

After an analogous calculation for F₂ we get (note that $a\pm {\frac {f}{a}}x\geq 0$ because $-a\leq x\leq a$ and $0\leq f\leq a$ )

\mathrm {P} \mathrm {F} _{1}+\mathrm {P} \mathrm {F} _{2}=(a+{\frac {f}{a}}x)+(a-{\frac {f}{a}}x)=2a

as claimed.

Second degree equation

The algebraic form of the previous section describes an ellipse in a special position. Rotation and translation transforms it into an equation of second degree in x and y:

f(x,y):=Ax^{2}+2Bxy+Cy^{2}+2Dx+2Ey+F=0,\,

(all variables are real). Such an equation always describes a conic section.

It represents a non-degenerate ellipse (minor axis not 0) if and only if the following conditions are satisfied:

$AC-B^{2}>0$

$(A+C)f_{t}<0\quad \mathrm {with} \quad f_{t}:=f(t_{1},t_{2})\neq 0$

or, equivalently,

\quad f_{t}>0\Rightarrow A+C<0\quad \mathrm {and} \quad f_{t}<0\Rightarrow A+C>0

where t₁ and t₂ are defined as the solutions of the following system of linear equations:

{\begin{matrix}At_{1}+Bt_{2}&=-D\\Bt_{1}+Ct_{2}&=-E\end{matrix}}

(These equations have a unique solution since, by the first condition, the determinant AC − B² ≠ 0.)

Proof

We now switch to matrix-vector notation and write f(x,y) as

f(\mathbf {r} )=\mathbf {r} ^{\mathrm {T} }\mathbf {Q} \mathbf {r} +\mathbf {a} ^{\mathrm {T} }\mathbf {r} +\mathbf {r} ^{\mathrm {T} }\mathbf {a} +F,

with

\mathbf {r} :={\begin{pmatrix}x\\y\\\end{pmatrix}},\quad \mathbf {Q} :={\begin{pmatrix}A&B\\B&C\\\end{pmatrix}},\quad \mathbf {a} :={\begin{pmatrix}D\\E\\\end{pmatrix}}.

The superscript T stands for transposition (row vector becomes column vector and vice versa).

We first show that the conditions are sufficient:

Since, by assumption, the determinant det(Q) = AC−B² ≠ 0, the matrix Q is invertible. With the help of the inverse Q⁻¹ the equation for f can be rewritten to

f(\mathbf {r} )=\left(\mathbf {r} +\mathbf {Q} ^{-1}\mathbf {a} \right)^{\mathrm {T} }\mathbf {Q} \left(\mathbf {r} +\mathbf {Q} ^{-1}\mathbf {a} \right)-\mathbf {a} ^{\mathrm {T} }\mathbf {Q} ^{-1}\mathbf {a} +F\quad .

Note that this uses

\mathbf {Q} ^{\mathrm {T} }=\mathbf {Q} \quad \Longrightarrow \quad \left(\mathbf {Q} ^{-1}\right)^{\mathrm {T} }=\mathbf {Q} ^{-1},

i.e., that both the matrix Q and its inverse are symmetric.

PD Image
Fig. 4. r′ = r − t

Define

\mathbf {t} :=-\mathbf {Q} ^{-1}\mathbf {a} \quad \Longrightarrow \quad \mathbf {Q} \mathbf {t} =-\mathbf {a} ,

and

\mathbf {r} ':=\mathbf {r} -\mathbf {t} .

In the definition of t the minus sign is introduced to get the translation of the origin as depicted in figure 4.

Now we substitute r′ in the expression for f. (This corresponds to shifting the origin of the coordinate system to the center of the ellipse):

f(\mathbf {r} )=\left(\mathbf {r} -\mathbf {t} \right)^{\mathrm {T} }\mathbf {Q} \left(\mathbf {r} -\mathbf {t} \right)-\mathbf {a} ^{\mathrm {T} }\mathbf {Q} ^{-1}\mathbf {a} +F=\left(\mathbf {r} '\right)^{\mathrm {T} }\mathbf {Q} \mathbf {r} '+f_{t}

with

f_{t}:=f(\mathbf {t} )=-\mathbf {a} ^{\mathrm {T} }\mathbf {Q} ^{-1}\mathbf {a} +F.

Thus, by translation of the origin over t the linear terms in f(r) have been eliminated, only two quadratic terms (in x′ := x−t₁ and y′ := y−t₂), one bilinear term, and one constant term (f_t) appear in the equation for f. (The "price paid" for it is the requirement det(Q) ≠ 0.)

In the next step we rotate the coordinate system (around the origin in O') such that the coordinate axes coincide with the axes of the ellipse. This will eliminate the bilinear term and "decouple" x′ and y′, the components of r′.

Let us recall that any real symmetric matrix may be diagonalized by an orthogonal matrix. For the (2×2)-case:

\mathbf {R} ^{\mathrm {T} }\mathbf {Q} \mathbf {R} ={\begin{pmatrix}\alpha _{1}&0\\0&\alpha _{2}\end{pmatrix}}\quad {\hbox{with}}\quad \mathbf {R} ^{\mathrm {T} }\mathbf {R} =\mathbf {R} \mathbf {R} ^{\mathrm {T} }=\mathbf {I} ,

where the last matrix on the right is the identity matrix I. Now

f=\left(\left(\mathbf {r} '\right)^{\mathrm {T} }\;\mathbf {R} \right)\left(\mathbf {R} ^{\mathrm {T} }\;\mathbf {Q} \;\mathbf {R} \right)\left(\mathbf {R} ^{\mathrm {T} }\;\mathbf {r} '\right)+f_{t}=\left(\mathbf {r} ''\right)^{\mathrm {T} }{\boldsymbol {\alpha }}\mathbf {r} ''+f_{t}

with

{\boldsymbol {\alpha }}:={\begin{pmatrix}\alpha _{1}&0\\0&\alpha _{2}\end{pmatrix}},\quad {\hbox{and}}\quad \mathbf {r} '':=\mathbf {R} ^{\mathrm {T} }\mathbf {r} '.

Switching back to a quadratic equation

\left(\mathbf {r} ''\right)^{\mathrm {T} }{\boldsymbol {\alpha }}\mathbf {r} ''+f_{t}=\alpha _{1}(x'')^{2}+\alpha _{2}(y'')^{2}+f_{t}=0

we see that an ellipse is obtained if the parameters α₁, α₂, and f_t are non-zero and if the signs of α₁ and α₂ are equal and opposite to the sign of f_t.

It is known that the determinant of a matrix is invariant under similarity transformations, hence

0<\det(\mathbf {Q} )=AC-B^{2}=\alpha _{1}\alpha _{2}

and the signs of α₁ and α₂ are equal.

The trace A+C of the matrix is also invariant under similarity transformations. Thus

\alpha _{1}+\alpha _{2}=A+C

and we can apply the assumption

{\begin{aligned}0<\alpha _{1},\alpha _{2}&\quad {\hbox{i.e.}}\quad 0<\alpha _{1}+\alpha _{2}=A+C\quad \Rightarrow \quad f_{t}<0\\0>\alpha _{1},\alpha _{2}&\quad {\hbox{i.e.}}\quad 0>\alpha _{1}+\alpha _{2}=A+C\quad \Rightarrow \quad f_{t}>0\\\end{aligned}}

and conclude that in both cases the second order equation represents an ellipse. This shows that the conditions given are sufficient.

The conditions are also necessary:

In the coordinate system determined by its axes, the equation clearly satisfies the conditions, and — since determinant and trace are preserved — they stay satisfied if the system is rotated and shifted. Thus the conditions are necessary if the determinant is not equal to 0. In fact, it is necessary without this assumption on the determinant (see second-order curve).

Remark
Clearly, in order to determine a priori whether the quadratic equation represents an ellipse, it is not necessary to actually perform the diagonalization of Q. It is sufficient to check the condition and determine the sign of f_t = f(t) by solving the equation given for the vector t.

Polar representation relative to focus

PD Image
Fig. 5. Polar representation

The length g of a vector (cf. figure 5) from the focus F₂ to an endpoint P on the ellipse

{\overrightarrow {\mathrm {F} _{2}\mathrm {P} }}=:{\vec {g}}=(ea+g\cos \theta ,\;g\sin \theta )

is given by the polar equation of an ellipse (with eccentricity less than 1)

g={\frac {\ell }{1+e\cos \theta }}\quad {\hbox{with}}\quad \ell :={\frac {b^{2}}{a}},

where 2ℓ is known as the latus rectum (lit. erect side) of the ellipse; it is equal to 2g for θ = 90^° (twice the length of the vector ${\vec {g}}$ when it makes a right angle with the major axis).

Proof

Earlier [Eq. (3)] it was derived for the distance from the right focus F₂ to P that

|{\overrightarrow {\mathrm {F} _{2}\mathrm {P} }}|=|{\vec {g}}|=g=a-ex.

Expressing x from

x=ea+g\cos \theta =ea+(a-ex)\cos \theta \,

gives

x={\frac {ea+a\cos \theta }{1+e\cos \theta }},

so that

g=a-ex=a-{\frac {e^{2}a+ea\cos \theta }{1+e\cos \theta }}={\frac {a(1-e^{2})}{1+e\cos \theta }}.

Substitute

a(1-e^{2})=a\left(1-{\frac {a^{2}-b^{2}}{a^{2}}}\right)={\frac {b^{2}}{a}}=\ell

and the polar equation for the ellipse follows.

Trammel construction

PD Image
Fig. 6. A trammel in theory

Before drafting was done almost exclusively by the aid of computers, draftsmen used a simple device for drawing ellipses, a trammel. Basically, a trammel is a rigid bar of length a (semi-major axis). In the top drawing of figure 6 the bar is shown as a blue-red line segment bounded by a black and a blue bead. On this bar a segment of length b (semi-minor axis) is marked; this is the red segment on the bar. Two beads fixed to the rigid bar move back and forth along the x-axis and y-axis, respectively. The blue bead fixed at one end of the bar moves along the y-axis, the red bead, which marks the beginning of the red segment of length b, moves along the x-axis. The endpoint of the bar (the black bead in figure 6) moves along an ellipse with semi-major axis a and semi-minor axis b and typically has a pen fixed to it.

(PD) : http://chestofbooks.com/
Fig. 7. A trammel in practice

The fact that the trammel construction works is proved very easily, cf. the bottom drawing in figure 6,

x=a\;\cos \theta \quad {\hbox{and}}\quad y=b\;\sin \theta .

Hence

{\frac {x^{2}}{a^{2}}}+{\frac {y^{2}}{b^{2}}}=\cos ^{2}\theta +\sin ^{2}\theta =1,

which indeed is the equation for an ellipse.

A device called a trammel point is used to guide a woodworking router in making elliptical cuts.

Gardener's construction

(PD) : http://chestofbooks.com/
Fig. 8. Gardener's construction

It is possible to construct an ellipse of given major and minor axes by the aid of a compass, a ruler, three thumbtacks, and a piece of string, see figure 8.

First draw the major axis AB, and then obtain with the compass its perpendicular bisector intersecting AB in the midpoint E. Along the bisector one measures off the length of the minor axis CD. Given that the distances CF and CG are the semi-major axis (AB/2), one can determine the foci by drawing an arc with the compass using C as center and AB/2 as radius. One now pins the thumbtacks in the foci and the point C and fixes a piece of string around the triangle FGC (i.e, its length equals the perimeter of the triangle). Removing the thumbtack at C, and keeping the string taut, one draws the ellipse by moving the pencil from C to A, D, B, and back to C.

Clearly this procedure can be used in the garden to create an elliptic lawn or flowerbed, which is why the procedure is sometimes referred to as the gardener's construction.

Notes

↑ The points S₁ and S₂ are the main vertices of the ellipse.
↑ The quantities a and b are referred to as semi-major and semi-minor axis, respectively. Note that, just as diameter of a circle, semi-axis does not only refer to the line segment itself, but also to its length.
↑ The shortest distance of a focus to a point on the ellipse (= p, as can be seen from equation (3), for instance) is the periapsis of the ellipse; the longest distance, S₁F₂=S₂F₁=2a−p, is the apoapsis. These two (Greek) terms are mainly used in astronomy when orbits of planets are described.
↑ M. Kline, Mathematical Thought from Ancient to Modern Times, Oxford UP, New York (1972)

Figures 7 and 8 are from George Watson Kittredge, The New Metal Worker Pattern Book, David Williams Company, New York, (1901) Online

[1] The points S₁ and S₂ are the main vertices of the ellipse.

[2] The quantities a and b are referred to as semi-major and semi-minor axis, respectively. Note that, just as diameter of a circle, semi-axis does not only refer to the line segment itself, but also to its length.

[3] The shortest distance of a focus to a point on the ellipse (= p, as can be seen from equation (3), for instance) is the periapsis of the ellipse; the longest distance, S₁F₂=S₂F₁=2a−p, is the apoapsis. These two (Greek) terms are mainly used in astronomy when orbits of planets are described.

[4] M. Kline, Mathematical Thought from Ancient to Modern Times, Oxford UP, New York (1972)

[1]

[2]

[3]

[4]

Ellipse

Contents

Conic section

Eccentricity

Algebraic form

Proof

Second degree equation

Proof

Polar representation relative to focus

Proof

Trammel construction

Gardener's construction

Notes

Navigation menu

Ellipse

Conic section

Eccentricity

Algebraic form

Proof

Second degree equation

Proof

Polar representation relative to focus

Proof

Trammel construction

Gardener's construction

Notes

Navigation menu

Search