I’ve been reading Elliptic Tales, a fantastic book on elliptic curves by Avner Ash and Robert Gross, and I’d like to summarize what I’ve learned from the book in three long-ish posts corresponding to the three parts of the book. This post is the first one in this series.
We are going to consider polynomial equations and their degrees. There are two different ways to define the degree of a polynomial equation and of the curve it describes. The first definition is purely algebraic: the algebraic degree of a polynomial is the largest degree of any monomial appearing in the polynomial: e.g. the algebraic degree of is 6, because the monomial with the largest degree, i.e. , has degree 6. The second notion of degree is purely geometric: informally, we want to say that the geometric degree of a curve described by a polynomial equation is the number of intersection points it has with an arbitrary line . Of course, the problem is that this is not well-defined, because the number of intersection points depends on the line we pick! Consider the figure below (adapted from Figure 1.6 in the book):
The line intersects the parabola at two points, the lines and intersect at one point only and the line doesn’t intersect at all!
The uncreative and uninteresting way to deal with this nuisance is to define the geometric degree to be the maximum number of intersection points we could get by varying the line . But instead we’re going to be bold and demand that the geometric degree of a curve to be the same regardless of the line we choose! It turns out that this can be done and is the source of a lot of rich, beautiful and exciting mathematics. But, enforcing this demand will require us to reconsider which number system to adopt, and to carefully redefine what we mean by an “intersection point” and how to count them.
It turns out that we need to do three things: 1) define our polynomials over an algebraically closed field of numbers; 2) switch from the Euclidean plane to the projective plane; 3) count intersection points by their “multiplicities”. I will now explain what each of these means.
1) is an algebraically closed field means that any non-constant polynomial whose coefficients are in has a root in . We can see immediately that the reals is not algebraically closed, since e.g. the polynomial has no roots in . However, the fundamental theorem of algebra tells us that the complex numbers is algebraically closed. It is intuitively clear why we need to work with an algebraically closed field, hence why e.g. won’t do for us: we want to say that algebraically has degree 2, but geometrically does not even make sense in .
Switching to an algebraically closed field helps us deal with cases such as the line in the above figure. Although this line does not intersect the parabola in the real plane, it does intersect the parabola in the complex plane at exactly two points.
2) In the projective plane, we describe points by three coordinates instead of the usual two: . Setting gives us the usual (“finite”) plane. Points with correspond to “points at infinity”. The coordinates and with describe the same point. So, the points at infinity form a single line at infinity that can be parametrized by or by where can be any real number.
Polynomial curves in the usual “finite” plane might get points at infinity attached to them in the projective plane. To find out what specific points at infinity get attached to a particular curve, we need to define the homogenization of a polynomial: for a polynomial of degree in two variables, , this means multiplying each monomial in by an appropriate power of so that all monomials have the same degree . For example, the homogenization of is . Now, to find out which points at infinity get attached to the curve described by the polynomial equation , we simply set : to which the solutions are with . By the property above, these describe a single point which we might as well denote by .
Why do we need this homogenization of polynomials and these points at infinity? Intuitively, it is to deal with cases such as the line in the figure above, which intersects the parabola at a single point, whereas we desire every line to intersect the parabola at exactly two points. The second intersection point turns out to be a point at infinity: namely . This point at infinity belongs to both and the parabola, as you can easily check.
3) Finally, we have to count intersection points by their multiplicity. Suppose a given curve and a line intersect at a point . Suppose further that the line is parametrized by the parameter such that corresponds to . Then is a root of the polynomial obtained by plugging the parametric form of in the polynomial describing . If the degree of this root is , we say that the intersection point has multiplicity and express it by . This definition was a mouthful, so let’s do an example: suppose our curve is defined by the polynomial and the line is parametrized by and . If we plug these in , we get . The root appears with multiplicity in , so we say that the intersection point corresponding to , i.e. has multiplicity .
When can the multiplicity of an intersection point be larger than ? This can happen either when the line is tangent to the curve at the intersection point, or when the intersection point is singular, i.e. the tangent is not well-defined at the intersection point, i.e. the partial derivatives all vanish at the intersection point. Incidentally, because we are dealing with polynomials only, we can define the derivative of a polynomial purely procedurally (i.e. the derivative of is defined to be ), without having to worry about whether the standard analytic definition in terms of limits even makes sense in a given field. Ditto for partial derivatives.
It is again clear why we need to count intersection points by their multiplicity: it is to deal with cases such as the line in the above figure: this line is tangent to the parabola, hence intersects it at a single point only, but the intersection has multiplicity , as you can easily check.
We are now ready to state Bézout’s theorem which is the culmination of our desire to make the algebraic degree and the geometric degree of polynomial equations or curves to always match each other. Suppose and are two projective curves that intersect at a finite number of points: , , , . Then Bézout’s theorem states that the product of the degrees of the homogeneous polynomials defining and exactly matches what we call the global intersection multiplicity , i.e. the sum of the multiplicities of all intersection points. The fact that any line intersects a curve at the same number of points and that this number is equal to the algebraic degree of the polynomial describing are immediate consequences of Bézout’s theorem.