Here I will take a top-to-bottom approach to introduce the theory of general relativity. Thus, I present from start the field equations. The field equations are extremely compact in the usual form that they are presented. This means that the equations we have to deal with are encrypted and need to be made explicit.
In traditional courses on general relativity, a lot of time and effort is invested in learning the elements of differential geometry that are necessary to derive and justify the field equations of Einstein. Very often this is discouraging because the student tackles a lot of mathematical definitions that many times do not help to understand the physics behind the symbols but increases the feeling that, besides physics, there is something additional that the student doesn’t understand so well, namely, differential geometry.
I think that it is possible to start to understand the physics of the general theory of relativity and in parallel to acquire the necessary knowledge on differential geometry. The advantage is that the physics ideas help to identify what is necessary to know about differential geometry. In my opinion, this is better than just start learning a lot of complicated mathematics with the promise that you will need it and you will understand everything later.
The field equations in general relativity
In general relativity, the field equation that describes gravity was proposed by Einstein. It is usually written in the form
The goal of the present post is to unpack this equation and briefly explain (when possible) what each term means.
Let’s start by saying that and are constants. is the cosmological constant and is the coupling constant between matter-energy and space-time geometry.
Actually, Eq.(1) is a set of ten coupled partial differential equations. Let’s see why ten. Each index and runs from zero to three, so we can write the equation in the form of a matrix equation. The term , which is called Ricci tensor, for example, in matrix form looks like:
We see that refers generically to any entry of this matrix. The symbol “:=” means that the tensor is represented by the matrix on the right side. Don’t worry about the word tensor. We will learn its meaning later on. For the time being it is enough to consider a tensor as a matrix when it has two indexes.
There are sixteen entries in the Ricci tensor. The tensors in the gravitational equations are symmetric. This means that the six entries , , , , , and that appears at the right side of the diagonal of the matrix are, in the respective order, equal to the six entries , , , , , and , that appears on the left side of the diagonal. Thus, the independent terms are those at the diagonal and those at one side of the diagonal, i.e., ten independent terms. The same applies for the terms , and . The value , or refers to the time coordinate of the space-time, usually written as , while the values or refers to the space coordinates; thus, in Cartesian coordinates, for example, we have , , and .
The explicit form of will be given later. The geometric meaning of the Ricci tensor is not accessible without some knowledge of differential geometry that we will acquire in other lectures. We can loosely say that this tensor measures how far the space-time is of being euclidean.
The metric tensor
The unknowns in general relativity are the entries of the matrix whose elements are . The independent ones are
The matrix whose elements are , is called “the metric”, or “the metric tensor”. The reason is that its entries are necessary to calculate the length of arcs and distances in space-time. The square of length of an arc connecting two infinitesimally close points in space-time is given (in Cartesian coordinates) by the expression
The factor 2 comes from the symmetry of the metric tensor, for example: .
The elements are functions of the space-time coordinates in general. Their form depends on the coordinates used to express them. When there is a coordinate system in which all of them are constant the space-time is said to be flat. Later we will see why. For the time being, we recall here the expression for in special relativity, namely, . So, the metric tensor in special relativity, which is commonly called instead of , is
and therefore, in special relativity the space-time is flat.
The metric tensor is non degenerated, i.e., its determinant is different from zero at any point in space-time. This means that the inverse of the metric tensor exists. The entries of the inverse matrix are written with upper indexes, ie., in the form :
Upper indexes refer to the inverse matrix in the case of the tensor metric only. For any other tensor, the connection between the expression with lower indexes and upper indexes follows a rule that will be explained in other articles. Let me emphasize that, the Ricci tensor with super-indexes, for example, is not represented by the inverse of the matrix that represents the same tensor with sub-indexes .
The connection between the metric and gravity
The metric tensor determines the geometry of the space-time. Thus for example, if we know the metric we can say whether the geometry of the space-time is either Euclidean or not. To better understand what non-Euclidean means, let’s consider the surface of the sphere. On the sphere, the role of straight lines is played by circles that result from intersecting the sphere with planes that pass through the center. With these lines, we can construct a triangle on the sphere such that the sum of its inner angles is greater than 180 degrees. That is not possible in a plane, whose geometry is Euclidean, so the geometry of the surface of the sphere is non-Euclidean.
The generalization of straight lines to non-Euclidean geometry are the geodesics, which are completely determined by the metric tensor. But, what are geodesics? They are curves with zero acceleration. To better understand what that means think about a point particle moving through space. It describes a trajectory. At each point of the trajectory, there is the velocity vector that is tangent to it. If there is no acceleration of the body, then the velocity vector doesn’t change along the trajectory. In other words, the derivative of the velocity vector along the trajectory is zero. Now think for a while that the space is two-dimensional and is the surface of a sphere. A point particle moving on this surface will describe a certain curve on it and will have a velocity vector on each point of its trajectory. If the derivative of the velocity vector along the trajectory on the sphere is zero, the curve will be a geodesic. This geodesic will be the analog of a straight line in the case the surface was a plane instead of a sphere. Thus, the geodesics on the sphere are curved but have zero acceleration. The curvature of the geodesic is due to the fact that the underlying space is itself curved.
Why are geodesics important? The key is the principle of inertia, namely, the motion of a material particle, on which there are no forces acting on, is along a geodesic. If the space is flat, the geodesics are straight lines, otherwise, the geodesics are curved. The differential equations determining the geodesics are
There are four differential equations, one for each value of . The terms on the right side contain the factors which are called Christoffel symbols and are determined by the metric tensor as follows:
If there is a system of coordinates in which the metric tensor is constant, then the Christoffel symbols are zero (in that system) and we get the differential equations , whose solutions are straight lines. In other posts, we will derive the equation of the geodesics.
In Newtonian mechanics, it was assumed that the geometry of space-time is flat, and consequently, the motion of free bodies is along straight lines. In general relativity, instead of assuming the geometry of the space-time, it is calculated.
The role of the mass in the Newtonian theory of gravitation is to exert a force on other masses curving their trajectories in space. In general relativity, the role of the mass is to modify the geometry of the surrounding space-time curving the straight lines to geodesics in the new geometry.
In general relativity, the distribution of matter determines the metric tensor and the metric tensor determines the geodesics in space-time. Thus, a material body affects the trajectory of another body, not by exerting a force on it, but by changing the geometry of the space-time in the region where the other body is freely moving. We will say more about this point in another post.
The scalar of curvature
We have seen that the Ricci tensor and the metric can be represented by matrices and that the matrix representing the metric has an inverse. The scalar function is defined as the trace (sum of the diagonal elements of a squared matrix) of the product of the inverse of the metric and the matrix representing :
Like the Ricci tensor, the scalar function measures how far the space-time is of being euclidean. The function is the sum of sixteen terms:
In other posts, we will explicitly calculate the Ricci tensor and the scalar of curvature for some known surfaces in the tridimensional space in order to gain some intuition about their meaning.
The explicit form of the Ricci tensor
The explicit expression of the Ricci tensor is cumbersome. Here it is:
The Christoffel symbols were defined in Eq.(6).
The Ricci tensor contains the second derivatives of the metric tensor and therefore the gravitational equations given in Eq.(1) are second-order partial differential equations for the metric tensor. This is a generalization of the Poisson equation for the gravitational potential in the Newtonian theory of gravitation. Indeed, the metric tensor plays the role of the gravitational potential in general relativity.
The source of the gravitational field
In the Newtonian theory of gravitation, the source of the gravitational potential is the mass density . The gravitational potential satisfies the Poisson equation . In general relativity, the source of the gravitational potential is the mass-energy density tensor , and the analog of the Poisson equation is played by Einstein’s equations given in Eq.(1).
In the Newtonian theory, one is often interested in the gravitational effect produced by the distribution of matter in its surroundings, i.e., outside the matter, where and the potential satisfies the Laplace equation . Similarly, in general relativity, we are often interested in determining the gravitational effect produced by certain distribution of matter and energy (remember that mass and energy are related by ) in its surroundings. Thus, in free space, , and the metric tensor has to satisfy the analog of the Laplace equation:
The cosmological term is not relevant in this kind of problem. In cosmology, the cosmological term has to be included as well as the tensor which has to be given beforehand. Indeed, the expression “outside the universe” has no meaning and we are always inside the distribution of matter.
Some final words
The goal in general relativity is to calculate the components of the metric tensor. This is achieved by solving Eq.(1). Once the metric has been calculated one can investigate the motion of bodies and light rays through space.
Solving the gravitational equations is usually extremely difficult. Thus, one first learns how to solve them in very simple cases to gain experience and some intuition.
For a better understanding of the geometric ideas contained in the general theory of relativity, some times is convenient to study the physic of artificial spaces whose metric not necessarily satisfy Einstein’s equations but is given from start.