| Alen's Page | Einstein's Derivation of the Transformation Equations in Special Relativity | Page 1 of 2 |
EINSTEIN'S DERIVATION
OF THE TRANSFORMATION EQUATIONS
IN SPECIAL RELATIVITY
|
Anyone who reads Einstein's derivation of the transformation equations of Special Relativity in his 1905 paper may find it somewhat cryptic and unclear, especially on a first acquaintance, since he does not include all the intermediate mathematical steps in the argument. No doubt he could have supposed that his peers of the time would easily have been able to fill in the missing steps for themselves. Now, however, Special Relativity has a much wider audience, some of whom are subjecting it to doubts and controversy, or even, perhaps, saying that Einstein fudged some steps in the argument. Since the cryptic nature of the presentation makes it difficult for someone trying to read it for the first time, it seemed to me that it might be useful to have Einstein's 1905 argument, with relevant illustrations added, and with the missing mathematical steps filled in. This is what I have attempted to do in what follows. It deals, of course, only with the 'kinematical part' of Einstein's paper The derivation is grounded in the two postulates of Special Relativity, the first being that all inertial reference frames provide equally valid viewpoints from which to describe events in general, in the sense that no inertial reference frame can be given a special status in preference to any other, and the second being that the velocity of light is the same if measured within any inertial frame, independently of the reference frame in which the light was emitted. The validity of the transformation equations that result from the derivation depend on the correctness of these postulates, which are not proven by argument, but depend entirely on accepted experimental verification for their credibility. The 1905 paper deals with the case of two inertial reference frames in constant relative motion. We rigidly connect ourselves to one of the frames so that it is not moving relative to us, and so that we can thus call it the 'stationary' frame. The other frame then becomes the 'moving' frame. We could connect ourselves to either frame, or neither, in which case both frames would be moving relative to us. It is simpler, however, to arrange for one of the frames to be stationary relative to ourselves.
Einstein starts off with a context that is illustrated in the diagrams below. The event whose coordinates are to be considered is event B, and this is described in the moving frame by coordinates of the type, There are, however, two versions of this general transformation: a transformation of a first kind, in which x,y,z,t are coordinate values referred to the origin of the stationary frame, as in Figure 2, and a transformation of a second kind, in which they are referred to the origin of the moving frame, but remain stationary frame measurements, as in Figure 3. In both kinds of transformations the values of |

|
Einstein refers to the velocity of light being c-v or c+v, rather than c, in the moving frame, from the perspective of the stationary observer. The existence of two distances and two times as, A form such as X/t, where X is a distance within the moving frame, and moving with it, represents the stationary observer's attempt to see how the light, emitted in the moving frame, is travelling within the moving frame, rather than within his own stationary frame. This is illustrated in figure 3. That is, he tries to eliminate the relative motion from his viewpoint in order to see what the moving observer sees, but necessarily uses stationary frame calculations. But this causes him to interpret light as travelling within the moving frame at values of velocity other than c, ie (c-v), or (c+v). The postulates of relativity, however, declare that the moving observer does not see the light moving in accordance with these values, and the coordinate values of events therefore have to be transformed to allow the stationary observer to understand the values the moving observer would obtain. This is achieved by the transformation equations of the second kind. Einstein begins by constructing these equations and then uses them to obtain the desired transformation equations of the first kind. As indicated previously, the viewpoint on the basis of which the transformation equations of the second kind are obtained involves a relationship of figures 1 and 3, rather than figures 1 and 2. |
|
In figure 1, we can see that, since, within the moving frame, for the moving observer, the frame is the same as a stationary frame, within which the rod is also stationary, the light takes equal times to go in both directions, so that the time at event B must be half way between the times at events A and C on a moving frame clock, which can be expressed as
For the expression 1/2(
1/2{ if X is sufficiently small, this has the differential form
1/2{ Einstein proceeds to carry out the derivation using differential equations, rather than macroscopic equations, and uses a principle of linearity to convert to macroscopic equations, which I shall do at the end of the derivation. Thus, using the well known partial differential expression for the transformation of the total differential, we can say, in general:
d
and this can be applied separately to each term in the above differential equation, so as to get a differential transformation for each value of d
For the first term, at the start, at event A, in figures 1 and 3, there are as yet no d
d At event C, we have dt 1= X/(c-v) + X/(c+v), and thus
d
For the right hand side of the equation, at event B, we have values for d
d here, dt = X/(c-v), so we have:
d
It is worth examining more closely the nature of the term ( Since the differential equation has been written in the form
1/2(
we can eliminate
1/2 d
and, substituting values obtained above for d
1/2( From which
(
Putting this into the equation for d
d
Now we can say, putting
cd which are the first two transformation equations of the second kind.
1/2{
All the subscripts have the same relationship to one another as before, but now refer to a corresponding arrangement of events in the transverse case. Applying the transformation of the total differential, as previously, to d d and with, as before
1/2 d we get the equation
(
so we must have
thus
d or
cd and, by a similar argument for the z axis
cd
Einstein has (
d
Einstein now goes on to obtain the value of
He then argues that we have d Since we have, above
( Therefore
This shows that
To get the desired, macroscopic transformation equations of the first kind, we must use coordinate values in figure 2 and, for simplicity, set
|
While the above page contains nothing regarding Special Relativity and the transformation equations that is unorthodox,
the following links connect to pages that do contain an unorthodox interpretation of the transformation equations.
This is because I believe that, although the Minkowski equation is mathematically valid, it can be proven that
the Minkowski metric cannot represent a really existing spacetime.
A Lightlike Interpretation of Special Relativity
© Alen, March 2007. All rights reserved.
alen1@westserv.net.au
Material on this page may be reproduced
for personal use only.