RELATIVITY
THE SPECIAL & THE GENERAL THEORY
A POPULAR EXPOSITION
BY
ALBERT EINSTEIN, Ph.D.
PROFESSOR OF PHYSICS IN THE UNIVERSITY OF BERLIN
AUTHORISED TRANSLATION BY
ROBERT W. LAWSON, D.Sc.
UNIVERSITY OF SHEFFIELD
WITH FIVE DIAGRAMS
AND A PORTRAIT OF THE AUTHOR
THIRD EDITION
METHUEN & CO. LTD.
36 ESSEX STREET W.C.
LONDON
This Translation was first Published August 19th 1920
Second EditionSeptember 1920
Third Edition1920
PREFACE
THE present book is intended, as far as possible, to give an exact insight into the theory of Relativity to those readers who, from a general scientific and philosophical point of view, are interested in the theory, but who are not conversant with the mathematical apparatus[1] of theoretical physics. The work presumes a standard of education corresponding to that of a university matriculation examination, and, despite the shortness of the book, a fair amount of patience and force of will on the part of the reader. The author has spared himself no pains in his endeavour to present the main ideas in the simplest and most intelligible form, and on the whole, in the sequence and connection in which they actually originated. In the interest of clearness, it appeared to me inevitable that I should repeat myself frequently, without paying the slightest attention to the elegance of the presentation. I adhered scrupulously to the precept of that brilliant theoretical physicist L. Boltzmann, according to whom matters of elegance ought to be left to the tailor and to the cobbler. I make no pretence of having withheld from the reader difficulties which are inherent to the subject. On the other hand, I have purposely treated the empirical physical foundations of the theory in a "step-motherly" fashion, so that readers unfamiliar with physics may not feel like the wanderer who was unable to see the forest for trees. May the book bring some one a few happy hours of suggestive thought!
December, 1916
A. EINSTEIN
[1]The mathematical fundaments of the special theory of relativity are to be found in the original papers of H. A. Lorentz, A. Einstein, H. Minkowski, published under the title Das Relativitätsprinzip (The Principle of Relativity) in B. G. Teubner's collection of monographs Fortschritte der mathematischen Wissenschaften (Advances in the Mathematical Sciences), also in M. Laue's exhaustive book Das Relativitätsprinzip—published by Friedr. Vieweg & Son, Braunschweig. The general theory of relativity, together with the necessary parts of the theory of invariants, is dealt with in the author's book Die Grundlagen der allgemeinen Relativitätstheorie (The Foundations of the General Theory of Relativity) Joh. Ambr. Barth, 1916; this book assumes some familiarity with the special theory of relativity.
NOTE TO THE THIRD EDITION
IN the present year (1918) an excellent and detailed manual on the general theory of relativity, written by H. Weyl, was published by the firm Julius Springer (Berlin). This book, entitled Raum—Zeit—Materie (Space—Time—Matter), may be warmly recommended to mathematicians and physicists.
BIOGRAPHICAL NOTE
ALBERT EINSTEIN is the son of German-Jewish parents. He was born in 1879 in the town of Ulm, Würtemberg, Germany. His schooldays were spent in Munich, where he attended the Gymnasium until his sixteenth year. After leaving school at Munich, he accompanied his parents to Milan, whence he proceeded to Switzerland six months later to continue his studies.
From 1896 to 1900 Albert Einstein studied mathematics and physics at the Technical High School in Zurich, as he intended becoming a secondary school (Gymnasium) teacher. For some time afterwards he was a private tutor, and having meanwhile become naturalised, he obtained a post as engineer in the Swiss Patent Office in 1902 which position he occupied till 1909. The main ideas involved in the most important of Einstein's theories date back to this period. Amongst these may be mentioned: The Special Theory of Relativity, Inertia of Energy, Theory of the Brownian Movement, and the Quantum-Law of the Emission and Absorption of Light (1905). These were followed some years later by the Theory of the Specific Heat of Solid Bodies, and the fundamental idea of the General Theory of Relativity.
During the interval 1909 to 1911 he occupied the post of Professor Extraordinarius at the University of Zurich, afterwards being appointed to the University of Prague, Bohemia, where he remained as Professor Ordinarius until 1912. In the latter year Professor Einstein accepted a similar chair at the Polytechnikum, Zurich, and continued his activities there until 1914, when he received a call to the Prussian Academy of Science, Berlin, as successor to Van't Hoff. Professor Einstein is able to devote himself freely to his studies at the Berlin Academy, and it was here that he succeeded in completing his work on the General Theory of Relativity (1915-17). Professor Einstein also lectures on various special branches of physics at the University of Berlin, and, in addition, he is Director of the Institute for Physical Research of the Kaiser Wilhelm Gesellschaft.
Professor Einstein has been twice married. His first wife, whom he married at Berne in 1903, was a fellow-student from Serbia. There were two sons of this marriage, both of whom are living in Zurich, the elder being sixteen years of age. Recently Professor Einstein married a widowed cousin, with whom he is now living in Berlin.
R. W. L.
TRANSLATOR'S NOTE
IN presenting this translation to the English-reading public, it is hardly necessary for me to enlarge on the Author's prefatory remarks, except to draw attention to those additions to the book which do not appear in the original.
At my request, Professor Einstein kindly supplied me with a portrait of himself, by one of Germany's most celebrated artists. Appendix III, on "The Experimental Confirmation of the General Theory of Relativity," has been written specially for this translation. Apart from these valuable additions to the book, I have included a biographical note on the Author, and, at the end of the book, an Index and a list of English references to the subject. This list, which is more suggestive than exhaustive, is intended as a guide to those readers who wish to pursue the subject farther.
I desire to tender my best thanks to my colleagues Professor S. R. Milner, D.Sc., and Mr. W. E. Curtis, A.R.C.Sc., F.R.A.S., also to my friend Dr. Arthur Holmes, A.R.C.Sc., F.G.S., of the Imperial College, for their kindness in reading through the manuscript, for helpful criticism, and for numerous suggestions. I owe an expression of thanks also to Messrs. Methuen for their ready counsel and advice, and for the care they have bestowed on the work during the course of its publication.
ROBERT W. LAWSON
THE PHYSICS LABORATORY
THE UNIVERSITY OF SHEFFIELD
June 12, 1920
CONTENTS
[PART I
THE SPECIAL THEORY OF RELATIVITY]
I. [Physical Meaning of Geometrical Propositions]
II. [The System of Co-ordinates]
III. [Space and Time in Classical Mechanics]
IV. [The Galileian System of Co-ordinates]
V. [The Principle of Relativity (in the Restricted
Sense)]
VI. [The Theorem of the Addition of Velocities employed
in Classical Mechanics]
VII. [The Apparent Incompatibility of the Law of
Propagation of Light with the Principle of
Relativity]
VIII. [On the Idea of Time in Physics]
IX. [The Relativity of Simultaneity]
X. [On the Relativity of the Conception of Distance]
XI. [The Lorentz Transformation]
XII. [The Behaviour of Measuring-Rods and Clocks
in Motion]
XIII. [Theorem of the Addition of Velocities. The
Experiment of Fizeau]
XIV. [The Heuristic Value of the Theory of Relativity]
XV. [General Results of the Theory]
XVI. [Experience and the Special Theory of Relativity]
XVII. [Minkowski's Four-dimensional Space]
[PART II
THE GENERAL THEORY OF RELATIVITY]
XVIII. [Special and General Principle of Relativity]
XIX. [The Gravitational Field]
XX. [The Equality of Inertial and Gravitational Mass
as an Argument for the General Postulate
of Relativity]
XXI. [In what Respects are the Foundations of Classical
Mechanics and of the Special Theory
of Relativity unsatisfactory?]
XXII. [A Few Inferences from the General Principle of
Relativity]
XXIII. [Behaviour of Clocks and Measuring-Rods on a
Rotating Body of Reference]
XXIV. [Euclidean and Non-Euclidean Continuum]
XXV. [Gaussian Co-ordinates]
XXVI. [The Space-time Continuum of the Special
Theory of Relativity considered as a
Euclidean Continuum
] XXVII. [The Space-time Continuum of the General
Theory of Relativity is not a Euclidean
Continuum]
XXVIII. [Exact Formulation of the General Principle of
Relativity]
XXIX. [The Solution of the Problem of Gravitation on
the Basis of the General Principle of
Relativity]
[PART III
CONSIDERATIONS ON THE UNIVERSE
AS A WHOLE]
XXX. [Cosmological Difficulties of Newton's Theory]
XXXI. [The Possibility of a "Finite" and yet "Unbounded"
Universe]
XXXII. [The Structure of Space according to the
General Theory of Relativity]
[APPENDICES]
I. [Simple Derivation of the Lorentz Transformation
[Supplementary to Section XI.]
II. [Minkowski's Four-dimensional Space ("World")
[Supplementary to Section XVII.]
III. [The Experimental Confirmation of the General
Theory of Relativity
(a) Motion of the Perihelion of Mercury
(b) Deflection of Light by a Gravitational Field
(c) Displacement of Spectral Lines towards the
Red]
[BIBLIOGRAPHY]
[INDEX]
RELATIVITY
THE SPECIAL AND THE GENERAL THEORY
PART I
THE SPECIAL THEORY OF RELATIVITY
I
PHYSICAL MEANING OF GEOMETRICAL
PROPOSITIONS
IN your schooldays most of you who read this book made acquaintance with the noble building of Euclid's geometry, and you remember—perhaps with more respect than love—the magnificent structure, on the lofty staircase of which you were chased about for uncounted hours by conscientious teachers. By reason of your past experience, you would certainly regard everyone with disdain who should pronounce even the most out-of-the-way proposition of this science to be untrue. But perhaps this feeling of proud certainty would leave you immediately if some one were to ask you: "What, then, do you mean by the assertion that these propositions are true?" Let us proceed to give this question a little consideration.
Geometry sets out from certain conceptions such as "plane," "point," and "straight line," with which we are able to associate more or less definite ideas, and from certain simple propositions (axioms) which, in virtue of these ideas, we are inclined to accept as "true." Then, on the basis of a logical process, the justification of which we feel ourselves compelled to admit, all remaining propositions are shown to follow from those axioms, i.e. they are proven. A proposition is then correct ("true") when it has been derived in the recognised manner from the axioms. The question of the "truth" of the individual geometrical propositions is thus reduced to one of the "truth" of the axioms. Now it has long been known that the last question is not only unanswerable by the methods of geometry, but that it is in itself entirely without meaning. We cannot ask whether it is true that only one straight line goes through two points. We can only say that Euclidean geometry deals with things called "straight lines," to each of which is ascribed the property of being uniquely determined by two points situated on it. The concept "true" does not tally with the assertions of pure geometry, because by the word "true" we are eventually in the habit of designating always the correspondence with a "real" object; geometry, however, is not concerned with the relation of the ideas involved in it to objects of experience, but only with the logical connection of these ideas among themselves.
It is not difficult to understand why, in spite of this, we feel constrained to call the propositions of geometry "true." Geometrical ideas correspond to more or less exact objects in nature, and these last are undoubtedly the exclusive cause of the genesis of those ideas. Geometry ought to refrain from such a course, in order to give to its structure the largest possible logical unity. The practice, for example, of seeing in a "distance" two marked positions on a practically rigid body is something which is lodged deeply in our habit of thought. We are accustomed further to regard three points as being situated on a straight line, if their apparent positions can be made to coincide for observation with one eye, under suitable choice of our place of observation.
If, in pursuance of our habit of thought, we now supplement the propositions of Euclidean geometry by the single proposition that two points on a practically rigid body always correspond to the same distance (line-interval), independently of any changes in position to which we may subject the body, the propositions of Euclidean geometry then resolve themselves into propositions on the possible relative position of practically rigid bodies.[2] Geometry which has been supplemented in this way is then to be treated as a branch of physics. We can now legitimately ask as to the "truth" of geometrical propositions interpreted in this way, since we are justified in asking whether these propositions are satisfied for those real things we have associated with the geometrical ideas. In less exact terms we can express this by saying that by the "truth" of a geometrical proposition in this sense we understand its validity for a construction with ruler and compasses.
Of course the conviction of the "truth" of geometrical propositions in this sense is founded exclusively on rather incomplete experience. For the present we shall assume the "truth" of the geometrical propositions, then at a later stage (in the general theory of relativity) we shall see that this "truth" is limited, and we shall consider the extent of its limitation.
[2]It follows that a natural object is associated also with a straight line. Three points
,
and
on a rigid body thus lie in a straight line when, the points
and
being given,
is chosen such that the sum of the distances
and
is as short as possible. This incomplete suggestion will suffice for our present purpose.
II
THE SYSTEM OF CO-ORDINATES
ON the basis of the physical interpretation of distance which has been indicated, we are also in a position to establish the distance between two points on a rigid body by means of measurements. For this purpose we require a "distance" (rod
) which is to be used once and for all, and which we employ as a standard measure. If, now,
and
are two points on a rigid body, we can construct the line joining them according to the rules of geometry; then, starting from
, we can mark off the distance
time after time until we reach
. The number of these operations required is the numerical measure of the distance
. This is the basis of all measurement of length.[3]
Every description of the scene of an event or of the position of an object in space is based on the specification of the point on a rigid body (body of reference) with which that event or object coincides. This applies not only to scientific description, but also to everyday life. If I analyse the place specification "Trafalgar Square, London,"[4] I arrive at the following result. The earth is the rigid body to which the specification of place refers; "Trafalgar Square, London," is a well-defined point, to which a name has been assigned, and with which the event coincides in space.[5]
This primitive method of place specification deals only with places on the surface of rigid bodies, and is dependent on the existence of points on this surface which are distinguishable from each other. But we can free ourselves from both of these limitations without altering the nature of our specification of position. If, for instance, a cloud is hovering over Trafalgar Square, then we can determine its position relative to the surface of the earth by erecting a pole perpendicularly on the Square, so that it reaches the cloud. The length of the pole measured with the standard measuring-rod, combined with the specification of the position of the foot of the pole, supplies us with a complete place specification. On the basis of this illustration, we are able to see the manner in which a refinement of the conception of position has been developed.
(a) We imagine the rigid body, to which the place specification is referred, supplemented in such a manner that the object whose position we require is reached by the completed rigid body.
(b) In locating the position of the object, we make use of a number (here the length of the pole measured with the measuring-rod) instead of designated points of reference.
(c) We speak of the height of the cloud even when the pole which reaches the cloud has not been erected. By means of optical observations of the cloud from different positions on the ground, and taking into account the properties of the propagation of light, we determine the length of the pole we should have required in order to reach the cloud.
From this consideration we see that it will be advantageous if, in the description of position, it should be possible by means of numerical measures to make ourselves independent of the existence of marked positions (possessing names) on the rigid body of reference. In the physics of measurement this is attained by the application of the Cartesian system of co-ordinates.
This consists of three plane surfaces perpendicular to each other and rigidly attached to a rigid body. Referred to a system of co-ordinates, the scene of any event will be determined (for the main part) by the specification of the lengths of the three perpendiculars or co-ordinates (
) which can be dropped from the scene of the event to those three plane surfaces. The lengths of these three perpendiculars can be determined by a series of manipulations with rigid measuring-rods performed according to the rules and methods laid down by Euclidean geometry.
In practice, the rigid surfaces which constitute the system of co-ordinates are generally not available; furthermore, the magnitudes of the co-ordinates are not actually determined by constructions with rigid rods, but by indirect means. If the results of physics and astronomy are to maintain their clearness, the physical meaning of specifications of position must always be[6] sought in accordance with the above considerations.
We thus obtain the following result: Every description of events in space involves the use of a rigid body to which such events have to be referred. The resulting relationship takes for granted that the laws of Euclidean geometry hold for "distances," the "distance" being represented physically by means of the convention of two marks on a rigid body.
[3]Here we have assumed that there is nothing left over, i.e. that the measurement gives a whole number. This difficulty is got over by the use of divided measuring-rods, the introduction of which does not demand any fundamentally new method.
[4]I have chosen this as being more familiar to the English reader than the "Potsdamer Platz, Berlin," which is referred to in the original. (R. W. L.)
[5]It is not necessary here to investigate further the significance of the expression "coincidence in space." This conception is sufficiently obvious to ensure that differences of opinion are scarcely likely to arise as to its applicability in practice.
[6]A refinement and modification of these views does not become necessary until we come to deal with the general theory of relativity, treated in the second part of this book.
III
SPACE AND TIME IN CLASSICAL MECHANICS
"THE purpose of mechanics is to describe how bodies change their position in space with time." I should load my conscience with grave sins against the sacred spirit of lucidity were I to formulate the aims of mechanics in this way, without serious reflection and detailed explanations. Let us proceed to disclose these sins.
It is not clear what is to be understood here by "position" and "space." I stand at the window of a railway carriage which is travelling uniformly, and drop a stone on the embankment, without throwing it. Then, disregarding the influence of the air resistance, I see the stone descend in a straight line. A pedestrian who observes the misdeed from the footpath notices that the stone falls to earth in a parabolic curve. I now ask: Do the "positions" traversed by the stone lie "in reality" on a straight line or on a parabola? Moreover, what is meant here by motion "in space"? From the considerations of the previous section the answer is self-evident. In the first place, we entirely shun the vague word "space," of which, we must honestly acknowledge, we cannot form the slightest conception, and we replace it by "motion relative to a practically rigid body of reference." The positions relative to the body of reference (railway carriage or embankment) have already been defined in detail in the preceding section. If instead of "body of reference" we insert "system of co-ordinates," which is a useful idea for mathematical description, we are in a position to say: The stone traverses a straight line relative to a system of co-ordinates rigidly attached to the carriage, but relative to a system of co-ordinates rigidly attached to the ground (embankment) it describes a parabola. With the aid of this example it is clearly seen that there is no such thing as an independently existing trajectory (lit. "path-curve"[7]), but only a trajectory relative to a particular body of reference.
In order to have a complete description of the motion, we must specify how the body alters its position with time; i.e. for every point on the trajectory it must be stated at what time the body is situated there. These data must be supplemented by such a definition of time that, in virtue of this definition, these time-values can be regarded essentially as magnitudes (results of measurements) capable of observation. If we take our stand on the ground of classical mechanics, we can satisfy this requirement for our illustration in the following manner. We imagine two clocks of identical construction; the man at the railway-carriage window is holding one of them, and the man on the footpath the other. Each of the observers determines the position on his own reference-body occupied by the stone at each tick of the clock he is holding in his hand. In this connection we have not taken account of the inaccuracy involved by the finiteness of the velocity of propagation of light. With this and with a second difficulty prevailing here we shall have to deal in detail later.
[7]That is, a curve along which the body moves.
IV
THE GALILEIAN SYSTEM OF CO-ORDINATES
AS is well known, the fundamental law of the mechanics of Galilei-Newton, which is known as the law of inertia, can be stated thus: A body removed sufficiently far from other bodies continues in a state of rest or of uniform motion in a straight line. This law not only says something about the motion of the bodies, but it also indicates the reference-bodies or systems of co-ordinates, permissible in mechanics, which can be used in mechanical description. The visible fixed stars are bodies for which the law of inertia certainly holds to a high degree of approximation. Now if we use a system of co-ordinates which is rigidly attached to the earth, then, relative to this system, every fixed star describes a circle of immense radius in the course of an astronomical day, a result which is opposed to the statement of the law of inertia. So that if we adhere to this law we must refer these motions only to systems of co-ordinates relative to which the fixed stars do not move in a circle. A system of co-ordinates of which the state of motion is such that the law of inertia holds relative to it is called a "Galileian system of co-ordinates." The laws of the mechanics of Galilei-Newton can be regarded as valid only for a Galileian system of co-ordinates.
V
THE PRINCIPLE OF RELATIVITY (IN THE RESTRICTED SENSE)
IN order to attain the greatest possible clearness, let us return to our example of the railway carriage supposed to be travelling uniformly. We call its motion a uniform translation ("uniform" because it is of constant velocity and direction, "translation" because although the carriage changes its position relative to the embankment yet it does not rotate in so doing). Let us imagine a raven flying through the air in such a manner that its motion, as observed from the embankment, is uniform and in a straight line. If we were to observe the flying raven from the moving railway carriage, we should find that the motion of the raven would be one of different velocity and direction, but that it would still be uniform and in a straight line. Expressed in an abstract manner we may say: If a mass
is moving uniformly in a straight line with respect to a co-ordinate system
, then it will also be moving uniformly and in a straight line relative to a second co-ordinate system
', provided that the latter is executing a uniform translatory motion with respect to
. In accordance with the discussion contained in the preceding section, it follows that:
If
is a Galileian co-ordinate system, then every other co-ordinate system
' is a Galileian one, when, in relation to
, it is in a condition of uniform motion of translation. Relative to
' the mechanical laws of Galilei-Newton hold good exactly as they do with respect to
.
We advance a step farther in our generalisation when we express the tenet thus: If, relative to
,
' is a uniformly moving co-ordinate system devoid of rotation, then natural phenomena run their course with respect to
according to exactly the same general laws as with respect to
. This statement is called the principle of relativity (in the restricted sense).
As long as one was convinced that all natural phenomena were capable of representation with the help of classical mechanics, there was no need to doubt the validity of this principle of relativity. But in view of the more recent development of electrodynamics and optics it became more and more evident that classical mechanics affords an insufficient foundation for the physical description of all natural phenomena. At this juncture the question of the validity of the principle of relativity became ripe for discussion, and it did not appear impossible that the answer to this question might be in the negative.
Nevertheless, there are two general facts which at the outset speak very much in favour of the validity of the principle of relativity. Even though classical mechanics does not supply us with a sufficiently broad basis for the theoretical presentation of all physical phenomena, still we must grant it a considerable measure of "truth," since it supplies us with the actual motions of the heavenly bodies with a delicacy of detail little short of wonderful. The principle of relativity must therefore apply with great accuracy in the domain of mechanics. But that a principle of such broad generality should hold with such exactness in one domain of phenomena, and yet should be invalid for another, is a priori not very probable.
We now proceed to the second argument, to which, moreover, we shall return later. If the principle of relativity (in the restricted sense) does not hold, then the Galileian co-ordinate systems
,
',
'', etc., which are moving uniformly relative to each other, will not be equivalent for the description of natural phenomena. In this case we should be constrained to believe that natural laws are capable of being formulated in a particularly simple manner, and of course only on condition that, from amongst all possible Galileian co-ordinate systems, we should have chosen one
of a particular state of motion as our body of reference. We should then be justified (because of its merits for the description of natural phenomena) in calling this system "absolutely at rest," and all other Galileian systems
"in motion." If, for instance, our embankment were the system
, then our railway carriage would be a system
, relative to which less simple laws would hold than with respect to
. This diminished simplicity would be due to the fact that the carriage
would be in motion (i.e. "really") with respect to
. In the general laws of nature which have been formulated with reference to
, the magnitude and direction of the velocity of the carriage would necessarily play a part. We should expect, for instance, that the note emitted by an organ-pipe placed with its axis parallel to the direction of travel would be different from that emitted if the axis of the pipe were placed perpendicular to this direction. Now in virtue of its motion in an orbit round the sun, our earth is comparable with a railway carriage travelling with a velocity of about 30 kilometres per second. If the principle of relativity were not valid we should therefore expect that the direction of motion of the earth at any moment would enter into the laws of nature, and also that physical systems in their behaviour would be dependent on the orientation in space with respect to the earth. For owing to the alteration in direction of the velocity of revolution of the earth in the course of a year, the earth cannot be at rest relative to the hypothetical system
throughout the whole year. However, the most careful observations have never revealed such anisotropic properties in terrestrial physical space, i.e. a physical non-equivalence of different directions. This is very powerful argument in favour of the principle of relativity.
VI
THE THEOREM OF THE ADDITION OF VELOCITIES EMPLOYED IN CLASSICAL MECHANICS
LET us suppose our old friend the railway carriage to be travelling along the rails with a constant velocity
, and that a man traverses the length of the carriage in the direction of travel with a velocity
. How quickly or, in other words, with what velocity
does the man advance relative to the embankment during the process? The only possible answer seems to result from the following consideration: If the man were to stand still for a second, he would advance relative to the embankment through a distance
equal numerically to the velocity of the carriage. As a consequence of his walking, however, he traverses an additional distance
relative to the carriage, and hence also relative to the embankment, in this second, the distance
being numerically equal to the velocity with which he is walking. Thus in total he covers the distance
relative to the embankment in the second considered. We shall see later that this result, which expresses the theorem of the addition of velocities employed in classical mechanics, cannot be maintained; in other words, the law that we have just written down does not hold in reality. For the time being, however, we shall assume its correctness.
VII
THE APPARENT INCOMPATIBILITY OF THE LAW OF PROPAGATION OF LIGHT WITH THE PRINCIPLE OF RELATIVITY
THERE is hardly a simpler law in physics than that according to which light is propagated in empty space. Every child at school knows, or believes he knows, that this propagation takes place in straight lines with a velocity
km./sec. At all events we know with great exactness that this velocity is the same for all colours, because if this were not the case, the minimum of emission would not be observed simultaneously for different colours during the eclipse of a fixed star by its dark neighbour. By means of similar considerations based on observations of double stars, the Dutch astronomer De Sitter was also able to show that the velocity of propagation of light cannot depend on the velocity of motion of the body emitting the light. The assumption that this velocity of propagation is dependent on the direction "in space" is in itself improbable.
In short, let us assume that the simple law of the constancy of the velocity of light
(in vacuum) is justifiably believed by the child at school. Who would imagine that this simple law has plunged the conscientiously thoughtful physicist into the greatest intellectual difficulties? Let us consider how these difficulties arise.
Of course we must refer the process of the propagation of light (and indeed every other process) to a rigid reference-body (co-ordinate system). As such a system let us again choose our embankment. We shall imagine the air above it to have been removed. If a ray of light be sent along the embankment, we see from the above that the tip of the ray will be transmitted with the velocity
relative to the embankment. Now let us suppose that our railway carriage is again travelling along the railway lines with the velocity
, and that its direction is the same as that of the ray of light, but its velocity of course much less. Let us inquire about the velocity of propagation of the ray of light relative to the carriage. It is obvious that we can here apply the consideration of the previous section, since the ray of light plays the part of the man walking along relatively to the carriage. The velocity
of the man relative to the embankment is here replaced by the velocity of light relative to the embankment.
is the required velocity of light with respect to the carriage, and we have
The velocity of propagation of a ray of light relative to the carriage thus comes out smaller than
.
But this result comes into conflict with the principle of relativity set forth in Section V. For, like every other general law of nature, the law of the transmission of light in vacuo must, according to the principle of relativity, be the same for the railway carriage as reference-body as when the rails are the body of reference. But, from our above consideration, this would appear to be impossible. If every ray of light is propagated relative to the embankment with the velocity
, then for this reason it would appear that another law of propagation of light must necessarily hold with respect to the carriage—a result contradictory to the principle of relativity.
In view of this dilemma there appears to be nothing else for it than to abandon either the principle of relativity or the simple law of the propagation of light in vacuo. Those of you who have carefully followed the preceding discussion are almost sure to expect that we should retain the principle of relativity, which appeals so convincingly to the intellect because it is so natural and simple. The law of the propagation of light in vacuo would then have to be replaced by a more complicated law conformable to the principle of relativity. The development of theoretical physics shows, however, that we cannot pursue this course. The epoch-making theoretical investigations of H. A. Lorentz on the electrodynamical and optical phenomena connected with moving bodies show that experience in this domain leads conclusively to a theory of electromagnetic phenomena, of which the law of the constancy of the velocity of light in vacuo is a necessary consequence. Prominent theoretical physicists were therefore more inclined to reject the principle of relativity, in spite of the fact that no empirical data had been found which were contradictory to this principle.
At this juncture the theory of relativity entered the arena. As a result of an analysis of the physical conceptions of time and space, it became evident that in reality there is not the least incompatibility between the principle of relativity and the law of propagation of light, and that by systematically holding fast to both these laws a logically rigid theory could be arrived at. This theory has been called the special theory of relativity to distinguish it from the extended theory, with which we shall deal later. In the following pages we shall present the fundamental ideas of the special theory of relativity.
VIII
ON THE IDEA OF TIME IN PHYSICS
LIGHTNING has struck the rails on our railway embankment at two places
and
far distant from each other. I make the additional assertion that these two lightning flashes occurred simultaneously. If I ask you whether there is sense in this statement, you will answer my question with a decided "Yes." But if I now approach you with the request to explain to me the sense of the statement more precisely, you find after some consideration that the answer to this question is not so easy as it appears at first sight.
After some time perhaps the following answer would occur to you: "The significance of the statement is clear in itself and needs no further explanation; of course it would require some consideration if I were to be commissioned to determine by observations whether in the actual case the two events took place simultaneously or not." I cannot be satisfied with this answer for the following reason. Supposing that as a result of ingenious considerations an able meteorologist were to discover that the lightning must always strike the places
and
simultaneously, then we should be faced with the task of testing whether or not this theoretical result is in accordance with the reality. We encounter the same difficulty with all physical statements in which the conception "simultaneous" plays a part. The concept does not exist for the physicist until he has the possibility of discovering whether or not it is fulfilled in an actual case. We thus require a definition of simultaneity such that this definition supplies us with the method by means of which, in the present case, he can decide by experiment whether or not both the lightning strokes occurred simultaneously. As long as this requirement is not satisfied, I allow myself to be deceived as a physicist (and of course the same applies if I am not a physicist), when I imagine that I am able to attach a meaning to the statement of simultaneity. (I would ask the reader not to proceed farther until he is fully convinced on this point.)
After thinking the matter over for some time you then offer the following suggestion with which to test simultaneity. By measuring along the rails, the connecting line
should be measured up and an observer placed at the mid-point
of the distance
. This observer should be supplied with an arrangement (e.g. two mirrors inclined at
) which allows him visually to observe both places
and
at the same time. If the observer perceives the two flashes of lightning at the same time, then they are simultaneous.
I am very pleased with this suggestion, but for all that I cannot regard the matter as quite settled, because I feel constrained to raise the following objection: "Your definition would certainly be right, if I only knew that the light by means of which the observer at
perceives the lightning flashes travels along the length
with the same velocity as along the length
. But an examination of this supposition would only be possible if we already had at our disposal the means of measuring time. It would thus appear as though we were moving here in a logical circle."
After further consideration you cast a somewhat disdainful glance at me—and rightly so—and you declare: "I maintain my previous definition nevertheless, because in reality it assumes absolutely nothing about light. There is only one demand to be made of the definition of simultaneity, namely, that in every real case it must supply us with an empirical decision as to whether or not the conception that has to be defined is fulfilled. That my definition satisfies this demand is indisputable. That light requires the same time to traverse the path
as for the path
is in reality neither a supposition nor a hypothesis about the physical nature of light, but a stipulation which I can make of my own free will in order to arrive at a definition of simultaneity."
It is clear that this definition can be used to give an exact meaning not only to two events, but to as many events as we care to choose, and independently of the positions of the scenes of the events with respect to the body of reference[8] (here the railway embankment). We are thus led also to a definition of "time" in physics. For this purpose we suppose that clocks of identical construction are placed at the points
,
and
of the railway line (co-ordinate system), and that they are set in such a manner that the positions of their pointers are simultaneously (in the above sense) the same. Under these conditions we understand by the "time" of an event the reading (position of the hands) of that one of these clocks which is in the immediate vicinity (in space) of the event. In this manner a time-value is associated with every event which is essentially capable of observation.
This stipulation contains a further physical hypothesis, the validity of which will hardly be doubted without empirical evidence to the contrary. It has been assumed that all these clocks go at the same rate if they are of identical construction. Stated more exactly: When two clocks arranged at rest in different places of a reference-body are set in such a manner that a particular position of the pointers of the one clock is simultaneous (in the above sense) with the same position of the pointers of the other clock, then identical "settings" are always simultaneous (in the sense of the above definition).
[8]We suppose further, that, when three events
,
and
occur in different places in such a manner that
is simultaneous with
, and
is simultaneous with
(simultaneous in the sense of the above definition), then the criterion for the simultaneity of the pair of events
,
is also satisfied. This assumption is a physical hypothesis about the law of propagation of light; it must certainly be fulfilled if we are to maintain the law of the constancy of the velocity of light in vacuo.
IX
THE RELATIVITY OF SIMULTANEITY
UP to now our considerations have been referred to a particular body of reference, which we have styled a "railway embankment." We suppose a very long train travelling along the rails with the constant velocity
and in the direction indicated in Fig. 1.
FIG. 1.
People travelling in this train will with advantage use the train as a rigid reference-body (co-ordinate system); they regard all events in reference to the train. Then every event which takes place along the line also takes place at a particular point of the train. Also the definition of simultaneity can be given relative to the train in exactly the same way as with respect to the embankment. As a natural consequence, however, the following question arises:
Are two events (e.g. the two strokes of lightning
and
) which are simultaneous with reference to the railway embankment also simultaneous relatively to the train? We shall show directly that the answer must be in the negative.
When we say that the lightning strokes
and
are simultaneous with respect to the embankment, we mean: the rays of light emitted at the places
and
, where the lightning occurs, meet each other at the mid-point
of the length
of the embankment. But the events
and
also correspond to positions
and
on the train. Let
' be the mid-point of the distance
on the travelling train. Just when the flashes[9] of lightning occur, this point
' naturally coincides with the point
, but it moves towards the right in the diagram with the velocity