THE LOGIC OF
MODERN PHYSICS
BY
P. W. BRIDGMAN
HOLLIS PROFESSOR OF MATHEMATICS AND NATURAL HISTORY
IN HARVARD UNIVERSITY
NEW YORK THE MACMILLAN COMPANY
COPYRIGHT, 1927,
By THE MACMILLAN COMPANY.
[PREFACE]
This excursion into the field of fundamental criticism by one whose activities have hitherto been confined almost entirely to experiment is not evidence of senile decay, as might be cynically assumed. I have always, throughout all my experimental work, felt an imperative need of a better understanding of the foundations of our physical thought and have for a long time made more or less unsystematic attempts to reach such an understanding. Only now, however, has a half sabbatical year given me leisure to attempt a more or less orderly exposition.
In spite of previous writings on the broad fundamentals by Clifford, Stallo, Mach, and Poincaré, to mention only a few, I believe a new essay of this critical character needs no apology. For entirely apart from the question of whether many of the points of view of these essays can be maintained, the discovery of new facts in the domain of relativity and quantum theory has shifted the center of interest and emphasis. All the quite recent activity with the new quantum mechanics seems to call for a new examination of fundamental matters which shall recognize, at least by implication, the existence of the special phenomena of the quantum domain. However, the necessity for re-examination does not mean at all that many of the results of previous criticism may not still be accepted; some of these results have become so thoroughly incorporated into physical thinking that we can assume them without mention. Thus the fundamental attitude of this essay is empiricism, which is now justified as the attitude of the physicist in large part by the inquiry into the physiological origin of our concepts of space, time, and mechanics with which the previous essays were largely concerned.
None of the previous essays have consciously or immediately affected the details of this; in fact I have not read any of them within several years. If passages here recall passages already written, it is because the ideas have been assimilated and the precise origin forgotten; it is probably worth while to let such passages stand without revision, because such ideas gain in plausibility through having been found acceptable to independent thought.
I am much indebted to Professor R. F. Alfred Hoernlé of the Department of Philosophy of Johannesburg University, South Africa, for suggesting several modifications to make the text more acceptable to a philosopher, and slight amplifications for the benefit of readers not familiar with all the details of recent technical developments in physics.
[INTRODUCTION]
One of the most noteworthy movements in recent physics is a change of attitude toward what may be called the interpretative aspect of physics. It is being increasingly recognized, both in the writings and the conversation of physicists, that the world of experiment is not understandable without some examination of the purpose of physics and of the nature of its fundamental concepts. It is no new thing to attempt a more critical understanding of the nature of physics, but until recently all such attempts have been regarded with a certain suspicion or even sometimes contempt. The average physicist is likely to deprecate his own concern with such questions, and is inclined to dismiss the speculations of fellow physicists with the epithet "metaphysical." This attitude has no doubt had a certain justification in the utter unintelligibility to the physicist of many metaphysical speculations and the sterility of such speculations in yielding physical results. However, the growing reaction favoring a better understanding of the interpretative fundamentals of physics is not a pendulum swing of the fashion of thought toward metaphysics, originating in the upheaval of moral values produced by the great war, or anything of the sort, but is a reaction absolutely forced upon us by a rapidly increasing array of cold experimental facts.
This reaction, or rather new movement, was without doubt initiated by the restricted theory of relativity of Einstein. Before Einstein, an ever increasing number of experimental facts concerning bodies in rapid motion required increasingly complicated modifications in our naïve notions in order to preserve self-consistency, until Einstein showed that everything could be restored again to a wonderful simplicity by a slight change in some of our fundamental concepts. The concepts which were most obviously touched by Einstein were those of space and time, and much of the writing consciously inspired by Einstein has been concerned with these concepts. But that experiment compels a critique of much more than the concepts of space and time is made increasingly evident by all the new facts being discovered in the quantum realm.
The situation presented to us by these new quantum facts is twofold. In the first place, all these experiments are concerned with things so small as to be forever beyond the possibility of direct experience, so that we have the problem of translating the evidence of experiment into other language. Thus we observe an emission line in a spectroscope and may infer an electron jumping from one energy level to another in an atom. In the second place, we have the problem of understanding the translated experimental evidence. Now of course every one knows that this problem is making us the greatest difficulty. The experimental facts are so utterly different from those of our ordinary experience that not only do we apparently have to give up generalizations from past experience as broad as the field equations of electrodynamics, for instance, but it is even being questioned whether our ordinary forms of thought are applicable in the new domain; it is often suggested, for example, that the concepts of space and time break down.
The situation is rapidly becoming acute. Since I began writing this essay, there has been a striking increase in critical activity inspired by the new quantum mechanics of 1925-26, and it is common to hear expositions of the new ideas prefaced by analysis of what experiment really gives to us or what our fundamental concepts really mean. The change in ideas is now so rapid that a number of the statements of this essay are already antiquated as expressions of the best current opinion; however I have allowed these statements to stand, since the fundamental arguments are in nowise affected and we have no reason to think that present best opinions are in any way final. We have the impression of being in an important formative period; if we are, the complexion of physics for a long time in the future will be determined by our present attitude toward fundamental questions of interpretation. To meet this situation it seems to me that something more is needed than the hand-to-mouth philosophy that is now growing up to meet special emergencies, something approaching more closely to a systematic philosophy of all physics which shall cover the experimental domains already consolidated as well as those which are now making us so much trouble. It is the attempt of this essay to give a more or less inclusive critique of all physics. Our problem is the double one of understanding what we are trying to do and what our ideals should be in physics, and of understanding the nature of the structure of physics as it now exists. These two ends are together furthered by an analysis of the fundamental concepts of physics; an understanding of the concepts we how have discloses the present structure of physics and a realization of what the concepts should be involves the ideals of physics. This essay will be largely concerned with the fundamental concepts; it will appear that almost all the concepts can profit from re-examination.
The material of this essay is largely obtained by observation of the actual currents of opinion in physics; much of what I have to say is more or less common property and doubtless every reader will find passages that he will feel have been taken out of his own mouth. On certain broad tendencies in present day physics, however, I have put my own interpretation, and it is more than likely that this interpretation will be unacceptable to many. But even if not acceptable, I hope that the stimulus of combatting the ideas offered here may be of value.
Certain limitations will have to be set to our inquiry in order to keep it within manageable compass. It is of course the merest truism that all our experimental knowledge and our understanding of nature is impossible and non-existent apart from our own mental processes, so that strictly speaking no aspect of psychology or epistemology is without pertinence. Fortunately we shall be able to get along with a more or less naive attitude toward many of these matters. We shall accept as significant our common sense judgment that there is a world external to us, and shall limit as far as possible our inquiry to the behavior and interpretation of this "external" world. We shall rule out inquiries into our states of consciousness as such. In spite, however, of the best intentions, we shall not be able to eliminate completely considerations savoring of the metaphysical, because it is evident that the nature of our thinking mechanism essentially colors any picture that we can form of nature, and we shall have to recognize that unavoidable characteristics of any outlook of ours are imposed in this way.
CONTENTS
CHAPTER
I. [BROAD POINTS OF VIEW]
New Kinds of Experience Always Possible
The Operational Character of Concepts
Einstein's Contribution in Changing Our Attitude
toward Concepts
Detailed Discussion of the Concept of Length
The Relative Character of Knowledge
Meaningless Questions
General Comments on the Operational Point of
View
II. [OTHER GENERAL CONSIDERATIONS]
The Approximate Character of Empirical Knowledge
Explanations and Mechanisms
Models and Constructs
The Rôle of Mathematics in Physics
III. [DETAILED CONSIDERATION OF VARIOUS CONCEPTS
OF PHYSICS]
The Concept of Space
The Concept of Time
The Causality Concept
The Concept of Identity
The Concept of Velocity
The Concepts of Force and Mass
The Concept of Energy
The Concepts of Thermodynamics
Electrical Concepts
The Nature of Light and the Concepts of Relativity
Other Relativity Concepts
Rotational Motion and Relativity
Quantum Concepts
IV. [SPECIAL VIEWS OF NATURE]
The Simplicity of Nature
Determinism
On the Possibility of Describing Nature Completely
in Terms of Analysis
A Glimpse Ahead
[INDEX]
THE LOGIC OF MODERN PHYSICS
CHAPTER I
BROAD POINTS OF VIEW
WHATEVER may be one's opinion as to our permanent acceptance of the analytical details of Einstein's restricted and general theories of relativity, there can be no doubt that through these theories physics is permanently changed. It was a great shock to discover that classical concepts, accepted unquestioningly, were inadequate to meet the actual situation, and the shock of this discovery has resulted in a critical attitude toward our whole conceptual structure which must at least in part be permanent. Reflection on the situation after the event shows that it should not have needed the new experimental facts which led to relativity to convince us of the inadequacy of our previous concepts, but that a sufficiently shrewd analysis should have prepared us for at least the possibility of what Einstein did.
Looking now to the future, our ideas of what external nature is will always be subject to change as we gain new experimental knowledge, but there is a part of our attitude to nature which should not be subject to future change, namely that part which rests on the permanent basis of the character of our minds. It is precisely here, in an improved understanding of our mental relations to nature, that the permanent contribution of relativity is to be found. We should now make it our business to understand so thoroughly the character of our permanent mental relations to nature that another change in our attitude, such as that due to Einstein, shall be forever impossible. It was perhaps excusable that a revolution in mental attitude should occur once, because after all physics is a young science, and physicists have been very busy, but it would certainly be a reproach if such a revolution should ever prove necessary again.
NEW KINDS OF EXPERIENCE ALWAYS POSSIBLE
The first lesson of our recent experience with relativity is merely an intensification and emphasis of the lesson which all past experience has also taught, namely, that when experiment is pushed into new domains, we must be prepared for new facts, of an entirely different character from those of our former experience. This is taught not only by the discovery of those unsuspected properties of matter moving with high velocities, which inspired the theory of relativity, but also even more emphatically by the new facts in the quantum domain. To a certain extent, of course, the recognition of all this does not involve a change of former attitude; the fact has always been for the physicist the one ultimate thing from which there is no appeal, and in the face of which the only possible attitude is a humility almost religious. The new feature in the present situation is an intensified conviction that in reality new orders of experience do exist, and that we may expect to meet them continually. We have already encountered new phenomena in going to high velocities, and in going to small scales of magnitude: we may similarly expect to find them, for example, in dealing with relations of cosmic magnitudes, or in dealing with the properties of matter of enormous densities, such as is supposed to exist in the stars.
Implied in this recognition of the possibility of new experience beyond our present range, is the recognition that no element of a physical situation, no matter how apparently irrelevant or trivial, may be dismissed as without effect on the final result until proved to be without effect by actual experiment.
The attitude of the physicist must therefore be one of pure empiricism. He recognizes no a priori principles which determine or limit the possibilities of new experience. Experience is determined only by experience. This practically means that we must give up the demand that all nature be embraced in any formula, either simple or complicated. It may perhaps turn out eventually that as a matter of fact nature can be embraced in a formula, but we must so organize our thinking as not to demand it as a necessity.
THE OPERATIONAL CHARACTER OF CONCEPTS
Einstein's Contribution in Changing Our Attitude
Toward Concepts
Recognizing the essential unpredictability of experiment beyond our present range, the physicist, if he is to escape continually revising his attitude, must use in describing and correlating nature concepts of such a character that our present experience does not exact hostages of the future. Now here it seems to me is the greatest contribution of Einstein. Although he himself does not explicitly state or emphasize it, I believe that a study of what he has done will show that he has essentially modified our view of what the concepts useful in physics are and should be. Hitherto many of the concepts of physics have been defined in terms of their properties. An excellent example is afforded by Newton's concept of absolute time. The following quotation from the Scholium in Book I of the Principia is illuminating:
I do not define Time, Space, Place or Motion, as being well known to all. Only I must observe that the vulgar conceive those quantities under no other notions but from the relation they bear to sensible objects. And thence arise certain prejudices, for the removing of which, it will be convenient to distinguish them into Absolute and Relative, True and Apparent, Mathematical and Common.
(I) Absolute, True, and Mathematical Time, of itself, and from its own nature flows equably without regard to anything external, and by another name is called Duration.
Now there is no assurance whatever that there exists in nature anything with properties like those assumed in the definition, and physics, when reduced to concepts of this character, becomes as purely an abstract science and as far removed from reality as the abstract geometry of the mathematicians, built on postulates. It is a task for experiment to discover whether concepts so defined correspond to anything in nature, and we must always be prepared to find that the concepts correspond to nothing or only partially correspond. In particular, if we examine the definition of absolute time in the light of experiment, we find nothing in nature with such properties.
The new attitude toward a concept is entirely different. We may illustrate by considering the concept of length: what do we mean by the length of an object? We evidently know what we mean by length if we can tell what the length of any and every object is, and for the physicist nothing more is required. To find the length of an object, we have to perform certain physical operations. The concept of length is therefore fixed when the operations by which length is measured are fixed: that is, the concept of length involves as much as and nothing more than the set of operations by which length is determined. In general, we mean by any concept nothing more than a set of operations; the concept is synonymous with the corresponding set of operations. If the concept is physical, as of length, the operations are actual physical operations, namely, those by which length is measured; or if the concept is mental, as of mathematical continuity, the operations are mental operations, namely those by which we determine whether a given aggregate of magnitudes is continuous. It is not intended to imply that there is a hard and fast division between physical and mental concepts, or that one kind of concept does not always contain an element of the other; this classification of concept is not important for our future considerations.
We must demand that the set of operations equivalent to any concept be a unique set, for otherwise there are possibilities of ambiguity in practical applications which we cannot admit.
Applying this idea of "concept" to absolute time, we do not understand the meaning of absolute time unless we can tell how to determine the absolute time of any concrete event, i.e., unless we can measure absolute time. Now we merely have to examine any of the possible operations by which we measure time to see that all such operations are relative operations. Therefore the previous statement that absolute time does not exist is replaced by the statement that absolute time is meaningless. And in making this statement we are not saying something new about nature, but are merely bringing to light implications already contained in the physical operations used in measuring time.
It is evident that if we adopt this point of view toward concepts, namely that the proper definition of a concept is not in terms of its properties but in terms of actual operations, we need run no danger of having to revise our attitude toward nature. For if experience is always described in terms of experience, there must always be correspondence between experience and our description of it, and we need never be embarrassed, as we were in attempting to find in nature the prototype of Newton's absolute time. Furthermore, if we remember that the operations to which a physical concept are equivalent are actual physical operations, the concepts can be defined only in the range of actual experiment, and are undefined and meaningless in regions as yet untouched by experiment. It follows that strictly speaking we cannot make statements at all about regions as yet untouched, and that when we do make such statements, as we inevitably shall, we are making a conventionalized extrapolation, of the looseness of which we must be fully conscious, and the justification of which is in the experiment of the future.
There probably is no statement either in Einstein or other writers that the change described above in the use of "concept" has been self-consciously made, but that such is the case is proved, I believe, by an examination of the way concepts are now handled by Einstein and others. For of course the true meaning of a term is to be found by observing what a man does with it, not by what he says about it. We may show that this is the actual sense in which concept is coming to be used by examining in particular Einstein's treatment of simultaneity.
Before Einstein, the concept of simultaneity was defined in terms of properties. It was a property of two events, when described with respect to their relation in time, that one event was either before the other, or after it, or simultaneous with it. Simultaneity was a property of the two events alone and nothing else; either two events were simultaneous or they were not. The justification for using this term in this way was that it seemed to describe the behavior of actual things. But of course experience then was restricted to a narrow range. When the range of experience was broadened, as by going to high velocities, it was found that the concepts no longer applied, because there was no counterpart in experience for this absolute relation between two events. Einstein now subjected the concept of simultaneity to a critique, which consisted essentially in showing that the operations which enable two events to be described as simultaneous involve measurements on the two events made by an observer, so that "simultaneity" is, therefore, not an absolute property of the two events and nothing else, but must also involve the relation of the events to the observer. Until therefore we have experimental proof to the contrary, we must be prepared to find that the simultaneity of two events depends on their relation to the observer, and in particular on their velocity. Einstein, in thus analyzing what is involved in making a judgment of simultaneity, and in seizing on the act of the observer as the essence of the situation, is actually adopting a new point of view as to what the concepts of physics should be, namely, the operational view.
Of course Einstein actually went much further than this, and found precisely how the operations for judging simultaneity change when the observer moves, and obtained quantitative expressions for the effect of the motion of the observer on the relative time of two events. We may notice, parenthetically, that there is much freedom of choice in selecting the exact operations; those which Einstein chose were determined by convenience and simplicity with relation to light beams. Entirely apart from the precise quantitative relations of Einstein's theory, however, the important point for us is that if we had adopted the operational point of view, we would, before the discovery of the actual physical facts, have seen that simultaneity is essentially a relative concept, and would have left room in our thinking for the discovery of such effects as were later found.
Detailed Discussion of the Concept of Length
We may now gain further familiarity with the operational attitude toward a concept and some of its implications by examining from this point of view the concept of length. Our task is to find the operations by which we measure the length of any concrete physical object. We begin with objects of our commonest experience, such as a house or a house lot. What we do is sufficiently indicated by the following rough description. We start with a measuring rod, lay it on the object so that one of its ends coincides with one end of the object, mark on the object the position of the other end of the rod, then move the rod along in a straight line extension of its previous position until the first end coincides with the previous position of the second end, repeat this process as often as we can, and call the length the total number of times the rod was applied. This procedure, apparently so simple, is in practice exceedingly complicated, and doubtless a full description of all the precautions that must be taken would fill a large treatise. We must, for example, be sure that the temperature of the rod is the standard temperature at which its length is defined, or else we must make a correction for it; or we must correct for the gravitational distortion of the rod if we measure a vertical length; or we must be sure that the rod is not a magnet or is not subject to electrical forces. All these precautions would occur to every physicist. But we must also go further and specify all the details by which the rod is moved from one position to the next on the object—its precise path through space and its velocity and acceleration in getting from one position to another. Practically of course, precautions such as these are not mentioned, but the justification is in our experience that variations of procedure of this kind are without effect on the final result. But we always have to recognize that all our experience is subject to error, and that at some time in the future we may have to specify more carefully the acceleration, for example, of the rod in moving from one position to another, if experimental accuracy should be so increased as to show a measurable effect. In principle the operations by which length is measured should be uniquely specified. If we have more than one set of operations, we have more than one concept, and strictly there should be a separate name to correspond to each different set of operations.
So much for the length of a stationary object, which is complicated enough. Now suppose we have to measure a moving street car. The simplest, and what we may call the "naïve" procedure, is to board the car with our meter stick and repeat the operations we would apply to a stationary body. Notice that this procedure reduces to that already adopted in the limiting case when the velocity of the street car vanishes. But here there may be new questions of detail. How shall we jump on to the car with our stick in hand? Shall we run and jump on from behind, or shall we let it pick us up from in front? Or perhaps does now the material of which the stick is composed make a difference, although previously it did not? All these questions must be answered by experiment. We believe from present evidence that it makes no difference how we jump on to the car, or of what material the rod is made, and that the length of the car found in this way will be the same as if it were at rest. But the experiments are more difficult, and we are not so sure of our conclusions as before. Now there are very obvious limitations to the procedure just given. If the street car is going too fast, we can not board it directly, but must use devices, such as getting on from a moving automobile; and, more important still, there are limitations to the velocity that can be given to street cars or to meter sticks by any practical means in our control, so that the moving bodies which can be measured in this way are restricted to a low range of velocity. If we want to be able to measure the length of bodies moving with higher velocities such as we find existing in nature (stars or cathode particles), we must adopt another definition and other operations for measuring length, which also reduce to the operations already adopted in the static case. This is precisely what Einstein did. Since Einstein's operations were different from our operations above, his "length" does not mean the same as our "length." We must accordingly be prepared to find that the length of a moving body measured by the procedure of Einstein is not the same as that above; this of course is the fact, and the transformation formulas of relativity give the precise connection between the two lengths.
Einstein's procedure for measuring the length of bodies in motion was dictated not only by the consideration that it must be applicable to bodies with high velocities, but also by mathematical convenience, in that Einstein describes the world mathematically by a system of coördinate geometry, and the "length" of an object is connected simply with quantities in the analytic equations.
It is of interest to describe briefly Einstein's actual operations for measuring the length of a body in motion; it will show how operations which may be simple from a mathematical point of view may appear complicated from a physical viewpoint. The observer who is to measure the length of a moving object must first extend over his entire plane of reference (for simplicity the problem is considered two-dimensional) a system of time coordinates, i.e., at each point of his plane of reference there must be a clock, and all these clocks must be synchronized. At each clock an observer must be situated. Now to find the length of the moving object at a specified instant of time (it is a subject for later investigation to find whether its length is a function of time), the two observers who happen to coincide in position with the two ends of the object at the specified time on their clocks are required to find the distance between their two positions by the procedure for measuring the length of a stationary object, and this distance is by definition the length of the moving object in the given reference system. This procedure for measuring the length of a body in motion hence involves the idea of simultaneity, through the simultaneous position of the two ends of the rod, and we have seen that the operations by which simultaneity are determined are relative, changing when the motion of the system changes. We hence are prepared to find a change in the length of a body when the velocity of the measuring system changes, and this in fact is what happens. The precise numerical dependence is worked out by Einstein, and involves other considerations, in which we are not interested at present.
The two sorts of length, the naive one and that of Einstein, have certain features in common. In either case in the limit, as the velocity of the measuring system approaches zero, the operations approach those for measuring the length of a stationary object. This, of course, is a requirement in any good definition, imposed by considerations of convenience, and it is too obvious a matter to need elaboration. Another feature is that the operations equivalent to either concept both involve the motion of the system, so that we must recognize the possibility that the length of a moving object may be a function of its velocity. It is a matter of experiment, unpredictable until tried, that within the limits of present experimental error the naive length is not affected by motion, and Einstein's length is.
So far, we have extended the concept of length in only one way beyond the range of ordinary experience, namely to high velocities. The extension may obviously be made in other directions. Let us inquire what are the operations by which we measure the length of a very large object. In practice we probably first meet the desirability of a change of procedure in measuring large pieces of land. Here our procedure depends on measurements with a surveyor's theodolite. This involves extending over the surface of the land a system of coordinates, starting from a base line measured with a tape in the conventional way, sighting on distant points from the extremities of the line, and measuring the angles. Now in this extension we have made one very essential change: the angles between the lines connecting distant points are now angles between beams of light. We assume that a beam of light travels in a straight line. Furthermore, we assume in extending our system of triangulation over the surface of the earth that the geometry of light beams is Euclidean. We do the best we can to check the assumptions, but at most can never get more than a partial check.
Thus Gauss[1] checked whether the angles of a large terrestrial triangle add to two right angles and found agreement within experimental error. We now know from the experiments of Michelson[2] that if his measurements had been accurate enough he would not have got a check, but would have had an excess or defect according to the direction in which the beam of light travelled around the triangle with respect to the rotation of the earth. But if the geometry of light beams is Euclidean, then not only must the angles of a triangle add to two right angles, but there are definite relations between the lengths of the sides and the angles, and to check these relations the sides should be measured by the old procedure with a meter stick. Such a check on a large scale has never been attempted, and is not feasible. It seems, then, that our checks on the Euclidean character of optical space are all of restricted character. We have apparently proved that up to a certain scale of magnitude optical space is Euclidean with respect to measures of angle, but this may not necessarily involve that space is also Euclidean with respect to measures of length, so that space need not be completely Euclidean. There is a further most important restriction in that our studies of non-Euclidean geometry have shown that the percentage excess of the angles of a non-Euclidean triangle over 180° may depend on the magnitude of the triangle, so that it may well be that we have not detected the non-Euclidean character of space simply because our measurements have not been on a large enough scale.
[1]C. F. Gauss, Gesammelte Werke, especially vol. IV.
[2]See a discussion of the theory of this experiment by L. Silberstein, Jour. Opt. Soc. Amer. 5, 291-307. 1921.
We thus see that the concept of length has undergone a very essential change of character even within the range of terrestrial measurements, in that we have substituted for what I may call the tactual concept an optical concept, complicated by an assumption about the nature of our geometry. From a very direct concept we have come to a very indirect concept with a most complicated set of operations. Strictly speaking, length when measured in this way by light beams should be called by another name, since the operations are different. The practical justification for retaining the same name is that within our present experimental limits a numerical difference between the results of the two sorts of operations has not been detected.
We are still worse off when we make the extension to solar and stellar distances. Here space is entirely optical in character, and we never have an opportunity of even partially comparing tactual with optical space. No direct measures of length have ever been made, nor can we even measure the three angles of a triangle and so check our assumption that the use of Euclidean geometry in extending the concept of space is justified. We never have under observation more than two angles of a triangle, as when we measure the distance of the moon by observation from the two ends of the earth's diameter. To extend to still greater distance our measures of length, we have to make still further assumptions, such as that inferences from the Newtonian laws of mechanics are valid. The accuracy of our inferences about lengths from such measurements is not high. Astronomy is usually regarded as a science of extraordinarily high accuracy, but its accuracy is very restricted in character, namely to the measurement of angles. It is probably safe to say that no astronomical distance, except perhaps that of the moon, is known with an accuracy greater than 0.1%. When we push our estimates to distances beyond the confines of the solar system in which we are assisted by the laws of mechanics, we are reduced in the first place to measurements of parallax, which at best have a quite inferior accuracy, and which furthermore fail entirely outside a rather restricted range. For greater stellar distances we are driven to other and much rougher estimates, resting for instance on the extension to great distances of connections found within the range of parallax between brightness and spectral type of a star, or on such assumptions as that, because a group of stars looks as if it were all together in space and had a common origin, it actually is so. Thus at greater and greater distances not only does experimental accuracy become less, but the very nature of the operations by which length is to be determined becomes indefinite, so that the distances of the most remote stellar objects as estimated by different observers or by different methods may be very divergent. A particular consequence of the inaccuracy of the astronomical measures of great distances is that the question of whether large scale space is Euclidean or not is merely academic.
We thus see that in the extension from terrestrial to great stellar distances the concept of length has changed completely in character. To say that a certain star is 105 light years distant is actually and conceptually an entire different kind of thing from saying that a certain goal post is 100 meters distant. Because of our conviction that the character of our experience may change when the range of phenomena changes, we feel the importance of such a question as whether the space of distances of 105 light years is Euclidean or not, and are correspondingly dissatisfied that at present there seems no way of giving meaning to it.
We encounter difficulties similar to those above, and are also compelled to modify our procedures, when we go to small distances. Down to the scale of microscopic dimensions a fairly straightforward extension of the ordinary measuring procedure is sufficient, as when we measure a length in a micrometer eyepiece of a microscope. This is of course a combination of tactual and optical measurements, and certain assumptions, justified as far as possible by experience, have to be made about the behavior of light beams. These assumptions are of a quite different character from those which give us concern on the astronomical scale, because here we meet difficulty from interference effects due to the finite scale of the structure of light, and are not concerned with a possible curvature of light beams in the long reaches of space. Apart from the matter of convenience, we might also measure small distances by the tactual method.
As the dimensions become smaller, certain difficulties become increasingly important that were negligible on a larger scale. In carrying out physically the operations equivalent to our concepts, there are a host of practical precautions to be taken which could be explicitly enumerated with difficulty, but of which nevertheless any practical physicist is conscious. Suppose, for example, we measure length tactually by a combination of Johanssen gauges. In piling these together, we must be sure that they are clean, and are thus in actual contact. Particles of mechanical dirt first engage our attention. Then as we go to smaller dimensions we perhaps have to pay attention to adsorbed films of moisture, then at still smaller dimensions to adsorbed films of gas, until finally we have to work in a vacuum, which must be the more nearly complete the smaller the dimensions. About the time that we discover the necessity for a complete vacuum, we discover that the gauges themselves are atomic in structure, that they have no definite boundaries, and therefore no definite length, but that the length is a hazy thing, varying rapidly in time between certain limits. We treat this situation as best we can by taking a time average of the apparent positions of the boundaries, assuming that along with the decrease of dimensions we have acquired a corresponding extravagant increase in nimbleness. But as the dimensions get smaller continually, the difficulties due to this haziness increase indefinitely in percentage effect, and we are eventually driven to give up altogether. We have made the discovery that there are essential physical limitations to the operations which defined the concept of length. [We perhaps do not regard the substitution of optical for tactual space on the astronomical scale as compelled by the same sort of physical necessity, because I suppose the possible eventual landing of men in the moon will always be one of the dreams of humanity.] At the same time that we have come to the end of our rope with our Johanssen gauge procedure, our companion with the microscope has been encountering difficulties due to the finite wave length of light; this difficulty he has been able to minimize by using light of progressively shorter wave lengths, but he has eventually had to stop on reaching X-rays. Of course this optical procedure with the microscope is more convenient, and is therefore adopted in practice.
Let us now see what is implied in our concept of length extended to ultramicroscopic dimensions. What, for instance, is the meaning of the statement that the distance between the planes of atoms in a certain crystal is 3 x 10-8 cm.? What we would like to mean is that ⅓ x 108 of these planes piled on top of each other give a thickness of 1 cm.; but of course such a meaning is not the actual one. The actual meaning may be found by examining the operations by which we arrived at the number 3 x 10-8. As a matter of fact, 3 x 10-8 was the number obtained by solving a general equation derived from the wave theory of light, into which certain numerical data obtained by experiments with X-rays had been substituted. Thus not only has the character of the concept of length changed from tactual to optical, but we have gone much further in committing ourselves to a definite optical theory. If this were the whole story, we would be most uncomfortable with respect to this branch of physics, because we are so uncertain of the correctness of our optical theories, but actually a number of checks can be applied which greatly restore our confidence. For instance, from the density of the crystal and the grating space, the weight of the individual atoms may be computed, and these weights may then be combined with measurements of the dimensions of other sorts of crystal into which the same atoms enter to give values of the densities of these crystals, which may be checked against experiment. All such checks have succeeded within limits of accuracy which are fairly high. It is important to notice that, in spite of the checks, the character of the concept is changing, and begins to involve such things as the equations of optics and the assumption of the conservation of mass.
We are not content, however, to stop with dimensions of atomic order, but have to push on to the electron with a diameter of the order of 10-18 cm. What is the possible meaning of the statement that the diameter of an electron is 10-18 cm.? Again the only answer is found by examining the operations by which the number 10-18 was obtained. This number came by solving certain equations derived from the field equations of electrodynamics, into which certain numerical data obtained by experiment had been substituted. The concept of length has therefore now been so modified as to include that theory of electricity embodied in the field equations, and, most important, assumes the correctness of extending these equations from the dimensions in which they may be verified experimentally into a region in which their correctness is one of the most important and problematical of present-day questions in physics. To find whether the field equations are correct on a small scale, we must verify the relations demanded by the equations between the electric and magnetic forces and the space coordinates, to determine which involves measurement of lengths. But if these space coordinates cannot be given an independent meaning apart from the equations, not only is the attempted verification of the equations impossible, but the question itself is meaningless. If we stick to the concept of length by itself, we are landed in a vicious circle. As a matter of fact, the concept of length disappears as an independent thing, and fuses in a complicated way with other concepts, all of which are themselves altered thereby, with the result that the total number of concepts used in describing nature at this level is reduced in number. A precise analysis of the situation is difficult, and I suppose has never been attempted, but the general character of the situation is evident. Until at least a partial analysis is attempted, I do not see how any meaning can be attached to such questions as whether space is Euclidean in the small scale.
It is interesting to observe that any increased accuracy in knowledge of large scale phenomena must, as far as we now can see, arise from an increase in the accuracy of measurement of small things, that is, in the measurement of small angles or the analysis of minute differences of wave lengths in the spectra. To know the very large takes us into the same field of experiment as to know the very small, so that operationally the large and the small have features in common.
This somewhat detailed analysis of the concept of length brings out features common to all our concepts. If we deal with phenomena outside the domain in which we originally defined our concepts, we may find physical hindrances to performing the operations of the original definition, so that the original operations have to be replaced by others. These new operations are, of course, to be so chosen that they give, within experimental error, the same numerical results in the domain in which the two sets of operations may be both applied; but we must recognize in principle that in changing the operations we have really changed the concept, and that to use the same name for these different concepts over the entire range is dictated only by considerations of convenience, which may sometimes prove to have been purchased at too high a price in terms of unambiguity. We must always be prepared some day to find that an increase in experimental accuracy may show that the two different sets of operations which give the same results in the more ordinary part of the domain of experience, lead to measurably different results in the more unfamiliar parts of the domain. We must remain aware of these joints in our conceptual structure if we hope to render unnecessary the services of the unborn Einsteins.
The second feature common to all concepts brought out by the detailed discussion of length is that, as we approach the experimentally attainable limit, concepts lose their individuality, fuse together, and become fewer in number, as we have seen that at dimensions of the order of the diameter of an electron the concepts of length and the electric field vectors fuse into an amorphous whole. Not only does nature as experienced by us become different in character on its horizons, but it becomes simpler, and therefore our concepts, which are the building stones of our descriptions, become fewer in number. This seems to be an entirely natural state of affairs. How the number of concepts is often kept formally the same as we approach the horizon will be discussed later in special cases.
A precise analysis of our conceptual structure has never been attempted, except perhaps in very restricted domains, and it seems to me that there is room here for much important future work. Such an analysis is not to be attempted in this essay, but only some of the more important qualitative aspects are to be pointed out. It will never be possible to give a clean-cut logical analysis of the conceptual situation, for the nature of our concepts, according to our operational point of view, is the same as the nature of experimental knowledge, which is often hazy. Thus in the transition regions where nature is getting simpler and the number of operationally independent concepts changes, a certain haziness is inevitable, for the actual change in our conceptual structure in these transition regions is continuous, corresponding to the continuity of our experimental knowledge, whereas formally the number of concepts should be an integer.
The Relative Character of Knowledge
Two other consequences of the operational point of view must now be examined. First is the consequence that all our knowledge is relative. This may be understood in a general or a more particular sense. The general sense is illustrated in Haldane's book on the Reign of Relativity. Relativity in the general sense is the merest truism if the operational definition of concept is accepted, for experience is described in terms of concepts, and since our concepts are constructed of operations, all our knowledge must unescapably be relative to the operations selected. But knowledge is also relative in a narrower sense, as when we say there is no such thing as absolute rest (or motion) or absolute size, but rest and size are relative terms. Conclusions of this kind are involved in the specific character of the operations in terms of which rest or size are defined. An examination of the operations by which we determine whether a body is at rest or in motion shows that the operations are relative operations: rest or motion is determined with respect to some other body selected as the standard. In saying that there is no such thing as absolute rest or motion we are not making a statement about nature in the sense that might be supposed, but we are merely making a statement about the character of our descriptive processes. Similarly with regard to size: examination of the operations of the measuring process shows that size is measured relative to the fundamental measuring rod.
The "absolute" therefore disappears in the original meaning of the word. But the "absolute" may usefully return with an altered meaning, and we may say that a thing has absolute properties if the numerical magnitude is the same when measured with the same formal procedure by all observers. Whether a given property is absolute or not can be determined only by experiment, landing us in the paradoxical position that the absolute is absolute only relative to experiment. In some cases, the most superficial observation shows that a property is not absolute, as, for example, it is at once obvious that measured velocity changes with the motion of the observer. But in other cases the decision is more difficult. Thus Michelson thought he had an absolute procedure for measuring length, by referring to the wave length of the red cadmium line as standard;[3] it required difficult and accurate experiment to show that this length varies with the motion of the observer. Even then, by changing the definition of the length of a moving object, we believe that length might be made to re-assume its desired absolute character.
[3]A. A. Michelson, Light Waves and Their Uses, University of Chicago Press, 1903, Chap. V.
To stop the discussion at this point might leave the impression that this observation of the relative character of knowledge is of only a very tenuous and academic interest, since it appears to be concerned mostly with the character of our descriptive processes, and to say little about external nature. [What this means we leave to the metaphysician to decide.] But I believe there is a deeper significance to all this. It must be remembered that all our argument starts with the concepts as given. Now these concepts involve physical operations; in the discovery of what operations may be usefully employed in describing nature is buried almost all physical experience. In erecting our structure of physical science, we are building on the work of all the ages. There is then this purely physical significance in the statement that all motion is relative, namely that no operations of measuring motion have been found to be useful in describing simply the behavior of nature which are not operations relative to a single observer; in making this statement we are stating something about nature. It takes an enormous amount of real physical experience to discover relations of this sort. The discovery that the number obtained by counting the number of times a stick may be applied to an object can be simply used in describing natural phenomena was one of the most important and fundamental discoveries ever made by man.
Meaningless Questions
Another consequence of the operational character of our concepts, almost a corollary of that considered above, is that it is quite possible, nay even disquietingly easy, to invent expressions or to ask questions that are meaningless. It constitutes a great advance in our critical attitude toward nature to realize that a great many of the questions that we uncritically ask are without meaning. If a specific question has meaning, it must be possible to find operations by which an answer may be given to it. It will be found in many cases that the operations cannot exist, and the question therefore has no meaning. For instance, it means nothing to ask whether a star is at rest or not. Another example is a question proposed by Clifford, namely, whether it is not possible that as the solar system moves from one part of space to another the absolute scale of magnitude may be changing, but in such a way as to affect all things equally, so that the change of scale can never be detected. An examination of the operations by which length is measured in terms of measuring rods shows that the operations do not exist (because of the nature of our definition of length) for answering the question. The question can be given meaning only from the point of view of some imaginary superior being watching from an external point of vantage. But the operations by which such a being measures length are different from the operations of our definition of length, so that the question acquires meaning only by changing the significance of our terms—in the original sense the question means nothing.
To state that a certain question about nature is meaningless is to make a significant statement about nature itself, because the fundamental operations are determined by nature, and to state that nature cannot be described in terms of certain operations is a significant statement.
It must be recognized, however, that there is a sense in which no serious question is entirely without meaning, because doubtless the questioner had in mind some intention in asking the question. But to give meaning in this sense to a question, one must inquire into the meaning of the concepts as used by the questioner, and it will often be found that these concepts can be defined only in terms of fictitious properties, as Newton's absolute time was defined by its properties, so that the meaning to be ascribed to the question in this way has no connection with reality. I believe that it will enable us to make more significant and interesting statements, and therefore will be more useful, to adopt exclusively the operational view, and so admit the possibility of questions entirely without meaning.
This matter of meaningless questions is a very subtle thing which may poison much more of our thought than that dealing with purely physical phenomena. I believe that many of the questions asked about social and philosophical subjects will be found to be meaningless when examined from the point of view of operations. It would doubtless conduce greatly to clarity of thought if the operational mode of thinking were adopted in all fields of inquiry as well as in the physical. Just as in the physical domain, so in other domains, one is making a significant statement about his subject in stating that a certain question is meaningless.
In order to emphasize this matter of meaningless questions, I give here a list of questions, with which the reader may amuse himself by finding whether they have meaning or not.
(1) Was there ever a time when matter did not exist?
(2) May time have a beginning or an end?
(3) Why does time flow?
(4) May space be bounded?
(5) May space or time be discontinuous?
(6) May space have a fourth dimension, not directly detectible, but given indirectly by inference?
(7) Are there parts of nature forever beyond our detection?
(8) Is the sensation which I call blue really the same as that which my neighbor calls blue? Is it possible that a blue object may arouse in him the same sensation that a red object does in me and vice versa?
(9) May there be missing integers in the series of natural numbers as we know them?
(10) Is a universe possible in which 2 + 2 ≠ 4?
(11) Why does negative electricity attract positive?
(12) Why does nature obey laws?
(13) Is a universe possible in which the laws are different?
(14) If one part of our universe could be completely isolated from the rest, would it continue to obey the same laws?
(15) Can we be sure that our logical processes are valid?
GENERAL COMMENTS ON THE OPERATIONAL POINT
OF VIEW
To adopt the operational point of view involves much more than a mere restriction of the sense in which we understand "concept," but means a far-reaching change in all our habits of thought, in that we shall no longer permit ourselves to use as tools in our thinking concepts of which we cannot give an adequate account in terms of operations. In some respects thinking becomes simpler, because certain old generalizations and idealizations become incapable of use; for instance, many of the speculations of the early natural philosophers become simply unreadable. In other respects, however, thinking becomes much more difficult, because the operational implications of a concept are often very involved. For example, it is most difficult to grasp adequately all that is contained in the apparently simple concept of "time," and requires the continual correction of mental tendencies which we have long unquestioningly accepted.
Operational thinking will at first prove to be an unsocial virtue; one will find oneself perpetually unable to understand the simplest conversation of one's friends, and will make oneself universally unpopular by demanding the meaning of apparently the simplest terms of every argument. Possibly after every one has schooled himself to this better way, there will remain a permanent unsocial tendency, because doubtless much of our present conversation will then become unnecessary. The socially optimistic may venture to hope, however, that the ultimate effect will be to release one's energies for more stimulating and interesting interchange of ideas.
Not only will operational thinking reform the social art of conversation, but all our social relations will be liable to reform. Let any one examine in operational terms any popular present-day discussion of religious or moral questions to realize the magnitude of the reformation awaiting us. Wherever we temporize or compromise in applying our theories of conduct to practical life we may suspect a failure of operational thinking.
CHAPTER II
OTHER GENERAL CONSIDERATIONS
THE APPROXIMATE CHARACTER OF EMPIRICAL KNOWLEDGE
ALTHOUGH many aspects of the processes by which we obtain knowledge of the external physical world are much beyond the scope of our present inquiry, one matter must be mentioned in detail because it tacitly underlies all our discussion, the fact, namely, that all results of measurement are only approximate. That such is true is evident after the most superficial examination of any measuring process; any statement about numerical relations between measured quantities must always be subject to the qualification that the relation is valid only within limits. Furthermore, all experience seems to be of this character; we never have perfectly clean-cut knowledge of anything, but all our experience is surrounded by a twilight zone, a penumbra of uncertainty, into which we have not yet penetrated. This penumbra is as truly an unexplored region as any other region beyond experiment, such as the region of high velocities, for example, and we must hold no preconceived notions as to what will be found within the region. The penumbra is to be penetrated by improving the accuracy of measurement. Within what was at one time penumbra has been found the displacement of angular position of the stars near the edge of the solar disc, and within the penumbra as yet unpenetrated we look for such effects as the equivalence of mass and energy. Many of the great discoveries of the future will probably be made within the penumbra: we have already mentioned that increased knowledge of phenomena of a cosmic scale is to be obtained by increasing the accuracy of measurement of the very small.
It is a general consequence of the approximate character of all measurement that no empirical science can ever make exact statements. This was fairly obvious in the case of mechanics, but it required a Gauss[4] to convince us that the geometry in which we are interested as physicists is an empirical subject, and that one cannot say that actual space is Euclidean, but only that actual space approaches to ideal Euclidean space within a certain degree of approximation. I believe that we are compelled to go still further, and recognize that arithmetic, so far as it purports to deal with actual physical objects, is also affected with the same penumbra of uncertainty as all other empirical science. A typical statement of empirical arithmetic is that 2 objects plus 2 objects makes 4 objects. This statement acquires physical meaning only in terms of certain physical operations, and these operations must be performed in time.
[4]C. F. Gauss, Gesammelte Werke, especially vols. IV and VIII.
Now the penumbra gets into this situation through the concept of object. If the statement of arithmetic is to be an exact statement in the mathematical sense the "object" must be a definite clean-cut thing, which preserves its identity in time with no penumbra. But this sort of thing is never experienced, and as far as we know does not correspond exactly to anything in experience. It is of course true that in most experience the penumbra is so very thin and snug-fitting that it requires special effort to recognize its presence at all; but scrutiny, I believe, shows that it is always there. If our experience had been restricted to phenomena in a vacuum, and the objects we were trying to count had been spheres of a gas which expand and interpenetrate, it is obvious that the concept of "object" as a thing with identity would have been much more difficult to form. Or, if our objects are tumblers of water, we discover when our observation reaches a certain stage of refinement that the amount of water is continually changing by evaporation and condensation, and we are bothered by the question whether the object is still the same after it has waxed and waned. Coming to solids, we eventually discover that even solids evaporate, or condense gases on them, and we see that an object with identity is an abstraction corresponding exactly to nothing in nature. Of course the penumbra of uncertainty which surrounds our arithmetical statements because of this property of physical objects is so exceedingly tenuous that practically we are not aware of its existence, and expect never to find undiscovered phenomena within the penumbra. But in principle we must recognize its presence, and must further recognize that all empirical science must be of this character.
In most empirical sciences, the penumbra is at first prominent, and becomes less important and thinner as the accuracy of physical measurement is increased. In mechanics, for example, the penumbra is at first like a thick obscuring veil at the stage where we measure forces only by our muscular sensations, and gradually is attenuated, as the precision of measurements increases. But with the arithmetical concept of an individual identifiable object it is the exact reverse; a crude point of view does not suspect the existence of the penumbra at all, and we discover it only by highly refining our methods. Doubtless arithmetic owes its early development to this property.
We may now go still further. Operations themselves are, of course, derived from experience, and would be expected also to have a nebulous edge of uncertainty. We have to ask such questions as whether the operations of arithmetic are clean-cut things. Is the operation of multiplying 2 objects by 2 a definite operation, with no enveloping haze? All our physical experience convinces us that if there is a penumbra about the concept of operations of this sort it is so tenuous as to be negligible, at least for the present; but the question affords an interesting topic for speculation. We also have to ask whether mental operations may similarly be enveloped in a haze.
EXPLANATIONS AND MECHANISMS
Perhaps the climax of our task of interpreting and correlating nature is reached when we are able to find an explanation of phenomena; with the finding of the explanation we are inclined to feel that our understanding of the situation is complete. We now have to ask what is the nature of the explanation which we set as the goal of our efforts. The answer is not easy to give, and there may be difference of opinion about it. We shall get the best answer to this, as to so many other questions, by adopting the operational point of view, and examining what we do in giving an explanation. I believe that examination will show that the essence of an explanation consists in reducing a situation to elements with which we are so familiar that we accept them as a matter of course, so that our curiosity rests.[5] "Reducing a situation to elements" means, from the operational point of view, discovering familiar correlations between the phenomena of which the situation is composed.
[5]The ultimate elements of explanation are analogous to the axioms of formal mathematics.
There is involved here the thesis that it is possible to analyze nature into correlations, without, however, any assumption whatever as to the character of these correlations. It seems to me that such a thesis is the most general that can be made if nature is to be intelligible at all. This thesis underlies all the considerations of this essay, and we shall not try to find anything more general. We shall, however, recognize that any assumption as to the character of the correlations constitutes a special hypothesis which may restrict the future, and that therefore these special hypotheses are to be subjected to special examination. We return to this matter in more detail in discussing the causality concept, which is closely related to the concept of explanation.
In this view of explanation there is no implication that the "element" is either a smaller or a larger scale thing than the phenomenon being explained; thus we may explain the properties of a gas in terms of its constituent molecules, or perhaps some day we shall become so familiar with the idea of a non-Euclidean space that we shall explain (instead of describe) the gravitational attraction of a stone by the earth in terms of a space-time curvature imposed by all the rest of the matter in the universe.
If this is accepted as the true nature of explanation, we see that an explanation is not an absolute sort of thing, but what is satisfactory for one man will not be for another. The savage is satisfied by explaining the thunderstorm as the capricious act of an angry god. The physicist demands more, and requires that the familiar elements to which we reduce a situation be such that we can intuitively predict their behavior. Thus even if the physicist believed in the existence of the angry god, he would not be satisfied with this explanation of the thunderstorm because he is not so well acquainted with angry gods as to be able to predict when anger is followed by a storm. He would have to know why the god had become angry, and why making a thunderstorm eased his ire. But even with this additional qualification, scientific explanation is obviously still a relative affair—relative to the elements or axioms to which we make reduction and which we accept as ultimate. These elements depend to a certain extent on the purpose in view, and also on the range of our previous physical experience. If we are explaining the action of a machine, we are satisfied to reduce the action to the push and pull of the various members of the machine, it being accepted as an ultimate that these members transmit pushes or pulls. But the physicist who has extended his experimental knowledge further, may want to explain how the members transmit pushes or pulls in terms of the action on each other of the electrons in their orbits in the atoms. The character of our explanatory structure will depend on the character of our experimental knowledge, and will change as this changes.
Formally, there is no limit to the process of explanation, because we can always ask what is the explanation of the elements in terms of which we have given the last explanation. But the point of view of operations shows that this is mere formalism which ends only in meaningless jargon, for we soon arrive at the limit of our experimental knowledge, and beyond this the operations involved in the concepts of our explanations become impossible and the concepts become meaningless.
As we extend experimental knowledge and push our explanations further and further, we see that the explanatory sequence may be terminated in several possible ways. In the first place, we may never push our experiments beyond a stage into which the elements with which we are already familiar do not enter. In this case explanation is very simple: it involves nothing essentially new, but merely the disentanglement of complexities. The kinetic theory of gases, in explaining the thermal properties of a gas in terms of ordinary mechanical properties of the molecules, suggests such a situation. Or, secondly, our experiments may bring us into contact with situations novel to us, in which we can recognize no familiar elements, or at least must recognize that there is something in addition to the familiar elements. Such a situation constitutes an explanatory crisis and explanation has to stop by definition. Or thirdly, we may try to force our explanations into a predetermined mold, by formally erecting or inventing beyond the range of present experiment ultimates more or less like elements already familiar to us, and seek to explain all present experience in terms of these chosen ultimates.
Leaving for the present the third possibility, which is within our control to accept or reject, and is a formal matter, it is merely a question of experimental fact which of the first two possibilities corresponds to the actual state of affairs. The most perfunctory examination of the present state of physics shows that we are now facing the second of these possibilities, and that in the new experimental facts of relativity, and in all quantum phenomena, we are confronted with an explanatory crisis. It has often been emphasized that Einstein's theory of gravitation does not seek at all to give an explanation of gravitational phenomena, but merely describes and correlates these phenomena in comparatively simple mathematical language. No more attempt is made to reduce the gravitational attraction between the earth and the sun to simple terms than was made by Newton. In the realm of quantum phenomena it is of course the merest commonplace that our old ideas of mechanics and electrodynamics have failed, so that it is a matter of the greatest concern to find how many, or indeed whether any, elements of the old situations can be carried over into the new.
An examination of many of the so-called "explanations" of quantum theory constitutes at once a justification of the definition of explanation given above, and of the statement that in quantum phenomena we are at an explanatory crisis. For the endeavor of all these quantum explanations is to find in every new or more complicated situation the same elements which have already been met in simpler situations, and which are therefore relatively more familiar. For example, many quantum phenomena are made to involve the emission of energy when an electron jumps from one orbit to another. But always the elements to which reduction is made are themselves quantum phenomena, and these are still so new and unfamiliar that we feel an instinctive need for explanation in other terms. We seek to understand why the electron emits when it jumps.
The explanatory crisis which now confronts us in relativity and quantum phenomena is but a repetition of what has occurred many time in the past. A similar crisis confronted Prometheus when he discovered fire, and the first man who observed a straw sticking to a piece of rubbed amber, or a suspended lodestone seeking the north star. Every kitten is confronted with such a crisis at the end of nine days. Whenever experience takes us into new and unfamiliar realms, we are to be at least prepared for a new crisis.
Now what are we to do in such a crisis? It seems to me that the only sensible course is to do exactly what the kitten does, namely, to wait until we have amassed so much experience of the new kind that it is perfectly familiar to us, and then to resume the process of explanation with elements from our new experience included in our list of axioms. Not only will observation show that this is what is now actually being done with respect to quantum and gravitational phenomena, but it is in harmony with the entire spirit of our outlook on nature. All our knowledge is in terms of experience; we should not expect or desire to erect an explanatory structure different in character from that of experience. Our experience is finite; on the confines of the experimentally attainable it becomes hazy, and the concepts in terms of which we describe it fuse together and lose independent meaning. Furthermore, at every extension of our experimental range we must be prepared to find, and as a matter of fact we have often found, that we encounter phenomena of an entirely novel character for which previous experience has given us no preparation. The explanatory structure proposed above has all these properties; it is finite, being terminated by the edge of experiment, the final stages of our explanations are hazy in that it becomes more and more difficult to distinguish elements of familiar experience, and every now and then we must admit new elements into our explanations.
The first step in resuming our explanatory progress, after we have been confronted with such a crisis, is to seek for various sorts of correlation between the elements of our new experience, in the confident expectation that these elements will eventually become so familiar to us that they may be used as the ultimates of a new explanation. This is exactly what is now happening in quantum theory.
Diametrically opposed to the views above, there is another ideal of the explanatory process which is held by many physicists, and which has been mentioned above as the third possible way in which the explanatory sequence may be terminated, namely, the endeavor to devise beyond the limit of present experiment a structure built of elements like some of those of our present experience, in the action of which we endeavor to find the explanation of phenomena in the present range. Now a program such as this, as a serious program for the final correlation of nature, is entirely opposed to the spirit of the considerations expounded here. There is no warrant whatever in experience for the conviction that as we penetrate deeper and deeper we shall find the elements of previous experience repeated, although sometimes we do find such repetitions, as in the behavior of gases. Yet this has been the attitude of many eminent physicists, for example, Faraday and Maxwell, in seeking to explain distant electrical action by the propagation through a medium of a mechanical push or pull, or by Hertz, who sought in all phenomena the effect of concealed masses with ordinary mechanical inertia. Although as a general principle this program seems to be absolutely without justification, nevertheless it may be justified if the specific character of the physical facts seems to indicate a repetition at lower levels of elements familiar higher up. Hertz undoubtedly had this justification, as did also Maxwell to a certain extent, in the discovery that the general equations of electrodynamics are of the same form as the generalized Lagrangean equations of mechanics. For Faraday, however, there seems no such justification; the urge to this sort of thing in Faraday came from an uncritical acceptance of his own temperamental reactions.
From a less serious point of view it may, however, be quite justified to make such a working hypothesis as that in the action of electrical forces may be discovered the same elements with which we are familiar in the everyday experiences of mechanics. For such a hypothesis often enables us to make partial correlations which suggest new experimental tests, and thus gives the stimulus to an extension of our experimental horizon. Many physicists recognize the tentative character of such attempted explanations, but others apparently take them more seriously, as for example Lord Kelvin in his continuous life-long attempts to find a mechanical explanation of all physical phenomena. This quotation from Kelvin is illuminating. "I never satisfy myself until I can make a mechanical model of a thing. If I can make a mechanical model, I can understand it. As long as I cannot make a mechanical model all the way through, I cannot understand it.... But I want to understand light as well as I can without introducing things that we understand even less of."
So much for general considerations on the nature of explanation. Coming now to greater detail, many explanations involve what may be described as a mechanism. It is difficult to characterize exactly what we mean by mechanism, but it seems to be associated with an attitude of mind that strives to realize the third of the possibilities mentioned above. As a matter of fact, the mechanism sought for is usually of a particular type, in that the ultimate elements selected are mechanical elements. This point of view is particularly characteristic of the English school of physicists. Although "mechanism" usually implies mechanical elements, we may show by specific examples that we do actually use the word in a broader significance. If, for example, we could devise within the core of an atom a revolving system of electrical charges, acting on each other with the ordinary inverse square forces of electrostatics, such that every now and then the system becomes unstable and breaks up, we should doubtless say that we had found a mechanism for explaining radioactive disintegration.
However, the formulation of a precise definition of mechanism is of secondary concern to us; we are primarily interested in understanding the attitude of mind that feels a mechanism is necessary. A typical example of such an urge to a mechanism is afforded by the gravitational action between distant bodies. To many minds the concept of action at a distance is absolutely abhorrent, not to be tolerated for an instant. Such an intolerable situation is avoided by the invention of a medium filling all space, which transmits a force from one body to the other through the successive action on each other of its contiguous parts. Or the dilemma of action at a distance may be avoided in other ways, as by Boscovitch in the eighteenth century, who, in order to explain gravitation, filled space with a triply infinite horde of infinitesimal projectiles. Now of course it is a matter for experiment to decide whether any physical reality can be ascribed to a medium which makes gravitation possible by the action of its adjacent parts, but I can see no justification whatever for the attitude which refuses on purely a priori grounds to accept action at a distance as a possible axiom or ultimate of explanation. It is difficult to conceive anything more scientifically bigoted than to postulate that all possible experience conforms to the same type as that with which we are already familiar, and therefore to demand that explanation use only elements familiar in everyday experience. Such an attitude bespeaks an unimaginativeness, a mental obtuseness and obstinacy, which might be expected to have exhausted their pragmatic justification at a lower plane of mental activity.
Although it will probably be fairly easy to give intellectual assent to the strictures of the last paragraph, I believe many will discover in themselves a longing for mechanical explanation which has all the tenacity of original sin. The discovery of such a desire need not occasion any particular alarm, because it is easy to see how the demand for this sort of explanation has had its origin in the enormous preponderance of the mechanical in our physical experience. But nevertheless, just as the old monks struggled to subdue the flesh, so must the physicist struggle to subdue this sometimes nearly irresistible, but perfectly unjustifiable desire. One of the large purposes of this exposition will be attained if it carries the conviction that this longing is unjustifiable, and is worth making the effort to subdue.
The situation with respect to action at a distance is typical of the general situation. I believe the essence of the explanatory process is such that we must be prepared to accept as an ultimate for our explanations the mere statement of a correlation between phenomena or situations with which we are sufficiently familiar. Thus, in quantum theory, there is no reason why we should not be willing to accept as an ultimate the fundamental fact that when an electron jumps radiation is emitted, provided always that we can give independent meaning in terms of operations to the jumping of an electron. If there is no experiment suggesting other and intermediate phenomena, we ought to be able to rest intellectually satisfied with this. Of course it is quite a different matter, and entirely justified, to imagine what the assumption of finer details in the process would involve experimentally, and then to seek for these possible new experimental facts.
It is a consequence of this view that any correlation is adapted to be an absolutely final element of explanation, and can never be superseded by the discovery of new experimental facts, if the correlation is by definition beyond the reach of further experiment. Such a possibility, for example, is contained in a correlation between the numerical magnitude of the gravitational constant and the total mass of the universe. Something of this sort may be well attempted by those who desire their explanations to take a formally final shape. We shall return to this subject later.
The instinctive demand for a mechanism is fortified by observation of the many important cases in which mechanisms have been discovered or invented. However, the significance of such successful attempts must be subject to most careful scrutiny. The matter has been discussed by Poincaré,[6] who showed that not only is it always possible to find a mechanistic explanation of any phenomenon (Hertz's program was a perfectly possible one), but there are always an infinite number of such explanations. This is very unsatisfactory.
[6]Henri Poincaré. Wissenschaft and Hypothese, Translated into German by F. and L. Lindemann, Teubner, Leipzig, 1906. See especially p. 217.
We want to be able to find the real mechanism. Now an examination of specific proposed mechanisms will show that most mechanisms are more complicated than the simple physical phenomenon which they are invented to explain, in that they have more independently variable attributes than the phenomenon has been yet proved to have. An example is afforded by the mechanical models invented to facilitate the study of the properties of simple inductive electrical circuits. The great number of such models which have been proposed is sufficient indication of their possible infinite number. But if the mechanism has more independently variable attributes than the original phenomenon, it is obvious that the question is without meaning whether the mechanism is the real one or not, for in the mechanism there must be simple motions or combinations of motions which have no counterpart in features of the original phenomenon as yet discovered. Obviously, then, the operations do not exist by which we may set up a one to one correspondence between the properties of the mechanism and the natural phenomenon, and the question of reality has no meaning. If, then, a mechanism is to be taken seriously as actually corresponding to reality, we must demand that it have no more degrees of freedom than the original phenomenon, and we must also be sure that the phenomenon has no undiscovered features. Physical experience shows that such conditions are most difficult to meet, and indeed the probability is that they are impossible.
A mechanism with more independently variable attributes than the phenomenon may prove to be a very useful tool of thought, and therefore worth inventing and studying, but it is to be regarded no more seriously than is a mnemonic device, or any other artifice by which a man forces his mind to give him better service.
There is another possible program of explanation, the converse of that considered above, namely, to explain all familiar facts of ordinary experience in terms of less familiar facts found at a deeper level. The most striking example of this is the recent attempt to give a complete electrical explanation of the universe. The original attempt was to explain electrical effects in mechanical terms; this attempt failed. At about the same time the existence of the electron was experimentally established, so that it was evident that electricity is a very fundamental constituent of matter. The program of explanation was reversed, and an electrical explanation sought for all mechanical phenomena, including in particular mechanical mass. But this attempt has also failed; we recognize that part of mass may be non-electrical in character, we postulate non-electrical forces inside the electron, and further, we usually postulate for electrons and protons the property of impenetrability, a property derived from experience on a higher scale of magnitude.
A program of this general sort is likely to be regarded with considerable sympathy, and indeed the chances of success seem much greater than do those of the converse program, for in our experience large scale phenomena are more often built up from and analyzed into small scale phenomena than the converse. But as a matter of principle we must again recognize that the only appeal is to experiment, and that we have to ask just one question: "Is it true, as a matter of fact, that all large scale phenomena can be built up of elements of small scale phenomena?" It seems to me that the experimental warrant for this conviction has not yet been given. The failure of the attempted electrical explanation of the universe is a case in point. However, the failure to prove a proposition is no guarantee that some time it may not be proved, and many physicists are convinced of the ultimate feasibility of this program. Personally I feel that the large may not always be analyzed into the smaller; the subject will be discussed again.
A conviction of the significance of microscopic analysis has many features in common with the usual conviction of the ultimate simplicity of nature. The thesis of simplicity involves in addition the assumption that the kinds of small scale elements are few in number, but actually this involves no important difference between the two convictions, because we have seen that the elements of which we build our structure become fewer in number as we approach the limit of the experimentally attainable. We may properly grant to convictions of this sort pragmatic value in suggesting new correlations and experiments, but a recognition of the empirical basis of all physics will not allow us to go further.
MODELS AND CONSTRUCTS
In discussing the concept of length, we could find no meaning in questions such as: "Is space on a scale of 10-8 cm. Euclidean?" Nevertheless it will seem to many that they do attach a perfectly definite meaning to a question of this kind. Of course it must be agreed that magnitudes of 10-8 cm. cannot be thought of in terms of immediate sensation. When one thinks of an atom as a thing with any geometrical properties at all, I believe he will find that what he essentially does is to imagine a model, multiplying all the hypothetical dimensions by a factor large enough to bring it to a magnitude of ordinary experience. This large scale model is given properties corresponding to those of the physical thing. For example, the model of the atom which was accepted in the fall of 1925 contains electrons rotating in orbits, and every now and then an electron jumps from one orbit to another, and simultaneously energy is radiated from the atom. Such a model is satisfactory if it offers the counterpart of all the phenomena of the original atom. Now I believe the only meaning that any one can find in his statement that the space of the atom is Euclidean is that he believes that he can construct in Euclidean space a model with all the observed properties of the atom. This possibility may or may not be sufficient to give real physical significance to the statement that the space of the atom is Euclidean. The situation here is very much the same as it was with respect to mechanisms. The model may have many more properties than correspond to measurable properties of the atom, and in particular, the operations by which the space of the model is tested for its Euclidean character may [and as a matter of fact I believe do] not have any counterpart in operations which can be carried out on the atom. Further, we cannot attach any real significance to the statement that the space of the atom is Euclidean unless we can show that no model constructed in non-Euclidean space can reproduce the measurable properties of the atom.
In spite of all this, I believe that the model is a useful and indeed unescapable tool of thought, in that it enables us to think about the unfamiliar in terms of the familiar. There are, however, dangers in its use: it is the function of criticism to disclose these dangers, so that the tool may be used with confidence.
Closely related to the mental model are mental constructs, of which physics is full. There are many sorts of constructs: those in which we are interested are made by us to enable us to deal with physical situations which we cannot directly experience through our senses, but with which we have contact indirectly and by inference. Such constructs usually involve the element of invention to a greater or less degree. A construct containing very little of invention is that of the inside of an opaque solid body. We can never experience directly through our senses the inside of such a solid body, because the instant we directly experience it, it ceases by definition to be the inside. We have here a construct, but so natural a one as to be practically unavoidable. An example of a construct involving a greater amount of invention is the stress in an elastic body. A stress is by definition a property of the interior points of a body which is connected mathematically in a simple way with the forces acting across the free surface of the body. A stress is then, by its very nature, forever beyond the reach of direct experience, and it is therefore a construct. The entire structure of a stress corresponds to nothing in direct experience; it is related to force, but is itself a six-fold magnitude, whereas a force is only three-fold.
We have next to ask whether the stress, which we have invented to meet the situation in a body exposed to forces, is a good construct. In the first place, a stress has the same number of degrees of freedom as the observable phenomenon, for it is one of the propositions of the mathematical theory of elasticity that the boundary conditions, which are the experimental variables, uniquely determine the stress in a given body [i.e. a body of given elastic constants]; and of course it is at once obvious, by an inspection of the equations, that conversely a possible stress system uniquely determines the boundary conditions to the significant amount. There is, therefore, a unique one-to-one correspondence between a stress and the physical situation it was made to meet, and so far a stress is a good construct. Up to this point a stress, from the point of view of the operations in terms of which it is defined, is a purely mathematical invention, which is justified because it is convenient in describing the behavior of bodies subjected to the action of force. But we wish now to go farther and ascribe physical reality to a stress, meaning by this that a stress in a solid body shall correspond to some real physical state of the interior points. Let us examine, from the point of view of operations, what the meaning of a statement like this may be. Since we now wish to ascribe an additional physical meaning to a stress beyond that of the mathematical operations in terms of which the stress was determined, there must exist additional physical operations corresponding to this meaning, or else our statement is meaningless. Now of course it is a matter of the most elementary experience that physical phenomena do exist which allow these other independent operations. A body under stress is also in a state of strain, which may be determined from the external deformations, or the strain at internal points may be made more vividly real by those optical effects of double refraction in transparent bodies which are now so extensively used in model experiments, or if the stress is pushed beyond a certain point, we have such new phenomena as permanent set or finally, rupture.
We have, then, every reason to be satisfied with our construct of stress. In the first place, from the formal point of view, it is a good construct because there is a unique correspondence between it and the physical data in terms of which it is defined; and in the second place we have a right to ascribe physical reality to it because it is uniquely connected with other physical phenomena, independent of those which entered its definition. This last requirement, in fact, from the operational point of view, amounts to nothing more than a definition of what we mean by the reality of things not given directly by experience. Since now in addition to satisfying the formal requirements, experience shows that a stress is most useful in correlating phenomena, we are justified in giving to this construct of stress a prominent place among our concepts.
Consider now another construct, one of the most important of physics, that of the electric field. In the first place, an examination of the operations by which we determine the electric field at any point will show that it is a construct, in that it is not a direct datum of experience. To determine the electric field at a point, we place an exploring charge at the point, measure the force on it, and then calculate the ratio of the force to the charge. We then allow the exploring charge to become smaller and smaller, repeating our measurement of force on each smaller charge, and define the limit of the ratio of the force to the charge as the electric field intensity at the point in question, and the limiting direction of the force on a small charge as the direction of the field. We may extend this process to every point of space, and so obtain the concept of a field of force, by which every point of the space surrounding electric charges is tagged with the appropriate number and direction, the exploring charge having completely disappeared. The field is, then, clearly a construct. Next, from the formal point of view of mathematics, it is a good construct, because there is a one to one correspondence between the electric field and the electric charges in terms of which it is defined, the field being uniquely determined by the charges, and conversely there being only one possible set of charges corresponding to a given field. Now nearly every physicist takes the next step, and ascribes physical reality to the electric field, in that he thinks that at every point of the field there is some real physical phenomenon taking place which is connected in a way not yet precisely determined with the number and direction which tag the point. At first this view most naturally involved as a corollary the existence of a medium, but lately it has become the fashion to say that the medium does not exist, and that only the field is real. The reality of the field is self-consciously inculcated in our elementary teaching, often with considerable difficulty for the student. This view is usually credited to Faraday, and is considered the most fundamental concept of all modern electrical theory. Yet in spite of this, I believe that a critical examination will show that the ascription of physical reality to the electric field is entirely without justification. I cannot find a single physical phenomenon or a single physical operation by which evidence of the existence of the field may be obtained independent of the operations which entered the definition. The only physical evidence we ever have of the existence of a field is obtained by going there with an electric charge and observing the action on the charge [when the charges are inside atoms we may have optical phenomena], which is precisely the operation of the definition. It is then either meaningless to say that a field has physical reality, or we are guilty of adopting a definition of reality which is the crassest tautology.
There can be no question whatever of the tremendous importance of the concept of the electric field as a tool in thinking about, describing, correlating, and predicting the properties of electrical systems; electrical science is inconceivable without this or something equivalent. But in addition to this aspect of the field concept, the further tacit implication of physical reality is almost always present, and has had the greatest influence on the character of physical thought and experiment. Yet I do not believe that the additional implication of physical reality has justified itself by bringing to light a single positive result, or can offer more than the pragmatic plea of having stimulated a large number of experiments, all with persistently negative results. It is sufficient to mention the fate of the attempt of Faraday and Maxwell to ascribe a stress like that of ordinary matter to the ether, which failed because, among other reasons, nothing can exist in the ether analogous to the strain of ordinary matter, to indicate the unfruitfulness of the idea of physical reality. It seems to me that any pragmatic justification in postulating reality for the electric field has now been exhausted, and that we have reached a stage where we should attempt to get closer to the actual facts by ridding the field concept of the implications of reality.
Another indispensable and most interesting construct is that of the atom. This is evidently a construct, because no one ever directly experienced an atom, and its existence is entirely inferential. The atom was invented to explain constant combining weights in chemistry. For a long time there was no other experimental evidence of its existence, and it remained a pure invention, without physical reality, useful in discussing a certain group of phenomena. It is one of the most fascinating things in physics to trace the accumulation of independent new physical information all pointing to the atom, until now we are as convinced of its physical reality as of our hands and feet.
A construct which had to be abandoned because it did not turn out to have physical reality, and which furthermore was not sufficiently useful in the light of newly discovered phenomena, was that of a caloric fluid.
The notion of "physical reality" is not of prime importance to this discussion of the character of our constructs; our definition of the meaning of physical reality may not appeal to everyone. The essential point is that our constructs fall into two classes: those to which no physical operations correspond other than those which enter the definition of the construct, and those which admit of other operations, or which could be defined in several alternative ways in terms of physically distinct operations. This difference in the character of constructs may be expected to correspond to essential physical differences, and these physical differences are much too likely to be overlooked in the thinking of physicists. We must always be on our guard not to forget the physical differences between a thing like a stress in an elastic body and an electromagnetic field.
The moral of all this is that constructs are most useful and even unavoidable things, but that they may have great dangers, and that a careful critique may be necessary to avoid reading into them implications for which there is no warrant in experience, and which may most profoundly affect our physical outlook and course of action.
THE RÔLE OF MATHEMATICS IN PHYSICS
Practically all the formulations of theoretical physics are made in mathematical terms; in fact to obtain such formulations is generally felt to be the goal of theoretical physics. It is then evidently pertinent to consider what the nature of the mathematics is to which we assign so prominent a rôle.
We have in the first place to understand why it is possible to express physical relations in mathematical language at all. I am not sure that there is much meaning in this question. It is the merest truism, evident at once to unsophisticated observation, that mathematics is a human invention. Furthermore, the mathematics in which the physicist is interested was developed for the explicit purpose of describing the behavior of the external world, so that it is certainly no accident that there is a correspondence between mathematics and nature. The correspondence is not by any means perfect, however, but there is always in mathematics a precise quality to which none of our information about nature ever attains. The theorems of Euclid's geometry illustrate this in a preeminent degree. The statement that there is just one straight line between two points and that this is the shortest possible path between the points is entirely different in character from any information ever given by physical measurement, for all our measurements are subject to error. It is possible, nevertheless, to give a certain real physical meaning to the ideally precise statements of geometry, because it is a result of everyday experience that as we refine the accuracy of our physical measurements the quantitative statements of geometry are verified within an ever decreasing margin of error. From this arises that view of the nature of mathematics which apparently is most commonly held; namely that if we could eliminate the imperfections of our measurements, the relations of mathematics would be exactly verified. Abstract mathematical principles are supposed to be active in nature, controlling natural phenomena, as Pythagoras long ago tried to express with his harmony of the spheres and the mystic relations of numbers.
This idealized view of the connection of mathematics with nature could be maintained only during that historical period when the accuracy of physical measurement was low, and must now be abandoned. For it is no longer true that the precise relations of Euclid's geometry may be indefinitely approximated to by increasing the refinements of the measuring process, but there are essential physical limitations to the very concepts of length, etc., which enter the geometrical formulations, set by the discrete structure of matter and of radiation. This is no academic matter, but touches the essence of the situation. There is no longer any basis for the idealization of mathematics, and for the view that our imperfect knowledge of nature is responsible for failure to find in nature the precise relations of mathematics. It is the mathematics made by us which is imperfect and not our knowledge of nature. [From the operational point of view it is meaningless to attempt to separate "nature" from "knowledge of nature".] The concepts of mathematics are inventions made by us in the attempt to describe nature. Now we shall repeatedly see that it is the most difficult thing in the world to invent concepts which exactly correspond to what we know about nature, and we apparently never achieve success. Mathematics is no exception; we doubtless come closer to the ideal here than anywhere else, but we have seen that even arithmetic does not completely reproduce the physical situation.
Mathematics appears to fail to correspond exactly to the physical situation in at least two respects. In the first place, there is the matter of errors of measurement in the range of ordinary experience. Now mathematics can deal with this situation, although somewhat clumsily, and only approximately, by specifically supplementing its equations by statements about the limit of error, or replacing equations by inequalities—in short, the sort of thing done in every discussion of the propagation of error of measurement. In the second place, and much more important, mathematics does not recognize that as the physical range increases, the fundamental concepts become hazy, and eventually cease entirely to have physical meaning, and therefore must be replaced by other concepts which are operationally quite different. For instance, the equations of motion make no distinction between the motion of a star into our galaxy from external space, and the motion of an electron about the nucleus, although physically the meaning in terms of operations of the quantities in the equations is entirely different in the two cases. The structure of our mathematics is such that we are almost forced, whether we want to or not, to talk about the inside of an electron, although physically we cannot assign any meaning to such statements. As at present constructed, mathematics reminds one of the loquacious and not always coherent orator, who was said to be able to set his mouth going and go off and leave it. What we would like is some development of mathematics by which the equations could be made to cease to have meaning outside the range of numerical magnitude in which the physical concepts themselves have meaning. In other words, the problem is to make our equations correspond more closely to the physical experience back of them; it evidently needs some sort of new invention to accomplish this.
We return later, in discussing Lorentz's equations of electrodynamics, to the disadvantages arising from the present undiscriminating character of mathematics. In the meantime, we must recognize that there are very important advantages here, as well as disadvantages. All experience justifies the expectation that the laws of nature with which we are already familiar hold at least approximately and without violent change in the unexplored regions immediately beyond our present reach. By assuming an unlimited validity for the laws as we now know them, mathematics enables us to penetrate the twilight zone, and make predictions which may be later verified. It is only when we are carried too far afield that we must deprecate this characteristic of our mathematics.
There is another aspect of the use of mathematics in describing nature that is often lost sight of; namely, that any system of equations can contain only a very small part of the actual physical situation; there is behind the equations an enormous descriptive background through which the equations make connection with nature. This background includes a description of all the physical operations by which the data are obtained which enter the equations. For instance, when Einstein formulates the behavior of the universe in terms of the world lines of events, the events as they enter the equations are entirely colorless things, merely 3 space and 1 time coördinate. To make connection with experience there must be a descriptive background giving the physical contents of the events; for example, there may be the statement that some of the events are light signals. This descriptive background is supposed to remain fixed, unaffected by any operations to which the equations themselves are subject. If, for example, the frame of reference of the equations is altered by changing its velocity, the physical significance of the descriptive background is supposed to remain unaltered, or rather no mention is usually made of this question at all. It would seem, however, that this matter needs some discussion. The descriptive background gets its meaning only in terms of certain physical operations. If the descriptive background remains unaltered when the uniform velocity of the frame of reference is changed, for instance, this means that the motion of the frame of reference does not at all affect the possibility of carrying out certain operations. This is pretty close to the restricted principle of relativity itself, which states that the form of natural laws is not affected by uniform velocity. Until a more careful analysis of the situation is made it would seem therefore that there is some ground for the suspicion that the principle of relativity is involved in the possibility of giving to physical phenomena a complete mathematical formulation, understanding "complete" to mean "including the descriptive background."
CHAPTER III
DETAILED CONSIDERATION OF VARIOUS CONCEPTS OF PHYSICS
WE now begin our detailed consideration of the most important concepts of physics. It is entirely beyond the scope of this essay to attempt more than an indication of some of the most important matters. Neither is it to be expected that the parts of this discussion will always have a very close connection with each other; the purpose of the discussion is to aid in acquiring the greatest possible self-consciousness of the whole structure of physics.
THE CONCEPT OF SPACE
A logically satisfying definition of what we understand by the concept of space is doubtless difficult to give, but we shall not be far from the mark if we think of it as the aggregate of all those concepts which have to do with position. Position means position of something. The position of things is determined by some system of measurement; perhaps the simplest is that implied in a Cartesian coördinate system with its three measurements of length. Hence much of the essential discussion of space has already been given in connection with the concept of length. We have seen that measurements of length are made with physical measuring rods applied to some physical object. We cannot measure the distance between two points in empty space, because if space were empty there would be nothing to identify the position of the ends of the measuring rod when we move it from one position to the next. We see, then, from the point of view of operations that the framework of Cartesian geometry, often imagined in an ideal mathematical sense, is really a physical framework, and that what we mean by spatial properties is nothing but the properties of this framework. When we say that space is Euclidean, we mean that the physical space of meter sticks is Euclidean: it is meaningless to ask whether empty space is Euclidean. Geometry, therefore, in so far as its results are expected to apply to the external physical world, and in as far as it is not a logical system built up from postulates, is an experimental science. This view is now well understood and accepted, but there was a time when it was not accepted, but vigorously attacked; the change of attitude toward this question is symptomatic of a change of attitude toward many other similar questions.
We have already emphasized that the space of astronomy is not a physical space of meter sticks, but is a space of light waves. We may, therefore, have different kinds of space, depending on the fundamental operations. The space of meter sticks we have called "tactual space", and the space of light beams "optical space". If we ask whether astronomical space is Euclidean, we mean merely to ask whether those features of optical space which are within the reach of astronomical measurement are Euclidean. The only possible attitude with respect to this question, or such related questions as whether the total volume of space is finite, or whether space has curvature, is that it is entirely for experiment to decide, and that we have no right to form any preconceived notion whatever. It is therefore beyond the scope of this discussion.
It is interesting to notice that the restricted theory of relativity virtually assumes, although often without making the explicit statement, that tactual and optical space are the same. This equivalence results from the properties assumed for light beams. The distance of a mirror may be found equally well by measuring it with meter sticks, or by determining the time required by a light signal to travel there and back. This situation is, however, logically unsatisfying, because it must be assumed that the operations for measuring time are independently defined, and we shall see that they are not. It is a consequence of the assumed equivalence of tactual and optical space that the path of a beam of light is a straight line, as a straight line is determined by operations with meter sticks. When we come to astronomical phenomena, the physical operations with meter sticks can no longer be carried out, and it is meaningless to ascribe to beams of light on an astronomical scale the same geometrical properties that we do on a small scale.
THE CONCEPT OF TIME
According to our viewpoint, the concept of time is determined by the operations by which it is measured. We have to distinguish two sorts of time; the time of events taking place near each other in space, or local time, and the time of events taking place at considerably separated points in space, or extended time. As we now know, the concept of extended time is inextricably mixed up with that of space. This is not primarily a statement about nature at all, and might have been made simply by the observation that the operations by which extended time is measured involve those for measuring space. Of course historically the doctrine of relativity was responsible for the critical attitude which led to an examination of the operations of measuring time, but relativity was not necessary for a realization of the spatial implications of time, any more than the discovery of Planck's quantum unit h was necessary for the invention by Planck of his absolute units of measurement, although historically he was inspired to make this invention by discovering h, and in his own mind seems to have thought of the connection as a necessary one.[7]
[7]Max Planck, The Theory of Heat Radiation, translated by Masius, P. Blakiston's Son & Co., 1914 edition, p. 174.
The physical operations at the basis of the measurement of time have never been subjected to the critical examination which seems to be required. One method of measurement, for instance, involves the properties of light.
A meter stick is set up with mirrors at the two ends, and a light beam travels back and forth between the two mirrors without absorption. The time required for a single passage back and forth is defined as the unit of time, and time is measured simply by counting these intervals. But such a procedure is unsatisfactory if we are to permit ourselves all those operations which are demanded by even the simplest postulate of relativity, for we must be able to move our clock from place to place, transfer it from one system to another in relative motion, and with it determine the properties of light beams in the stationary or moving system. We recognize in principle that the length of the meter stick may be different when it is in motion, that it may change also during the acceleration incident to moving it from one place to another, and that until proved to the contrary the velocity of light may be a function of velocity or acceleration. The complicated interplay of all these possibilities leaves us in much doubt as to the physical significance of such postulates as, for example, that the velocity of light is the same in the moving system and the stationary system. In order to ascribe any simple significance to postulates about the velocity of light, it would seem that we must have an instrument for measuring this velocity, and therefore for measuring time, which does not itself involve the properties of light. To do this we might seek to specify the measurement of time in purely mechanical terms, as for instance in terms of the vibration of a tuning fork, or the rotation of a flywheel. But here again we encounter great difficulties, because we recognize that the dimensions of our mechanical clock may change when it is set in motion, and that the mass of its parts may also change. We want to use the clock as a physical instrument in determining the laws of mechanics, which of course are not determined until we can measure time, and we find that the laws of mechanics enter into the operation of the clock.
The dilemma which confronts us here is not an impossible one, and is in fact of the same nature as that which confronted the first physicist who had to discover simultaneously the approximate laws of mechanics and geometry with a string which stretched when he pulled it. We must first guess at what the laws are approximately, then design an experiment so that, in accordance with this guess, the effect of motion on some phenomenon is much greater than the expected effect on the clock, then from measurements with uncorrected clock time find an approximate expression for the effect of motion on mass or length, with which we correct the clock, and so on ad infinitum. However, so far as I know, the possibility of such a procedure has not been analyzed, and until the analysis is given, our complacency is troubled by a real disquietude, the intensity of which depends on the natural skepticism of our temperament.
In practice, the difficulties of such a logical treatment are so great that the matter has been entirely glossed over. It is convenient to postulate a clock, of unknown construction, but such that the velocity of light, when measured in terms of it, has certain properties. Such, for example, is the point of view in Birkhoff's recent book.[8]
[8]G. D. Birkhoff. Relativity and Modern Physics, Harvard University Press, 1923.
The difficulty with this method is that the resulting edifice is as divorced from physical reality as is the logical geometry of postulates. We cannot be at all sure that the properties of light as measured with our physical clocks are the same as the theoretical properties. The difficulty is particularly important and fundamental in the general theory of relativity; the basis of the whole theory is the infinitesimal interval ds, which is supposed to be given. Once given, the mathematics follows. But in a physical world, ds is not given, but must be found by physical operations, and these operations involve measurements of length and of time with clocks whose construction is not specified. In any actual physical application the question must be answered whether the physical instrument used in measuring the temporal part of ds is really a clock or not. There is at present no criterion by which this question can be answered. If the vibrating atom is a clock, then the light of the sun is shifted toward the infra-red, but how do we know that the atom is a clock (some say yes, others no)? If we find the displacement physically have we thereby proved that general relativity is physically true, or have we proved that the atom is a clock, or have we merely proved that there is a particular kind of connection between the atom and the rest of nature, leaving the possibility open that neither is the atom a clock nor general relativity true? In practice, of course, we shall adopt the solution which is simplest and most satisfying aesthetically, and doubtless shall say that the atom is a clock and relativity true. But if we adopt this simple view, we must also cultivate the abiding consciousness that at some time in the future troubles may have their origin here.
It seems to me that the logical position of general relativity theory is merely this: Given any physical system, then it is possible to assign values to ds such that relations mathematically deduced by the principle of relativity correspond to relations between measurable quantities in the physical system; but that the things that we physically call ds are anything more than approximately connected with the ds's required to give the mathematical relations, is at present no more than a pious faith.
To return to the concept of time, we have already stated that there are two main problems, that of measuring time at a single point of space, and that of spreading a time system over all space. The second aspect of the problem is that to which attention has been directed by relativity theory; the following detailed examination shows how the operations of relativity for setting and synchronizing clocks at distant places involve the measurement of space. It is a fundamental postulate that the adjustment of the clocks is to be accomplished by light signals. The synchronization of the clocks is now simple enough. We merely demand that light signals sent from the master clock at intervals of one second arrive at any distant clock at intervals of one second as measured by it, and we change the rate of the distant clock until it measures these intervals as one second. After its rate has been adjusted, the distant clock is to be so set that when a light signal is despatched from the master clock at its indicated zero of time the time of arrival recorded at the distant clock shall be such that the distance of the clock from the master clock divided by the time of arrival shall give the velocity of light, assumed already known. This operation involves a measurement of the distance of the distant clock, so that in spreading the time coordinates over space the measurement of space is involved by definition, and the measurement of time is, therefore, not a self-contained thing. This is the physical basis for the treatment of space and time as a four-dimensional manifold. Although mathematically the numbers measuring space and time enter the formulas symmetrically, nevertheless the physical operations by which these numbers are obtained are entirely distinct and never fuse, and I believe it can lead only to confusion to see in the possibility of a four dimensional treatment anything more than a purely formal matter.
The notion of extended time, therefore, involves the measurement of space. It is an interesting question whether the notion of local time also involves the measurement of space. A rigorous answer to this question involves giving the specifications for the construction of a clock, which we have seen has not yet been done. It seems to me probable, however, that the construction of even a single local clock involves in some way the measurement of space. If, for example, we use a vibrating tuning fork, we must find how the time of vibration depends on the amplitude of vibration, and this involves space measurement, or if we use a rotating flywheel, we have to correct for the change of moment of inertia due to the change of dimensions when it is set into motion or brought into a gravitational field, and all this involves space measurement. However, these considerations are not certain, and perhaps the question is not important.
There is now the further consideration that actually in practice the concept of local time is not entirely divorced from that of extended time, for two bodies cannot occupy the same space at the same time, and the time of any event is actually measured on an instrument at some distance, communication being maintained by light or elastic signals. But experience convinces us that in the limit, as the phenomenon to be measured gets closer to the clock, there is no measurable difference, whether communication with the clock is maintained by light, or acoustical or tactual signals, so that we have come in physical practice to accept measurement of the time of events in the immediate neighborhood of the clock (local time) as one of the ultimately simple things behind which we do not attempt to go.
Local time is, therefore, a concept treated by the physicist even now as simple and unanalyzable. This concept is what most people have in mind when they think of time. Time, according to this concept, is something with the properties of local time; it was something of this kind that Newton must have meant by his absolute time, and it is the tacit retention of this sort of concept that is responsible for the difficulty so often found in grasping the idea of the relativity of simultaneity, which is of course entirely foreign to our experience of simultaneity in local time. An examination of the operations involved in extending time has shown how the concept of extended time is different from that of simple local time; this difference leads to appreciably different numerical relations when we are dealing with high velocities or great distances. Local time is proved by experience not to be a satisfactory concept for dealing with events separated by great distances in space or with phenomena involving high velocities. For instance, we must not talk about the age of a beam of light, although the concept of age is one of the simplest derivatives of the concept of local time. Neither must we allow ourselves to think of events taking place in Arcturus now with all the connotations attached to events taking place here now. It is difficult to inhibit this habit of thought, but we must learn to do it. The naïve feeling is very strong that it does mean something to talk about the entire present state of the universe independent of the process by which news of the condition of distant parts is determined by us. I believe that an examination of this feeling will show that it is psychological in character; what we mean by the totality of the present is merely the entire present content of our consciousness. This is apparently a simple direct thing; we do not appreciate until we make further analysis that our present consciousness of the existence of the moon or a star is due to light signals, and that therefore the apparently simple immediate consciousness of events distant in space involves complicated physical operations.
Similarly, if we continue to use local time, we get into trouble, when we go to high velocities, with our simple concept of velocity, which may be defined in terms of a combination of space and time concepts. The concept of local time thus loses its value and becomes merely a blunted tool when we try to carry it out of its original range. But the concept of extended time, with which we have to replace local time, is a complicated thing, to which we have not yet got ourselves accustomed; it may perhaps prove to be so complicated as never to be a very useful intuitive tool of thought.
All these considerations about time have been concerned only with intervals of such an order of magnitude that they are readily experienced by any individual. If we have to deal with intervals either very long or very short, it is obvious that our entire procedure changes, and consequently the concept changes. In extending the time concept to eras remote in the past, for example, we try as always, to choose the new operations so as to piece on continuously with those of ordinary experience. A precise analysis of the change in the concept of time when applied to the remote past does not seem to be of great significance for our present physical purpose, and will not be attempted here. It is perhaps worth while to point out, however, that all our other concepts, as well as that of time, must be modified when applied to the remote past; an example is the concept of truth. It is amusing to try to discover what is the precise meaning in terms of operations of a statement like this: "It is true that Darius the Mede arose at 6:30 on the morning of his thirtieth birthday."
Of more concern for our physical purposes is the modification which the time concept undergoes when applied to very short intervals. What is the meaning, for example, in saying that an electron when colliding with a certain atom is brought to rest in 10-18 seconds? Here I believe the situation is very similar to that with regard to short lengths. The nature of the physical operations changes entirely, and as before, comes to contain operations of an electrical and optical character. The immediate significance of 10-18 is that of a number, which when substituted into the equations of optics, produces agreement with observed facts. Thus short intervals of time acquire meaning only in connection with the equations of electrodynamics, whose validity is doubtful and which can be tested only in terms of the space and time coordinates which enter them. Here is the same vicious circle that we found before. Once again we find that concepts fuse together on the limit of the experimentally attainable.
This discussion of the concept of time will doubtless be felt by some to be superficial in that it makes no mention of the properties of the physical time to which the concept is designed to apply. For instance, we do not discuss the one dimensional flow of time, or the irrevocability of the past. Such a discussion, however, is beyond our present purpose, and would take us deeper than I feel competent to go, and perhaps beyond the verge of meaning itself. Our discussion here is from the point of view of operations: we assume the operations to be given, and do not attempt to ask why precisely these operations were chosen, or whether others might not be more suitable. Such properties of time as its irrevocability are implicitly contained in the operations themselves, and the physical essence of time is buried in that long physical experience that taught us what operations are adapted to describing and correlating nature. We may digress, however, to consider one question. It is quite common to talk about a reversal of the direction of flow of time. Particularly, for example, in discussing the equations of mechanics, it is shown that if the direction of flow of time is reversed, the whole history of the system is retraced. The statement is sometimes added that such a reversal is actually impossible, because it is one of the properties of physical time to flow always forward. If this last statement is subjected to an operational analysis, I believe that it will be found not to be a statement about nature at all, but merely a statement about operations. It is meaningless to talk about time moving backward: by definition, forward is the direction in which time flows.
THE CAUSALITY CONCEPT
The causality concept is unquestionably one of the most fundamental, perhaps as fundamental as that of space and time, and therefore at least equally entitled to a first place in the discussion. But as ordinarily understood, there are certain spatial and temporal implications in the causality concept, so that it can best be discussed in this order after our examination of space and time.
There is an aspect of the causality concept that in many respects is closely related to the question of "explanation", for to find the causes of an event usually involves at the same time finding its explanation. But there are nevertheless sufficient differences to warrant a separate discussion.
It seems fairly evident that there was originally in the causality concept an animistic element much like that in the concept of force to be discussed later. The physical essence of the concept as we now have it, freed as much as possible from the animistic element, seems to be somewhat as follows. We assume in the first place an isolated system on which we can perform unlimited identical experiments, that is, the system may be started over again from a definite initial condition as often as desired.[9]
[9]We must include in general in the concept of "initial condition" the past history of the system. In order not to make this condition so broad as to defeat itself, we have to add the observation that actually identity of past history is necessary over only a comparatively short interval of time. Logical precision seems unattainable here—the physical concepts themselves have not the necessary precision.
We assume further that when so started, the system always runs through exactly the same sequence of events in all its parts. This contains the assumption that the course of events runs independent of the absolute time at which they occur—there is no change with time of the properties of the universe.[10] It is a result of experience that systems with these properties actually exist. An alternative way of stating our fundamental hypothesis is that two or more isolated similar systems started from the same initial condition run through the same future course of events. Upon the system given in this way, which by itself runs a definite course of events, we assume that we can superpose from the outside certain changes, which have no connection with the previous history of the system, and are completely arbitrary. Now of course in nature, as we observe it, there is no such thing as an arbitrary change, without connection with past history, so that strictly our assumption is a pure fiction. It is here that the animistic element still seems to persist, although perhaps not necessarily.
[10]As so often in physics, we appear to be doing two things at once here. It is doubtful whether we can give a meaning to "definite initial condition" apart from the future behavior of the system, so that we have no real right to infer from uniform future behavior both a constancy of the laws of nature, independent of time, and a constancy of initial condition. I very much question whether a thoroughgoing operational analysis would show that there are really two independent concepts here, and whether the use of two formally quite different concepts is anything more than a convenience in expression. It seems to me that it may be just as meaningless to ask whether the laws of nature are independent of time as it was to ask with Clifford whether the absolute scale of magnitude may not be changing as the solar system travels through space.
We regard our acts as not determined by the external world, so that changes produced in the external world by acts of our wills are, to a certain degree of approximation, arbitrary. The system, then, on which we are experimenting, is one capable of isolation from us in that we may regard ourselves as outside the system, and having no connection with it. The system, furthermore, is capable of isolation from the rest of the physical universe, in that events taking place outside the system have no connection with those taking place inside.[11] Experience gives the justification for assuming that physical isolation of this sort is possible. Actually, of course, isolation is never complete, but only partial, up to presumably any desired degree of approximation.
[11]Here again, the concept of "isolation" or "connection" is defined only in terms of the behavior of the system, and it is not clear whether this is really an operationally independent concept or not.
The statement that two exactly similar isolated systems, starting from the same initial conditions (including past history in the general idea of initial condition) will run through the same future course of events involves as a corollary that if differences develop in the behavior of two such apparently similar systems these differences are evidence of other previous differences. The thesis that this corresponds to experience may be called the thesis of essential connectivity and is perhaps the broadest we have: it is the thesis that differences between the behavior of systems do not occur isolated but are associated with other differences. It is essentially the same thesis as that already mentioned in connection with "explanation", namely that it is possible to correlate any of the phenomena of nature with other phenomena.
If now the connectivity or correlation between phenomena is of a special kind, we have a causal connection; namely, if whenever we arbitrarily impress event A on a system we find that event B, always occurs, whereas if we had not impressed A, B would not have occurred, then we say that A is the cause of B, and B the effect of A. By suitably choosing the event A, we may find the effect of any event of which the system is susceptible.
The relation between A and B is an unsymmetrical one, by the very nature of the definition, the cause being the arbitrary variable element, and the effect that which accompanies it. Furthermore, A may obviously be the cause of more than one event B, and may cause a whole train of events.
The causal concept analyzed in this way is not simple by any means. We do not have a simple event A causally connected with a simple event B, but the whole background of the system in which the events occur is included in the concept, and is a vital part of it. If the system, including its past history, were different, the nature of the relation between A and B might change entirely. The causality concept is therefore a relative one, in that it involves the whole system in which the events take place.
In practice we now take an exceedingly pregnant step and seek to extend the concept, and rid ourselves as much as possible of its relativity. It is a matter of experience that there are often a great number of systems in which A is the cause of B. In many cases the causal relation persists through such a very wide range of systems that we lose sight entirely of the system, and come to assume that we have an absolute causal connection between A and B. For instance, when I strike a bell, and hear the sound, the causal connection persists through such a great number of different kinds of system that I might think that here is an absolute causal connection. Such an absolute causal connection would mean that always under all circumstances, the striking of the bell is accompanied by a sound. But all conditions means only all those conditions covered by experiment. Thus in the case of the bell, all our experiments were made in the presence of the atmosphere. The causal connection between the striking of the bell and the sound should have been always recognized in principle as relative to the presence of the atmosphere. Indeed, later experiments in the absence of the atmosphere show that the atmosphere does play an essential part. Now as a matter of fact, the atmosphere is so comparatively easy to remove that we very readily include the atmosphere in the chain of causal connection. But if the atmosphere had been impossible to remove, like the old ether of space, our idea of the causal connections between the striking of the bell and its sound might have been quite different. In actual physical applications of the causality concept, the constant background which is maintained during all the variations by which the causal connection is established usually has to be inferred from the context.
It is a matter of perhaps universal experience that the event A is accompanied by not only one event, which is the effect of A by definition, but A entails a whole causal train of events. It seems to be a generalization from experience that the causally connected train of events started by A is a never ending train, provided the system is large enough. This is perhaps not necessary in the general case, but if the event A involves imparting external energy to the system, or the action of external force (momentum change), there can be no question.
That there is a causal train started by A is particularly evident if A and B are separated in space. Thus in the case of the bell, the impulse given to the air by the vibration of the bell is propagated through the air as an elastic wave, which thus constitutes the causal train of events. The phenomenon of propagation is characteristic of causal connections of a mechanical character, and is the justification for the introduction of the time concept in connection with the causality concept, where it now appears for the first time. It is evident that when a disturbance is propagated to a distant point, the effect follows the cause in time, as time is usually measured.
We extend this result, and usually think that the effect necessarily follows the cause. We now examine whether this is a necessary result of the causality concept. If we are to talk about the time of events at different places, we must have some way of setting clocks all over space. If this is done arbitrarily, there is no necessary connection between the local clock times of a cause and its effect, but nevertheless the causality concept involves a certain temporal relation even in this most general case. Suppose that event A takes place at point 1 and its effect, event B, at point 2. We station a confederate at 2 who sends a light signal (or any other sort of signal) to 1, as soon as the event B occurs at 2. Then it is a consequence of the nature of the causality concept that the signal cannot arrive at 1 before event A occurs. For if it did arrive before A, we should merely omit to perform A, which by hypothesis is arbitrary, and entirely in our control, and then our assumption would be violated that the system is such that the event B occurs only when A also occurs. The same argument shows a fortiori that if the effect B occurs at the same place as its cause A, it cannot precede it in time. I cannot see that the nature of the causality concept imposes any further restriction on the time of B. The restricted principle of relativity, however, in postulating that no signal can be propagated faster than a light signal, virtually makes a further assumption about the temporal connection of causally connected events, namely, that the event B at 2 cannot occur before the arrival at 2 of a light signal which started from 1 at the instant that A occurred at 1. For if B did occur earlier, we could use events A and B as a signaling code, thus violating our hypothesis.
There is thus a closest connection in time, when time is extended over space as the theory of relativity directs, between cause and effect, depending on their separation in space; from this arises the relativity concept of the causal cone, which in the four dimensional manifold of space-time divides the aggregate of all those events which may be causally related from the aggregate of those which are separated by such a small interval in time and such a large interval in space that communication by light signals and therefore a causal connection is not possible. Given now two events A and B which are related as cause to effect in one system of reference, then they must be causally related also in every other system of reference. For if they were not, we could by definition of causality suppress the event A in one of the systems in which the causal relation does not hold, and this, because of the nature of the concept of event, involves suppressing A in all the systems, thus violating our hypothesis of a causal connection in the original system. The concept of event involved in this argument will be examined later. It appears then, that the fundamental postulate of relativity (that the form of natural laws is the same in all reference systems) demands that the temporal order of events causally connected be the same in all reference systems.
The whole universe at this present moment is often supposed to be causally connected with all succeeding states. This means that if we could repeat experience, starting from the same initial conditions, the future course of events would always be found to be the same. The truth of this conviction can never be tested by direct experiment, but it is something at which we arrive by the usual physical process of successive approximation. It is difficult to formulate precisely what we mean by "present" state of the universe, and there is every reason to think that such a formulation is not unique, but the concept contains the necessary implication that none of the events constituting the "present" can be causally connected. The events in distant places which constitute the present must be separated by an interval of time less than time required by light to travel between the two places.
The conviction, arising from experience, that the future is determined by the present and correspondingly the present by the past, is often phrased differently by saying that the present causally determines the future. This is in a certain sense a generalization of the causality concept. It is one of the principal jobs of physics to analyze this complex causal connection into components, representing as far as possible the future state of the system as the sum of independent trains of events started by each individual event of the present. How far such an analysis is possible must be decided by experiment. It is certainly possible to a very large extent in most cases, but there seems to be no reason to expect that a complete analysis is possible. So far as the system is describable in terms of linear differential equations, the causal trains started by different events propagate themselves in space and time without interference and with simple addition of effects, and conversely the present may be analyzed back into the simple sum of elementary events in the past, but if the equations governing the motion of the system are not linear, effects are not additive, and such a causal analysis into elements is not possible. No emphasis is to be laid here on the differential aspect of the equations: it is quite possible that finite difference equations may have the same property of additivity. Although there can be no question that linear equations enormously preponderate, neither can there be any doubt that some phenomena cannot be described in terms of linear equations (e.g., ferro-magnetism), so that there seems no reason to think that a causal analysis is always possible. I believe, however, that the assumption that such an analysis into small scale elements is possible is tacitly made in the thought of many physicists. If the analysis is not possible, we may expect to find results following the cooperation of several events which cannot be built up from the results of the events occurring individually.
When a causal analysis is possible, finding the simplest events which act as the origin of independent causal trains is equivalent to finding the ultimate elements in a scheme of explanation, so that here we merge with the concept of explanation, as already mentioned. As was true of the explanatory sequence, so here there can be no formal end of the causal sequence, because we can always ask for the cause of the last member. But it may be physically meaningless to extend the causal sequence beyond a certain point. We have seen from the point of view of operations that the causal concept demands the possibility of variation in the system. It is therefore meaningless to say that A is the cause of B unless we can experience systems in which A does not occur. Now if in extending the causal sequence, we eventually arrive at a condition so broad that physically no further variation can be made, our causal sequence has to stop.
Corresponding to this property of the causality concept, the causal sequence may be terminated either formally, by postulate, or naturally, by the intrinsic physical nature of the elements of the sequence. Thus if we say that light gets from point to point because it is propagated by a medium of unalterable properties, which fills all space, which is always present and can never be eliminated physically, we have by the postulated properties of the medium brought the possibility of further inquiry to a close, because to take the next step and ask the cause of the properties of the ether, demands that we be able to perform experiments with the ether altered or absent. Such an ending of the sequence is evidently pure formalism, without physical significance. But other considerations may give physical significance. Thus if there are other sorts of experiment that can be explained by assuming a universal medium of the same properties, the concept proves not only to be useful, but to have a certain degree of physical significance. An example of an inevitable termination of the causal sequence is afforded by the possibility, already mentioned, that the value of the gravitational constant may be determined by the total quantity of matter in the universe. Without further qualification, this is an entirely sterile statement, but if it can be shown that there is a simple numerical connection, the matter takes on interest, and we may seek further for a correlation between the numerical relation and other things.
This analysis of the causality concept does not pretend to be complete and leaves many interesting questions untouched. Perhaps one of the most interesting of these questions is whether we can separate into cause and effect two phenomena which always accompany each other, and whether therefore the classification of phenomena into causally connected groups is an exhaustive classification. But the discussion is broad enough for our purpose here; the most important points of view to acquire are that the causality concept is relative to the whole background of the system which contains the causally connected events, and that we must assume the possibility of an unlimited number of identical experiments, so that the causality concept applies only to sub-groups of events separated out from the aggregate of all events.
THE CONCEPT OF IDENTITY
One of the most fundamental of all the concepts with which we describe the external world is that of identity; in fact, thinking would be almost inconceivable without such a concept. By this concept we bridge the passage of time; it enables us to say that a particular object in our present experience is the same as an object of our past experience. From the point of view of operations, the meaning of identity is determined by the operations by which we make the judgment that this object is the same as that one of my past experience. In practice there are many indirect ways of making this judgment, but I believe the essence of the situation lies in the possibility of continuous connection between the object of the present and the past by continuous observation (either direct or indirect) through all intermediate time. We must, for example, be able to look continuously at the object, and state that while we look at it, it remains the same. This involves the possession by the object of certain characteristics—it must be a discrete thing, separated from its surroundings by physical discontinuities which persist. The concept of identifiability applies, therefore, only to certain classes of physical objects; no one thinks of trying to identify the wind of to-day with the wind of yesterday. It is somewhat easier to identify a liquid such as water in its flow in a stream, because we can make the motion of the water visible by solid particles suspended in it, but even here it is not easy to prove to a captious critic that it is really the water and not the suspended particles of solid that we are identifying. Even solids, when our measurements are sufficiently refined, seem to lose their discontinuous edges, as has been suggested in the discussion of the approximate character of experimental arithmetic, and the identity concept becomes hazy.
There can be no question that the concept of identity is a tool perfectly well adapted to deal approximately with nature in the region of our ordinary experience, but we have to ask a more serious question. Does not the apparent demand of our thinking apparatus to be furnished with discrete and identifiable things to think about impose a very essential restriction on any picture of the physical universe which we are able to form? We are continually surprising ourselves in the invention of discrete structure further and further down in the scale of things, whole sole raison d'être is to be found entirely within ourselves. Thus Osborne Reynolds[12] has speculated seriously and most elaborately about an atomic structure in the ether, and we find Eddington[13] hinting at the existence of structure of an order of magnitude of 10-40 cm. On a much larger scale of magnitude we also think in the same terms, and conceive positive and negative elementary charges with hard and impenetrable cores, which involves a complete change in the law of force at points sufficiently close. What physical assurance have we that an electron in jumping about in an atom preserves its identifiability in anything like the way that we suppose, or that the identity concept applies here at all?
[12]Osborne Reynolds, The Sub-Mechanics of the Universe, 254 pp., Cambridge University Press, 1903.
[13]A. S. Eddington, Report on Gravitation, Lon. Phys. Soc., 1918, p. 91.
In fact, the identity concept seems to lose all meaning in terms of actual operations on this level of experience.
The mind seems essentially incapable of dealing with continuity as a property of physical things; it is not even able to talk about continuity except in negative terms. To each attempted description of the properties of a truly continuous substance, it can say "No, it is not that", but cannot imagine experience which corresponds to what it conceives a really continuous thing ought to feel like. In terms of operations, continuity has only a sort of negative meaning. Now certain implications of this inability of the mind can be removed by appropriate postulates, as, for example, we can postulate the complete annihilation of a negative by a positive charge, as is now being done in certain speculations.[14] There is point in doing this, because the annihilation of two charges has physical meaning. But it is a question whether all implications of this habit of thought can be removed, and whether any picture that we can form of nature will not be tinged—sickbed o'er with the pale cast of thought.
[14]For example, J. H. Jeans, Nat. 114, 828-829, 1924.
The operational view suggests that in this last we are coming perilously close to a meaningless question, although there is a certain sense in which there is meaning here. It may turn out as a matter of fact that we shall not be able to carry our delving into small-scale phenomena deeper than a certain point, and that nature will appear to be finite downward, so that we shall bring up against a wall of some kind. But to ask in such a situation whether we have come to the end because nature is really finite, or whether we only appear to be at an end because of some property of our minds, such as inability to deal with continuity, is, I believe, a meaningless question.
In actual use the identity concept is extended, and identity is used in other senses than the fundamental one examined above. For instance, we speak of two observers seeing the same object, or if the object moves or does something, we may speak of two observers perceiving the same happening. A happening about which the judgment of sameness is possible when perceived by different observers (or mathematically expressed when observed in two reference frames) is what we mean by an event, which is one of the fundamental concepts of relativity theory. What now is involved in this concept of event, or what do we mean when we say that two observers experience the same event? A first crude attempt might say that the event is the same if it is described in the same way by the two observers. But this leads us into all the complicated questions of the meaning of language, which we would gladly avoid, and is furthermore not true, because the whistle of a locomotive, for example, does not have the same pitch for two observers moving with different velocities. A satisfactory analysis of the situation is difficult to give, but I believe the essence lies in the discrete character of the event, just as the identity concept when applied only to objects involved discreteness. The event is bounded on all sides by discontinuities, both in space and time. Now it seems to be a result of experience that discontinuities have a certain absolute significance, in that there is a one-to-one correspondence between the discontinuities observed in any one reference system and those observed in any other. Corresponding discontinuities in two reference systems are by definition the same. An event is by definition the aggregate of all phenomena bounded by certain discontinuities, and two reference systems are by definition describing the same event if the discontinuous boundaries of the event are the same, irrespective of the appearance of the event in the two systems. The emission of a light signal, for example, is an event according to this definition, although it may appear as red light in one reference system and green in another.
We now see that the concept of event is only an approximate concept, as was also that of identity, and for the same reason, namely, there are no such things in experience as sharp discontinuities, but as our measurements become more refined, the edges of supposed discontinuities become blurred. As we go to smaller scales of magnitude this blurring becomes more important, until the physical possibility of performing those operations by which the discontinuities are detected entirely disappears, and the concept of event acquires, in terms of operations, an entirely different meaning. We continue to think of the event in the same way as before in terms of a mental model, but the true operational significance now depends on the particular phenomenon under consideration. The concept of event is really not the same sort of thing when applied to the emission of a quantum of radiation from an atom, or the emission of gamma radiation from a radioactive disintegration, or the flashing of a signal from a dark lantern by opening and closing a shutter. Here as always, when our range of experience is extended, we must be prepared at some future time to find that, by extending the ordinary concept of event to small-scale phenomena by the device of the mental model, we have by implication smuggled into our picture phenomena which do not exist, so that it will be necessary to revise our thinking, casting it into terms corresponding to direct experience.
THE CONCEPT OF VELOCITY
The concept of velocity, as ordinarily defined, involves the two concepts of space and time. The operations by which we measure the velocity of an object are these: we first observe the time at which the object is at one position, and then later observe the time at which it is at a second position, divide the distance between the two positions by the time interval, and if necessary, when the velocity is variable, take the limit. As long as we deal with fairly low velocities we do not have to inquire carefully as to the kind of time we use in these operations, but when the velocities become high, we do have to take care to use the local times at the two positions of the body, which means that we must have a time system spread over space, or, in other words, the "extended" time system. This velocity concept, defined in this way, may be used as a tool in describing nature, and it will be found that nature has certain properties; for example, the velocity of light is 3 x 1010 cm./sec. Further, no material thing can be given a velocity as high as this, but as its velocity is made to approach this value, increments of energy increasing without limit are required.
But now it is very much a question for examination whether the velocity concept defined in this particular way has been chosen wisely as a tool for describing natural phenomena. It is quite possible to modify the velocity concept, that is, to set up other operations which correspond to our instinctive feeling of what velocity is in terms of immediate sensation and such that all numerical measures are unmodified at low velocities.[15] For example, a traveller in an automobile measures his velocity by observing the clock on his instrument board and the mile stones which he passes on the road. This operation differs from that of the definition above in that the time is no longer extended time, but is the local time of the moving object. The space coordinates used in this alternative operation at first seem a hybrid sort of thing, but they are what the observer would actually most naturally use: they are what he would measure with a tape measure fixed to a point of the road and allowed to unwind as he proceeds, or what is measured by a vessel at sea with a log line let out behind.
[15]It is an interesting question for the psychologist whether the velocity concept is not a more primitive thing in order of apprehension than that of time, and whether the concept of time is not derived from observing things in motion, or whether indeed there is any necessary connection at all between velocity and time in terms of untutored experience.
Or there is still another most interesting way of defining velocity, in which the analysis into space and time is not made at all, but velocity is directly measured by building up the given velocity by physical addition of a unit velocity selected arbitrarily. This matter is discussed at some length in my book "Dimensional Analysis",[16] but is of sufficient pertinence here to describe briefly. We may in the first place construct a concrete standard for velocity, as, for example, by stretching a string between two pegs on a board with a fixed weight. If we strike the string, a disturbance travels along the string which we can follow with the eye, and we define unit velocity as the velocity of this disturbance. An object has greater than unit velocity if it outruns the disturbance, and less if it lags behind. We may now duplicate our standard, making another board with pegs and stretched string, and check the equality of the two velocities by observing that the two disturbances run together. We now define two units of velocity as the velocity of anything which runs with the disturbance of the string of the second board, when the second board is made to move bodily with such a velocity that it runs with the disturbance of the first string. The process may be extended indefinitely, and any velocity measured.
[16]Yale University Press, 1922.
If either of these two alternative definitions of velocity were adopted, it would be found that the velocity of light is infinite. Further, there would be no limit to the velocity which can be imparted to material bodies on giving them unlimited energy, which is what we are prepared to regard from ordinary experience is natural and simple. The infinite velocity for light, on the other hand, is most unnatural, particularly if we favor a medium point of view. We are here faced with a dilemma—all sorts of phenomena cannot at the same time be treated simply. If we attach the most fundamental significance to the behavior of material bodies, we shall do well to adopt one of the alternative definitions of velocity. If, on the other hand, we regard the phenomena of light as the most fundamental, we shall endeavor to form our definition so that the properties of light are simple. This was precisely the point of view of Einstein; it is characteristic of his entire scheme of restricted relativity that light is the fundamental thing, and this influenced him in adopting the first definition of velocity. Now one can have no quarrel with this desire to make light fundamental (the wisdom of doing this is to be justified by the results), and if the properties of light are to be treated mathematically, one can easily see the desirability of getting rid of infinite attributes, and so admit the desirability of making the velocity of light finite. But all this involves another very insidious assumption which we ourselves have tacitly used in all our preceding discussion, namely, that the notion of velocity properly pertains to light at all. Einstein has very definitely adopted this point of view, and so determined the character of the entire structure of relativity. I believe, on the contrary, that it is very gravely to be questioned whether the identification of light with a thing travelling, which is involved in applying the velocity concept, should be made. This discussion must be postponed, however, until we deal with the properties of light. The important points for us to notice at present are that the definition of velocity actually used involves the concept of extended time, and that it would be possible to define velocity in different ways, which would give quite a different complexion to phenomena at high velocities, but which would leave untouched our ordinary experience.
The velocities at which the precise form of definition becomes important are higher than can be reached in ordinary mechanical experiments. Such velocities can be attained in terrestrial laboratories only with electrified particles, as in experiments in high vacua or with radioactive disintegrations. It is interesting to notice that we very seldom attempt a direct measurement of velocity in such experiments by following a discrete particle in its flight and finding the time required to pass over a measured distance, but the velocities are measured indirectly, by calculation from the equations of electrodynamics and in terms of such immediately observed things as curvature of path. It is true that one or two experiments have attempted a more direct measure of velocity, but it seems there is room for more work here.
THE CONCEPTS OF FORCE AND MASS
Another concept of great importance is that of force. Since the usual analysis finds a connection between force and acceleration, and acceleration involves velocity, this is a natural place for the discussion of force. This concept has been subjected to much analysis by various writers. In origin the concept doubtless arises from the muscular sensations of resistance experienced from external bodies. This crude concept may at once be put on a quantitative basis by substituting a spring balance for our muscles, or instead of the spring balance we may use any elastic body, and measure the force exerted by it in terms of its deformation. Of course, the various precautions which must be taken in carrying out this idea physically are complex; the matter of precautions against temperature changes, for example, is one of the most easily understood. The concept of force so defined is limited to static systems; it is the task of statics to find the relation between the forces in systems at rest. We next extend the force concept to systems not in equilibrium, in which there are accelerations, and we must conceive that at first all our experiments are made in an isolated laboratory far out in empty space, where there is no gravitational field. We here encounter a new concept, that of mass, which as it is originally met is entangled with the force concept, but may later be disentangled by a process of successive approximations. The details of the various steps in the process of approximation are very instructive as typical of all methods in physics, but need not be elaborated here. Suffice it to say that we are eventually able to give to each rigid material body a numerical tag characteristic of the body, such that the product of this number and the acceleration it receives under the action of any given force applied to it by a spring balance is numerically equal to the force, the force being defined, except for a correction, in terms of the deformation of the balance, exactly as it was in the static case. In particularly, the relation found between mass, force, and acceleration applies to the spring balance itself by which the force is applied, so that a correction has to be applied for a diminution of the force exerted by the balance arising from its own acceleration.
We now extend the scope of our measurements by bringing our laboratory into the gravitational field of the earth, and immediately our experience is extended, in that we continually see bodies accelerated with no spring balance (that is, no force) acting on them. We extend the concept of force, and say that any body accelerated is acted on by a force, and the magnitude of this force is defined as that which would have been necessary to produce in the same body the same acceleration with a spring balance in empty space. There is physical justification for this extension in that we find we can remove the acceleration which a body acquires in a gravitational field by exerting on it with a spring balance a force of exactly the specified amount in the opposite direction. This extended idea of force may also be applied to systems in which there are electrical actions.
We thus see that in extending the notion of force from bodies in rest to bodies in motion, the character of the concept has changed, because the operations by which force is measured change—the force acting on a body is now measured in terms of its acceleration. But in determining the force from the acceleration, we have to know the mass. This mass has to be independently measured with the original concept of force; otherwise we have no basis for such simple statements as that the force of gravity on a body is proportional to its mass. All this applies to the ordinary range of experiments with low velocities. If now we extend the range of measurements, we find phenomena which we had not expected; for example, there seem to be difficulties in the way of indefinitely increasing the velocity of a material body, as of a charged atom. We begin to ask searching questions: is the force of gravity independent of velocity at high velocities, or is the mass independent of velocity under the same conditions or independent of the gravitational field, etc.?
In attempting to answer these new questions, we find difficulty with the concepts in terms of which they are formulated. There are no operations by which we can find whether force is independent of velocity unless we first know the mass, or any operations by which a mass can be measured unless we know a force. The purely mechanical systems with the highest velocities of which we have any experimental knowledge are the heavenly bodies. The motion of these is, with the important exception of Mercury, that predicted by the ordinary laws of mechanics, so that at first it might appear that we have here confirmation of the laws of mechanics for bodies with comparatively high velocities. But it must be remembered that all we can observe of the heavenly bodies is their positions, and that we cannot perform on these bodies all the operations by which we can check the laws of mechanics for terrestrial phenomena. If, for example, mass and the force with which gravity acts on mass were both equally affected by velocity, the motion of the heavenly bodies would be exactly the same as that observed now. Hence as we increase the range of velocity, the concepts of force and mass simultaneously lose their definiteness, and become partially fused. This is typical of what we have now come always to expect near the limit of the experimentally attainable; experience becomes less rich, the choice of physical operations more restricted, concepts change and become fewer in number. If we are to retain the same formal number of concepts, we must introduce arbitrary conventions or definitions. These definitions are to be determined largely by convenience. In the case of mechanical systems, this motive of convenience is supplied by considerations from outside the domain of mechanical phenomena. The highest velocities of practice are not reached in mechanical, but in electrical systems, in experiments with vacuum tubes, etc. Considerations of convenience are therefore dictated from the electrical point of view. These considerations will be gone into in much more detail later; the conclusion is all that we need here, which is that it is convenient to assume for the charge of the electron a constant number, independent of the velocity, and this involves making its mass variable in a definite way with velocity. Now if the principle of relativity is accepted, the mass of mechanical objects must vary with velocity in the same way as the mass of electrical charges. Since the variability of this latter is fixed, mechanical mass becomes a definite function of velocity, and the force is therefore also fixed in any specific physical case.
The fundamental definition of force given above is highly academic, involving as it does hypothetical experiments in laboratories situated far out in empty space. Some sort of procedure like this seems to correspond to more or less explicit statements to be found in the literature of mechanics. The meaning in terms of actual operations to be given to such definitions involves complicated inferential reasoning. We would make much closer connection with the conditions of actual experiment if in the definition we substituted for the hypothetical operations in empty space more or less approximately realizable operations on bodies sliding on level table tops without friction. I suppose our instinctive feeling for the laws of mechanics is such that we are convinced that definitions in terms of an interstellar space laboratory or a level table top are actually the same. But in principle we must recognize that when the operations are different, the concepts are different, and if we adopt something equivalent to the table top definition, as it seems we are physically forced to do, we must leave open in our thinking the possibility of finding in the present penumbra, when our accuracy is sufficiently increased, such phenomena perhaps as directional attributes of mass in a gravitational field.
We have just considered the sort of problem that we encounter on ordinary scales of magnitude on going from low to high velocities; what becomes of the concepts of force and mass when we go to a very small scale? Down to the atomic scale we may at least slur over the new physical difficulties, for although we cannot of course experiment with actual atoms, we can nevertheless make measurements of the Brownian[17] movement of suspensions in liquids settling in a gravitational field, for example, and the extrapolation to the atom is not a very great one. The mass of each individual atom is obtained by what is equivalent to a process of counting, assuming the law of conservation of mass on an atomic scale. This is justified by all chemical experience. The mass of the component parts of the atoms, the electrons, may also perhaps be given a unique significance after we have decided on the laws of the electrical field, by experiments on acceleration in electrical fields. The question which interests in principle here is what meaning, if any, shall be attached to the mass of the elements of the electron.
[17]This phenomenon is discussed at length in the book by J. Perrin, Brownian Movement and Molecular Reality, translated by F. Soddy, Taylor and Francis, London, 1909.
It is evident that we here go beyond any possible experience, at least for the present, and that experience has again become poorer and our concepts fewer in number. All that we can now demand is that certain combinations of numbers, some of which represent mechanical mass and others electrical charge, have proper relations to each other when integrated throughout the entire body of the electron. Similar questions confront us when we ask what are the forces which the parts of the electron exert on each other. We return to this question in considering the nature of the electrical concepts. In any event, the concepts of both force and mass are entirely altered in this domain.
It is interesting to note, in passing, that present electrical theory gives no meaning to the mass of the elements of the electron, since the total electromagnetic mass of the electron is built up from the mutual terms in the action of the elements—the total mass is not a linear resultant of the action of the elements.
THE CONCEPT OF ENERGY
In examining the concept of energy, we start with purely mechanical energy. In isolated mechanical systems, in which there are only conservative forces, the sum of kinetic and potential energy is constant. The kinetic energy may be defined as ∑ ½ mv2, formed for all parts of the body. The potential energy is determined by the position of the parts of the system, and has physical significance only with reference to a datum position, that is, only changes of potential energy have meaning in terms of operations. The total energy ascribed to the system has therefore an element of arbitrariness in that the datum position may be chosen at random, and energy acquires meaning only on tracing the history back to the epoch of the datum position.
The concept of energy may be extended from mechanical systems to all systems with which we are acquainted; the operations by which meaning is given to the extended energy concept involve the generalized conservation principle, or the first law of thermodynamics. The extension to thermal systems is immediate; the inclusion of optical and electrical systems in the scheme was a most important physical step, which of course required careful experimental justification. Because of its wide range of application, the energy concept has now come to be regarded as one of the most important in physics; this idea was held by Ostwald[18] twenty and more years ago, and is now much to the front because of the connection between mass and energy indicated by the theory of relativity, and the important rôle assigned to energy levels in spectrum analysis.
[18]W. Ostwald, Die Energie, Barth, Leipzig, 1908.
What now is the precise nature and significance of the general energy concept? In the first place the conservation property of energy is one of the simplest and most obvious of the properties of matter, so that in this property of energy is seen a reason for ascribing to it certain of the properties of matter, in particular and most important, that of localization in space. We must recognize, however, that this idea of a location in space is injected into the situation entirely by ourselves, and corresponds to nothing directly given by the operations of experiment. The idea has had a most important effect, however. Witness, for instance, the importance ascribed to the discovery by Kelvin of a function by which the total energy of an electric field can be represented as distributed through space;[19] this was one of the most important props of the medium point of view.
[19]This function is ⅛π times the scalar product of electric force and displacement. If Maxwell's definition of displacement is adopted, the factor ⅛π is replaced by ½, and an accurate analogy results between the energy stored in the ether and the elastic energy stored in a bent spring.
A more critical examination is likely to diminish considerably our satisfaction with this naive analogy drawn between matter and energy. With regard to matter, we may still be tolerably satisfied with our ascription to matter of location in space, but it is quite different with regard to conservation of matter. In just what sense is matter conserved? Certainly not in terms of mass, as we at one time thought. Nevertheless we undeniably have a feeling that there is some sort of conservation property here, and are driven to formulate it badly in terms of a hypothetically constant number of protons and electrons. I have long thought that Newton was groping after some very similar idea when he so far forgot himself as to define mass as quantity of matter, a definition perfectly meaningless to a rigorous and unsympathetic interpretation. On the other hand, whatever meaning may reside in our idea of conservation of matter, it certainly is not, in at least one important respect, like the conservation of energy. For the energy of an isolated mechanical system is a function of the frame of reference in which it is described; merely by giving velocity to the reference frame and altering in no way the mechanical system we may change its kinetic, and so its total, energy by any direct amount. This does not even remotely resemble ordinary matter. I cannot see that the operations which are equivalent to the energy concept justify us in saying more than that energy is a property of a material system; the operations do not seem to give any unique meaning to a location associated with energy.
We now ask what significance is to be ascribed to the sort of conservation that energy does have. We restrict ourselves first to mechanical systems. The motions of a mechanical system satisfy certain differential equations of the second order, and the actual motion is to be found by an integration of the equations. In the integral of a differential equation certain constants appear which are determined by the initial conditions, and are therefore the same during all the future motion of the system; obviously these constants of motion correspond to conservative properties. This reasoning can of course be at once extended. Any system, mechanical or not, whose motion is determined by differential equations, will have certain conservative properties. For the systems of mechanics energy is one of the conservative functions; others are momentum and moment of momentum. Energy is particularly simple, in that it is connected with measurable properties of the system by a simple formula (∑ ½ mv2), and is furthermore scalar, which is also a property of quantity of matter. But to go further and ascribe to energy other properties of matter, such as localization in space, is entirely overlooking the essential difference in the character of the operation by which matter and quantity of energy are measured, that is, overlooking the essential difference in their physical character.
The possible extension of the energy concept from mechanics to thermodynamics receives a sufficient physical explanation in terms of our views of the essentially mechanical character of thermal phenomena. That the idea can be extended also to simple electrical or magnetic systems, in which the effect of velocity of propagation is neglected, is a consequence of the fact that in these systems the equations of motion remain of the same general mechanical type, it having been shown by Maxwell that the equations of such systems may be written in the generalized Lagrangean form. When, however, we extend our formulas to systems in which the velocity of propagation is important (that is, when we consider the field equations in their general form) we find that the Lagrangean equations no longer apply to matter taken by itself, and energy is no longer conserved in the original sense. A new function appears, however, which behaves mathematically in the same way that the energy did before. The equations of motion of the system remain Lagrangean in form if the mechanical parts of the system are supplemented by the electric and magnetic fields in space. In this extended form we have, therefore, a conservative function as before, and the energy concept may be retained in this enlarged aspect. The physical operations by which energy is determined are entirely altered, however, and the physical character of the concept is changed. No more than before is there justification for localizing energy in space, or ascribing to it other properties of matter. Yet the materialization of the energy concept, and the consequent desire that energy be localized in space, is one of the strongest arguments in many minds for the existence of a medium.
As far as I can see, therefore, the existence of conservative functions is involved in the possibility of describing natural phenomena with differential equations. That further there is a conservative function of the precise form found in mechanics is a consequence of the particular form of the equations and the nature of the forces. The question of the significance of the fact that the forces of nature appear to be conservative, with respect to this particular function of mechanics, is of much interest, but it is not our immediate concern now. We are interested rather to ask under what general conditions we shall have conservative functions. Quantum theory strongly suggests that when we pass to phenomena on a small enough scale, we may no longer be able to employ differential equations in our descriptions, and hence the previous reason for the existence of constants connected with the motion disappears. Now there is one obvious remark to be made about this more general situation. Whenever the future history of a system is so connected with its present condition that we can retrace our way to the present from any future configuration, we shall always have conservative functions. For any future configuration contains certain fixed (or conservative) features, in that we can reconstruct the unique present from any future state. There is no reason to expect that the operations by which we find the fixed features will always be simple, as in the mechanical case. Now the determination of the future by the present, and conversely the possibility of reconstructing the present from the future (or the past from the present), is, we are convinced, a property which is at least approximately true of phenomena down to a smaller scale of magnitude than we have yet reached, and so we expect to find these conservative functions in systems whose ultimate laws of motion are much more general than any with which we are yet familiar. The particular form of the conservative function depends on the character of the system. That there is a scalar conservative function for ordinary systems depends of course on particular properties of the system, but we are at least prepared to find that a scalar conservative function does not necessarily mean a differential equation of the second order.
The potential energy of a system has a particular significance with respect to this point of view. In an ordinary mechanical system, the potential energy simply measures the work done by the applied forces in being displaced from the initial to the final positions; that is, the potential energy is a measure of the deviation from the initial position, and so measures a certain feature of the history of the system. In the more general system, in which we may not have differential equations, we may look for something analogous to the potential energy which shall measure the displacement of the system from its initial configuration. Such a measure is always possible as long as the past can be reconstructed from the present (or the present from the future). We recall a remark of Poincaré's[20] to the effect that we of necessity must always have conservation, because if we have a system in which conservation apparently fails, we merely have to invent a new form of potential energy. This remark is obviously not of entire generality, but applies only to such systems as those considered here, in which the past may be reconstructed from the present.
[20]Henri Poincaré, Wissenschaft und Hypothese, translated into German by F. and L. Lindemann, Teubner, Leipzig, 1906, Chap. VIII.
Of late there has been much discussion of the advisability, on the basis of certain quantum phenomena, of giving up conservation as a principle applied to the details of the emission and absorption of light, retaining it only in a statistical sense. It seems to me that the question here in the minds of physicists was always merely one of convenience, and that few, if any, doubted the ultimate applicability of the principle of Poincaré, or thought that we were here concerned with a system of such great generality that the past could not be reconstructed from the present. The question was merely whether those variables in terms of which the potential energy is defined make close enough connection with other things of immediate experimental significance, or whether on the whole the retention of a potential energy is not more trouble than is justified by its convenience, making a treatment from the statistical point of view preferable. However, this is all now a matter of more or less past history, because with the recent extension of the experiments of Compton,[21] we seem to have experimental evidence for the validity of the conservation law in detail for elementary quantum processes, with a corresponding simple potential energy.
[21]W. Bothe and H. Geiger, 2S. f. Phys. 32, 639-663, 1925. A. H. Compton, Proc. Nat. Acad. Soc., II, 303-306, 1925.
Going still deeper, however, there are quantum phenomena which still may have to be treated by statistical methods, and this may mean giving up conservation in detail. We have no experimental evidence, for example, of what the electron is doing while jumping from one quantum orbit to another. A situation like this merely means that those details which determine the future in terms of the past may lie so deep in the structure that at present we have no immediate experimental knowledge of them, and we may for the present be compelled to give a treatment from a statistical point of view based on considerations of probability. But I suppose that no one, except perhaps Norman Campbell,[22] will maintain that such a situation is more than temporary, or will cease to search for consequences of these details of structure which may be open to experimental verification.
[22]Norman Campbell, Time and Chance, Phil. Mag. I, 1106-1117, 1926.
Similarly, we cannot permanently be satisfied with a picture of radioactive phenomena which represents radioactive disintegration as a matter of chance.
The general conclusion to which all this discussion leads is that energy is probably not entitled to the fundamental position that physical thought is inclined to give it, but that it is a more or less incidental consequence of more deep-seated properties, and that the character of these deep-seated properties is subject to only the most general restrictions, so that very little about the nature of the details can be inferred from the existence of any energy function.
THE CONCEPTS OF THERMODYNAMICS
We shall not be concerned here with the many technical questions which are the proper subject of treatises on thermodynamics, but shall attempt an examination only of some fundamental concepts.
The most fundamental of these, which sets thermodynamics off apart from the simpler subjects, is probably that of temperature. In origin this concept was without question physiological, in much the same way as the mechanical concept of force was physiological. But just as the force concept was made more precise, so the temperature concept may be more or less divorced from its crude significance in terms of immediate sensation and be given a more precise meaning. This precision may be obtained through the notion of equilibrium states. We have in the first place the fundamental experimental fact that when a small body is placed inside a large system, which we recognize by crude means as comparatively invariable in temperature as time goes on, the small body very soon acquires a steady condition, that is, it comes to equilibrium with its surroundings. We now have the further experimental fact that if the small body A is in equilibrium with its environment, and body B is also in equilibrium with the same environment, there will be no change of condition of A and B when they are brought into contact with each other—that is, A and B are each in equilibrium with the other and also with the environment and therefore, by definition, at the same temperature as the environment. The temperature of the environment is now measured in terms of some of the properties of A and B which crude experience has shown change with the physiological temperature of A and B. The physiological notion of temperature is thus made more precise by being connected with the physical phenomenon of equilibrium.
Now it is at once evident that stated in this way without qualification we have said things that are not true. It is not true in general that, when A is in equilibrium with an environment and B is in equilibrium with the same environment, A will be in equilibrium with B. Suppose, for example, that the environment is a stream of water and A is a tiny water wheel moving freely in its bearings, and that B is a similar wheel with much friction. Then we know that B will become warm, and will not be in equilibrium with A when brought into contact with it. Or we may choose for A a mercury thermometer with bulb covered with putty, and for B a similar thermometer with bulb sheathed in platinum, and we know that the two thermometers will not register the same temperature in the water stream. Or still more simply, we may try to read the temperature of the air in our garden on any bright day with a silvered and with a blackened bulb thermometer; we know that the two thermometers will read different temperatures. It is evident, therefore, that we shall have to specify much more carefully the conditions under which equilibrium holds if we are to give precise significance to the temperature concept.
It seems fairly evident in the first place that we shall have to rule out systems in which there is large scale mechanical motion; the simple notion of temperature does not apply to a system moving with respect to us. Only when the two thermometers A and B move with the same velocity as the stream do we have three-fold equilibrium between the stream, A and B. We may state this in another way by saying that the temperature of a moving body must be measured on a thermometer stationary with respect to the body, but this is only a matter of words, and properly speaking the temperature concept applies only to a certain aspect of the relation between two bodies mutually at rest. We here entirely neglect relativity questions such, for example, as the proper way of correcting for the change of dimensions of moving thermometers.
If now the body whose temperature we are measuring does not move with the same velocity in all its parts, we may still give a meaning to local temperature by dividing the body into parts so small that the velocity of each part is essentially uniform, and measuring the temperature of each part with a thermometer stationary with respect to it. We are now confronted with the question of how far to carry the process of subdivision. Suppose we have a fluid whose motion is completely turbulent when measured with instruments of the ordinary scale of magnitude. For such a fluid the fundamental equilibrium proportions hold between two measuring bodies A and B and the fluid, provided that the bodies A and B are so large that the motion is completely turbulent on their scale of magnitude. We may then define the temperature of the turbulent fluid from the standpoint of these large scale bodies. But we may also define the temperature from the small scale point of view as the average of the temperatures recorded by sufficiently small thermometers, each moving with the velocity of a local bit of the fluid. These two temperatures will in general be different, and we must more or less arbitrarily select one which we define as the true temperature. It would seem that the small scale temperature is the better one to choose, because there is a certain degree of arbitrariness in specifying the scale from which the motion shall be judged completely turbulent But on the other hand, there are difficulties in the small scale definition, because the turbulence may become more and more fine grained, until we end with the motion of the molecules themselves, when the operations certainly fail which give meaning to the temperature concept. In this case of molecular turbulence, we are driven back to the large scale definition, which obviously corresponds to ordinary physical practice.
It appears then that the temperature concept is not a clean cut thing, which can be made to apply to all experience, but that it is more or less arbitrary, involving the scale of our measuring instruments. In any special case, the meaning of the temperature concept must be set by special convention. In practice this does not often make difficulty, because in the majority of cases there is no large scale motion with respect to the thermometers.
Consider now the other aspect of the equilibrium problem suggested by the thermometers with blackened and silvered bulbs in the sunshine. Our common experience tells us how to deal with this situation effectively enough for ordinary purposes. We recognize that the possibility of temperature equilibrium is disturbed by the radiation, and we protect the bulbs of the thermometers from the sun's radiation by appropriate shields. But this only minimizes the difficulty. For the shield is warmed by the sun, and in turn warms to a less degree by its radiation the bulb of the thermometer within. We must recognize that every body, no matter what its temperature, is always emitting radiation, so that the bulb of our thermometer is always in a radiation field. At first this puts us in a serious quandary as to the whole question of equilibrium and the meaning of temperature. The situation is saved by the experimental observation that there is a particular radiation field which affects all thermometers equally, namely, the field inside an infinite body all at the same temperature. Logically this looks like the vicious circle again, for we have not yet defined what we mean by the same temperature. But actually we avoid the circle here, as in so many other physical cases, by a process of asymptotic approximation. The procedure is perhaps something like this: we find that if we experiment with larger and larger bodies, isolated and at great distances from other bodies at approximately the same temperature as judged by crude physiological sensations, two thermometers, identical except that the bulb of one is blackened and that of the other is silvered, record more nearly the same temperature as time goes on and as the thermometers are sunk to greater depths in the body. In actual practice, of course, the radiational opacity of most bodies is so high that these precautions against the effects of external radiation can usually be entirely ignored. At high temperatures, on the other hand, radiation has to be explicitly dealt with.
The conclusion for us from these considerations is that operationally the concept of temperature is tied up with that of radiation—the equilibrium concept of temperature is strictly never exactly applicable; it is only a limiting sort of concept applicable when the radiation field is of a special sort, namely, that of a black body.
In spite of the explicit recognition which we have to give radiation in defining temperature, we usually entirely lose sight of it in thinking about the mechanism of ordinary physical processes, as for instance when we picture the temperature of a gas as determined by the kinetic energy of its molecules. Now I have no doubt that negligence of this sort can be justified, but the necessary logical analysis is apparently complicated, and involves a great many different sorts of experiment by methods of asymptotic approximation, by which we establish the existence of various sorts of physical constants, such as constants of emission and absorption and reflection and scattering and fluorescence and thermal conductivity. We do not need to make the analysis here, but I believe that some time it would be worth while to attempt it. Such an analysis will justify the principle so often used: that if a body is in thermal equilibrium the various processes involved, such as radiation or thermal conductivity, must when taken separately also be in equilibrium. Doubtless, if our experience had been confined to higher temperatures, like that of the sun, this notion of different mechanisms acting independently would have been more difficult to acquire.
We next consider another fundamental concept of thermodynamics, that of quantity of heat. We are at first perhaps inclined to think of this as a comparatively straightforward concept, given immediately in terms of experience, but an analysis of the operations by which we measure quantity of heat will show that the situation is really most complicated. Consider, for example, Joule's experiment in which the mechanical equivalent of heat was measured by determining the rise of temperature of the water in a container when stirred by paddles driven by a falling weight. We do not question that the rise of temperature of the water has its origin in the mechanical work done on it by the paddles. But what about the rise of temperature of the container? We shall doubtless say that part of this rise comes from heat communicated to it by the warmer water in contact with it, and part from mechanical work done on it by turbulent impact of the water. But by what operations shall we measure what part of the energy communicated to the container is heat and what part mechanical work? We try to give an idealized answer to this question in terms of Maxwell demons stationed at all parts of the boundary of the containing vessel with small scale measuring instruments. To measure the heat entering at any point I can see nothing else for the Lilliputian observers to do but to determine the temperature gradient at every point of the boundary from temperature observations at two different levels, and calculate the heat inflow from the gradient and the thermal conductivity of the material of the walls—there seems no way of measuring a flow of heat as such. The inflow of mechanical energy must be calculated from a detailed knowledge of the elastic waves and other large scale deformations of the walls. Here again there is an arbitrary element in our procedure; if our mechanical measuring instruments are on too gross a scale, we may miss mechanical energy which we would catch with finer instruments.
This situation which we have just submitted to detailed analysis is, I believe, typical of the general situation; it is not possible in the general case to find anything which we can call heat as such. Without further explicit examination, we can unambiguously speak of a body losing or gaining heat only when there has been no energy interchange of any other sort with other bodies. In such a case the heat is measured in terms of the temperature change of the body. The heat concept is in the general case a sort of wastebasket concept, defined negatively in terms of the energy left over when all other forms of energy have been allowed for.
The essential fact that a quantity of heat can by itself be defined only in terms of a drop of temperature is somewhat obscured by the usual method of thermodynamic analysis. In describing a Carnot engine, for example, it is specified that the engine shall work between a source and a sink so large that their temperature is not changed by the heat given out or absorbed by them, so that the impression is natural that heat may in some way be measured apart from temperature changes. This of course is not the case; we merely require that the source and sink be so large that their temperature changes are of a different order of magnitude from those in the working substance itself, so that with respect to the working substance, source and sink may be considered to be at constant temperature.
Assuming now that we are able to measure quantity of heat in those cases in which the concept has meaning, let us examine the first law of thermodynamics, which we write in the form:
dQ + dW = dE
Here dQ is the heat imparted to a given body by other bodies, dW is the work of all kinds done on it from outside, and dE is the increase of internal energy. Now if this equation says what appears at a naïve first glance, it should say that we find experimentally that the relation written always holds between the measured quantities dQ, dW, and dE. We have seen that in the general case it is not possible to assign a unique operational significance to dQ and dW, and presumably not to their sum. We ignore for the present difficulties of this kind and confine attention on dE; how shall we measure it? I believe it does not take much examination to convince us that there are no physical operations for measuring dE as such, and that therefore the equation expressing the first law must have a different significance from that which appears on the surface. This is often recognized in the statement that the essence of the first law is that dE is an exact differential determined only by the variables which fix the internal condition of the body, and not a function of the path by which the body is carried from one condition to another. But what shall we mean by internal condition, and how shall we be sure that we have found all the variables required to specify it completely? Internal condition may be a most complicated thing and require many variables, as shown by a piece of iron with a complicated magnetic history or by a piece of aluminum about to undergo recrystallization after overstrain. Here again I believe there is no physical procedure by which general meaning can be given to this concept of internal condition. In specific cases we can state what the variables are which determine internal condition, and the criterion that we have found the correct internal variables is that dE shall be a complete differential in terms of them. The first law of thermodynamics properly understood is not at all a statement that energy is conserved, for the energy concept without conservation is meaningless. The essence of the first law is contained in the statement that the energy concept exists (or has meaning in terms of operations).
The first law is often thought to be one of the most general of physics, but in a paradoxical sense it is the most special of all laws, because no general meaning can be given to the energy concept, but only specific meaning in special cases. The first law owes its complete generality to the fact that no specific case has yet been found of so broad a character that it cannot be included under one or another special case.
Examination will at once justify this view. Thus we find a great many systems which are adequately described in terms of two variables, pressure and temperature, in that a function of p and t can be found such that its differential equals dQ + dW. There are other systems in which the six components of stress and t completely fix the internal condition in the sense that they determine a dE. In other systems the specification of a magnetic field may be necessary, or an electric, or a gravitational field. No case is known which cannot be handled in terms of the action of external forces of the proper kind, but there is no general procedure, and the first law owes its generality to the exhaustive cataloging of special cases.
We may now return to the question left in abeyance above of the ambiguity in dQ + dW. In all the cases in which the specific variables can be found which define dE, dQ and dW also have meaning. Consider, for example, a gas, the internal condition of which may be characterized in terms of t and p. The mere fact that the internal condition can be specified in terms of two variables, one a mechanical variable, shows that the substance is mechanically homogeneous. Being mechanically homogeneous, we do not have the possibility of ambiguous values of dW varying with the scale of the measuring instruments, and in fact we know that dW = p dv. Similarly the gas being homogeneous and at rest as a whole allows unique values for dQ. Of course this cannot obscure the physical fact that even in such a gas, when we go to a small enough scale, we find inhomogeneities arising from the Brownian movement, etc. Practically our statement means that the inhomogeneities are so fine grained that over a very wide range of scale of the measuring instruments we find the same definite results. The same sort of considerations apply to more complicated systems. If dE is a complete differential in terms of t and six stress components, this means again that the body is homogeneous, its condition is determined by temperature and stress, which are the same throughout the body, and again there is no possible ambiguity from the scale of the instruments which measure dW and dQ. It seems in general, then, that if the body allows operations by which dE acquires meaning, at the same time dQ and dW are provided for. In working out this idea in full detail, some care must be given to the question of order of differentials. dQ, for unit time and unit volume, is strictly equal to k∇2t, where k is thermal conductivity, so that in determining dQ the second derivatives of temperature are involved.
If the body is obviously not homogeneous, it is still a matter of experience that it can be divided into small pieces, each of which are by themselves sufficiently homogeneous, and the first law in its usual form may be applied to each of the pieces.
Finally, we emphasize a fact already implicitly mentioned, namely, that no physical significance can be directly given to flow of heat, and there are no operations for measuring it. All we can measure are temperature distributions and rates of rise of temperature. As at present defined, a heat current is a pure invention, without physical reality, for any determined heat flow may always be modified by the addition of a solenoidal vector, with change in no measurable quantity. If someone states that throughout all space there is a uniform heat current of 106 cal./cm.2 sec. in the direction of Polaris, no disproof can be given, for such a stream is solenoidal, and as much heat flows out of every closed surface in unit time as flows in. Such a solenoidal flow is meaningless in terms of operations; we could give meaning to such a flow only in terms of some slight modification of the solenoidal condition introduced by the measuring instrument. In all ordinary conditions the flow of heat given by the simple relation q = k Grad t corresponds exactly to what our atomic pictures lead to expect in those cases where the details of the picture can be worked out. But there may be cases where it is advantageous to supplement the ordinary heat flow (= k Grad t) by the pure fiction of a solenoidal flow, because in this way it may be possible to account for new phenomena which appear when the solenoidal conditions are slightly departed from. Thus if in a conductor at uniform temperature carrying a steady electric current we say that a heat current is also flowing proportional to the electric current and therefore solenoidal, we may provide the possibility for a simple correlation of phenomena found under those more complicated conditions when an electric current flows in a conductor of non-uniform temperature in a magnetic field. If it should turn out that the heat current is uniquely determined by considerations of this character, then we have taken the first step away from the pure formalism which this sort of thing otherwise is in the direction of giving physical reality to the invention of "heat current."
There are other interesting questions of a fundamental thermodynamic character, such for example, as whether the entropy concept has any general significance apart from the scale of our measuring instruments, and what is the operational significance of applying thermodynamic concepts to radiation, but we shall not consider these questions here.
ELECTRICAL CONCEPTS
We now set ourselves the problem of finding the meaning of the various concepts in terms of which we describe the behavior of electrical systems, assuming that we understand what we mean by "electrical." We start with the simplest electrical systems, namely, those in which we deal with static phenomena on a large scale. In such systems there are independent physical operations by which we may find the magnitude of any charge, provided that it is effectively concentrated in a geometrical point. The measurements involved in these operations are measurements of ordinary mechanical forces; we assume that our knowledge of mechanics has already taught us how to make such measurements. An electrically charged body experiences forces, which may be measured by tying a string to it and pulling on the string with a spring balance hard enough to keep the body in equilibrium. Three charges are numerically equal if when each is placed at unit distance from another, in the absence of the third (or other charge), the forces are always the same. If furthermore the forces are of unit magnitude, the charges are defined as unit charge. Having obtained unit charge, we define the magnitude of any other charge as equal to the force which it experiences when placed at unit distance from unit charge. This of course is all very trite; the important thing for us is merely that magnitude of charge, or quantity of electricity, is an independent physical concept, and that unique operations exist for determining it. These operations presuppose the ability to perform certain operations of mechanics. Having now learned how to measure electrical quantities, we discover experimentally the inverse square law of force, and later arrive at the concept of the electric field. As we have seen, the field is an invention; here we shall use this concept only for the purpose for which the invention was made, and shall not involve ourselves in any of the implications of ascribing physical reality to the field. Notice that as long as we deal only with point charges we do not have to define field strength in terms of the limiting procedure of making the exploring charge smaller, for the limiting small charge is necessary only to avoid the reaction of the exploring charge on the positions of the charges which generate the field. All this again is trite; the important point is that the operations by which the inverse square law and the concept of the field are established presuppose that the charge is given as an independent concept, since the operations involve a knowledge of charges. The operations also involve the measurement of forces by the ordinary static procedure of mechanics with spring balances. With the means now at our command we establish one very important property of electric charges, namely that the total amount of charge on an isolated body of finite size is conserved, no matter how the charge is forced to rearrange itself by the motion of charges on adjacent bodies.
By procedures exactly like those outlined above, we may treat all the corresponding magnetic quantities; there is formal parallelism between the two sets of phenomena, but there is the physical difference that we have to realize a single magnetic pole by the device of using a very long slender magnet.
We now give our electrical system more freedom, in that we allow the charges to be in motion with respect to each other. Perhaps the most immediate question which we now have to ask is whether charge continues to be conserved when set in motion, or whether the total charge on an isolated body is a function of its velocity? To answer this question we must generalize the procedure by which we assigned a numerical value to a stationary charge. Perhaps the simplest way is to allow two unit charges each to move with constant velocity, remaining at unit distance apart, and measure with a spring balance the force required to keep them at constant distance apart. Now we immediately find that the force is altered under these conditions, so that our first impulse is to say that the charge is a function of the velocity. But as we experiment further, we find that the state of affairs is very complicated; the force between the two charges at any moment of their motion depends not only on the charges, their distance apart, and their velocities, but also on the angle between the line joining them and the direction of motion in the lines. Further experiment of other kinds yields other information; it requires a force to maintain a charge in uniform motion in a magnetic field, or to maintain a magnetic pole in motion in an electric field. A moving electric charge exerts a force on a stationary magnetic pole, so that by definition the moving charge is surrounded by a magnetic field, and similarly a moving magnetic pole is surrounded by an electric field. Returning to our two moving electric charges, we are impelled to ask whether, if all these complications are possible, the numerical constant (unity for static charges) in the inverse square law of force is a function of velocity as well as the magnitude of the charges themselves? If we broaden the question in this way, as we apparently must, our problem becomes indeterminate, for we are trying to answer two different questions with a single kind of measurement, namely of the force between moving charges. I have had no better luck on trying other methods of measurement. Apparently the operations do not exist by which unique meaning can be given to the question of whether the magnitude of a charge is a function of its velocity. On realizing this situation, we are at first embarrassed to know how to proceed, but we reflect that the embarrassment is not of our own making, but corresponds to a physical fact. The concept of charge as a unique and independent thing essentially pertains only to static systems. We may extend the concept to moving systems if we wish, as a matter of convenience to ourselves, but must recognize that such an extension is an invention of ours and not a reality of nature. Now we do make such an extension, and we make it in the simplest possible way, that is, we define the charge on an isolated body in motion as that which we should find on it if we reduced it to rest and made measurements according to the regular static procedure. That this is a convenient thing to do depends on the experimental result that the charge so found is independent of the way in which velocity is imparted to or removed from the body; in other words, whenever the body is reduced to rest, the same charge is always found on it.
Although this is pure definition on our part, it turns out to have a most simple and convenient connection with experimental facts which were discovered after the decision to treat a moving charge in this way was made; the discovery is of the atomic structure of electricity. If then we agree to call each elementary charge a constant independent of the velocity, the total charge on a body becomes merely proportional to the count of the total number of atomic charges on the body, which is certainly highly convenient and suggestive.
Having now fixed what we mean by the magnitude of a moving charge, we are ready to turn to the general problem of the behavior of any system of charged bodies in motion. For the present we consider only phenomena of the scale of everyday experience. The most general problem that has meaning here is to determine all measurable properties of the system in terms of those data which experiment shows can be arbitrarily specified. Now we have already emphasized that the electromagnetic field itself is an invention, and is never subject to direct observation. What we observe are material bodies, with or without charges (including eventually in this category electrons), their positions, motions, and the forces to which they are subject. The forces are to be measured according to definition in mechanical terms, either by the strains in members of a framework if the system is in equilibrium, or in terms of accelerations and masses if it is not in equilibrium. The electromagnetic field as such is not the final object of our calculations, but the calculation of it is only an intermediate auxiliary step, convenient to make because our mathematical formulation gives so simple a connection between electromagnetic field, charges, and mechanical action that the latter can be calculated at once in terms of the former. In fact the connection is so simple that in many cases we have come to regard our problem as solved if we can compute the electromagnetic field, overlooking the fact that the field has no immediate meaning in terms of experience.
Electromagnetic theory now presents us with a solution of the general problem; this solution is contained in the four-field equations of Maxwell, the constitutive equations, and those additional equations (quite often lost sight of) which give the forces exerted by the field on electric charges, or currents, or dielectrics. Let us inquire how we may set about testing the physical correctness of these equations. We may begin with one of the simplest possible tests, and inquire whether the equations are correct in stating that the force acting on a charge moving in an electric field is simply the product of the charge and the field strength. This, on the face of it, is a surprising statement. The field itself is affected by the motion of the charges which generate it, and it is natural to expect a converse effect. If, furthermore, we have sympathy with the medium point of view, it is easy to think that whatever it is in the medium that gets hold of a charge and exerts a force on it will find it harder to take hold when the charge is in motion.
In attempting to check our statement experimentally, the only additional complication, as compared with the static case which we have already checked, is afforded by the motion of the charge, for we have defined the magnitude of a charge in motion, so there is no difficulty here, and we may furthermore suppose that the field is generated by stationary charges, so that we need not trouble to inquire whether the procedure by which the field was originally defined is here applicable. The task of checking the equation then reduces to the simple physical task of measuring the force on the moving charge. How shall we do this? If the velocity is low, we may tie a string to the charge and measure the force with a spring balance (or its equivalent). But now an examination of the equations shows that in more complicated phenomena perceptible deviations from the static behavior are to be expected only at much higher velocities than can be attained by towing charges with a string and a spring balance, so that it is evidently necessary to check the simple equation for the force on a moving charge also at high velocity. Since at high velocity the spring balance method for measuring forces fails, we are driven to the only procedure that we have, namely a measurement in terms of the resultant acceleration, calculating the force by Newton's first law of mechanics. But this involves a knowledge of the mass of the moving body, which we recognize in general may be a function of the velocity. Now we have already seen, in discussing the concepts of mechanics, that the operations by which mechanical mass is defined cannot be carried out at high velocities, so that either the concept of mechanical mass becomes meaningless at high velocities, or we must adopt another definition. In attempting to give this new definition of mass at high velocities, we are driven to a result of special relativity theory, namely that all mass, mechanical or electrical, must be the same function of velocity. If now electrical mass can be found in terms of velocity, our immediate problem is solved and we shall be in a position to complete the experimental check of the equation. But as a matter of fact, in order to determine electrical mass, we have to use that equation which we are now engaged in trying to establish. Logically we have again the vicious circle, the physical significance of which is that independent operations do not exist for giving unique meaning to the concept of force on a charge at high velocity.
We seemed so close to our goal a minute ago; that we may allow ourselves to jump the logical chasm, and assume that the equation is correct. Electrical mass now becomes a definite function of velocity, mechanical mass the same function, and we are in a position to compare the actual acceleration received by a charge in a field with that calculated by the equation. Our conviction, on the basis of all experience up to the present, is that the two accelerations will be found to agree.
The equation then does somehow make correct connection with experience in that a consequence of the equation can be verified experimentally, in spite of the fact that as the equation stands it is meaningless, because the operations do not exist by which meaning can be given to the individual terms. At low velocities the equation really says what it seems to say, because the individual terms have meaning in terms of operations; and, what is more, what the equation says agrees with experiment. At high velocities the equation does not mean at all what appears on the surface; by itself it has no meaning; it has meaning only when considered as a member of a system of equations, and only in so far as the system of equations makes by implication statements about nature that have meaning in terms of operations that can be carried out physically. The individual terms of the equation of the system do not have meaning at high velocities, and in fact there are more terms than there are independent physical operations.
An exact analysis from the operational point of view of the significance of the equations at high velocities has perhaps never been made, and is not necessary for our immediate purpose. The discussion has brought out, however, that the number of physically independent concepts has been cut down by two at least, in that we have made purely formal definitions of the meaning of quantity of electricity, and of the force exerted by a field on a charge at high velocity. There is no reason to think that there is anything unique about this analysis, or that formal definitions might not have been given to other concepts than charge and force. We can only state that as far as physical content goes the equations have at least two degrees of freedom. It should then be possible to find quite different sorts of equations which agree equally well with experience. In particular, since we have seen that the force on a moving charge has no meaning in terms of independent operations, it should be possible by arbitrary definition to make this force any function of velocity that we please (of course reducing to the proper value at low velocities), and then to determine the other equations so that the entire group of equations is consistent with experiment. So far as I know, no one has tried to give such a modified set of equations, and indeed there is no particular reason why anyone should bother to do this, because the present equations are simple enough, and the modified equations, although perhaps differing greatly in appearance from the present ones, would have no advantage in any greater or different physical content.
But there is no reason to think that the present state of affairs will always continue. We have seen that the decrease in the number of concepts corresponds to our inability to measure as many sorts of physical things at high velocities as at low. Now it is the task of the future experimenter so to refine the possibilities of measurement at high velocities as to restore these two degrees of freedom. In particular, mass should be made measurable in mechanical terms at high velocities. When this restoration has been made, and all the quantities in our equations receive independent physical meaning, the significance of the equations in terms of operations will be quite altered, although the formal appearance will be unchanged. We must then be prepared to find, as always when we change the range of phenomena, that the equations in their present form do not correspond to the facts at all, and that one of the alternative forms allowed by our present two degrees of freedom is the correct form. But until the new experimental facts have been obtained, it seems hardly worth while to attempt to specify the doubly infinite variety of forms which the equations might have consistently with present experiment.[23]
[23]Since this was written, a paper has appeared by V. Bush, Jour. of Math. and Phys., vol. V., No. 3, 1926, in which it is shown that there are advantages in supposing the charge of an electron to change when it is set in motion.
So far we have discussed the extension of ordinary electric phenomena in only one direction, to high velocities. There is another extension which is much more important physically, namely to very small scales of magnitude. This extension is necessary to an understanding of the properties of matter in bulk, the electrical nature of the atom having been once established. Our problem is to show how the statistical average of the behavior of a large number of electrons gives the large scale effects which are within the reach of observation, and which are described by the equations we have just discussed. To get this statistical average we must be able to calculate at least certain features of the behavior of the individual electrons, which means that we must know the form of the equations down to dimensions of the order of those of an electron, or smaller. Now if one contrasts the scale of the supposed dimensions of the electron with the smallest dimensions on which we can make independent experimental verification of those equations, he must admit that there is an enormous chance for change in the type of equation beyond the limit that we can reach by direct experiment, and the chances of guessing the correct extension of the equation to small dimensions are correspondingly almost vanishingly small. (We may perhaps say that experiments on the Brownian motion on a scale a good many atoms in diameter bring us the closest possible directly, which means that we are 106 or 107 fold away from electronic dimensions.) In spite, however, of the apparently enormous chances against it, this program of extending the field equations to small dimensions and following out the consequences was exactly the program which Lorentz set himself.[24] That Lorentz saw that such a program might be carried through must be recognized as a vision of extraordinary genius, and that he was willing to devote to it the years of arduous and detailed calculation that he did is evidence of a pertinacity of purpose of the highest moral order.
[24]See for example, H. A. Lorentz, The Theory of Electrons, B. G. Teubner, 1916.
We now have to examine critically this program and to inquire what is the significance of the measure of success that Lorentz attained. The precise extension of the equations that he made was very simple, for the large scale equations of Maxwell were taken over with as little change as possible. The equations are so familiar that it is not necessary for us to write them in detail; they express relations between the electric and magnetic force vectors (force and induction now becoming the same thing, the difference between them in ponderable bodies being one of the things that is to be explained in terms of the electrons), the space density of electric charge, its velocity, and the force acting on elementary charge. We have to notice that although formally the equations have changed little in appearance, nevertheless the physical content, as judged by the operations, has changed a great deal. Consider, for instance, the meaning of charge density. In the Maxwell equations, ρ was merely the number of discrete elementary charges per unit volume, the distances between these charges being supposed so small compared with the scale of the phenomena involved that their average effect could be fairly represented in terms of their numbers. In the Lorentz equations, on the other hand, ρ has a value different from 0 only inside the electron; everywhere else ρ = 0. Now an examination of the previous discussion, in which we questioned whether the magnitude of the charge might be a function of its velocity, will show that there are no physical operations whatever by which meaning can be given to ρ at individual points inside an electron. There is a single condition on this ρ, namely, that its integral throughout the total volume assigned to the electron shall equal the total static charge of the electron. Obviously a single scalar condition is a pretty blunt tool with which to attempt to determine a point function throughout a volume. Again, the equations talk about the velocity of the charge at interior points of the electron; what possible physical operations are there by which meaning can be assigned to the velocity of an amorphous structureless substance in regions inaccessible to experiment? Here again, the concept as a detailed description of the behavior at a point has become meaningless, and again there is a single integral condition, namely, that the v associated with every ρ must be such that when integrated over the volume of the electron it will give a total transport of charge equal to that carried by the electron in its motion. This again is a single condition on a function distributed through space. Still again, the equations contain the electric and magnetic vectors at points inside the electron. What is the possible meaning of these field vectors in terms of operations? Our procedure for finding the field at a point involves by definition finding the force on an electric charge placed at that point. But there is no charge smaller than an electron, and the procedure degenerates into a fiction. Again there is a single integral condition on the field vectors; the integral of the force on the assumed charge density when taken over the total volume of the electron must give a value corresponding to experiment. Except for this single condition, the concept of the field at points inside the electron is an invention without physical reality. Not only is the field concept meaningless at points inside the electron, but it is meaningless at points outside within a certain distance, because the exploring charge can never be made smaller than the electron itself, and so can never come closer than a certain distance.
The actual state of affairs is much worse than has already appeared. It was shown in the discussion of space and time that no independent physical meaning can be attached to lengths and times as small as have to be assumed in describing the behavior of the individual electrons. The operations Div, Curl, d/dt which enter the field equations are, therefore, physically meaningless as they stand; they have only a mathematical meaning which begins to acquire physical complexion in a most complicated way when the equations are integrated over large enough volumes.
It is evident, therefore, that the concepts which enter the field equations have entirely lost their large scale significance; they have become blurred, fused together, and fewer in number. A precise analysis of this situation has probably never been attempted and would obviously be difficult: it would be interesting to know at least how many really independent concepts there are at this order of phenomena. An attempt at an analysis would probably be worth while from a physical point of view in suggesting possible experiments by which the number of physically independent concepts could be extended.
Since the quantities in the field equations are meaningless in the naked form in which they enter the equations, it is meaningless to inquire whether the equations as they stand are true or not. In our present state of experimental knowledge it is also meaningless to ask whether, for example, the inverse square law between electric charges continues to hold, or whether an accelerated charge radiates. These questions have meaning only when applied to phenomena on a scale large enough to correspond to possible experiment.
There is a rather interesting obverse to the statement that it is meaningless to ask whether the field equations are true, namely, that it may not be meaningless to state that they are false. A statement is not true unless it is true in every particular, but it is false if it is false in a single particular. If we can show that a single consequence of the field equations of Lorentz, when integrated or averaged in such a way as to correspond to experimental possibilities, is false, then the equations must be false. It seems that, regarded as a complete description of physical behavior on a small scale, the equations must be judged false, because they contain no suggestion of quantum phenomena.
Even if we have to recognize that the equations are false, there can be no question that they correspond to an important part of reality, and that they have been of the greatest service to physics. What is the significance of the success that they have attained? It is to be noticed that all the phenomena to which the Lorentz equations have been successfully applied, although not large scale phenomena in the ordinary sense of the word, are nevertheless phenomena involving the coöperation of a number of atoms, and that the equations unquestionably fail when applied to phenomena involving single electrons. It appears from our best present evidence that on a small scale the behavior of nature is governed by quantum principles and is therefore quite different from large scale behavior, Which we have seen is governed by the Maxwell equations. There must of course be a transition zone in which the character of phenomena changes from quantum to Maxwell. Now any program like that of Lorentz is almost inevitably bound to begin to give correct results when we get up as far as the transition zone, for the simple reason that the relations of Maxwell have been put into the equations and are always there ready to appear as soon as the quantum relations begin to give way. The physical significance of the success of the Lorentz program seems to be that the transition from Maxwell to quantum takes place at a stage pretty far down toward the individual atoms. To find the precise details of the transition from Maxwell to quantum phenomena constitutes a large part of the program of the immediate future.
All this skepticism about the classical work of Lorentz is likely to be rather irritating or depressing, particularly if one attempts to imagine what other course could have been adopted. Indeed it does seem that we find ourselves in a real quandary; Lorentz was practically forced, because of the character of the mathematical tools at his command, to take the course that he did, in spite of any recognition of the physical meaninglessness of the mathematical operations. We have already seen that conventional mathematics does not correspond to the physical reality; it cannot easily make a qualified statement subject to limitations, and it recognizes no difference between the physically big and the physically little and the corresponding change in the operational meaning of its symbols. It begins by being a most useful servant when dealing with phenomena of the ordinary scale of magnitude, but ends by dragging us by the scruff of the neck willy nilly into the inside of the electron where it forces us to repeat meaningless gibberish. Larmor recognized this, and in his electron theory, developed practically contemporaneously with that of Lorentz, endeavored to treat electrons as wholes, and not to make meaningless statements about their insides.[25] But he was much less successful than Lorentz in making his analysis give physical results, and one may suspect that it was at least in part due to difficulty with his tools.
[25]Joseph Larmor, Æther and Matter, Cambridge University, Press, 1900. In this book the electron is treated as a point singularity in the ether.
What we should like to be able to do is easy to see. The things that go into our equations must have independent physical meaning, and the character of our mathematical formulation should change to keep pace with the change in the physical operations which give meaning to the terms. For example, electrical density has a meaning for large scale phenomena, but means nothing on a small scale. Our ultimate electric unit is the electron; when we get down to this scale of magnitude, our mathematics ought to be making statements about the relative behavior of discrete electrons, and not mention so much as by implication the density at points inside an electron. But this sort of thing we apparently cannot yet do; the proper mathematical language has not been developed. Such a language, when developed, must not only be able to resist the temptation to burrow inside the electron, but must also try to get along without the field concept, which we have seen is liable to so much physical abuse, and must reduce effects in complicated electrical systems to the ultimate elements that have physical meaning, namely, a dual action between pairs of electrical charges, with no implications about physical action where the charges are not.
THE NATURE OF LIGHT AND THE CONCEPTS OF
RELATIVITY
We have already discussed several aspects of the theory of relativity in connection with the relation to it of some of our fundamental concepts. There are still other topics connected with relativity which demand attention; most of these involve the properties of light. It will now be convenient to discuss together the properties of light and these concepts of relativity. We restrict our discussion of light to those simple properties which bear on the theory of relativity.
Practically all our thinking about optical phenomena is done in terms of an invention, by means of which these phenomena are assimilated to those of ordinary mechanical experience, and so made easier to think about. To realize that invention has been active here, we must think ourselves back into that naive frame of mind in which experience is given directly in terms of sensation. The most elementary examination of what light means in terms of direct experience shows that we never experience light itself, but our experience deals only with things lighted. This fundamental fact is never modified by the most complicated or refined physical experiments that have ever been devised; from the point of view of operations, light means nothing more than things lighted. Now experience shows that these things lighted may stand to each other in varied relations; in attempting to reduce these relations to order and understandability we make a certain invention. This is prompted by several cardinal experimental facts: in the first place, things lighted have a simple geometrical relation to each other, in that screens placed on straight lines between the lighted objects may suppress the illumination of one or the other and themselves become illuminated. This leads to the concept of rectilinear beams of light, which is no more than a description of the geometrical relation between lighted objects. Then we have the experimental fact of the asymmetrical relation of the lighted objects, described in terms of sources and sinks. Finally, we have the discovery made at a much later stage, and not possible until physical measurements had reached a high refinement, that light has properties analogous to the velocity of material things. This was first discovered in connection with astronomical phenomena in the shift of the time of eclipse of Jupiter's satellites and in aberration, but was later found to hold for purely terrestrial phenomena, in that a beam of light reflected from a distant mirror does not return to the source until after the lapse of a time interval that can be measured with means sufficiently refined. This property of return after the lapse of time is precisely like that of material things, such as a messenger despatched for an answer, or a ball or a water wave bouncing from a wall. These various properties of light lead quite naturally and almost inevitably to the invention of light as a thing that travels, "thing" not necessarily connoting material thing.
The question now for us is whether we shall regard this as a mere invention, made for convenience in thinking, or shall go further and ascribe physical reality to it, that is, shall we think of light as capable of independent physical existence in the space between the matter that constitutes the source and the mirror? Now in spite of the resemblances pointed out above, there is at least one universal and fundamental difference between a thing that travels and light. We have independent physical evidence of the continued existence of the ball, for example, at all intermediate points of space; we can see it, or hear it, or feel the wind in the air as it passes, or even touch it. All these phenomena are independent of the initial and terminal phenomena, and hence by our criterion for the physical reality of an invention, we are justified in ascribing physical reality to the ball in transit. But with the beam of light it is entirely different; the only way by which we can obtain physical evidence of the intermediate existence of the beam is by interposing some sort of a screen, and this act destroys just that part of the beam whose existence we have thereby detected. There is no physical phenomenon, whatever by which light may be detected apart from the phenomena of the source and the sink (understanding that a mirror is included in the idea of, sink); that is, no phenomenon exists independent of the phenomenon which led us to the invention of a thing travelling. Hence from the point of view of operations it is meaningless or trivial to ascribe physical reality to light in intermediate space, and light as a thing travelling must be recognized to be a pure invention.
The status of light is exactly the same as that of an electric field; there is not the slightest warrant for ascribing physical reality to either at points of empty space—light and field-at-a-point have no meaning until we go there and make experiments with some material thing. Of course the electromagnetic theory of light makes this resemblance inevitable, provided the theory and our views of the nature of light and the field are correct.
It cannot be denied that there are some phenomena which uncritically considered appear to justify thinking of light as a thing that travels; these will now be discussed. Probably the argument to which most significance is usually ascribed is derived from the phenomena of energy. The passing of light from source to sink is accompanied by the transfer of energy. But energy is conserved, so that we have to ask where the energy is in the time interval between the emission of light from the source and its absorption by the sink. There is an obvious answer: the energy is in transit, of course, somewhere in the intermediate space between source and sink. If we think of light as propagated through a medium, then the medium is such that energy may reside in it, as in the electromagnetic theory of light, or if light is more material and ballistic in character, the thing that travels has itself energy. We notice in the first place that the conservation principle involves the time concept, because what we mean by conservation is that the total energy of the universe, at a fixed instant of time, is constant. That is, we have to integrate over all space the local energy at a definite instant of time, and this involves spreading the time concept over all space. It is further evident that unless we spread the time concept over space in the right way we shall not get conservation. The proof that it is possible to spread the time concept over space in such a way as to give conservation involves a knowledge of the properties of light. It would seem, then, that we ought not to assume conservation in deducing the properties of light, when a knowledge of the properties of light is necessary to establish conservation. These considerations cannot be accepted as final, however, until a detailed analysis has been made, and this would be most complicated. But there is a more important consideration derived from our previous critique of the energy concept, namely, that there is no basis for asserting that energy is localized in space at all; energy is not a physical thing, but rather what we would call a property of a system as a whole. If this view of energy be granted, the whole energy argument for light as a thing travelling, and also for the existence of a medium, falls. I believe that similar considerations apply to any arguments from the conservation of momentum.
The possibility of detecting light in apparently empty space by a screen constitutes perhaps the most immediate reason for considering light as a thing that travels. This point of view I believe is characteristic of the entire attitude of Einstein in deducing the theorems of the special theory of relativity. Einstein's light signal is for the purposes of the deduction thought of as a simple spherical wave spreading from the source and capable of being watched as it spreads by an observer outside the system, in much the same way that a water wave can be watched. Of course the light signal cannot actually be watched in transit, but we can come fairly close to this ideal by placing screens at any point we please to make the wave visible. It is true that the mere act of showing the existence of the light destroys that part of the beam whose existence is detected, but the screen needs only an infinitesimal amount of light to make it visible, and so by the usual physical argument we may suppose that the detecting screen produces only an infinitesimal modification of the total original light.
Our satisfaction with this picture evaporates if our present quantum views of the nature of light are correct. We can no longer think of the spherical light pulse as of irreducible simplicity, but it is an exceedingly complicated thing, perhaps more complicated than a gas from the point of view of kinetic theory, and simulates simplicity by some sort of averaging of the effects of the elementary quantum processes of which it is composed. If the principles of relativity are to continue to be regarded as fundamental, or even if they are to remain intelligible, we must apply our reasoning, not to spherical wavelets, but to the elementary process of which these wavelets are composed. Now the elementary quantum act is essentially a twofold thing: there is a discrete act of emission at some discrete material particle, and the act is consummated by another discrete act (absorption or scattering) at some other discrete particle. We cannot yet fully characterize the details of this twofold process, but have to connect the place at which absorption takes place with the place of emission by statistical considerations. It is evident, however, that to think of emission as starting some process like a spherical wavelet travelling like a thing through space presents an entirely incorrect view, because in the wave there is no hint of the discrete place which is to terminate it. We may say crudely that there is no way by which the wave can know what discrete material particle is to complete the emission process. We may perhaps try to save the situation by remembering that a spherical wave is polarized and so has a unique direction associated with it; but further examination shows that this does not help, because the unique direction is that of no energy flow, and absorption can take place in any direction except this. It appears then that instead of being a help, the thing travelling point of view is a positive embarrassment when we try to picture by means of it the essentially twofold nature of the elementary quantum act.
Another plausible argument for light as a thing travelling may be deduced from our principle of connectivity. Imagine a dark lantern with a shutter that can be opened or closed so as to emit a momentary flash of light, a distant mirror, and a receiving instrument near the source. One of the properties of light that we always assume is that no permanent trace of the act of emission is left behind in the source. The most minute examination of all the details of the lantern and its surroundings at some time after the emission of the flash has not yet shown any phenomenon that betrays a remembrance of the emission of the flash, unless perhaps we measure the total energy or momentum and have some way of knowing what the energy or momentum would have been if the flash had not been emitted, and in any event we cannot specify the moment in past time when the signal was emitted. In the same way we cannot tell from an examination of a mirror whether it has at any time in the past reflected a beam of light. Consider now two systems, each consisting of a source and a mirror distant 3 x 1010 cm., identical in all respects, except that in one a light signal was flashed from the source 1.5 seconds ago, and in the other only 0.5 seconds ago. According to our hypothesis, the most complete examination of source and mirror in either system fails to show the slightest difference, but nevertheless there is something essentially different about the two systems, for in one a light signal arrives at the screen in 0.5 seconds, and in the other not until 1.5 seconds. This violates what we have suggested might be regarded as the cardinal and most general principle of all physics, the principle of essential connectivity, which states that differences between two systems must be associated with other differences. A most obvious and simple way of maintaining our principle is merely to point out that the system really included more than we investigated: the system properly consists of source, mirror, screen, and all intermediate space, so that if we had examined intermediate space we should have found light there in transit in different positions in the two systems, thus correlating with the differences in subsequent history. This argument appeals to me as perhaps the strongest that can be advanced for the view of light as a thing travelling. But it seems in no way conclusive. Our principle of essential connectivity made no mention of the time concept, but we have somehow smuggled it in making the application above. We sought to give a complete description of our system at some one instant of time, and this involved spreading the time concept over space. This itself is a questionable operation and may be done in different ways. But, more important, what is the justification for supposing that the system can be completely described by giving a complete description of all the measurable parts of it at some one epoch? We have seen that in the most general case the principle of essential connectivity must recognize that the concept of "initial condition" of a system involves all the past history, and we may have here a case in point. The answer can be given only by experiment. In dealing with ordinary experience, when we do not have to distinguish between local and extended time, and are not dealing with optical phenomena, there can be little question that experience at least approximately justifies the expectation that future behavior is determined in terms of the present condition and that present condition may be specified in terms of the results of present operations performed in the system. But before extending this principle to phenomena in which we have to distinguish between local and extended time, we have to answer just that question which we are now considering; namely, whether there are physical phenomena taking place in apparently empty space, and whether therefore empty space has to be included in the system. We find ourselves again treading the vicious circle. Perhaps experience will show that the extension of the principle of connectivity to optical phenomena involves something like this: namely, the future at any point in a material system is determined by a complete description of the present state of the system in the immediate vicinity, and by a history of the behavior at more distant points, this history extending over longer and longer intervals of time as the point becomes more remote.
I believe, however, that these possibilities will not seem very satisfactory, and that most physicists will discover in themselves a very strong disposition to feel that the future is determined in terms of a complete description of some sort of instantaneous configuration, time being extended in some suitable way over space. This instinctive demand that the future be determined in terms of the present may easily be consistent with the optical phenomena in our two systems consisting of source, mirror, and screen, without involving the material existence of light in empty space, provided that our assumption that the emission of light leaves behind it no permanent record in the source was incorrect. It may be that detailed examination of a source after emission will disclose permanent traces, from which the instant of emission may be found by extrapolation. If the conviction of determinism of the future is strong, the physicist may well be impelled to search here for new phenomena indicating such a memory of emission.
Let us now inquire how our physical structure might be affected if we should give up the identification of light with a thing travelling. One consequence is that light need no longer be thought of as having the property of velocity, since velocity, in terms of immediate experience, is a property of things moving from place to place. Giving up the concept of light as a thing travelling would enable us, then, to adopt an alternative method of describing nature with a different concept of velocity; we have seen that it is possible to define velocity in terms of operations different from the usual ones, in such a way as to give the usual numerical results at small velocities, but different results at high velocities, and in particular to give an infinite velocity for light.[26]
[26]No difficulty arises from the asymmetric character of light in assigning an infinite velocity to light because those physical operations by which we discover which is the source and which the sink are entirely distinct from the operations by which a velocity is measured; or in other words, even in the limit, it still has meaning to say that an infinite velocity has a direction associated with it.
There is now no objection to an infinite number associated with light, if we no longer think of light as having physical velocity. We may, if we like, continue for the sake of convenience to talk of the velocity of light, clearly understanding that the infinite value which must be ascribed to this velocity corresponds to the fact that the physical concept of velocity does not apply in all respects to light. We should now have to revise our process for extending the time concept over space, because this was formerly so done as to give light a finite velocity. We are now to make this velocity infinite, which is obviously to be done simply by setting a distant clock on zero at the instant it receives a light signal flashed from our clock at its zero. The behavior of material things now takes on a simple aspect—there is no longer a finite upper limit to the velocity that can be given a material thing, and light has no longer the paradoxical property of bearing the same finite relation to each of two material systems which differ from each other by a finite amount (that is, the first postulate of relativity that the velocity of light is 3 x 1010 in all reference systems). Light instead now bears the relation of infinitude to each of two systems which differ from each other by only a finite amount, and this is natural from the mathematical point of view.