Translation:On the Theory of the Michelson Experiment

From testwiki
Jump to navigation Jump to search

Template:Translation header


Template:Center


The goal of the following investigation is to discuss the objections against the Michelson experiment given by Template:Sc[1] in his Karlsruhe lecture, in a more closely way as it was possible in the discussion immediately after his lecture. The most severe of these objections concern the question, as to how to calculate the phase differences shown by the light ray in its given path at different points, in a system of moving bodies. This criticism of him is placed by us at the beginning. Part II is then concerned with the influence exerted by the thickness of the separation plate; part III is concerned with the location (occupied by the interferometer during the experiments) relative to Earth's velocity.


Template:Center

Template:Sc objects against the previously given representations (based on the one given by Template:Sc) of the Template:Sc experiment, that there is a principal mistake in the calculation of the phase difference at which the interference takes place. Since these representations mostly neglect the thickness of the separating plate, we also want to allow us to proceed the same way here; the principal things won't be changed by that. Additionally, part II will rectify the things missed here.

In his book concerning the relativity principle[2], the author of this response has based the criticized representation on the theorem that light traverses a distance moving with velocity 𝔮, where the relative velocity 𝔠r is the vector difference from its absolute velocity 𝔠 (c=31010cmsec1) and from 𝔮. This theorem shall at first be proven extensively here. There, we use a reference system x,y,z,t, which is to be referred to as the aether in the sense of the absolute theory, and as a valid system in the sense of relativity theory. Besides, we introduce a coordinate system

ξ=xqt, η=y, ζ=z Template:Optional style|(1)

moving with velocity 𝔮 in x-direction. It may not confused with a second valid reference system in the sense of relativity theory; all length and time indications are rather exclusively to be related to the measures valid in the resting system x,y,z,t. Therefore, the following considerations are also valid in all electromagnetic theories for moving bodies. The moving distance s, for which the light propagation shall be studied, shall be extended from the origin of the axes-cross ξηζ to a point P(ξPηPζp). Its direction is that of the relative velocity 𝔠r.

As mentioned above, it shall be

𝔠r=𝔠𝔮; Template:Optional style|(2)

if we represent these vectors by distances, then the latter are mutually located as in Fig. 1. Now, if the periodic term of light excitation for a plane wave (related to x,y,z,t) is

eiν(t1c(xα+yβ+zγ) Template:Optional style|(3)

then α,β,γ are direction cosines of 𝔠, so that Template:Pagenum

α=cosϑ Template:Optional style|(4)
File:LaueMichelson12a.png
Fig.1

when ϑ is the angle between 𝔠 and the x-direction. Yet according to (1) it is

eiν(t1c(xα+yβ+zγ)=eiν{t(1qαc)1c(ξα+ηβ+ζγ)} Template:Optional style|(5)

At the starting point of distance s, the oscillation is thus represented by

eiνt(1qαc), Template:Optional style|(6)

at endpoint P by

eiν{t(1qαc)1c(ξPα+ηPβ+ζPγ)} Template:Optional style|(7)

According to a known theorem of analytical geometric, it is however

ξPα+ηPβ+ζPγ=scosψ Template:Optional style|(8)

where ψ denotes the angle between s and 𝔠. Furthermore it can be seen from the figure, that

cosψ=cqcosϑcr=cqαcr Template:Optional style|(9)

Thus expression (7) becomes equal to

eiνr(tscr) Template:Optional style|(10)

when

νr=ν(1qαc) Template:Optional style|(11)

denotes the relative oscillation number (relative to a co-moving point). The comparison of (6) and (10) shows, that the phases traverse the moving distance s with relative velocity 𝔠r defined by (2). It is after all quite illustrative, that the displacement of a "wave crest" (sit venia verbo) happens with respect to the moving distance with this relative velocity.

Now, the ray shall be reflected in P by a mirror, so that it now traverses in its relative optical path the distance s in the opposite direction. The returning wave is represented at the origin of the moving system ξ,η,ζ in analogy to (6) by

ei(νri+δ), Template:Optional style|(12)

at point P in analogy to (10) by

ei{νr(t+sc'r)+δ}, Template:Optional style|(13)

𝔠'r is given from the construction of Fig. 1, except that its direction is opposite to that of 𝔠r; δ is a number still to be specified. That the relative frequency νr is maintained at the reflection, follows from the presence of linear limiting conditions, which connect the incident wave with the reflected one. They can only be satisfied, when the periodic functions (10) and (13) representing both waves at point P, are mutually connected by[3]

ei{νr(t+sc'r)+δ+ϵ}=eiνr(tscr) Template:Optional style|(14)

ϵ would be the phase transition at the reflection; if one neglects this, as it is done by Template:Sc – this assumption is without influence on the result –, one additionally finds from (14)

δ=νrs(1cr+1c'r) Template:Optional style|(15)

Thus according to (12), the returning oscillation is represented in the origin by

eiνr(ts(1cr+1c'r)) Template:Optional style|(16)

i.e., the phases require the time s(1cr+1c'r) to traverse distance s back and forth. Herein lies in my view the irrefutable justification of the present theory.

The question remains undecided, whether (for the calculation of the phase difference) one has to divide the difference of the times calculated for both in this way, by the relative period:

τr=2πνr Template:Optional style|(17)

or by the absolute period present at the end:

τ=2πν; Template:Optional style|(18)

as to this effect of second order, the difference between them plays no role.

It is completely identical with the given procedure for time calculations, when one speaks about traversing the absolute optical path l with velocity 𝔠. Because the absolute optical path l emerges from distance s and velocity 𝔮 by the construction in Fig. 2., which is similar to Fig 1, so that

scr=lc Template:Optional style|(19)

A third procedure would be,

Template:Pagenum

File:LaueMichelson12b.png
Fig.2

to divide distance s by the relative wave length

λr=2π𝔠rvr Template:Optional style|(20)

According to (6) and (16),

Template:Center

directly gives the phase difference present in the zero point between the waves traveling for- and backwards. Since νr is conserved at every reflection, λr varies proportional to cr.

Instead of these three valid procedures, Template:Sc employs a fourth one; he divides any absolute optical path ln by the corresponding wave length

λn=2πcνn Template:Optional style|(21)

The sum of two of such ratios shall represent the phase difference between the waves traveling for- and backwards for one of the points of the separating plate. νn and λn vary from distance to distance in accordance with Template:Sc's principle.

Already in the supplement to my discussion remarks[4] (of the two possibilities mentioned there, the second one applies here) I have alluded to the fact, that such a sum is not proportional to the traversing time of the phases, and that therefore the difference of two such sums (at that place simply written as lnλn, in which the distances ln related to the second ray, are calculated negatively) are not in any relation to the phase difference. The reply of Template:Sc[5], that it is always lnλn=tnτn, is indeed correctly per se, but it doesn't affect my objection; not every traversing time tn is to be divided by the corresponding period τn, but the division is to be executed in all of them through period τ or τr being present at the end (see above).

In the discussion, however, Template:Sc has supported his way of calculation by a figure not printed, which should have given a scheme for the appearance of two wave trains, which (starting from one point) eventually encounter again on different ways in terms of their absolute optical path and with some changes of wave length. For example, if one wave train has a length of 166 and the other one 166½ wave lengths, then they interfere with phase difference π; thus is his conclusion. However, what shall this figure represent? Evidently the spatial distribution of the oscillations in a certain moment. Yet this is not the case, since the absolute optical path is replaced by the same velocity as the apparatus. The different parts of the paths traversed by the wave train, therefore have no meaning for the representation of the oscillation state in a certain moment. It is not even necessary, that light excitation is simultaneously present on all these parts at all. At great arm lengths and great velocity, this wouldn't be the case.[6]

If one wants to correctly describe the thought on which the figure is based, then one has to draw the relative optical paths with the wave trains lying upon them, and by that one comes to the third of the previous procedures, at which one divides every distance s by the relative wave length λr lying upon them, and to sum up the ratios.

Besides this principal objection against Template:Sc's way of calculation, we also must express still another objection against a single point of this calculation, from which (despite its subordinate importance) nevertheless the correctness of the result is depending.

Template:Sc namely sets in equation (1) the ratio of the absolute wave lengths before and after the reflection of the moving mirror (λ1 and λ2)[7], up to terms of third and higher order.

Template:Pagenum Template:Center

there, φ is the angle formed by the motion of the mirror with its normal directed in the direction of the side of light, and α is the angle of incidence. As to how this equation came about, cannot exactly be seen from Template:Sc's specification. It seems to me, that it is already incorrect in the terms of second order, as well as for the special values of α appearing at the interferometer of Template:Sc.

To show this, we start from the formulas given by Template:Sc[8] for the Doppler effect and the reflection law at a moving mirror. Template:Sc's equation (11b) then says:

λ2λ1=1βα21+βα1=(1βα2)(1βα1+β2α2) Template:Optional style|(22)

while according to his equation (15e)

α1+β1α12=α2β1α22 Template:Optional style|(23)

There, β means the ratio from the velocity component of the mirror in the direction of its normal, and c, α1 and α2 are the cosines of the angles of incidence and reflection. Thus it is

β=qccosφ, a1=cosα Template:Optional style|(24)

From (23) we take by differentiation with respect to β at constant α1, that

Template:Center

and in case β=0, in which α1=α2,

Template:Center

Thus the series expansion holds:

α2=α1+2(1α12)β+ Template:Optional style|(25)

However, from (22) and from (24) and (25) it follows:

Template:Center

which doesn't agree with Template:Sc's equation – also in the special case that α = 45°.


Template:Center

We now come to speak about the influence exerted by the finite thickness δ of the separating plate. It is certainly to be appreciated, that Template:Sc alludes to the necessity to study its influence as well. However, he surely overlooks at this occasion an essential part in the installation of the interferometer. In his Fig. 1 he namely shows, that (in his opinion) both surfaces of these plates are reflecting uniformly, and when he explains that the rays (emerging from one ray at the separation) are passing at a certain distance b from each other at the end, then also this is based on the same opinion. Yet this is by no means correct. The reflection ability with respect to any interferometer at one side of the plate (the side can change depending on the circumstances), is rather increased[9] by silvering thus far, so that the reflected and passing ray have approximately the same brightness. The reflection at the other side which is not silvered, is comparatively unimportant. The purpose of this measure mainly consists in suppressing (as far as possible) the plan-parallel rings occurring otherwise at the plates, which would superimpose the planned interference fringes in a disturbing way. However, in this way, one of both beams traverses the plate two times after the separation, the other one not at all. The compensating plate which must be traversed two times by the latter, serves to balance this. And although we (as with Template:Sc) aren't required to include it to the essential parts of the apparatus, it may be at least included in our considerations, since it was actually there.

Before this, we have to allude to a point in Template:Sc's calculation, from which the result is essentially depending, and which seems problematic to us. Already when Template:Sc begins:[10] "Then the relative velocity in glass, at which the first beam arrives in O, is cu. The relative velocity in glass is thus 23(cu)", then this is surely an unjustified generalization of the theorem being valid for resting bodies, that the speed of light within bodies is related to that in vacuum inversely as the refraction index (32 namely is presupposed as the value for the refraction index in glass). If it is assumed that this were correct, then one shall change (maintaining the direction of the beam) the direction of the limiting surface; by that, also the direction of the beam incident from vacuum is changing, thus also the relative velocity of light upon it is changing; thus this is also the case for the velocity in glass according to the Template:Pagenum generalized theorem, which is evidently impossible when the beam direction is maintained. This theorem is actually not in agreement with any of the electromagnetic theories of moving bodies.

If we now want to compare with each other the statements of the absolute theory and of relativity theory concerning the influence of the plate thickness δ under consideration of terms of second order in q/c, then this would be a quite difficult task, because the absolute theory is only developed under consideration of terms of first oder.[11] However, the fortunate remark of Template:Sc helps us here, by which the ratio of the plate thickness to the length of the arms of the interferometer, is of the same magnitude as qc. Thus if one wants to calculate the times required by the phases to traverse both plates, then one only needs to consider the terms proportional to qc themselves. Because in all statements concerning terms of first order in qc, both theories are in full agreement.[12] The influence of plate thickness thus cannot lead to a decision between them. Additionally, one understands without further ado, that it doesn't change anything according to relativity theory, when one replaces an infinitely thin separating plate in the apparatus by a plate of thickness δ, and simultaneously turns on the compensating plate.

That all relevant reflections at the separating plate occur at the same surface, has by the way the success, that the beam displacement b, of which Template:Sc speaks, vanishes. By that, all consequences and modifying proposals connected with that seem irrelevant to me. By another reason, however, all mutually corresponding beams don't coincide at the end; namely because in the experiment discussed, not fringes of equal inclination, but such ones of equal thickness are produced at a layer of air, whose one surface is formed by one mirror, and whose other surface is formed by a mirror image of the other mirror drawn at the separating plate. Nevertheless these beams come to interfere when one adjusts the telescope to this layer of air.[13]


Template:Center

Contrary to both remarks previously discussed, we have to admit the validity of the third remark of Template:Sc. The velocity of Earth against the aether indeed has an unknown component: the velocity by which Earth is moving against the aether. That this one is sometimes identified in the literature with the known motion of the sun against the fixed stars, is an assumption which only finds weak support by the fact, that there is presently no reason to ascribe to the complete system of fixed stars a motion relative to the aether. Nevertheless, in the sense of the absolute theory, we can say something about the amount of the velocity in question: it cannot be essentially greater than the velocity of Earth against the Sun. Otherwise, some of the effects of second order must have been observed at terrestrial or astronomical observations in the planetary system, which according to this theory must be present at almost all electromagnetic and optical phenomena. Thus the two possibilities remain to discuss, first, that the velocity of sun–aether is of the same magnitude as the velocity earth–planetary system, and second, that the latter is essentially greater.

In the second of these cases, we can evidently totally neglect the first velocity. However, in accordance with the first of these two possibilities, one has to add a component of same magnitude to the velocity between earth and the planetary-system; then in general a velocity of same order arises again, so that the ordinary assumption concerning the amount of the velocity earth–aether remains correct in terms of magnitude. The direction of the resulting velocity is, however, unknown to a large extent, we can only say that it must considerably change in the course of a year. Although Template:Sc and Template:Sc[14] have apparently overlooked this, there is still no reason to doubt the conclusiveness of their experiment. Because one hundredth of the expected fringe displacement couldn't be missed in the observation. To explain the lack of this in the course of so many repeated experiments, by an unfortunate accident with respect to the direction of the velocity earth–aether, Template:Pagenum is an assumption of much too low probability.

Munich, Institute for Theoretical Physics, March 1912.

Template:Center


  1. Template:Sc, this journal. 12. 979, 1911.
  2. Template:Sc, Das Relativitätsprinzip (Braunschweig 1911), p. 14.
  3. Template:Sc, Ann. d. Phys. 14, 236, 1904.
  4. This journal, 12, 990, 1911.
  5. The same page, last line.
  6. For example, one shall try to represent the path traversed by a phase, not only schematically in one dimension, but with the direction corresponding to the device. The separating line of the apparatus must necessarily be drawn in two positions, so that the figure impossibly can represent the state in this moment.
  7. The change in the mode of denotation with respect to Template:Sc's lecture, has the purpose to avoid confusions with the path lengths l.
  8. Template:Sc, Ann. d. Phys. 14, 236, 1904. Here, relativity theory is not used, yet all results also apply to it.
  9. Template:Sc, Light waves and their uses, pag. 40 (Chicago 1903). The silvering is also indicated in Fig. 1. in Template:Sc a. Template:Sc, Phil. Mag. 9, 680, 1905.
  10. l. c., p. 983, left below.
  11. Template:Sc, Versuch einer Theorie der electrischen und optischen Erscheinungen in bewegten Körpern. Leiden 1895. (Reprint: Leipzig 1906.)
  12. Only at magnetizable bodies a difference exists, which possibly stems, however, from a incompleteness of the formulation of the absolute theory, see. Template:Sc, Gött. Nachr. 1908, p. 53, § 9.
  13. Template:Sc, Ann. d. Phys. 33, 186, 1910.
  14. Template:Sc a. Template:Sc, Phil. Mag. 9, 680, 1905.

Template:Translation-license