Translation:On the Theory of Relativity I: Four-dimensional Vector Algebra

From testwiki
Jump to navigation Jump to search

Template:Translation header


Template:C/sTemplate:X-larger Template:X-larger

by. Template:ScTemplate:C/e

In this and some subsequent studies I want to illustrate, how remarkably simplified the electrodynamic concepts and calculations become, when we allow ourselves to be led by the deep space-time understanding of Template:Sc. For the friend who suddenly passed away, the following would hardly provide any new insight; yet as elucidation of Template:Sc's ideas it may be welcomed by some.

The content of the relativity principle can be formulated according to Template:Sc: In the physical equations only space-time vectors are allowed to occur, i.e. quantities that have vector-character in the four-times manifold of space and time, the Template:Scan "world". Therefore when passing from one to a new reference system, its components can be transformed according to the scheme of the coordinate transformation for that manifold ("Template:Sc transformation"). Instead of absolute space of the older theory, the absolute world takes its place, i.e. the connection of space and time by the speed of light c, whose invariability now constitutes the absolute substrate of electrodynamics.

In this first part I confine myself to the algebraic relations of the space-time vectors. A second part "vector analysis" should illustrate the differential properties of the four-dimensional vector-fields. In particular I attached great importance to the geometric description (as far as possible) of the definition and the transformation of the vectors. The required concepts and calculations, that give also a full replacement for the matrix calculus used by Template:Sc, are (when admitting imaginary coordinates) direct generalizations of the known three-dimensional vector-procedure. Template:Pagenum

§ 1. Four- and Six-vectors.

It's well known that in space of three dimensions we have to distinguish between two kinds of vectors, vectors of first kind or polar vectors and vectors of second kind or axial vectors, also denoted as rotors or plan-quantities. A vector of first kind is a straight line provided with a direction-sense, its components are the perpendicular projections upon the coordinate-axes. A vector of second kind is a plane surface provided with a rotation sense, its components are the perpendicular projections of the surface upon the coordinate-planes. The components of the first ones are thus related to one axis for each component, those of the second ones are related to two axes for each component, thus the first has to be written with one and the latter with two indices. In mechanics, force and velocity belong to the first, torque and angular velocity to the second. E. g. the components of an angular velocity were to be written as ωyz, ωzx, ωxy; only after passing to Template:Sc's "supplement", i.e. the perpendicular upon the rotating plane, the ordinary notation ωx, ωy, ωz is justified. Also the magnetic field strength belongs to the same class of vectors, as it was emphasized by Template:Sc. The ordinary notation x that is also maintained in the following, should consequently be replaced by yz.

In Template:Sc[1] space of four dimensions the vectors of first kind have four components, "four-vectors", those of second kind have six components, "six-vectors". There also exist vectors of third kind, that again become a four-component vector.[2]

Template:Pagenum A four-vector is e.g. a position vector from the origin to a space-time point (x y z l) with components

Template:Center

l as well as x, y, z is a length; the expression shall remind of a "way of light". The magnitude of that vector, i.e. the length of the corresponding four-dimensional distance, is:

Template:Center

Two vectors (x1y1z1l1) and (x2y2z2l2) shall be called perpendicular to each other, if

Template:Center

The coordinate vector (x y z l) is the type of all four-vectors. A quadruple of quantities only then deserves the name four-vector, when it is transformed by a coordinate change in the same manner as the coordinate vector, i.e. covariant with it. Here, the length of any four-vector is invariant; it is its unique invariant. Especially concerning the coordinate vector, the invariance of its length expresses the principle of the constant propagation of light. By the way, due to the imaginary character of its fourth components, the length of its four-vector can be zero, without that the vector and its components vanish.

A vector of first kind is also the "four-density P", that summarizes the concept of velocity 𝔳 and density ϱ of a moving charge, with the components:

Template:Optional style|(1) Px=ϱ𝔳xc, Py=ϱ𝔳yc, Pz=ϱ𝔳zc, Pl=iϱ,

its length becomes

Template:Optional style|(1a) |P|=iϱ1β2 where β2=1c2(𝔳x2+𝔳y2+𝔳z2)

The vector-character of P is given as follows: We consider the worldline of a charge element de, i.e. the successive space-time locations of it in the xyzl-manifold. The worldline element between to adjacent points 1 and 2

Template:Center

is surely a four-vector covariant with the coordinate-vector. Afterwards we use the important principle Template:Pagenum of the reference-frame independence of charge, by which the electric mass is fundamentally distinguished before the material mass. This principle is of course in accordance with Template:Sc's equations and is mentioned by Template:Sc as a consequence of it. By that:

Template:Center

dS = dx dy dz means the three-dimensional space-element, as well as dS is the same in a new "primed" reference system, that emerges from the original by a mere rotation around 0. In Fig. 1, dS and dS appear as two arbitrary bisections through the world-line-canal, that corresponds to the limits of de. If we put each of such bisection-pairs through the points 1 and 2, then one cuts out two equal four-dimensional space elements :

Template:Center

where dl,dl are the heights of the two space elements , and at the same time the components of the worldline-element 12 into the axes l,l, which as the fourth axes of our unprimed and primed reference system must stand perpendicularly upon dS and dS.[3] By division of the two preceding equations it is given, that

Template:Center

is a reference-frame-independent scalar quantity. If we thus multiply the four-vector (dx, dy, dz, dl) by iϱ/dl, then it keeps its vector-character; exactly the four-vector P from equation (1) occurs by that.

When passing to the space-time vectors of second kind, we consider in space of four dimensions a plane-section Template:Pagenum of specific magnitude and location, where we are only concerned with the area and the orientation, but not with the form or the absolute location; the location can be changed by a parallel displacement, the form can be arbitrarily changed by a transformation of same content. Yet this geometric image is still too special; it has only five independent specification-parts, where its magnitude is given by one, and its location by four[4] numbers. But we have to consider, that to a plane in space of four dimensions, also the perpendicular plane (die ensemble of all those lines that are perpendicular to all lines of the first plane) are uniquely[5] specified. In this normal plane we now imagine a second plane-section of specific magnitude. The quintessence of two such plane-sections perpendicular to each other, is the geometric image of a vector of second kind. It depends on six specification-sections, by adding to the five mentioned parts, the magnitude of the plane-section (that is perpendicular to it) as the sixth specification-section. We therefore speak about a six-vector. As its pure right-angled components in a xyzl coordinate space, we denote the sum of the perpendicular projections of the two plane-sections upon one of the six coordinate planes xy, yz, .. zl; the components of a six-vector shall therefore be written by two indices each, Template:Pagenum that must be mutually different, and their order is of importance.[6]

To the special space-time vector of second kind, the simple plane-section, we can kinematically relate a simple rotation; to the general six-vector, the plane-pair, we relate a general rotation or a helix. Namely we have to note, that for the simplest rotation operation that let unchanged all points of a point in four dimensions, we have to speak not about a rotation axis, but about a rotation plane; that rotation plane it then given by the plane of our plane section, the rotation quantity is given by the magnitude of it. As in three dimensions, a helix[7] is composed of a certain rotation and a translation to the axis of this translation, i.e. a rotation around the infinitely distant axis perpendicular to it, likewise the general rotation in four dimensions emerges from the composition of two simple rotations around two mutually perpendicular planes, in agreement with the constitution of our general six-vector from two perpendicular plane-sections. By this remark the expression "helix" for the general rotation may be justified.

A vector of second kind is e.g. the six-vector f of the electromagnetic field with the components

Template:Optional style|(2) {fyz=x,fzx=y,fxy=z,fxl=i𝔈x,fyl=i𝔈y,fzl=i𝔈z,

The meaning of the six-vector and the relation of its components to two coordinate axis chosen in a certain order, corresponds to the general definition:

Template:Optional style|(2a) fik=fki, fii=0 (i,k=x,y,z,l)

That the electric and magnetic forces 𝔈 and are combined by Template:Sc to the higher unity of the six-vector f, Template:Pagenum is characterizing for the relativity theory, in which (depending on the reference system) the same field can appear as purely electric or as electromagnetic. E.g a uniform moving charge shows the co-moving observer only the "electric side" of the six-vector, but to the non-co-moving observer also its "magnetic side".

That the electromagnetic field is indeed to be understood as a six-vector, i.e. that when changing the reference frame its components are transformed accordingly to the type of the four-dimensional plane-section or plane-pair, follows from the fact, that due to Template:Sc's equations the field is connected with the four-density P by the divergence condition (see later).

To better become acquainted with the six-vector, we consider the special case of a single plane-section φ of area 1 and give it the form of a parallelogram that emerges from the origin. The two sides of it may be the four-vectors u, v. Then the projection of that plane-section upon the xy-plane is specified by the projections uxuyvxvy of the four-vectors, namely

Template:Optional style|(3) φxy=uxvyvxuy

Between the thus formed six components of this specific six-vector, of which only five are independent from each other by the counting on p. 753, the identity exists:

Template:Optional style|(3a) φyzφxl+φzxφyl+φxyφzl=0

as it can directly be proven by multiplication of the relevant determinants. Furthermore[8] the area |φ| of the surface section must be defined as the square root of the square sum of its six components. In our case we thus have:

Template:Optional style|(3b) |φ|2=φyz2+φzx2+φxy2+φxl2+φyl2+φzl2=1

Template:Pagenum On the other hand we consider the surface-section φ* (perpendicular to φ) of area 1, that we can also denote as the "supplement" of that, and claim that its components are

Template:Optional style|(3c) φyz*=φxl, φzx*=φyl, φxy*=φzl, φxl*=φyz

or in general, when we indicate the indices that are nor illustrated by brackets

Template:Optional style|(3d) φik*=φ(ik)

where in respect to the order of the indices ik(ik) it must be defined, that it must emerge from the order x y z l by an even number of permutations.

For verification we represent φ* again by two four-vectors u*v*, which individually are perpendicular to uv. That means (see the above definition of perpendicularity) the existence of the following equations:

Template:C/sux*ux+uy*uy+uz*uz+ul*ul=0,ux*vx+uy*vy+uz*vz+ul*vl=0,vx*ux+vy*uy+vz*uz+vl*ul=0,vx*vx+vy*vy+vz*vz+vl*vl=0.Template:C/e

By multiplying them by vl, ul, vl, ul respectively, and subtracting the second from the first and the fourth from the third, we obtain:

Template:C/sux*φxl+uy*φyl+uz*φzl=0,vx*φxl+vy*φyl+vz*φzl=0.Template:C/e

Furthermore, if we multiply these two equations by vz*uz* or vy*uy* respectively, and subtract them from each other, then it follows

Template:Center

Due to φxz*=φzx* we can write the proportion:

Template:Center

Accordingly it is given:

Template:Center

and thus, when λ means the factor of proportionality:

Template:Center

Template:Pagenum However, as φ* as well as φ should have the same area 1, then λ2=1 in accordance with the claim before (3d).

The identity (3a) can be written, by using the expression of § 3A, also in this way:

Template:Optional style|(3a) {(φφ*)=φyzφyz*+φxyφxy*++φzlφzl*=2(φyzφxl+φzxφyl+φxyφzl)=0

The general six-vector f now emerges from the two special ones φ and φ* in the form

Template:Optional style|(4) f=ϱφ+ϱ*φ*

ϱ and ϱ* are the areas of the two mutually perpendicular surface-sections of orientation φ and φ*, of which f is composed. To f we form the "dual vector f*", its "supplement" by permutation of the surfaces ϱ and ϱ*. Namely, f* may be defined in consequence of (4):

Template:Optional style| f*=ϱ*φ+ϱφ*

From that it follows e.g. for the yz-components of this dual vector

Template:Optional style|(4b) fyz*=ϱ*φyz+ϱφyz*=ϱ*φxl*+ϱφxl=fxl

in the same way

Template:Optional style|(4b) fzx*=fyl, fxy*=fzl, fxl*=fyz, fyl*=fzx, fzl*=fxy

and in general

Template:Optional style|(4c) fik*=f(ik), fki*=fik*, fii*=0

so that f and f* are, with respect to each other, in the same relation, as the special surface-section φ with φ* perpendicular to it.

Also the quantities |f|2 and (ff*) can be expressed by ϱ and ϱ* in accordance with (3b, e).

Template:Optional style|(5a) {|f|2=fyz2+fzx2+fzl2=ϱ2|φ|2+ϱ*2|φ*|2+2ϱϱ*(φφ*)=ϱ2+ϱ*2
Template:Optional style|(5b) {(ff*)=fyzfyz*+fxyfxy*++fzlfzl*=ϱ2(φφ*)2+ϱ*2(φ*φ)+ϱϱ*(|φ|2+|φ*|2)=2ϱϱ*

Both have to be denoted as invariants of the six-vector, as they have, like the surfaces ϱ and ϱ*, a geometric meaning independent of the location of the coordinate system. We can, for example, denote |f| as (geometric) surface sum, and (ff*) as (double) surface product. In case Template:Pagenum f means the electromagnetic force, we have by (2) and (5a, b)

Template:Optional style|(5c) |f|2=2𝔈2=2L, (ff*)=2i(𝔈)

The expression L means "Lagrangian function of unit volume".

However, besides the components related to the coordinate planes, we can also speak of its components related to the coordinate spaces and coordinate axes.

By the component related to coordinate space, we understand a surface section that lies in this space, which we obtain when we project the two surface sections ϱ and ϱ* upon this space, and form the geometric sum of their two projections. E.g. the component of f in relation to the space of x y z, is the three-dimensional vector of second kind:

Template:Center

By the component related to a coordinate axis[9], we understand a surface-section obtained by splitting off (from the surface sections ϱ and ϱ*) its components in relation to the space perpendicular with this axis, and the two remaining surface-sections are combined geometrically. If we intersect the resulting surface with the space perpendicular to the axis, then a three-dimensional vector of first kind in this space arises. E.g. the component of f in relation to the axis of l, is the three-dimensional vector of first kind:

Template:Optional style|(6) fl=(flx, fly, flz)=i𝔈

that also can be interpreted as the special four-vector of the vanishing l-component. In general fj shall be the four-vector that is perpendicular to the j-axis:

Template:Optional style|(6a) fj=(fjx, fjy, fjz, fjl)

The component of the supplement f* related to the j-axis has the corresponding meaning:

Template:Optional style|(6b) fj*=(fjx*, fjy*, fjz*, fjl*)

Template:Pagenum and in particular

Template:Optional style|(6c) fl*=(fyz, fzx, fxy, 0)=

Besides vectors of first and second kind, also vectors of third kind exist in four-dimensional space, which are again four-vectors given not by a distance, but by a three-dimensional space-section, and which were written with three indices as projections of the space-section upon the four coordinate spaces, e.g.

Template:Center

Geometrically, to such a vector of third kind we can attribute a vector of first kind, its "supplement", namely the line perpendicular to the space section whose length is equal to the size of the space-section. The components of the latter

Template:Center

are one after the other equal to the components of the vector of third kind. The replaceability of the vector of third kind by its supplement is reaching even further, like the relation of the supplement with the vector of second kind in the case of three dimensions, in so far as in this case the supplement (due to the even dimension number) correctly reproduces the behavior of the vector of third kind, not only regarding coordinate transformations of determinant +1, but also regarding such of determinant -1 (mirroring, inversion). As the summarizing denotation for the vector of first and third kind, the name "four-vector" is recommended.

Since we can geometrically compose any two space sections by vectorial addition of their supplements, we don't obtain a more general vector of third kind through a space-pair as through a single space-section. Indeed the latter has already four mutually independent specification-sections - its size and the three data that are necessary for the definition of its location.

An important example (compare § 3B and §4) for a vector of third kind, we will find in the electrodynamic force (Pf). Template:Pagenum

§ 2. Component formation into arbitrary directions and planes, and its connection with the Lorentz transformation.

It's characteristic for the vector-concept and its independence of the coordinate system, that we can speak of its components into arbitrary directions (planes as regards the six-vector). By the component of a four-vector P into the x-axis we have to think of the perpendicular projection upon this axis, i.e.

Template:Optional style|(7) Px=Pxcos(xx)+Pycos(xy)+Pzcos(xz)+Plcos(xl)

If x is a "space-like" axis, then[10] cos(x,x), cos(x,y), cos(x,z) become real, cos(x,l) becomes purely imaginary, so that (see (1) and 7)) Px becomes real. On the other hand, Pl is purely imaginary when l is a "time-like" axis.

By the component of a six-vector f into an arbitrary plane, which we think as given by two mutually perpendicular directions xy contained in it, we understand the sum of the perpendicular projections of the surface sections (that forms the vector f) upon this plane; it can be calculated from the two-row determinant of the direction cosine:

Template:Optional style|(8) {fxy=fxy|cos(xx)cos(xy)cos(yx)cos(yy)|+fyz|cos(xy)cos(xz)cos(yy)cos(yz)|+fzx|cos(xz)cos(xx)cos(yz)cos(yx)|+fxl|cos(xx)cos(xl)cos(yz)cos(yl)|+fyl|cos(xy)cos(xl)cos(yy)cos(yl)|+fzl|cos(xz)cos(xl)cos(yz)cos(yl)|

For verification we consider at first the special six-vector φ (pure surface section of magnitude 1) Template:Pagenum and illustrate those as in §1 in the form of a parallelogram by two four-vectors u,v. Then the surface-section (projected upon the x'y'-plane, i.e. the sought component φxy, is specified by the projections of the four-vectors ux,uy,vx,vy upon the x- and y-axis as in equation (3) by:

Template:Center

If we substitute into ux, its value from (7) and write, according to (3), uxvyuyvx and φxy for the sub-determinants, then exactly the equation (8) for the special case f=φ is given. The same equations also arise for the surface-section φ* (perpendicular to φ) and therefore also for the sum ϱφ+ϱ*φ*, that means the general six-vector by equation (4). The component rule (8) for the six-vector is also a necessary consequence of that for the four-vector (7).

Regarding the reality-relations, it is similar for the components of the four-vector: fxy consists of only real, fxl of only imaginary terms, when the axes x,y are space-like, and l is time-like.

Furthermore we consider the quadruple of four-vectors fh derived from the six-vector f, and ask for the expression of the components fx of f into the new x-axis (which thus means a surface-section parallel to x', or a four-vector perpendicular to this axis) from the components fx,fy,fz,fl into the original axes. Indeed, by stating fx,fy,fz,fl, also fx must be given. We claim, that the following connection exists:

Template:Optional style|(9) fx=fxcos(xx)+fycos(xy)+fzcos(xz)+flcos(xl)

Equation (9) is formally in agreement with (7), but also essentially differs from it, because on the right-hand side of (9) there are four directed four-vectors, on the side of (7) there are four undirected components. Equation (9) is of course to be proved due to the vector-property of f, equation (8). We denote the vector-sum that occurs in (9) by Σ:

Template:Center

Template:Pagenum and form the components into the axis x y z l by equation (6a):

Template:C/sΣx=fyxcos(xy)+fzxcos(xz)+flxcos(xl)Σy=fxycos(xx)+fzycos(xz)+flycos(xl)Σz=fxzcos(xx)+fyzcos(xy)+flzcos(xl)Σt=fxlcos(xx)+fylcos(xy)+fzlcos(xz)Template:C/e

If we multiply one after the other by cos(jx), cos(jy), cos(jz), cos(jl) (by j we understand an arbitrary axis at first), and sum up, then by (7) the components of Σ into this axis emerge, and namely it is given:

for j=x the sum 0,
for j=y the expression (8),
for j=z the same expression by permutation of y with a new direction z (perpendicular to x) etc.

The right-hand side of (9) indeed represents the quadruple of that magnitude, denoted by us as fx by equation (6), namely the quadruple

Template:Center

For the sake of completeness, also the behavior of a vector of third kind in its projection upon an arbitrary space xyz may be mentioned. Its component into space xyz would be calculated by a four-parametric expression, in which the three-row sub-determinants of the direction cosine occur as coefficients, yet for which we can put (by the conditions of orthogonality that exist between them) the direction cosine of the common perpendicular l. The four-vector of third kind is also projected the same way as the four-vector of first kind forming its supplement.

Now, we prove to ourselves, that the projection- and component-formulas developed here, are identical with the transformation formulas of relativity theory. For this sake we consider, as usual, the special case, that the new reference system of xyzl corresponds in the two middle axes with the original one of x y z l, so that one from the other can obtained by a rotation in the xl-plane (around the yz-plane) Template:Page break (rotation angle φ); Template:Pagenum namely the x-axis may be space-like, thus the l-axis is time-like. Then

Template:Optional style|(10) {cos(xx)=cos(ll)=cosφcos(xl)=cos(lx)=sinφcos(yy)=cos(zz)=1cos(yx)==0

If we apply the component formula (7) to the coordinate vector (x y z l), then we obtain

Template:Optional style|(10a) {x=xcosφ+lsinφ,y=yl=xsinφ+lcosφ,z=z

Now cosφ>1, since by the remark on p. 760 φ=x1/x12+l12 and l12<0, thus φ, sinφ and tgφ are purely imaginary, and |tgφ|<1. Thus we have, when β denotes a real fraction:

Template:Optional style|(10b) tgφ=iβ, cosφ=11β2, sinφ=iβ1β2

Consequently the equations (10a) become identical with the Template:Sc transformation and show, that v=βc means the velocity, by which (in space of xyz) the new system of axis is uniformly displaced in the x-direction against the original one.

At the same time equation (8) shows, as to how the six-vector of the field is transformed (or "projected"). We have by (8) and (10):

Template:C/sfxy=fxycosφfylsinφ,fzx=fzxcosφ+fzlsinφ,fyz=fyz,fxl=fxl,fyl=fylcosφ+fxysinφ,fzl=fzlcosφfzxsinφ,Template:C/e

or because of (10b) and (2):

Template:Optional style|(10c) {x=x,y=y+β𝔈z1β2,z=z+β𝔈y1β2,𝔈x=𝔈x,𝔈y=𝔈y+βz1β2,𝔈z=𝔈z+βy1β2.

By that also these well-known transformation formulas of the field appear to be the immediate consequence of Template:Pagenum our geometrical interpretation of the six-vector and its component formation derived from the projection of four-dimensional lines. About Template:Sc's equation, from which these formulas were derived by Template:Sc and Template:Sc, we didn't speak at all (except in so far, as we have denoted f as six-vector by referring to those equations). Generally, according to Template:Sc and Template:Sc the relativity principle is to be seen as a superior principle above electrodynamics.

§ 3. Product of four- and six-vectors.

A. Scalar products.

Two four-vectors P and Φ will be scalar-multiplied by the rule

Template:Center

E.g., if P means the four-density, Φ the four-potential (combination of scalar and vector potential, see later), then (PΦ) becomes the "electro-kinetic potential" of Template:Sc. The square of the length of a four-vector is equal to the scalar product by itself, e.g. |P|2=(PP) see (1a).

In case of arbitrary motion of P and Ψ, the formula taken from ordinary vector calculus

Template:Center

defines the direction cosine of two arbitrary directions; it was used in this sense already at the beginning of § 2, footnote. The definition of perpendicularity from the beginning of § 1 is a special case of this.

Also two six-vectors f and F can be scalar-multiplied by the rule

Template:Center

Into this rule belong the two invariants of the six-vector f, see (5a, b)

Template:Center

B. Vector products.

Two four-vectors u, v, will be vector-multiplied by a rule, which is analogous to the ordinary three-dimensional vector Template:Pagenum product. Like in that place a vector of second case emerges from two vectors of second kind, here a six-vector results from two four-vectors, namely a special six-vector φ (simple surface section) with the invariant (φφ*)=0. The multiplication rule already used in § 1 reads

Template:Center

φ means the parallelogram formed by u and v in terms of size and location.

We can also vector-unify a four-vector and a six-vector, namely to a four-vector of third kind. If the six-vector is in particular a simple surface section, then the vector of third kinds means the parallelepiped directly formed by the four-vector and the supplement of that surface-section. In the general case it will be constructed as the geometric sum of two such parallelepipeds, yet it can always be replaced (see the end of §1) by a four-vector of first kind. This state of facts is the same as in ordinary vector calculus, where a vector of first kind emerges by vector-multiplication of a vector of first and second kind. The formal scheme of that multiplication is taken by us from this three-dimensional case.

𝔄=(𝔄x𝔄y𝔄z) be the vector of first kind and 𝔅=(𝔅yz,𝔅zx,𝔅xy) the vector of second kind. From the latter we form three special vectors of first kind, namely

Template:C/s𝔅x=(𝔅xx,𝔅xy,𝔅xz),𝔅y=(𝔅yx,𝔅yy,𝔅yz),𝔅ik=𝔅ki, 𝔅ii=0,𝔅z=(𝔅zx,𝔅zy,𝔅zz),Template:C/e

in general 𝔅j, from which all are perpendicular to the j-axis because of 𝔅jj=0. Like in §1 equation (6a), they can also be called the components of the surface section 𝔅 into the j-axis, and originally they denote the surface sections parallel to this axis. As j-component of the vector product of 𝔄 and 𝔅, we now have to define the scalar product of the two vectors of first kind 𝔄 and 𝔅j, namely

Template:Center

Template:Pagenum This is in agreement with the ordinary rule of the vector product and gives e.g. for j=x:

Template:Center

By that we define in the four-dimensional case the product (PF) from an arbitrary four-vector P and a six-vector f by its four components for j = x, y, z, l, by means of formula:

Template:Optional style|(11) (Pfj)=Pxfjx+Pyfjy+Pzfjz+Plfjl

Any of this components it thus given by scalar-multiplication of the four-vector P with the four-vector fj (perpendicular to the j-axis, defined in (6a), and derived from f). In the next paragraph we consider the electrodynamic force 𝔉 as an important example of this. In this place, at first the already mentioned geometric meaning of this vector of third kind may be derived, in the special case, where f=φ represents a simple surface-section.

We replace φ as a parallelogram by two four-vectors u, v and go over to the dual surface section φ* consisting of two four-vectors u*, v* parallel to u, v. Then the components of (Pφ) prove to be equal to the four three-row sub-determinants Dx,Dy,Dz,Dl of the scheme

Template:Optional style|(11a) |PxPyPzPlux*uy*uz*ul*vx*vy*vz*vl*|

When neglecting the first colon it is namely given:

Template:C/sDx=Py(uz*vl*ul*vz*)+Pz(ul*vy*uy*vl*)+Pl(uy*vz*uz*vy*)=Pyφzl*+Pzφly*+Plφyz*Template:C/e

For that we can write by (5a):

Template:Center

which is in agreement with our definition (11) of Pφxl.

Dx means by the known formulas of space geometry, a parallelepiped situated in the yzl-space formed by the vectors (Py,Pz,Pl), (uy*, uz*, ul*), (vy*, vz*, vl*), i.e. by the projections of the four-vectors P, u*, v* upon the yzl-space; Dx is at the same time also the projection of the parallelepiped (formed by these four-vectors, situated in four-dimensional space) Template:Pagenum upon the coordinate space of yzl. By the definition of the end of §1, Dx, Dy, Dz, Dl indeed represent the components of a vector of third kind. By denoting this components with Dx, instead of the more extensive Dyzl, we already executed the passage from the space section to its supplement (the line perpendicular to the space section). That the quantity (Dx, Dy, Dz, Dl) considered as a vector of first kind, is perpendicular to the parallelepiped (formed by the vectors P, u*, v*), can directly be seen. We only have to supplement the above 3 × 4-row determinant scheme by addition of the four components of P or u* or v* to a vanishing four-row determinant, and obtain:

Template:Optional style| (PD)=(u*D)=(v*D)=0

If we are dealing with the product of P with a general six-vector f=ϱφ+ϱ*φ*, then we shall consider in the same way the two parallelepipeds D and D*, which are formed by the vectors P, u*, v* and P, u, v, respectively. Then (Pf)=ϱD+ϱ*D* represents the geometric sum of these two space sections, which we can imagine as to be constructed by the supplements (perpendicular to that space section), as the resultant of the two lines ϱD and ϱ*D*.

C. Tensor products.

A vector of second kind gives, completely multiplied by itself, a tensor. We explain the way of constitution at first in the three-dimensional case.

If 𝔅 is a three-dimensional vector of second kind with components 𝔅yz, 𝔅zx, 𝔅xy, and 𝔅j, 𝔅h are whatever two of the three special vectors derived from it (see p. 765), then it is in general for j, h = x, y, z

Template:Center

From that it directly follows

Template:Center

The thus defined three-dimensional tensor quantity has six components, and differs essentially from our four-dimensional six-vector, for which fik=fki, fii=0 is given. Template:Pagenum The six components of T can be most conveniently, geometrically interpreted as coefficients of the equation of a surface of second degree.

In the four-dimensional case we accordingly start with a six-vector f, and define a tensor (ff) by all possible scalar products of the special four-vectors fj,fh (derived by equation (6a) from f) for h, j = x, y, z, l. From that, at first 16 tensor components follow

Template:Center

which, however, are reduced to 10 because of (fjfh)=(fhfj). The calculation gives, if f=(,i𝔈) especially means the field vector, for j = h:

Template:C/s(fxfx)=y2+z2𝔈x2(fyfy)=z2+x2𝔈y2(fzfz)=x2+y2𝔈z2(flfl)=𝔈x2𝔈y2𝔈z2Template:C/e

and for jh

Template:C/s(fxfy)=(fyfx)=(xy+𝔈x𝔈y)(fyfz)=(fzfy)=(yz+𝔈y𝔈u)(fzfx)=(fxfz)=(zx+𝔈z𝔈x)(fxfl)=(flfx)=i(𝔈yz𝔈zy)(fyfl)=(flfy)=i(𝔈zx𝔈xz)(fzfl)=(flfz)=i(𝔈xy𝔈yx)Template:C/e

Besides we also consider the tensor (f*f*) which is formed by the dual six-vector f*=(i𝔈,), whose components emerge form the preceding by permutation of by i𝔈 and 𝔈 by +i, e.g.

Template:C/s(fx*fx*)=𝔈y2𝔈z2+x2(fx*fy*)=(fx*fy*)=+(𝔈x𝔈y+𝔈x𝔈y)(fx*fl*)=(fl*fx*)=i(y𝔈zz𝔈y)Template:C/e

and eventually go over to the tensor[11] (composed by both):

Template:Optional style|(12) T=12((ff)(f*f*))

Template:Pagenum with the components

Template:Optional style|(12a) {Txx=12(x2+y2+z2𝔈x2+𝔈y2+𝔈z2)Tyy=12(y2+z2+x2𝔈y2+𝔈z2+𝔈x2)Tzz=12(z2+x2+y2𝔈z2+𝔈x2+𝔈y2)Tll=12(x2y2z2𝔈x2𝔈y2𝔈z2)
Template:Optional style|(12b) {Txy=Tyx=xy+𝔈x𝔈yTyz=Tzx=yz+𝔈y𝔈zTzx=Txz=zx+𝔈z𝔈x
Template:Optional style|(12c) {Txl=Tlx=i(𝔈yz𝔈zy)Tyl=Tly=i(𝔈zx𝔈xz)Tzl=Tlz=i(𝔈xy𝔈yx)

As it can be seen, the components Tik are identical with the above components (fifk); the components (Tii) that are different from (fifi), satisfy the relation Txx+Tyy+Tzz+Tll=0. The three components (12c) are, after multiplication by -ic, equal to the components of Template:Sc's energy flux, the first three components (12a) and the three components (12b) represent Template:Sc's stresses, while the fourth component (12a) is the negatively taken energy density.

Also now, it is most convenient to interpret the 16 tensor components as coefficients in the equation of a three-dimensional "space of second degree" located in four-dimensional space.

The product formations discussed here, do all correspond to Template:Sc's matrix scheme. Without doubt this offers (in mathematical view) the advantage of greatest generality and simplicity. On the contrary, in the preceding it was tried to emphasize the special geometric meaning of the single products by ordinary vector calculus.

All these products are, by their geometric meaning, independent of the choice of reference frame, namely in the following way: The scalar products (considered in A) are pure invariants of the Lorentz transformation. The vector products (considered in B) were transformed covariantly Template:Pagenum as a six- or a four-vector; their components in the new reference frame are thus only connected to those of the old reference frame by the formulas of §2.

For the vector product (Pf) we can calculate this directly from equation (9). Namely, if we scalar-multiply the four-vectors located at the left and right sides with the four-vector P, then

Template:Center

i.e. that formula which, by §2 equation (7), expresses the x-component of a four-vector by the old components. (Pf) is thus a true four-vector and (Pfx) its projection upon the x-direction.

At the same time also the new tensor components are calculated from the old tensor components by the simple scheme of the tensor transformation, whose coefficients are quadratically formed from the direction cosine of the new against the old coordinate axes.

§ 4. The electrodynamic force 𝔉.

The covariant formation (Pf) from the four-density P and the field-vector f (discussed in the previous paragraph under (11)) requires a special interest. Its formal calculation gives:

Template:C/s𝔉x=(Pfx)=ϱ(𝔳xcfxx+𝔳ycfxy+𝔳zcfxz+ifxl)=ϱ(𝔳ycz𝔳zcy+𝔈x)Template:C/e

and similarly for the y,z-components; in contrast for the l-component:

Template:C/s𝔉l=(Pfl)=ϱ(𝔳xcflx+𝔳ycfly+𝔳zcflz+ifll)=iϱc(𝔳x𝔈x+𝔳y𝔈y+𝔳z𝔈z)Template:C/e

In the expression of the ordinary three-dimensional vector calculation we thus find for j = x, y, z or l:

Template:Optional style|(13) 𝔉j=ϱ(𝔈+1c[𝔳]), 𝔉l=iϱc(𝔈𝔳)

Template:Pagenum 𝔉j thus represents the j-component of the electrodynamic force (acting on the unit volume), and 𝔉l the electrical work (multiplied with i/c) on the moving charge per unit volume and time. The latter shall be denoted energetic, the first dynamic components.

If we decompose f=ϱφ+ϱ*φ* into two simple surface sections, then F=ϱD+ϱ*D*, as discussed in the previous paragraph, is decomposed into two simple space sections, whose suplements are perpendicular to the four-density P. Since the latter (see the beginning of §1) has the direction of the worldline element of the charge (its components were proportional to dx, dy, dz, dl), then also the force 𝔉 is perpendicular to the worldline of its point of contact. The corresponding requirement is to be extended to an arbitrary force, as long as we exclude dissipative processes like the generation of Template:Scan heat etc.

The fourth energetic component of 𝔉 has in no way only a formal meaning, as it might appear at first. Rather (when changing frames) it is located directly at the real dynamic components. If it is (e.g. like in equation (10a)) a reference frame xyzl emerging from the original one by a rotation in the xl-plane, then

Template:Optional style|(14) 𝔉x=𝔉xcosφ+𝔉lsinφ and 𝔉x=𝔉xcosφ𝔉lsinφ

The energetic supplementary terms 𝔉lsinφ (or 𝔉lsinφ) are small and of order 1/c2, since 𝔉l (or 𝔉l) as well as sinφ contain the factor 1/c. Nevertheless this supplementary terms are indispensable for carrying out the principle of relativity, as it will be shown later by the example of the simple Template:Sc law.

If the dynamic components 𝔉x and 𝔉'x for any force 𝔉 in two relatively moving systems are known, e.g. by sufficiently exact measurements, then we can (by comparison according to the preceding formulas) specify the energetic components 𝔉l and 𝔉l. Conversely, we can also say: A force (e.g. also gravitation) is only then physically known, i.e. in the sense of the relativity principle specifiable for an arbitrary reference frame, when also its energetic component 𝔉l is known.

Template:Pagenum If we especially choose the primed system as "co-moving", so that the point of contact of force 𝔉 is at rest in this reference frame, then dx=dy=dz=0 and also 𝔉l=0, since the force on the worldline of its point of contact is perpendicular. Only in this case the energetic component vanishes out of the formulas for the determination of 𝔉x𝔉l, thus having the simple form (x-direction = direction of relative motion):

Template:Optional style|(14a) 𝔉x=cosφ𝔉x, 𝔉y=𝔉y, 𝔉z=𝔉z, 𝔉l=sinφ𝔉x

These are the typical transformation formulas for the passage from the "co-moving" to the "resting system", valid for any vector of first (or third) kind, especially for the electrodynamic force related to unit volume. We assume, that any "specific" force, i.e. calculated per unit volume, also behaves as a vector of first kind.

It is different for the entire force 𝔎 or 𝔎 in the stationary or co-moving system, of which the first, due to its relation to an arbitrary coordinate system, has actually no place in relativity theory. By the specific force is can be thus defined:

Template:Optional style|(15) 𝔎=𝔉dS, 𝔎=𝔉dS

if dS = dy dy dz is the three-dimensional volume element in the stationary, ds=dydydz is that in the co-moving system. The latter differs from the first, depending on the mutual location of both reference frames. By addition of dS, dS, the vector character of the right-hand side will thus be disturbed. The entire force doesn't simply behave as a vector of first kind.

From Fig. 1 it follows

Template:Optional style|(15a) dS=dScosφ=dS1β2

If, what we don't presuppose, the primed system with its l-axis is orientated into the world-line of the contact point of dS/dS=1β2, then the xyz-space and its volume element dS is perpendicular to this worldline, and dS is the perpendicular projection of dS in this space. The projection angle is equal to the angle of the two Template:Pagenum considered perpendiculars l and l, thus equal to φ. The circumstance, that φ is imaginary, has the consequence that the projection dS is greater than the projected element dS, opposing to our image in Fig. 1, that operates with a real φ. The ratio dS/dS=1β2 is the known Template:Sc contraction.

From (15), (15a) and (14a) it directly follows:

Template:C/s𝔎x=𝔉xdS=cosφ𝔉xdS=𝔉xdS=𝔎x𝔎y=𝔉ydS=𝔉ydS=1cosφ𝔉ydS1cosφ𝔎y etc.Template:C/e

and summarizing:

Template:Optional style|(14b) 𝔎x=𝔎x, 𝔎y=1cosφ𝔎y, 𝔎z=1cosφ𝔎z, 𝔎l=sinφcosφ𝔎x

These formulas (14b), when compared with (14a), prove the preceding explanation, that 𝔎 doesn't behave like a vector. Namely, if we denote, in formal analogy to our vector notation, by |𝔎|2 and |𝔎|2 the square sum of the unprimed and primed components (where 𝔎'l is zero according to definition equation (15)), then because of (14b), |𝔎| is not becoming equal to |𝔎|, but

Template:Center

while from (14a), as it must be the case for a four-vector, it follows of course |𝔉|=|𝔉|.

We also especially consider the case of a normal pressure acting in the primed system, that acts upon a surface σ at rest in the primed system, which gives rise to pressure p that coincides with the perpendicular n and which is measured per unit area. We can interpret this case, either as a distribution of infinitely small total-forces 𝔎 or infinitely great specific forces 𝔉, according to one or the other of the following two schemes

Template:Center

The first standpoint was taken by Template:Sc[12]. The latter Template:Pagenum should be preferable because of the vivid vectorial behavior of 𝔉. From this standpoint we think of 𝔉 as being distributed over a layer of thickness dn which is adjacent to surface σ, similarly as we interpret the surface charge as the limit of a spatial distribution.

If we erect upon this three-dimensional layer σdn (located in xyz-space) a four-dimensional cylinder parallel to the l-axis, then this encloses the entirety of all worldlines of the layer points. The force 𝔉, that should be perpendicular to σ in xyz-space, also coincides with the perpendicular of that four-dimensional cylinder, since it is also perpendicular to the worldline at its contact point. If we lay any inclined intersection as xyz-space through the cylinder, then it intersects (from the cylinder) the image σdn of the layer that appears to an observer in the xyzl-system. In Fig. 2, the (by p. 753, Note 2 completely specified) normal plane (laid through the considered point of σ) to the tangential plane of σ, is chosen as drawing plane. This not only contains the normal n of σ in xyz-space, but also the cylinder-normal n as well as the l-axis, since all three directions are perpendicular to σ. In the n-direction falls the direction of vector 𝔉; by direction and size we have 𝔉=𝔉n. For the reference frame xyzl it is decomposed in two components, which both can be seen in the figure: the energetic component 𝔉l and the dynamic 𝔉n.

On the other hand, the component of 𝔉 perpendicular to the drawing plane in the tangential plane of σ doesn't exist, i.e. also in the unprimed system (after separation of its energetic component) the force 𝔉 and its corresponding pressure p normal to its contact-area σ. Regarding the magnitude of p, it directly follows from the figure: If α is the angle between n and n (where α will lie between 0 and φ and will assume these most extreme values, when n lies especially perpendicular or parallel to the direction of relative motion of Template:Pagenum the primed and unprimed system). Then it is obviously given

Template:Center

thus

Template:Center

Because the unprimed observer defines its pressure p by the dynamic component 𝔉n of 𝔉, and the layer-thickness dn (seen by him) by 𝔉ndn in the same way as the primed by 𝔉ndn. The equality of these two pressures, fundamental for Template:Sc's thermodynamics of moving systems, thus follows here again as purely geometric statement concerning the behavior of four-dimensional vectors and its projection.

When operating with the total-force 𝔎, such a direct geometric reasoning can hardly be carried out, because 𝔎 doesn't behave like a vector.

We again come back to the electrodynamic meaning of 𝔉=(Pf). While this was, at the beginning of this paragraph, explained by the formal agreement with Lorentz's force-definition, equation (13), it should be directly derived from the relativity principle now.

We consider a unit volume at rest in the primed system, in which the electric charge is distributed by the density ϱ. Since the electric field-strength is defined as the moving force upon the charge 1, then the force ϱ𝔈 is acting on our unit volume. By this the three dynamic components of the specific force 𝔉 are specified; the energetic component is to be put equal to zero, since the contact-point of 𝔉 is at rest in the primed system, and its worldline is perpendicular to 𝔉. The four components of 𝔉 thus read:

Template:Center

On the other hand, the vector (Pf) for the xyzl-axis gives by equation (11), since Px=Py=Pz=0, Pl=iϱ (see equation (1)) and fxl=i𝔈x etc. (see equation (2));

Template:C/s(Pfx)=Plfxl=ϱ𝔈x,(Pfy)=Plfyl=ϱ𝔈y,(Pfz)=Plfzl=ϱ𝔈z,(Pfl)=Plfll=0.Template:C/e

Template:Pagenum This vector thus agrees for the primed system in all four components with the specific force 𝔉. Under the single assumption, that the force behaves like a four-vector, 𝔉 must be given for any reference system by the four-vector (Pf). In relativity theory, Template:Sc's force-definition directly follows from the definition of the electric field-strength by the single assumption, that the specific force behaves as a four-vector perpendicular to the worldline of the contact-point. In the original absolute-theory of Template:Sc, in contrast, this force-definition had to be introduced as a new fact of experience besides the field-equations. It seems to me a particularly nice achievement of relativity theory, that it makes this fact of experience unnecessary as such, by deriving it from much more general principles.

Following this, a remark of a more critical nature is appended. The relativity principle should be valid only for stationary or quasi-stationary motions of both reference systems against each other. A derivation of Template:Sc's force-concept from relativity theory, consequently guarantees it only for the realm of these motions. While in relativity theory this concept was postulated as generally valid, we have (when we consider the relativity principle as its true source) to let it open at all, whether in the expression of 𝔉 for a real accelerated motion there might occur supplementary terms that might depend on the acceleration of first and higher order. As it seems, all realizable motions fall under the class of quasi-stationary motions, thus such supplementary terms would hardly have a practical interest. Yet in a principle view it seems useful to me, to consider its possibility.

Template:Center


  1. The two papers of Template:Sc are: The Fundamental Equations for Electromagnetic Processes in Moving Bodies, Göttinger Nachr. 1908. p. 1 or Mathematische Ann. 68, p. 472. 1910, also separately to be published soon by B.G. Teubner; Space and Time, Physik. Zeitschr. 10. p. 104. 1909, also separately by B.G. Teubner, with a portrait of Template:Sc.
  2. Corresponding to the number of coordinate axes, -planes and -spaces calculated by the number of combinations of four elements to each 1, 2 or 3, namely: Template:Center
  3. I think it is allowed to arrange the image and the expression, as if l is real and the perpendicularity is euclidean. Of course, in reality an imaginary l and a non-euclidean perpendicularity is meant, which according to Template:Sc can be constructed by a hyperbola or a hyperboloid. It is possible, but not recommended, to reinterpret all that follows accordingly in a non-euclidean way.
  4. We can imagine that the location of the plane-section is given by two four-vectors that emerge from two arbitrary points of the plane, and which are arbitrarily located within the plane. Their location is at first specified by three magnitudes, e.g. the relation of its four components. However, any of the two four-vectors can arbitrarily rotated within the plane. By that we can give to each of their three specification-sections an arbitrary value. Thus 2(3-1) = 4 specification-sections for the location of the plane remain only.
  5. We think of two lines drawn in the first plane, and we construct the entirety of the lines that are perpendicular to them, where each line fills a linear space. These two spaces intersect each other in a plane containing all common perpendiculars of both starting lines, and which is thus the normal plane of the first plane. The two planes perpendicular to each other, as well as any two planes of general location, have only one point in common. Because any plane in a space of four dimensions is given by two linear equations, and four linear equations uniquely specify a point.
  6. Permutation of xy by yx means a projection upon the back of the xy-plane and therefore, as with three dimensions, a sign reversal.
  7. To this analogy also Template:Sc refers: Space and Time § V.
  8. Its square is at first given by the formula of ordinary planimetry: Template:Center compare § 3A, from which equation (3b) directly follows by a suitable combination of the components.
  9. This at first somehow complicated term appears indispensable to me for the geometrical interpretation of the subsequent differential vector-operations.
  10. If x1y1z1l1 are the coordinates of an arbitrary point upon the axis of x in the system of x y z l and if we put R12=x12+y12+z12+l12, then by the general definition of § 3 A: Template:Center The axis is space-like if R12>0, that is x12+y12+z12>|l1|2, and time-like if R12<0. In the first case the first three cosines are real and the last is imaginary, in the second case the first three are imaginary and the last is real.
  11. According to a friendly suggestion of my colleague M. Template:Sc.
  12. A. Einstein, Jahrbuch f. Radioakt. 4. 448.

Template:Translation-license