Translation:Supplementum theoriae combinationis observationum erroribus minimis obnoxiae

From testwiki
Revision as of 00:37, 12 March 2025 by imported>ShakespeareFan00 (Dark mode compatibility)
(diff) ← Older revision | Latest revision (diff) | Newer revision β†’ (diff)
Jump to navigation Jump to search

Template:Translation header

1.

In the previous Memoir, we assumed that the quantities to be determined, with the help of imperfect observations, depended on certain unknown elements, in terms of which they could be expressed: the problem then consisted in deducing from the observations, as accurately as possible, the values of these elements.

In most cases, this is indeed how the question arises. But sometimes the problem is presented differently, and at first glance it is even doubtful whether or not it can be reduced to a problem of the required form. Indeed, frequently the quantities under observation are not given in the form of a function of certain elements, and they seem to only be reducible to such a form by difficult or even ambiguous operations; whereas the nature of the problem does provide certain conditions that the observed values must rigorously satisfy.

Upon closer inspection, however, it can be seen that this case does not differ essentially from the previous one, and that it can be reduced to it. Indeed, if π denotes the number of observed quantities, and σ denotes the number of constraint equations, then nothing prevents us from choosing, among the former, πσ quantities, and considering them as our only unknowns, the others being regarded as functions of these, defined by the constraint equations. By this artifice, we can return to the case of the previous Memoir.

Nevertheless, although this approach often leads to the result in a quite convenient manner, one cannot deny that it is less natural, and it is therefore desirable to treat the problem in another form, which admits a very elegant solution. Moreover, since this new solution leads to faster calculations than the previous one whenever σ is less than 12π, or equivalently, whenever the multitude of elements that we have denoted by ρ in the previous commentary, is greater than π2, it is preferable to use the new solution, even if it would be easy, given the nature of the problem, to unambiguously eliminate the constraint equations.

2.

Let v, v, Template:Nobr denote the π quantities whose values are provided by observation. Suppose an unknown depends on these and is expressed by a known function u; let l, l, Template:Nobr be the values of the differential quotients Template:C corresponding to the true values of v, v, Template:Nobr. If one substituted the true values for v, v, Template:Nobr in the function u, one would also obtain the true value of u; but if the observations are affected by errors e, e, Template:Nobr, there resulting total error for u will be represented by Template:C provided (when u is not linear) that one can neglect the squares and products of errors e, e, e, etc., as we will always assume.

Although the magnitude of errors e, e, Template:Nobr, is uncertain, the uncertainty attached to the value found for u can generally be measured by the mean error to be feared in the adopted determination. According to the principles developed in the previous commentary, this average error is Template:C where m, m, Template:Nobr are the average errors of the various observations. If all observations are affected by the same degree of uncertainty, this expression becomes Template:C and it is clear that in this calculation, it is permissible to replace l, l, Template:Nobr, with the values taken by the differential coefficients Template:C when one replaces v, v, Template:Nobr with their observed values.

3.

When the quantities v, v, Template:Nobr are independent, the unknown can only be determined in one way, and the uncertainty attached to the result cannot be avoided or reduced. In this case, the observations provide a value of the unknown which is not arbitrary.

It is quite different when the quantities v, v, Template:Nobr are subject to certain relations, which we will suppose are expressed by σ conditional equations Template:C where X, Y, Template:Nobr are given functions of the unknowns v, v, Template:Nobr In this case, there are infinitely many ways to represent our unknown as a combination of the quantities v, v, Template:Nobr since it is clear that the function u can be replaced by any other function U such that the difference Uu vanishes identically upon setting Template:C

If the observations were rigorously exact, this substitution would not change the result at all; but, due to inevitable errors, each form adopted for u will correspond to a different result, and the error committed, instead of being Template:C will be Template:C where L, L', Template:Nobr denote the differential quotients Template:C Although it is impossible to assign the value of the various errors, we can, however, compare the mean errors to be feared in the various combinations. The most advantageous combination will be the one that gives the minimum value to the mean error. This error being Template:C we must seek to make the sum Template:C as small as possible.

4.

The infinite number of functions U, by which u can be replaced, will differ from each other in our investigations only by the values they provide for L, L', Template:Nobr: therefore, we must first seek the relationships that exist between the systems of values that these coefficients can take. Let Template:C denote the values taken by the coefficients Template:C when we substitute the true values of v, v, Template:Nobr It is clear that if increments dv, dv, Template:Nobr are given to v, v, Template:Nobr which do not change X, Y, Template:Nobr and thus leave each of them with a value of zero, these increments, which satisfy the equations Template:C will not change the value of Uu, and consequently, we will have Template:C We easily conclude that L, L', Template:Nobr, must have the form Template:C where x, y, Template:Nobr are determined multipliers. Conversely, it is clear that for all values of x, y, Template:Nobr, we can form a function U, for which the values L, L, Template:Nobr, will precisely be those provided by these equations, and this function can, according to the above, be substituted for u. The simplest form we can give it is Template:C but the most general is Template:C where u denotes a function of v, v, Template:Nobr which vanishes identically when X, Y, Template:Nobr, are zero, and whose value, in the current case, will be maximum or minimum. However, this makes no difference for our purposes.

5.

It is now easy to assign values to x, y, Template:Nobr such that the sum Template:C achieves its minimum value. It is clear that to achieve this goal, the knowledge of the absolute mean errors m, m, Template:Nobr is not necessary. Rather, it suffices to know only their ratios. Indeed, instead of these quantities, let us introduce the weights of the observations, p, p, Template:Nobr, i.e. numbers inversely proportional to the squares m2, m2, Template:Nobr The quantities x, y, Template:Nobr must then be determined in such a way that the polynomial Template:C achieves its minimum value. Suppose that x0, y0, Template:Nobr, are the determined values of x, y, Template:Nobr that correspond to this minimum.

We now introduce the following notation: Template:C The minimality condition clearly requires that we have

0=(aa)x0+(ab)y0+(ac)z0++(al),0=(ba)x0+(bb)y0+(bc)z0++(bl),0=(ca)x0+(cb)y0+(cc)z0++(cl),} Template:Optional style|(1)

After these equations have provided the values of x0, y0, Template:Nobr, we set

ax0+by0+cz0++l=L,ax0+by0+cz0++l=L,ax0+by0+cz0++l=L,} Template:Optional style|(2)

and the most suitable function to determine our unknown, corresponding to the minimum mean error, will be one whose differential coefficients, for the considered values of the variables, are equal to L, L, Template:Nobr The weight of the determination thus obtained will be

P=1L2p+L2p+L2p+, Template:Optional style|(3)

meaning that 1P will precisely be the value taken by the polynomial considered above, for the values of the variables x, y, Template:Nobr which satisfy the equations (1).

6.

In the previous article, we showed how to determine the function U that provides the most suitable determination of the unknown u. Let us now examine what value results from this. Let's denote it by K: it will be obtained by substituting, in U, the observed values of the quantities v, v, Template:Nobr Let k be the value that u takes when the same substitutions are made, and let ϰ be the true value of the unknown, obtained by substituting the true values of v, v, Template:Nobr, either in u, or in U. We have Template:C and, consequently, Template:C Replacing Ll, Ll, Template:Nobr with their values provided by (2), and setting

{ae+ae+ae+=𝔄,be+be+be+=𝔅,ce+ce+ce+=β„­, Template:Optional style|(4)

we will have

K=k+𝔄x0+𝔅y0+β„­z0+. Template:Optional style|(5)

It is not possible to calculate 𝔄, 𝔅, Template:Nobr by means of the formulas (4), as the errors e, e, Template:Nobr appearing in them have unknown values, but it is easy to see that the quantities 𝔄, 𝔅, Template:Nobr, are none other than the values of X, Y, Template:Nobr, corresponding to the observed values of v, v, Template:Nobr In this way, the systems of equations (1), (3), (5) exhibit the complete solution to our problem. In fact, it is clear that the remark made at the end of art. 2, regarding the quantities l, l, Template:Nobr, can be applied to the calculation of a, a, Template:Nobr b, b, Template:Nobr, or in other words, the true values of v, v, Template:Nobr can be replaced with the observed values.

7.

In place of formula (3), which represents the weight of the most probable determination, several other expressions can be given, which are worth developing.

First, observe that if the equations (2) are multiplied by ap, ap, Template:Nobr resp. and then added, the result is Template:C The left hand side is =0, and therefore, denoting the right hand side by (aL), we have Template:C and similarly Template:C Next, multiplying the equations (2) by Lp, Lp, Template:Nobr and adding, we find that Template:C and thus we obtain a second expression for the weight, Template:C

If, finally, if we multiply the same equations (2) by lp, lp, Template:Nobr and add, we obtain a third expression for the weight, Template:C where, with the same notation as before, Template:C One can easily derive from this a fourth expression for the weight, Template:C

8.

The general solution we have just outlined is particularly adapted to the case where there is only one unknown to determine. However, when seeking the most plausible values for several unknowns, all of which depend on the same observations, or when it is unknown which unknowns should be derived from the observations, it is necessary to proceed differently, as we will now explain.

Let us consider the quantities x, y, Template:Nobr as indeterminate, and set

(aa)x+(ab)y+(ac)z+=ξ,(ab)x+(bb)y+(bc)z+=η,(ac)x+(bc)y+(cc)z+=ζ,} Template:Optional style|(6)

Suppose that we deduce, by elimination, that

(αα)ξ+(αβ)η+(αγ)ζ+=x,(βα)ξ+(ββ)η+(βγ)ζ+=y,(γα)ξ+(γβ)η+(γγ)ζ+=z,} Template:Optional style|(7)

Let us first of all observe that the symmetrically placed coefficients are necessarily equal, i.e. Template:C This follows from the theory of elimination, but we will also give a direct demonstration below.

Thus we have

x0=(αα)(al)(αβ)(bl)(αγ)(cl),y0=(βα)(al)(ββ)(bl)(βγ)(cl),z0=(γα)(al)(γβ)(bl)(γγ)(cl),} Template:Optional style|(8)

and so, by setting

(αα)𝔄+(αβ)𝔅+(αγ)β„­+=A,(βα)𝔄+(ββ)𝔅+(βγ)β„­+=B,(γα)𝔄+(γβ)𝔅+(γγ)β„­+=C,} Template:Optional style|(9)

we obtain Template:C and if we further set

aA+bB+cC+=pε,aA+bB+cC+=pε,aA+bB+cC+=pε,} Template:Optional style|(10)

it follows that

K=klεlεlε. Template:Optional style|(11)

9.

Comparing equations (7) and (9) shows that the auxiliary quantities A, B, Template:Nobr, are the values taken by the indeterminates x, y, Template:Nobr, when we set ξ=𝔄, η=𝔅, Template:Nobr and thus we have

(aa)A+(ab)B+(ac)C+=𝔄,(ba)A+(bb)B+(bc)C+=𝔅,(ca)A+(cb)B+(cc)C+=β„­,} Template:Optional style|(12)

Multiplying equations (10) by ap, ap, Template:Nobr respectively, and adding, we obtain

Template:Optional style|Template:Nobr


𝔄=aε+aε+aε+,𝔅=bε+bε+bε+,β„­=cε+cε+cε+,} Template:Optional style|(13)

Now, since 𝔄 is the value taken by X after the observed values are substituted for v, v, Template:Nobr, it is easy to see that if we apply the corrections ε, ε, Template:Nobr, to each of these quantities, the value of X becomes 0, and similarly Y, Template:Nobr, vanish under this assumption. In a similar way, equation (11) also shows that K is the value taken by u after the same substitutions have been made.

We will refer to the application of the corrections ε, ε, Template:Nobr as compensation of observations. This evidently leads us to the following very important consequence. Namely, the observations, compensated in the way we have describde, exactly satisfy all of the conditional equations, and cause any function of the observed quantities to take the value resulting from the most suitable combination of the unaltered observations. Since the conditional equations are too few to deduce the exact values of the errors, we will have found, by the above, the most plausible errors. We will henceforth refer to the quantities ε, ε, Template:Nobr by this name.

10.

Since the number of observations is greater than the number of conditional equations, an infinite number of systems can be found, other than the system of the most plausible corrections, which exactly satisfy the condition equations.

It is important to examine the relationships that link these various systems. Let E, E, Template:Nobr, be such a system of corrections, different from the most plausible system. Then we have Template:C Multiplying these equations by A, B, Template:Nobr, and adding, we obtain, with the help of the equations (10), Template:C But combining equations (13) in the same manner gives

pε2+pε2+pε2+=A𝔄+B𝔅+Cβ„­+; Template:Optional style|(14)

From the combination of these results, one easily deduces Template:C Therefore, the sum Template:C is necessarily greater than the sum Template:C which can be stated as follows:

Template:Sc The sum of squares of the corrections by which the observations can be reconciled with the condition equations, being multiplied by the weights of the corresponding observations respectively, gives a minimum, if the most plausible corrections are adopted.

This is the very principle of least squares, from which equations (12) and (10) can be easily and immediately derived. The minimum value of the sum, which we will now denote by S, is equal, by equation (14), to Template:C

11.

The determination of the most plausible errors, being independent of l, l, Template:Nobr, clearly provides the most convenient preparation, regardless of how the observations are to be subsequently used. Furthermore, it is easy to see that, to achieve this goal, it is not necessary to perform elimination indefinitely, i.e. to calculate (αα), Template:Nobr, it suffices to deduce from the equations (12), by a definite elimination, the auxiliary quantities A, B, Template:Nobr, which we will call, in what follows, the correlates of the condition equations Template:C and to substitute these quantities into equation (10).

This method leaves nothing to be desired when only the most plausible values of the quantities provided by the observation are required, but it is different when one also seeks the weights of each of the found values. Whichever of the four previous formulas one wants to use, it is essential to know L, L, Template:Nobr, or equivalently, x0, y0, Template:Nobr for this reason, it will be useful to more closely study the elimination that provides these quantities, and to obtain a more convenient method for determining the weights.

12.

The relations between the quantities we are concerned with are notably simplified by considering the indefinite second-degree function Template:C which we will denote by T. This function is obviously equal to

(ax+by+cz+)2p+(ax+by+cz+)2p+ Template:Optional style|(15)

Moreover, we clearly have

T=ξx+ηy+ζz+; Template:Optional style|(16)

and if, with the help of the equations (7), one expresses x, y, Template:Nobr in terms of ξ, η, Template:Nobr, Template:C

The theory developed above provides two sets of determined values for the quantities x, y, Template:Nobr, ξ, η, Template:Nobr

The first one is Template:C whose corresponding value of T is Template:C as can be seen by comparing the third form of the weight P with equation (16), or by directly considering the fourth form.

The second set of values is Template:C and the corresponding value of T is Template:C as is evident from formulas (10) and (15), and also from formulas (14) and (16).

13.

We must now subject the function T to a transformation similar to that which has been indicated in Theoria Motus Corporum Coelestium art. 182, and was further developed in Disquisitione de elementis ellipticis Palladis. To this end, let us set

(bb,1)=(bb)(ab)2(aa),(bc,1)=(bc)(ab)(ac)(aa),(bd,1)=(bd)(ab)(ad)(aa),(cc,2)=(cc)(ac)2(aa)(bc,1)2(bb,1),(cd,2)=(cd)(ac)(ad)(aa)(bc,1)(bd,1)(bb,1),(dd,3)=(dd)(ad)2(aa)(bd,1)2(bb,1)(cd,2)2(cc,2), Template:Optional style|(17)

After also setting[1]

Template:C we will have Template:C where η, ζ, Template:Nobr are derived from ξ, η, ζ, Template:Nobr using the following equations: Template:C

We can then easily derive all the formulas required for our purpose. Namely, to determine the correlates A, B, Template:Nobr, we set

𝔅=𝔅(ab)(aa)𝔄,β„­=β„­(ac)(aa)𝔄(bc,1)(bb,1)𝔅,𝔇=𝔇(ad)(aa)𝔄(bd,1)(bb,1)𝔅(cd,2)(cc,2)β„­,} Template:Optional style|(18)

and then A, B, C, Template:Nobr are obtained from the following formulas, starting with the last one:

(aa)A+(ab)B+(ac)C+(ad)D+=𝔄,(bb,1)B+(bc,1)C+(bd,1)D+=𝔅,(cc,2)C+(cd,2)D+=C,(dd,3)D+=𝔇,} Template:Optional style|(19)

For the sum S, we have the new formula

S=𝔄2(aa)+𝔅2(bb,1)+β„­2(cc,2)+𝔇2(dd,3)+, Template:Optional style|(20)

and finally, the weight P which is to be assigned to the most plausible determination of the quantity u, is given by the formula

1P=(ll)(al)2(aa)(bl,1)2(bb,1)(cl,2)2(cc,2)(dl,3)2(dd,3); Template:Optional style|(21)

In this formula, we have

(bl,1)=(bl)(ab)(al)(aa),(cl,2)=(cl)(ac)(al)(aa)(bc,1)(bl,1)(bb,1),(dl,3)=(dl)(ad)(al)(aa)(bd,1)(bl,1)(bb,1)(cd,2)(cl,2)(cc,2)} Template:Optional style|(22)

Formulas (17)....(21), whose simplicity leaves nothing to be desired, provide the complete solution to our problem.

14.

Having solved our primary problem, we will now address some secondary questions, which will shed more light on the subject.

We will first investigate whether it is possible for the elimination that provides x, y, Template:Nobr, as functions of ξ, η, Template:Nobr, to become impossible in certain cases. This would obviously happen if the functions ξ, η, Template:Nobr, were not independent of each other. Suppose, for a moment, that this is the case, and that one of them can be expressed as a function of the others, such that we have an identity Template:C where α, β, Template:Nobr are determined numbers. Then we will have Template:C and if we set Template:C it follows automatically that Template:C and consequently, Template:C where p, p, Template:Nobr are, by their nature, all positive, this equation requires Template:C

Let us now consider the complete differentials dX, dY, Template:Nobr corresponding to the values of v, v, Template:Nobr, immediately provided by the observations. By our previous results, these differentials Template:C will be related to each other in such a way that when they are multiplied by α, β, Template:Nobr respectively and added, the sum is identically zero, so that among the equations Template:C at least one can be regarded as superfluous, because it will be satisfied as soon as the others are.

Upon closer examination of the question, it is seen that this conclusion is only applicable to values of the variables that differ infinitesimally from those provided by observation. In particular, there are two cases to distinguish: first, the case where one of the equations Template:C is generally and absolutely contained within the others, and can therefore be eliminated; and second, the case where, for the particular values of v, v, Template:Nobr provided by the observations, one of the functions X, Y, Template:Nobr, X for example, achieves its maximum or minimum value, or, more generally, a value where its differential vanishes, while the other equations remain satisfied. However, since we only consider variations of our variables whose squares are negligible, this second case (which in practice will rarely occur) can be assimilated into the first, and one of the constraint equations can be removed as redundant. If the remaining equations are independent in the sense we have indicated, it is certain, according to the above, that the elimination is possible. However, we reserve the right to return to this matter, which deserves to be examined as a theoretical subtlety rather than as a question of practical utility.

15.

In the previous commentary, arts. 37 sqq., we have shown how to approximate the weight of a determination a posteriori. If approximate values of Template:Nobr are provided by equally precise observations, and they are compared with the values resulting from the most plausible assumptions one can make about the Template:Nobr on which they depend, it has been seen that one must add the squares of the obtained differences, divide the sum by πρ, and then the resulting quotient can be regarded as an approximate value of the square of the average error inherent in this kind of observation. If the observations are of unequal precision, the only modification that must be made is that one must multiply the squares of the differences by the respective weights of the corresponding observations, and the mean error obtained in this way relates to observations whose weight is taken to be unity.

In the current case, the sum of the squares we are talking about obviously coincides with the Template:Nobr, and the difference πρ with the Template:Nobr of the condition equations. Consequently, for the average error of observations whose weight is 1, we will have the expression Sσ, so that the determination will be increasingly reliable for larger values of σ.

However, it is worthwhile to establish this result independently of the reasoning in the first investigation. To this end, it will be helpful to introduce some new notation. Suppose that, corresponding to the values Template:C we have Template:C so that Template:C and furthermore, corresponding to the values Template:C we have Template:C and finally, corresponding to the values Template:C we have Template:C and so on.

Combining equations (4) and (9) then yields Template:C and, since Template:C we will clearly have Template:C

16.

The series of observations that provide the quantities v, v, Template:Nobr, affected by random errors e, e, Template:Nobr, can be considered as a test that does not reveal the magnitude of each error, but instead, by means of the rules explained above, allows us to determine the quantity S as a function of the errors. In such a test, some errors may be larger and others smaller; but the greater the number of errors used, the greater the probability that S differs little from its average value. The difficulty thus boils down to finding the average value of S.

By the principles outlined in the first Memoir, which it is unnecessary to reproduce here, we find for this average value Template:C Let μ denote the average error corresponding to observations with weight 1, so that we have Template:C Then the previous expression can be written as follows: Template:C but we have found Template:C and thus the right-hand side is unity, as can be easily recognized by comparing equations (6) and (7). Similarly, we find that Template:C and so on.

Hence the average value of S is =σμ2, and insofar as it is permissible to regard the random value of S as equal to the average value, we conclude that μ=Sσ.

17.

One can assess the confidence warranted by this determination by calculating either the mean error to be feared, or its square. The latter will be the square root of the average value of the expression (Sσμ2)2, which can be developed by reasoning similar to that which was explained in the first commentary (arts. 39 sqq.). We will omit these for the sake of brevity, and simply give the result.

The mean error to be feared in determining the square μ2 is expressed by Template:C where ν4 is the average value of the fourth powers of the errors with weight 1, and N is the sum Template:C This sum cannot generally be simplified; however it can be shown, by a method similar to that used in art. 40 of the previous commentary, that its value is between π and σ2π. With the hypothesis on which we originally established the method of least squares, the term containing this sum disappears, because ν4=3μ4, and the precision that must be attributed to the determination μ=Sσ is therefore the same as if one had operated on σ observations affected by exactly known errors, following the precepts of arts. 15, 16 of the previous commentary.

18.

For the compensation of observations, we have seen above that there are two operations to perform: first, it is necessary to determine the correlates of the conditional equations, that is, A, B, C, etc., which satisfy equations (12); secondly, we must substitute these quantities into equation (10). The compensation thus obtained can be called perfect or complete, as opposed to an imperfect or incomplete compensation. We will use the latter term to describe those resulting from the same equations (10), in which values of A, B, C are substituted which do not satisfy the equations (12), i.e. which satisfy only some or none. We will not consider such a system of corrections here, and we will not even describe them as a compensation. When equations (10) are satisfied, systems (12) and (13) become equivalent, and the difference of which we speak can then be stated as follows: Fully compensated observations satisfy the conditional equations Template:C Partially compensated observations satisfy only some of these equations, and perhaps none; a compensation for which all equations are satisfied is necessarily complete.

19.

It follows from the very definition of a compensation that a combination of two compensations is again a compensation, and it does not matter whether the rules given to obtain a perfect compensation are applied to the raw observations or to imperfectly compensated observations.

Let θ, θ, θ, etc., be a system of incomplete compensations resulting from the formulas

{θp=A0a+B0b+C0c+,θp=A0a+B0b+C0c+,θp=A0a+B0b+C0c+, Template:Optional style|(I)

Since the observations thus changed do not satisfy all of the conditional equations, let 𝔄, 𝔅, β„­, etc., be the values taken by X, Y, Z, etc., when the values obtained for v, v, v, etc., are substituted into them. We must then find the values A, B, C, etc., which satisfy the equations

{𝔄=A(aa)+B(ab)+C(ac)+,𝔅=A(ab)+B(bb)+C(bc)+,β„­=A(ac)+B(bc)+C(cc)+, Template:Optional style|(II)

Once this is done, the complete compensation of the observations thus modified will be accomplished by the new changes ϰ, ϰ, ϰ, etc.; ϰ, ϰ, ϰ, etc., being deduced from the formulas

{ϰp=Aa+Bb+Cc+,ϰp=Aa+Bb+Cc+,ϰp=Aa+Bb+Cc+, Template:Optional style|(III)

Let us now examine how these corrections agree with the complete compensation of the raw observations. First of all, it is clear that we have

Template:Optional style| π’œ=π’œaθaθaθ,ℬ=ℬbθbθbθ,π’ž=π’žcθcθcθ,

In these equations, if we replace θ, θ, θ, etc. with their values from (I), and if we replace π’œ, ℬ, π’ž, etc., with their values from (II), then we obtain

Template:Optional style| π’œ=(A0+A)(aa)+(B0+B)(ab)+,ℬ=(A0+A)(ab)+(B0+B)(bb)+,π’ž=(A0+A)(ac)+(B0+B)(bc)+,

It follows that the correlates of the conditional equations (12) are

Template:Optional style| A=A0+A,B=B0+B,C=C0+C,,

and then equations (10), (I), and (III) show that

Template:Optional style| ε=θ+ϰ,ε=θ+ϰ,ε=θ+ϰ,;

Hence the perfect compensation produces the same value for each unknown, whether calculated directly or obtained through an incomplete compensation.

20.

When there are a large number of conditional equations, determining the correlates A, B, C, etc. may require calculations so lengthy that the calculator may be deterred: in such cases, it may be advantageous to obtain a complete compensation using a series of approximations based on the theorem of the preceding article. For this purpose, the conditional equations will be divided into two or more groups, and first, a compensation will be sought that satisfies the equations of the first group. Then, the values modified by this initial calculation will be treated, and they will be corrected again, considering only the equations of the second group. This second calculation will generally yield results that no longer satisfy the equations of the first group, and if only two groups were formed, one must then return to the first group and satisfy its equations with new corrections. The observations will then undergo a fourth compensation, in which only the conditions of the second group are considered, and by alternating in this way between the two groups of equations, corrections will be formed that will necessarily become smaller and smaller. If the choice of groups has been made skillfully, one will arrive at stable values after a few iterations.

When forming more than two groups, one must proceed in the same manner, with the various groups being used successively until the last, after which one returns to the first group to repeat the process in the same order. We have only indicated this method, the success of which will depend greatly on the skill of the calculator.

21.

We have yet to provide the proof of the lemma which was assumed in art. 8. For the sake of perspicuity, let us use notation which is better adapted to this matter.

Let x0, x, x, etc., be indeterminates, and suppose that the equations Template:C have, through elimination, led to the following: Template:C By substituting, in the first two equations of the second system, the values of X0, X', X', etc., provided by the first, we obtain two identical equations: Template:C These equations being identical, we can substitute any quantities we wish for x0, x, x, etc. Let's choose in the first Template:C and in the second Template:C Subtracting the two identities element-wise yields N10N01=(N00N11N10N01)(n01n10)+(N00N12N10N02)(n02n20)+(N00N13N10N03)(n03n30)++(N01N12N11N02)(n12n21)+(N01N13N11N03)(n13n31)++(N02N13N12N03)(n23n32)+ This can be written more succinctly as: N10N01=(N0αN1βN1αN0β)(nαβnβα), where α, β denote two arbitrarily chosen indices; from this, we conclude that the equalities n01=n10,n02=n20, and more generally, nαβ=nβα,, and so N01=N10; Moreover, since the order of the indeterminates is arbitrary, it is evident that, under the assumed hypothesis, we have Nαβ=Nβα in general.

22.

Since the method outlined in this commentary is intended primarily for useful application in high geodesy, the reader may appreciate it if we include some examples drawn from this branch of science.

The conditional equations that exist among the angles of a system of triangles can generally be divided into three categories.

I. The sum of the horizontal angles formed around the same vertex, encompassing the entire horizon, must be equal to four right angles.

II. The sum of the angles of each triangle can always be considered as known; for even when the triangle is situated on a curved surface, the excess of the sum of its angles over two right angles can be calculated with such approximation that it is permissible to consider the result as absolutely exact.

III. Finally, a third kind of relation is obtained by examining the ratios of the sides in triangles that form a closed network. If, indeed, the triangles are placed in such a way that the second triangle has a side a in common with the first, and a side b in common with the third; if the fourth triangle has two sides c and d respectively in common with the third and the fifth, and so on, until the last triangle, which has a common side k with the previous one, and with the first of all a common side l, then the quotients Template:C can be calculated by means of the angles opposite to them in the triangle whose two compared sides are part of it, and since the product of these fractions is obviously unity, there will be a relation between the sines of the various measured angles (reduced by one-third of the spherical or spheroidal excess when operating on a curved surface).

In somewhat complicated networks, it often happens that there are too many equations of the second and third category, and some of them are contained in the rest. On the other hand, it may happen in rare cases that some new equations must be added to those of the second category. For example, this will occur when the network contains polygons not divided into triangles by measurements; one must then introduce equations related to figures with more than three sides. On another occasion, we will return to this and provide more details on these various circumstances, but examining them now would divert us from our purpose. However, we cannot refrain from making an essential remark here for those who would like to make the rigorous application of our theory: we always assume that the quantities denoted by v, v, Template:Nobr, have been observed directly, or deduced from observations, in such a way that their determinations are independent of each other, or at least can be regarded as such. In common practice, one observes the angles that can be considered as the elements v, v, Template:Nobr themselves. But one must not forget that if the system contains, in addition, triangles whose angles have not been directly observed and have been deduced from those that are known, by additions or subtractions, then these angles should not be counted among the quantities determined by the observation, and they must be included in the calculation as functions of the elements used to form them. It will be different if one adopts the method of observations of Template:Sc (Astronomische Nachrichten II, p. 431), which consists of determining all directions around the same vertex, by relating them all to a single arbitrary direction. The angles measured in this way will then be taken for v, v, Template:Nobr, and the angles of the triangles will all appear as differences. The equations of the first category should, in this case, be suppressed as superfluous, for they will be identically satisfied. The method that I myself followed in the triangulations carried out during the last few years differs from the two previous methods; however, it can be reconciled, as to the result, with Struve's method, in the sense that, at each station, one must regard v, v, Template:Nobr, as the angles formed by the directions emanating from it, with a single arbitrarily chosen line.

We will give two examples: the first relates to the first mode of operation, and the second is related to observations made according to the second method.

23.

The first example comes from the work of Template:Sc, PrΓ©cis historique des opΓ©rations trigonomΓ©triques faites en Hollande. We will seek to compensate the part of the observations related to the system of triangles that are contained between Harlingen, Sneek, Oldeholtpade, Ballum, Leeuwarden, Dockum, Drachten, Oosterwolde, and GrΓΆningen. Between these points, nine triangles numbered 121, 122, 123, 124, 125, 127, 128, 131, 132 were formed in the cited work. The observed angles are as follows:

Template:Sc
0. Harlingen Template:T 50Β° 58β€² 15β€³ 238
1. Leeuwarden Template:T 82Β° 47β€² 15β€³ 351
2. Ballum Template:T 46Β° 14β€² 27β€³ 202
Template:Sc
3. Harlingen Template:T 51Β° 05β€² 39β€³ 717
4. Sneek Template:T 70Β° 48β€² 33β€³ 445
5. Leeuwarden Template:T 58Β° 05β€² 48β€³ 707
Template:Sc
6. Sneek Template:T 49Β° 30β€² 40β€³ 051
7. Drachten Template:T 42Β° 52β€² 59β€³ 382
8. Leeuwarden Template:T 87Β° 36β€² 21β€³ 057
Template:Sc
9. Sneek Template:T 45Β° 36β€² 07β€³ 492
10. Oldeholtpade Template:T 67Β° 52β€² 00β€³ 048
11. Drachten Template:T 66Β° 31β€² 56β€³ 513
Template:Sc
12. Drachten Template:T 53Β° 55β€² 24β€³ 745
13. Oldeholtpade Template:T 47° 48′  52β€³ 580
14. Oosterwolde Template:T 78° 15′  42β€³ 347
Template:Sc
15. Leeuwarden Template:T 59° 24′  00β€³ 645
16. Dockum Template:T 76° 34′  09β€³ 021
17. Ballum Template:T 44° 01′  51β€³ 040
Template:Sc
18. Leeuwarden Template:T 72° 06′  32β€³ 043
19. Drachten Template:T 46° 53′  27β€³ 163
20. Dockum Template:T 61° 00′  04β€³ 494
Template:Sc
21. Dockum Template:T 57° 01′  55β€³ 292
22. Drachten Template:T 83° 33′  14β€³ 515
23. GrΓΆningen Template:T 39° 24′  52β€³ 397
Template:Sc
24. Oosterwolde Template:T 81° 54′  17β€³ 447
25. GrΓΆningen Template:T 31° 52′  49β€³ 094
26. Drachten Template:T 66° 12′  57β€³ 246

The consideration of these triangles shows that the twenty-seven angles directly provided by observation have thirteen necessary relationships among them, namely: two of the first kind, nine of the second kind, and two of the third kind. However, it is not necessary to write down all these equations here in their final form, because for the calculation we only need the quantities which were denoted in the general theory by π’œ, a, a, etc., ℬ, b, b, etc. etc. Therefore, we can immediately write down the equations (13), which bring out these quantities. Instead of ε, ε, etc., we will simply write here (0), (1), (2), etc.

In this way, the two equations of the first kind correspond to the following: Template:C

Next, we find, for the spheroidal excesses of the nine triangles: 1.749; 1.147; 1.243; 1.698; 0.873; 1.167; 1.104; 2.161; 1.403. Then, we have the conditional equation of the second kind: Template:C and so on for the others, and we have the following nine equations: Template:C The conditional equations of the third kind are more easily expressed using logarithms. The first one is: Template:C It seems unnecessary to develop the other one in finite form. To these two equations correspond the following, in which the coefficients refer to the seventh decimal place of Brigg's logarithms: Template:C

No reason leads us to attribute unequal weights to the various observations, so we will assume Template:C Denoting the correlates of the condition equations, in the same order as these equations were written, by Template:C we determine them by the following equations: Template:C We then deduce by elimination:

A=0,598B=0,255C=1,234D=+0,086E=0,447F=+1,351G=+0,271 H=+0,659I=+1,050K=+0,577L=1,351M=0,109792N=+0,119681.

Finally, the most plausible errors are given by the formulas Template:C and we obtain the following numerical values, to which we add, for comparison, the corrections adopted by Template:Sc:

Template:Sc Template:Sc
(0)=3,108(1)=1,832(2)=+0,981(3)=+1,952(4)=0,719(5)=0,512(6)=+3,648(7)=3,221(8)=1,180(9)=1,116(10)=+2,376(11)=+1,096(12)=+0,016(13)=2,013 2,090+0,1161,982+1,722+2,8483,8480,137+1,0001,6140+5,9283,570+2,4146,014 (14)=+0,795(15)=+0,061(16)=+1,211(17)=1,732(18)=+1,265(19)=+2,959(20)=1,628(21)=+2,211(22)=+0,322(23)=2,489(24)=1,709(25)=+2,701(26)=1,606 +2,400+1,273+5,9457,674+1,876+6,2515,530+3,4863,4540+0,400+2,0543,077

|}

The sum of the squares of our corrections is 97.8845; the average error, as indicated by the 27 observed angles, is therefore Template:C

The sum of the squares of the corrections by Template:Sc is 341.4201.

Template:C

The triangles whose vertices, in the triangulation of Hanover, are placed at Falkenberg, Breithorn, Hauselberg, Wulfsode, and Wilsede, will provide us with a second example.

The following directions have been observed:[2]

At the station in Falkenberg
0. Wilsede Template:T 187Β° 47β€² 30β€³β€Š311
1. Wulfsode Template:T 225° 09′ 39β€³β€Š676
2. Hauselberg Template:T 266° 13′ 56β€³β€Š239
3. Breithorn Template:T 274° 14′ 43β€³β€Š634
Γ€ la station de Template:Sc
4. Falkenberg Template:T 094° 33′ 40β€³β€Š755
5. Hauselberg Template:T 122° 51′ 23β€³β€Š054
6. Wilsede Template:T 150° 18′ 35β€³β€Š100
Γ€ la station de Template:Sc
7. Falkenberg Template:T 086° 29′ 06β€³β€Š872
8. Wilsede Template:T 154° 37′ 09β€³β€Š624
9. Wulfsode Template:T 189° 02′ 56β€³β€Š376
10. Breithorn Template:T 302° 47′ 37β€³β€Š732
At the station in Wulfsode.
11. Hauselberg Template:T 009° 05′ 36β€³β€Š593
12. Falkenberg Template:T 045° 27′ 33β€³β€Š556
13. Wilsede Template:T 118° 44′ 13β€³β€Š159
At the station in Wilsede.
14. Falkenberg Template:T 007° 51′ 01β€³β€Š027
15. Wulfsode Template:T 298° 29′ 49β€³β€Š519
16. Breithorn Template:T 330° 03′ 07β€³β€Š392
17. Hauselberg Template:T 334° 25′ 26β€³β€Š746

These observations allow us to form seven triangles.

Template:Sc
Falkenberg Template:T 008Β° 00′ 47β€³ 395
Breithorn Template:T 028° 17′ 42β€³ 299
Hauselberg Template:T 143° 41′ 29β€³ 140
Template:Sc
Falkenberg Template:T 086° 27′ 13β€³ 323
Breithorn Template:T 055° 44′ 54β€³ 345
Wilsede Template:T 037° 47′ 53β€³ 635
Template:Sc
Falkenberg Template:T 041° 04′ 16β€³ 563
Hauselberg Template:T 102° 33′ 49β€³ 504
Wulfsode Template:T 036° 21′ 56β€³ 963
Template:Sc
Falkenberg Template:T 078° 26′ 25β€³ 928
Hauselberg Template:T 068° 08′ 02β€³ 752
Wilsede Template:T 035° 25′ 34β€³ 281
Template:Sc
Falkenberg Template:T 037° 22′ 09β€³ 365
Wulfsode Template:T 073° 16β€² 39β€³ 603
Wilsede Template:T 069° 21′ 11β€³ 508
Template:Sc
Breithorn Template:T 027° 27′ 12β€³ 046
Hauselberg Template:T 148° 10′ 28β€³ 108
Wilsede Template:T 004° 22′ 19β€³ 354
Template:Sc
Hauselberg Template:T 034° 25′ 46β€³ 752
Wulfsode Template:T 109° 38′ 36β€³ 566
Wilsede Template:T 035° 55′ 37β€³ 227

We have here seven conditional equations of the second kind (there is obviously no need to form any of the first); to form them, we must first find the spheroidal excesses of the seven triangles, and for this it is essential to know the length of one side. The one that connects Wilsede to Wulfsode is 22877.94 meters. We conclude that the spheroidal excesses of the various triangles are:

Template:C

If we denote the angles that determine the directions indicated above by v(0), v(1), v(2), v(3), etc., then the angles of the first triangle, marked with the same indices, will be Template:C and the first condition equation is therefore Template:C The remaining six triangles will yield six similar equations, but a little attention shows that these equations are not independent. Indeed, the second is identical to the sum of the first, fourth, and sixth; the sum of the third and fifth is identical to that of the fourth and seventh: for this reason we will ignore the second and fifth. Instead of writing the remaining equations in finite form, we will write here the corresponding equations (13), with symbols (0), (1), (2), etc. taking the place of ε, ε, ε, etc.: Template:C

We can obtain eight conditional equations of the third kind from the triangles of the system, to which end one may combine three of the four triangles I, II, IV, VI, or three of the triangles III, IV, V, VII; however, a little attention shows that it suffices to consider two belonging to each of these two systems of triangles, and those will encompass all the others. Thus, for the sixth conditional equation, we have Template:C and for the seventh, Template:C to which correspond the equations (13), Template:C If we attribute the same certainty to the various directions, setting Template:C and if we denote the correlates of the seven conditional equations by A, B, C, D, E, F, G, then their determination will depend on the following equations: Template:C From this, by elimination, we deduce Template:C and the most probable errors are given by the formulas Template:C from which we derive the following numerical values:

(0)=+0,065(1)=0,212(2)=+0,339(3)=0,193(4)=+0,233(5)=0,071(6)=0,162(7)=0,481(8)=+0,406 (9)=+0,021(10)=+0,054(11)=0,219(12)=+0,501(13)=0,282(14)=0,256(15)=+0,164(16)=+0,230(17)=0,139.

The sum of the squares of these errors is equal to 1,2288; hence, the average error resulting from the 18 observed directions is Template:C

25.

To give an example of the last part of our theory, let us determine the precision with which the compensated observations determine the Falkenberg-Breithorn side, using the Wilsede-Wulfsode side. The function u, by which it is expressed, in this case, is: Template:C

Its value, deduced from the corrected observations, is Template:C

Differentiation of this equation provides the following, with dv(0), Template:Nobr expressed in seconds, Template:C

From this we find that: Template:C Taking the meter as the unit of length, the methods indicated above yield, Template:C We conclude that the mean error to be feared in the value of the Falkenberg-Breithorn side is Template:Nobr (where m denotes the average error to be feared in the observed directions, this error being expressed in seconds), and consequently, if we adopt the value of m announced above, this mean error to be feared is 0m1209.

Moreover, inspecting the system of triangles immediately shows that one cans completely disregard the Hauselberg station without breaking the network that connects the other four. However, it would not be good practice to eliminate the operations related to this point,[3] as they certainly contribute to increasing the precision of the whole. To show more clearly what increase in precision results from this, we will finish by again performing the calculation, after excluding all results related to the Hauselberg point. Of the eighteen directions mentioned above, eight then cease to be used, and the most plausible errors on those that remain are:

(0)=+0,327(1)=0,206(3)=0,121(4)=+0,121(6)=0,121 (12)=+0,206(13)=0,206(14)=+0,327(15)=+0,206(16)=+0,121

The value of the Falkenberg-Breithorn side then becomes 26766m,63, a result little different from that obtained earlier. But calculating the weight yields Template:C and the mean error to be feared, in meters, is Template:C We see that by adding the operations related to Hauselberg, the weight of the determination of the Falkenberg-Breithorn side is increased in the ratio of 7,644 to 12,006, that is, in the ratio of unity to 1,571.

  1. ↑ In the previous calculations, it was sufficient to use three letters from each series to clarify the formulas' law; it seemed necessary here to include a fourth one to make the algorithm more evident. We have followed the series of letters a, b, c; A, B, C; 𝔄, 𝔅, β„­; in the natural way, with d, D, 𝔇. However, lacking the required letters, we have followed the series x, y, z with w, and the series ξ, η, ζ with φ.
  2. ↑ The starting points to which the individual directions are referred are here regarded as arbitrary, although in reality they coincide with the meridian lines of the stations. The observations will be published in full at a later date; meanwhile, the figure can be found in "Astronomische Nachrichten" Vol. I. p. 441
  3. ↑ The greater part of these observations had already been made before Breithorn's point was discovered and incorporated into the system.