In most experiments, it turns out that if one were to take thousands of measurements, and from
all of these compute the probability
(called the PDF or probability distribution
function of x) that any one measurement results in a value for x between ξ and ξ + dξ, one
would get a probability distribution like the following;

. This is in fact the average measurement

), we
estimate the mean as

= 2.0cm and σ = 1.0cm. If you were to make a
measurement of x, what is the probability that your resulting value would be between 1.9 and
1.91?

The actual distribution is the Bell-curve which looks like the following;

This type of random number is important for a second reason that is very deeply connected with lab measurements; averages of collections of random numbers are always themselves Normally distributed. This is called the Central Limit Theorem of statistics. It means that we do not need to know the probabilities of actually getting a particular measurement result, averages of many measurements will always be Normal random numbers, which is why any proper experiment will always involve many measurements from which averages are computed. Another corollary of the theorem is that averages come closer to the “true” value than single measurements in the absence of systematic errors. If quantity x is randomly distributed with standard deviation (precision) σx, then averages of sets of N measurements of x are normally distributed with standard deviation

The standard deviation σ is a measure of the precision (not accuracy) of the measurement set; the smaller σ, the “tighter” the distribution of outcomes. This means that your chances of making a measurement close to the true value (assumed to be the mean in the absence of systematic errors) is higher as σ gets smaller. The calculation of σ is simple, as it measures the average deviation of each measurement from the mean value; if we actually knew the PDF of x we would use


- σ and
+ σ.
If you measure a quantity x in the lab, and get N values {x1,x2,
,xN} with mean
= ∫
ξ
dξ and standard deviation (squared) σx2 = ∫ (ξ -
)2
dξ, and then use this data
to compute some function of x, say g(x), how do you report your results?
This is the topic of error propagation, what are the means and standard deviations of functions
of random numbers?
The answer comes from a major theorem in the mathematics of random numbers (statistics); the
Law of the Unconscious Statistician;

)2
);






i.
If in the lab you measure several quantities a, b, and c each N times, obtaining the data sets

Step 1. Estimate the means and standard deviations (estimate since you do not know the true PDFs for a,b and c)


Step 2. Compute the derivatives

Step 3. Compute the mean f;



Suppose that we measure the critical angle for a static equilibrium on an inclined plane, and discover that motion begins at angle θ = 0.120, 0.123, 0.119, 0.121, 0.120, 0.121 radians for six trials. The mean of these is estimated to be







Example The error in the area of a circle whose radius measurements have average
and
standard deviation σR; use


Example An experiment measures three quantities a, b, and c repeatedly, getting averages
,
,
and
and standard deviations σa, σb, and σc for them. These data are used to obtain an
experimental value for a quantity d supposedly given by the formula





Remember that you can use the binomial theorem

Example Obtain an error formula for A =
Partial derivatives of multi-variable functions are the same as ordinary derivatives





Several of the experiments test a hypothesis that some quantity y is related to another quantity x
according to a linear relation y = ax + b. The experiment is usually about testing the
hypothesis by taking a set of data {(xi,yi)∣i = 1,2,
,N} and using the data to
compute a the slope and b the intercept of the relation and comparing the results to the
theoretical values. We perform such an analysis by computing a and b for the line that most
closely conforms to the set of data taken in the lab, through the method of linear
regression.
Consider the two collections of points {(xi,yi)∣i = 1,2,
,N}, which is your lab data, and
{(xi,axi + b)∣i = 1,2,
,N} which is the “true” set of points gotten by using the actual
theoretical relation between x and y, which is assumed to be the straight line y = ax + b.
Consider then the sum of squares of the distances between corresponding points in the two
sets;


To find the slope a and b of the best fit line we simply minimize σ2 with respect to a and b; set the derivatives to zero





= (1,1,
,1). The formulas for a and b define them to be linear functions of the random
deviates
= (y1,y2,
,yN)



i, so that all non-diagonal correlation
coefficients vanish,







If you have a collection of data points

Step 1. Compute

Step 2. The best fit slope and intercept are

Step 3. Compute


Simple command-line linear fit (regression).