3.4 Error Estimates for the Monte Carlo Method

3.4.2 The Error In Estimating Probabilities

Measurable Outcome 3.8, Measurable Outcome 3.11, Measurable Outcome 3.12

Often, Monte Carlo simulations are used to estimate the probability of an event occurring. For example, in the turbine blade example, we might be interested in the probability that the hot metal temperature exceeds a critical value. Generically, suppose that the event of interest is \(A\). Then, an estimate of \(P\{ A\}\) is the fraction of times the event \(A\) occurs out of the total number of trials,

\[\hat{p}(A) = \frac{N_ A}{N}\] (3.45)

where \(N_ A\) is the number of times \(A\) occurred in the Monte Carlo simulation of sample size \(N\).

\(\hat{p}(A)\) is an unbiased estimate of \(P\{ A\}\). To see this, define a function \(I(A_ i)\) which equals 1 if event \(A\) occurred on the \(i\)-th trial, and equals zero if \(A\) did not occur. For example, if the event \(A\) is defined as \(y > y_{limit}\), \(I(A_ i)\) would be defined as,

\[I(A_ i) = I(y_ i > y_{limit}) = \left\{ \begin{array}{cc} 1 & \mbox{if } y_ i > y_{limit}, \\ 0 & \mbox{if } y_ i \leq y_{limit}. \end{array}\right.\] (3.46)

Using this definition, the number of times that \(A\) occurred can be written as

\[N_ A = \sum _{i=1}^ N I(A_ i).\] (3.47)

Finding the expectation of \(N_ A\) gives,

\[E[N_ A] = E[\sum _{i=1}^ N I(A_ i)] = \sum _{i=1}^ N E[I(A_ i)].\] (3.48)

Since we assume that the Monte Carlo trials are drawn at random and independently from each other, then \(E[I(A_ i)] = P\{ A\}\). Thus,

\[E[N_ A] = N P\{ A\}\] (3.49)

Finally, using this result we see that

\[E[\hat{p}(A)] = \frac{E[N_ A]}{N} = P\{ A\} .\] (3.50)

We can also use the central limit theorem to show that \(\hat{p}(A)\) is normally distributed for large \(N\) with mean \(P\{ A\}\) and standard error,

\[\sigma _{\hat{p}} = \sqrt {\frac{P\{ A\} (1-P\{ A\} )}{N}}\] (3.51)

 

In the exposition above, we calculated the standard error in estimating the probability of an event \(A\) defined as \(A=\{ y:y > y_{limit} \}\) using Monte Carlo sampling as \(\sqrt {\frac{P\{ A\} (1-P\{ A\} )}{N}}\). Now suppose we estimate the probability of the event \(B\), which is defined as \(B = \{ y:y > \frac{y_{limit}}{2} \}\) using the Monte Carlo method, with the same number of samples. The standard error in estimating this probability will be given by,

Exercise 1

Answer:

For estimating the standard error associated with computing \(P\{ B\}\), the exposition above remains the same, except \(P\{ A\}\) is replaced by \(P\{ B\}\).