Estimation of Distributional Parameters

For a chosen distributional model, its shape and position are completely defined by the associated parameters. By referring to Eq. (3.5), determination of the quantile also requires knowing the values of the parameters в.

There are several methods for estimating the parameters of a distribution model on the basis of available data. In frequency analysis, the commonly used parameter-estimation procedures are the method of maximum likelihood and the methods of moments (Kite, 1988; Haan, 1977). Other methods, such as method of maximum entropy (Li et al., 1986), have been applied.

3.6.1 Maximum-likelihood (ML) method

This method determines the values of parameters of a distribution model that maximizes the likelihood of the sample data at hand. For a sample of n indepen­dent random observations, x = (x1, x2,…, xn)1, from an identical distribution, that is,

Xi ~ fx(x | в) for i = 1,2,…, n

in which в = (0i, 02,…, 6m) a vector of m distribution model parameters, the likelihood of occurrence of the samples is equal to the joint probability

of {xi}i=1,2,…,n calculable by

n

L(x | в) = Ц fx(xi | в) (3.10)

i=1

in which L(x |в) is called the likelihood function. The ML method determines the distribution parameters by solving

Max L(x | в) = maxln[L(x | в)]

в в

or more specifically

nn

Подпись: (3.11)Maxll fx(Xi | в) = ln[fx(Xi | в)]

i=1 i=1

Estimation of Distributional Parameters
Подпись: Example 3.4 Referring to Eq. (2.79) for the exponential distribution as
Подпись: fx(x | в) = exp(-x/в)/в for x > 0, в > 0 determine the maximum likelihood estimate for в based on n independent random samples {xi }i=1,2,..., n Solution The log-likelihood function for the exponential distribution is

As can be seen, solving for distribution-model parameters by the ML principle is an unconstrained optimization problem. The unknown model parameters can be obtained by solving the following necessary conditions for the maximum:

Подпись: i=1

which is the sample mean.

Подпись: fx(x | а, в) = Подпись: 1 Подпись: _ 1 / x-а  2 PH в ) Подпись: for — 00 < x <00

Example 3.5 Consider a set of n independent samples, x = (xi, x%,…, xn)1, from a normal distribution with the following PDF:

Determine the ML estimators for the parameters a and в.

Подпись: L(x | а, в) = Estimation of Distributional Parameters Estimation of Distributional Parameters

Solution The likelihood function for the n independent normal samples is

Подпись:n, Y^n=1(xi — a)2

2в2

Estimation of Distributional Parameters Estimation of Distributional Parameters

Taking the partial derivatives of the preceding log-likelihood function with respect to a and в2 and setting them equal to zero results in

After some algebraic manipulations, one can easily obtain the ML estimates of normal distribution parameters a and в as

a Ei=1 xi в2 Ei=1(xi — a)2

aML = —n~ ^L =——————- n————

As can be seen, the ML estimation of the normal parameters for a is the sample mean and for в2 is a biased variance.

3.6.2 Product-moments-based method

By the moment-based parameter-estimation methods, parameters of a distribu­tion are related to the statistical moments of the random variable. The conven­tional method of moments uses the product moments of the random variable. Example 3.3 for frequency analysis is typical of this approach. When sample data are available, sample product moments are used to solve for the model parameters. The main concern with the use of product moments is that their reliabilities owing to sampling errors deteriorate rapidly as the order of moment
increases, especially when sample size is small (see Sec. 3.1), which is often the case in many geophysical applications. Hence, in practice only, the first few statistical moments are used. Relationships between product-moments and pa­rameters of distribution models commonly used in frequency analysis are listed in Table 3.4.

3.6.3 L-moments-based method

As described in Sec. 2.4.1, the L-moments are linear combinations of order statistics (Hosking, 1986). In theory, the estimators of L-moments are less sen­sitive to the presence of outliers in the sample and hence are more robust than the conventional product moments. Furthermore, estimators of L-moments are less biased and approach the asymptotic normal distributions more rapidly and closely. Hosking (1986) shows that parameter estimates from the L-moments are sometimes more accurate in small samples than are the maximum — likelihood estimates.

To calculate sample L-moments, one can refer to the probability-weighted moments as

вr = M1,r, o = E{X[Fx(X)]r} for r = 0,1,… (3.13)

which is defined on the basis of nonexceedance probability or CDF. The esti­mation of вг then is hinged on how Fx(X) is estimated on the basis of sample data.

Consider n independent samples arranged in ascending order as X(n) < X(n-1) < ■ ■ ■ < X(2) < X(1). The estimator for Fx(X(m>) for the mth — order statistic can use an appropriate plotting-position formula as shown in Table 3.2, that is,

m ___ a

F( X (m)) = 1———————- —— T for m = 1,2,…, n

n + 1 — b

with а > 0 and b > 0. The Weibull plotting-position formula (a = 0, b = 0) is a probability-unbiased estimator of Fx(X(m>). Hosking et al. (1985a, 1985b) show that a smaller mean square error in the quantile estimate can be achieved by using a biased plotting-position formula with a = 0.35 and b = 1. According to the definition of the в-moment вг in Eq. (3.13), its sample estimate br can be obtained easily as

1n

br = — J^X(m)[F (X(m))]r for Г = 0,1, … (3.14)

П i=1

Stedinger et al. (1993) recommend the use of the quantile-unbiased estimator of Fx(X(m)) for calculating the L-moment ratios in at-site and regional frequency analyses.

Distribution

 

Estimation of Distributional Parameters

Range

 

Product moments

 

L-Moments

 

*1 — M; *2 — O/

тз — 0;Т4 — 0.1226

 

Normal

 

— Ж < x < Ж

 

Min x — ln(Mx) x/2;

o2n x — ln(^2 +1);

Yx — 3^x +

 

2

 

1 /ln(x)-A1n x

2 V Oln x

 

Estimation of Distributional Parameters

Lognormal

 

x > 0

 

Eq. (2.68); Eq. (2.70)

 

*1 — % + aJ n / 2;

*2 — 2 а^ж(л/2 — 1) т3 — 0.1140; т4 — 0.1054

 

Estimation of Distributional Parameters

Rayleigh

 

% < x < Ж

 

a > 0

for в > 0: x > %; for в < 0: x < %

 

M — % + ав; о2 — ав2; Y — sign(в) JJS

 

Estimation of Distributional Parameters

Pearson 3

 

-(х—%)/в

 

*1 — в; *2 — в/2; тз — 1/3; Т4 — 1/6

 

Exponential fx(x) — e х/в/в

 

x > 0 m — в

 

M — % + 0.5772в; о2 — 2І6!; y — 1.1396

 

*1 — % + 0.5772в; *2 — в ln(2); т3 — 0.1699; т4 — 0.1504

 

Gumbel (EV1 fx(x) — 1 exp j — (xj^) — exp — (x^—^^ —ж < x < ж

for maxima)

 

M — вГ (1 + 1);

о2 — в2 [Г (l + D — г2(і + 1)]

 

*1 — % + вГ (1 +±);

*2 — в (1 — 2—1/рГ (1 + і)

 

fx (X) — a (XP)“ —1exP — ( ^

 

Weibull

 

а, в > 0; x > 0

 

*1 — % + (§) [1 — Г(1 + а)]; *2 — § (1 — 2—а )Г(1 + а);

 

а> 0: х < (% + §); а < 0: х > (% + І)

 

M — % + ) [1 — Г(1 + а)]; о2 — (в)2 [Г(1 + 2а) — Г2(1 + а)]

 

Generalized Fx (х) — exp| — (l — а{‘Х— %) ] 1/а|

extreme-value ^ *

(GEV)

 

2(1—3-°) 3.

(1—2—“) ’

1—5(4—“ )+10(3—“)—6(2—“)
1—2-“

 

тз

т4

 

Estimation of Distributional Parameters
Estimation of Distributional Parameters

а > 0:

z <х < (% +1); а < 0: Z < х < ж;

 

Generalized Fx (х) — 1 — (l — а (Х—%) ]1/а

Pareto (GPA)

 

Подпись: 123

For any distribution, the L-moments can be expressed in terms of the probability-weighted moments as shown in Eq. (2.28). To compute the sample L-moments, the sample probability-weighted moments can be obtained as

11 = 60

12 = 261 — 60

13 = 662 — 661 + 60 (3.15)

14 = 2063 — 3062 + 1261 — 60

Estimation of Distributional Parameters Подпись: t _ l3 t3 = І2 Подпись: t l4 t4 = І2 Подпись: (3.16)

where the lr s are sample estimates of the corresponding L-moments, the Xr s, respectively. Accordingly, the sample L-moment ratios can be computed as

where t2, t3, and t4 are the sample L-coefficient of variation, L-skewness co­efficient, and L-kurtosis, respectively. Relationships between L-moments and parameters of distribution models commonly used in frequency analysis are shown in the last column of Table 3.4.

Example 3.6 Referring to Example 3.3, estimate the parameters of a generalized Pareto (GPA) distribution by the L-moment method.

Solution Since the GPA is a three-parameter distribution, the calculation of the first three sample L-moments is shown in the following table:

Year

qi (ft3/s)

Ordered q(i) (ft3/s)

Rank

(i)

F (q(i)) =

(i — 0.35)/n

q(i) x F(q(i))

q(i) x F (q(i))2

q(i) x F (q(i))3

1961

390

342

1

0.0433

14.82

0.642

0.0278

1962

374

374

2

0.1100

41.14

4.525

0.4978

1963

342

390

3

0.1767

68.90

12.172

2.1504

1964

507

414

4

0.2433

100.74

24.513

5.9649

1965

596

416

5

0.3100

128.96

39.978

12.3931

1966

416

447

6

0.3767

168.37

63.419

23.8880

1967

533

505

7

0.4433

223.88

99.255

44.0030

1968

505

505

8

0.5100

257.55

131.351

66.9888

1969

549

507

9

0.5767

292.37

168.600

97.2260

1970

414

524

10

0.6433

337.11

216.872

139.5210

1971

524

533

11

0.7100

378.43

268.685

190.7666

1972

505

543

12

0.7767

421.73

327.544

254.3922

1973

447

549

13

0.8433

462.99

390.455

329.2836

1974

543

591

14

0.9100

537.81

489.407

445.3605

1975

591

596

15

0.9767

582.09

568.511

555.2459

Sum =

7236

4016.89

2805.930

2167.710

Note that the plotting-position formula used in the preceding calculation is that pro­posed by Hosking et al. (1985a) with a = 0.35 and b = 1.

Based on Eq. (3.14), the sample estimates of Pj, for j = 0,1, 2, 3, are Ьз = 428.4, b1 = 267.80, Ьз = 187.06, and b4 = 144.51. Hence, by Eq. (3.15), the sample estimates of к j, j = 1,2, 3,4, are I1 = 482.40, I2 = 53.19, I3 = -1.99, and I4 = 9.53, and the corresponding sample L-moment ratios Tj, for j = 2, 3, 4, are t2 = 0.110, t3 = —0.037, and t4 = 0.179.

Estimation of Distributional Parameters

By referring to Table 3.4, the preceding sample h = 482.40, I2 = 53.19, and t3 = —0.037 can be used in the corresponding L-moment and parameter relations, that is,

Updated: 16 ноября, 2015 — 6:58 дп