{"id":477,"date":"2021-04-30T14:16:22","date_gmt":"2021-04-30T14:16:22","guid":{"rendered":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/conor-murphy\/?p=477"},"modified":"2022-01-31T14:36:57","modified_gmt":"2022-01-31T14:36:57","slug":"dependence-in-extremes","status":"publish","type":"post","link":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/conor-murphy\/2021\/04\/30\/dependence-in-extremes\/","title":{"rendered":"Dependence in Extremes"},"content":{"rendered":"\n

In my last post, I briefly discussed the two standard approaches to modelling of extremes events, block maxima and threshold models. Both models were introduced with the assumption of observations being independent and identically distributed. This assumption of temporal independence is unrealistic as most extreme events occur over several consecutive observations. This may make you question the appropriateness of the two models I have mentioned previously and rightly so! <\/p>\n\n\n\n

Stationarity<\/h2>\n\n\n\n

Stationarity is more realistic for many physical processes. This allows variables to be mutually dependent but the stochastic properties remain the same throughout time. So, the distribution of X_1<\/span> is the same as that of X_{41}<\/span>. When extreme observations of a process exhibit short-range dependence, i.e. several consecutive observations are classified as extreme, they are said to form a cluster. In order to fit the models previously mentioned, we need some way to obtain observations from clusters which we can then deem independent. This is called declustering.<\/p>\n\n\n\n


\n\n\n\n

A more precise definition is as follows:<\/p>\n\n\n\n

Declustering corresponds to a filtering of the dependent observations to obtain a set of threshold excesses that are approximately independent.<\/p>\n\n\n\n


\n\n\n\n

Inference for clusters of extreme values of a time series usually requires the identification of independent clusters of exceedances over a high threshold. There are many different methods used to “decluster” a stationary series. The choice of declustering scheme can have a significant effect on estimates of within-cluster characteristics and return levels.<\/p>\n\n\n\n

A common approach is to identify independent clusters of exceedances above a high threshold, to evaluate the characteristic of interest for each cluster and to form estimates from these values. The two most common methods used to identify clusters are blocks and runs declustering. Runs declustering, for example, assumes that exceedances belong to the same cluster if they are separated by fewer than a certain number of observations below the threshold, known as the run length. The potential problem then arises in the selection of this run length parameter as this choice is somewhat arbitrary.<\/p>\n\n\n\n

Peaks over Threshold<\/h2>\n\n\n\n

The standard approach to declustering is called Peaks over Threshold (POT). Once a definition of clusters has been decided, the maximum excess in each cluster is recorded and these cluster maxima are then assumed to be independent. The GPD can then be fitted to these independent cluster maxima.<\/p>\n\n\n\n

However, there are some problems with this approach.<\/p>\n\n\n\n