Complexity Science Seminar Series - Abstracts
Presenter: Dr. Michael McKay, Los Alamos National Laboratory (ret.)
Date and Time: Monday, April 09, 2007, 3:00-4:00 PM
Note: This is the first seminar in a series of three to be held by Dr. Michael McKay during the week of April 9, 2007. The other two seminars are:
Abstract:
Quantifying and understanding uncertainty of predictions made by combining computer simulations, i.e., theoretical predictions, with physical data involve many statistical issues. Generally, relevant statistical methods can be grouped into categories related to (1) designs and sampling plans for computer experiments, (2) measures of model-input importance, (3) model calibration and prediction, (4) model evaluation (testing) and (5) model validation.References
This talk focuses on assessing model-input importance and the role of uncertainty in input values in prediction uncertainty. Input uncertainty refers to incomplete knowledge of "correct" values of model inputs, including model parameters. When it is characterized by a probability distribution, prediction uncertainty is characterized by the induced prediction distribution. Comparison of a model predictor based on a subset of model inputs to that of the full model leads to a natural decomposition of the prediction variance and to Pearson's (1903) correlation ratio as a measure of importance. Additive variance decompositions similar to those from classical analysis of variance (ANOVA) have been used by several authors to evaluate computer models, including, Cukier, Levine and Shuler (1978), Cox (1982) and Sobol' (1990). A most general decomposition due to Panjer (1973), and discussed by McKay (1999), provides a (not unique) variance decomposition useful when model inputs cannot be treated as statistically independent. The correlation ratio has been used by Iman and Hora (1990), Krzykacz (1990), McKay (1995, 1997) and in Saltelli et al. (2000).
Elements of this lecture include a mathematical background for uncertainty quantification and uncertainty importance, as well as a scheme for estimation of the correlation ratio. Concepts are demonstrated in an application with a compartmental model that simulates the movement of contaminants through an ecosystem. The 84 model inputs represent parameters in differential equations as well as initial conditions. The model is studied with a set of 343 computer runs, selected using Orthogonal array sampling (OAS). Addition examples are used to demonstrate interpretation of sample values of R^2, used as an estimate of the correlation ratio. For these examples, replicated Latin hypercube sampling (rLHS) is used.
Virginia Tech, Corporate Research Center
1880 Pratt Drive, Building XV
Seminar Room 2018, Second Floor
Directions: Map (PDF)
Back to: NDSSL Seminar Page
