We present a new method of modelling numerical systems where there are two distinct output solution classes, for example tipping points or bifurcations. Gaussian process emulation is a useful tool in understanding these complex systems and provides estimates of uncertainty, but we aim to include systems where there are discontinuities between the two output solutions. Due to continuity assumptions, we consider current methods of classification to split our input space into two output regions. Classification and logistic regression methods currently rely on drawing from an independent Bernoulli distribution, which neglects any information known in the neighbouring area. We build on this by including correlation between our input points. Gaussian processes are still a vital element, but used in latent space to model the two regions. Using the input values and an associated output class label, the latent variable is estimated using MCMC sampling and a unique likelihood. A threshold (usually at zero) defines the boundary. We apply our method to a motivating example provided by the hormones associated with the reproductive system in mammals, where the two solutions are associated with high and low rates of reproduction.
翻译:我们提出了一个新的数字系统模型方法,其中有两个不同的产出解决方案类别,例如倾角点或双形。高斯进程模拟是了解这些复杂系统的有用工具,提供了不确定性的估计,但我们的目标是将两种产出解决方案之间不连续的系统包括在内。由于连续性假设,我们考虑目前的分类方法,将输入空间分成两个产出区域。分类和后勤回归方法目前依赖于从独立的伯努利分布中提取的信息,而该分布忽略了附近地区已知的任何信息。我们以此为基础,纳入了我们输入点之间的关联。高斯进程仍然是一个关键要素,但用于暗地空间以模拟这两个区域。利用输入值和相关的产出类别标签,潜在变量估计使用MCMC抽样和独特的可能性。一个阈值(通常为零)界定了边界。我们运用了与哺乳动物生殖系统相关的激素提供的激励范例,因为这两个解决方案与高低繁殖率相关。