论文标题
辅助促进的参数模型的正交抽样
Quadrature Sampling of Parametric Models with Bi-fidelity Boosting
论文作者
论文摘要
最小二乘回归是用于在设计空间探索和不确定性量化之类的目的跨越科学和工程问题的仿真器(又称替代模型)的无处不在的工具。当使用实验设计过程(例如,正交网格)生成回归数据(涉及计算昂贵的模型),或者当数据大小较大时,素描技术已显示出有望降低回归模型的成本,同时确保准确的准确性与完整数据相当。但是,随机素描策略(例如基于杠杆得分的策略)导致随机的回归误差,并且可能显示出很大的可变性。为了减轻此问题,我们提出了一种新颖的提升方法,该方法利用了手头问题的更便宜,低保真的数据来确定一组候选草图中的最佳草图。反过来,这指定了预期的高保真模型和相关数据的草图。我们提供了这种双重性提升(BFB)方法的理论分析,并讨论了低保真数据必须满足成功提升的条件。在此过程中,我们得出了BFB素描解决方案的剩余规范的限制,该解决方案将其与理想但计算昂贵,高保真相关的相关性相关。与非促进溶液相比,对制造和PDE数据的经验结果证实了理论分析,并说明了BFB解决方案减少回归误差的功效。
Least squares regression is a ubiquitous tool for building emulators (a.k.a. surrogate models) of problems across science and engineering for purposes such as design space exploration and uncertainty quantification. When the regression data are generated using an experimental design process (e.g., a quadrature grid) involving computationally expensive models, or when the data size is large, sketching techniques have shown promise to reduce the cost of the construction of the regression model while ensuring accuracy comparable to that of the full data. However, random sketching strategies, such as those based on leverage scores, lead to regression errors that are random and may exhibit large variability. To mitigate this issue, we present a novel boosting approach that leverages cheaper, lower-fidelity data of the problem at hand to identify the best sketch among a set of candidate sketches. This in turn specifies the sketch of the intended high-fidelity model and the associated data. We provide theoretical analyses of this bi-fidelity boosting (BFB) approach and discuss the conditions the low- and high-fidelity data must satisfy for a successful boosting. In doing so, we derive a bound on the residual norm of the BFB sketched solution relating it to its ideal, but computationally expensive, high-fidelity boosted counterpart. Empirical results on both manufactured and PDE data corroborate the theoretical analyses and illustrate the efficacy of the BFB solution in reducing the regression error, as compared to the non-boosted solution.
