nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2026, 01, v.15 31-41
基于混合Copula函数和聚类算法的极值风速预测
基金项目(Foundation): 湖北省重点实验室开放项目(Y202201)
邮箱(Email): hexiaoxia@wust.edu.cn;
DOI: 10.19943/j.2095-3070.jmmia.2026.01.04
摘要:

极值预测在自然灾害防御、化工安全生产、金融风险管控等许多安全场景中具有重要应用,然而,由于混合类型事件的影响,传统极值预测方法往往无法有效处理数据中不同分布所引起的问题.为此,本文提出了一种基于混合Copula函数结合聚类算法的极值风速预测方法.在此方法中,首先采用聚类算法自适应地将风速块最大数据集划分为不同簇,并获取每个簇的最佳边缘分布;然后,利用混合Copula函数连接各簇的边缘分布,采用粒子群优化算法最小化KL散度,从而确定混合Copula函数的权重,最终实现在特定重现期下的风速极值预测.为避免过拟合和欠拟合,采用两折交叉验证对模型进行验证与调优.实证结果表明,本文提出的模型在混合分布下能够显著提高极值预测的准确性和可靠性,在风速等气象数据的极值预测中,相较于传统方法具有更高的预测精度和稳定性.本文方法不仅为风速数据等领域的极值分析提供了有力工具,也为其他混合分布数据的极值预测提供了新的思路和方法.

Abstract:

Extreme value prediction plays a vital role in various safety-critical scenarios, including natural disaster defense, chemical production safety, and financial risk management. However, due to the presence of mixed-type events, traditional extreme value prediction methods often struggle to effectively capture the mixed distribution characteristics inherent in the data. To address this, this paper proposes an extreme wind speed prediction method based on a mixed copula function combined with a clustering algorithm. In this method, a clustering algorithm is first employed to adaptively group the block maxima dataset of wind speed into distinct clusters, and the optimal marginal distribution for each cluster is subsequently obtained. A mixed copula function is then constructed to link the marginal distributions of each cluster, and the particle swarm optimization algorithm is utilized to minimize the KL divergence, thereby determining the weights of the mixed copula function, which ultimately enables the prediction of wind speed extremes for specific return periods. To avoid overfitting and underfitting, 2-fold cross-validation is used for model validation and tuning. The empirical results demonstrate that the proposed model significantly improves the accuracy and reliability of extreme value prediction under mixed distributions, exhibiting higher predictive accuracy and greater stability in forecasting meteorological extremes, such as wind speed, compared to traditional methods. This study not only provides a powerful tool for extreme value analysis in fields involving wind speed data but also offers new insights and methodologies for predicting extremes in other datasets characterized by mixed distributions.

参考文献

[1]郭顺宁,马雪,杨帆,等.基于最优Copula相关性分析的短期风速预测方法[J].电网与清洁能源,2024,40(2):63-72.

[2]Lin D,Qingshan Y,Min L,et al.Estimation of extreme wind speed based on upcrossing rate of mean wind speeds with Weibull distribution[J]Journal of Wind Engineering&Industrial Aerodynamics,2023,230(5):105-115.

[3]Rodríguez C,Hernández E,Aracil J,et al.Interpretable extreme wind speed prediction with concept bottleneck models[J]Renewable Energy,2024,231:120935.

[4]Smith C.Representing external hazard initiating events using a Bayesian approach and a generalized extreme value model[J]Reliability Engineering and System Safety,2020,193:106650.

[5]Wang Y,Ma H,Wang D,et al.new method for wind speed forecasting based on copula theory[J]Environmental Research,2018,160:365-371.

[6]Yesilbudak M,Sagiroglu S,Colak I.new approach to very short term wind speed prediction using k-nearest neighbor classification[J]Energy Conversion and Management,2013,69:77-86.

[7]滕寰宇,叶仁传,田阿利,等基于平均条件超越率和插值法的风速极值预测[J]海岸工程,2018,37(4):29-34.

[8]孙雅,李志华基于区域极值点的时间序列聚类算法[J]计算机工程,2015,41(5):33-37.

[9]Wambura A,Smith B,Jones C.Fast memory-efficient extreme events prediction in complex time series[J]Journal of Time Series Analysis,2021,42(3):567-580.

[10]Baorui D,Ye X,Qi L.An extreme value prediction method based on clustering algorithm[J]Reliability Engineering System Safety,2022,217:108062.

[11]黄宇,张冰哲,庞慧珍,等基于混合Copula优化算法的风速预测方法研究[J]太阳能学报,2022,43(10):192-201.

[12]勾红叶,冷丹,王涵玉,等基于混合Copula函数的风雨联合概率分布模型[J]中国公路学报,2021,34(2):309-316.

[13]张文强,刘继文,陈静基于对称KL散度的电力用户负荷聚类方法研究[J]电工技术,2024,14:103-107.

[14]周丹,南敬昌,高明明改进的简化粒子群算法优化模糊神经网络建模[J]计算机应用研究,2015,32(4):1000-1003.

[15]Wang L,Zhang Y,Li X,et al.Power curve estimation with multivariate environmental factors for inland and offshore wind farms[J]Renewable Energy Journal,2022,150(5):1123-1135.

[16]何帅达基于广义极值分布的气象数据分类与聚类算法研究[D]兰州:兰州大学,2020.

[17]Rousseeuw J.Silhouettes:graphical aid to the interpretation and validation of cluster analysis[J]Journal of Computational and Applied Mathematics,1987,20(2):53-65.

[18]Jain K.Data clustering:50 years beyond K-means[J]Pattern Recognition Letters,2009,31(8):651-666.

[19]Stone A.Cross-validatory choice and assessment of statistical predictions[J]Journal of the Royal Statistical Society:Series (Methodological),1974,36(2):111-147.

[20]张金环,王超群,张彤,等基于高斯混合分布模型的风电功率预测误差统计分析研究[J]智慧电力,2020,48(7):59-64+72.

基本信息:

DOI:10.19943/j.2095-3070.jmmia.2026.01.04

中图分类号:P412.16;P457.5

引用信息:

[1]董胜男,刘云冰,何晓霞,等.基于混合Copula函数和聚类算法的极值风速预测[J].数学建模及其应用,2026,15(01):31-41.DOI:10.19943/j.2095-3070.jmmia.2026.01.04.

基金信息:

湖北省重点实验室开放项目(Y202201)

发布时间:

2026-03-15

出版时间:

2026-03-15

引用

GB/T 7714-2015 格式引文
MLA格式引文
APA格式引文