基于双模型集成的太阳黑子磁类型分类
Classification of Sunspot Magnetic Types Based on Dual Model Integration
-
摘要: 太阳黑子是发生在太阳光球层的现象,对太阳耀斑预测具有重要意义。针对类别样本数量不均衡的三分类太阳黑子数据集,提出了双模型集成分类算法。该方法通过一轻一重两个模型,分别承担两个类别的分类任务,再将两者的分类结果进行集成,夹逼出第3个类别的分类结果。实验表明,该方法能够减小单个模型在不均衡数据集上过拟合和欠拟合带来的不利影响,从新的角度解决了太阳黑子数据集的类别不均衡问题,平均F1分数达到0.931。Abstract: Sunspots occur in the solar photosphere and can make the prediction of solar flares. Aiming at the three classification sunspot data set with unbalanced number of category samples, a dual model integrated algorithm is proposed. This method uses two models, one light and one heavy, to undertake the classification tasks of two categories respectively, and then integrate the classification results of the two to squeeze out the classification results of the third category. Experiments show that this method can reduce the adverse effects of over-fitting and under-fitting of a single model on unbalanced data sets, and solve the problem of class imbalance in sunspot data sets from a new perspective, with an average F1 score of 0.931.
下载: