数据集规模是影响深度学习模型性能的关键因素之一。由于深度学习模型性能高度依赖于数据集规模,实现特定精度所需的数据量通常难以预估。该问题在超材料智能设计中同样存在,成为制约建模精度和效率的重要因素。为此,提出一种动态数据生成与模型性能评估框架,以实现数据集规模与模型性能的动态监控。为提升模型动态评估效率并有效缓解灾难性遗忘现象,设计了一种持续学习策略,使模型在动态评估过程中仅需针对新数据进行学习,同时保持对已有知识的记忆。实验结果表明,基于该持续学习策略训练的模型预测平均准确率可达到93.28%,平均遗忘率为3.68%,充分验证了该模型在缓解灾难性遗忘问题方面的有效性。
The size of the dataset is one of the key factors affecting the performance of deep learning models. Since the performance of deep learning models is highly dependent on the size of the dataset, the amount of data required to achieve a specific accuracy is usually difficult to estimate. This problem also exists in the intelligent design of metamaterials, and has become an important factor restricting the accuracy and efficiency of modeling. To this end, a dynamic data generation and model performance evaluation framework is proposed to achieve dynamic monitoring of the size of the dataset and model performance. In order to improve the efficiency of dynamic evaluation of the model and effectively alleviate the catastrophic forgetting phenomenon, a continuous learning strategy is designed so that the model only needs to learn new data during the dynamic evaluation process while maintaining the memory of existing knowledge. Experimental results show that the average prediction accuracy of the model trained based on this continuous learning strategy can reach 93.28%, and the average forgetting rate is 3.68%, which fully verifies the effectiveness of the model in alleviating the problem of catastrophic forgetting.