返回届次CSCP-ICC-2024-260

Knowledge and Data-Driven Framework for Multi-Principal Element Alloys: Automated Knowledge Acquisition, Representation, and Rediscovery

作者

Guangxuan SongDongmei Fu1Dawei Zhang2Lingwei Ma

单位

1Key Laboratory of Knowledge Automation for Industrial Processes of Ministry of Education、School of Automation and Electrical Engineering、University of Science and Technology Beijing、Beijing 100083、China 2National Materials Corrosion and Protection Data Center、University of Science and Technology Beijing、Beijing 100083、China

关键词

Multi-Principal Element AlloysMachine LearningKnowledge GraphMaterial Performance PredictionAutomated Knowledge Engineering

收录来源

International Corrosion Congress · 第22届国际腐蚀大会

摘要

Machine learning algorithms are advancing material research into a fourth paradigm. However, their application in material science faces significant hurdles, including the expensive acquisition of high-quality data and data scarcity. The introduction of prior knowledge can mitigate these limitations, but the scattered and unstructured nature of material science publications complicates knowledge aggregation. This issue is especially acute in the study of multi-principal element alloys (MPEA), where the complexity of their compositions and structures complicates performance predictions. Efficiently collecting material knowledge and data and incorporating complex interactions within machine learning models to enable new material discoveries presents a significant scientific challenge. Fig 1 Knowledge and Data-driven Framework for Multi-Principal Element Alloys To address the aforementioned issues, w e propose a knowledge and data -driven framework (Fig 1 ) to explore MPEA, integrating automated know ledge acquisition, representation, and rediscovery. Initially, we established an automated extraction pipeline based on ChatGPT, guided by material experts, to extract component, property data, and information on processes and structures from MPEA literature, addressing data dispersion and building a diverse dataset of alloy hardness and corrosion. Subsequently, we proposed a cross -modal material property prediction model, Mat -NRKG, that utilizes a cross -modal knowledge graph to integrate experimental data and process information, achieving high -precision performance predictions of MPEA, exceeding the baseline by 18.5% and 13.7% on MSE metrics in hardness and corrosion resistance predictions, respectively. This method enables precise analysis of complex material property using small, real-world datasets. Finally, using the Mat -NRKG model, we performed high -throughput predictions, expediting knowledge discovery via SHAP, spatial mapping, and visualization techniques. Through the "acquisition -integration-utilization-discovery" process, we seamlessly integrate knowledge and data, fully utilizing existing experimental results to deepen our understanding of material performance. This framework not only improves our comprehension of material properties but also exp ands opportunities for knowledge - driven research in material science, showing significant application potential.

生成收录证明查看摘要文件