ZHONG Yuan, Andy Hsitien Shen, ZHANG Zhiqing, YE Min, HAN Yu. Application of Machine Learning Algorithms in the Geographical Origin Determination of Peridot[J]. Journal of Gems & Gemmology, 2023, 25(6): 65-75. DOI: 10.15964/j.cnki.027jgg.2023.06.006
Citation:
ZHONG Yuan, Andy Hsitien Shen, ZHANG Zhiqing, YE Min, HAN Yu. Application of Machine Learning Algorithms in the Geographical Origin Determination of Peridot[J]. Journal of Gems & Gemmology, 2023, 25(6): 65-75. DOI: 10.15964/j.cnki.027jgg.2023.06.006
ZHONG Yuan, Andy Hsitien Shen, ZHANG Zhiqing, YE Min, HAN Yu. Application of Machine Learning Algorithms in the Geographical Origin Determination of Peridot[J]. Journal of Gems & Gemmology, 2023, 25(6): 65-75. DOI: 10.15964/j.cnki.027jgg.2023.06.006
Citation:
ZHONG Yuan, Andy Hsitien Shen, ZHANG Zhiqing, YE Min, HAN Yu. Application of Machine Learning Algorithms in the Geographical Origin Determination of Peridot[J]. Journal of Gems & Gemmology, 2023, 25(6): 65-75. DOI: 10.15964/j.cnki.027jgg.2023.06.006
The commonly used elemental mapping method in gemstone origin tracing exhibits inherent limitations, such as subjectivity in element selection, reliance on original samples, and overlapping distribution of multiple origins in two-dimensional mapping. Machine learning (ML) has been widely applied in classification scenarios, including medical diagnosis and crop traceability. While linear discriminant analysis (LDA) has been extensively studied for gemstone origin determination, other ML algorithms have received less attention. In this study, peridot samples from three origins (Damaping, Hebei; Yiqisong, Jilin; Changwon District, Democratic People's Republic of Korea) were analyzed using LA-ICP-MS and modeled with Python. The influence of element selection on LDA effectiveness was analyzed. Results showed that selecting elements with low correlation and significant origin distribution differences improved model accuracy. A linear discriminant model using 10 elements (Mn, Zn, Na, Al, Sc, V, Cr, P, Ti, REE) achieved 0.889 cross-validation accuracy, outperforming models with all detectable elements. Comparing different ML algorithms (LDA, SVM, Decision tree, Random forest, Back propagation neural network) based on these 10 elements, non-linear algorithms, especially SVM, showed better performance.
Zhang Y Y, Chen M H, Ye S, et al. Research of geographical origin of sapphire based on three-dimensional fluorescence spectroscopy: A case study in Sri Lanka and Laos sapphires[J]. Spectroscopy and Spectral Analysis, 2022, 42(5): 1 508-1 513. (in Chinese)
[3]
Abduriyim A. Geographic origin determination of colored gemstones[J]. Gems & Gemology, 2011, 47(2): 114-116.
[4]
Abduriyim A, Kitawaki H. Applications of laser ablation-inductively coupled plasma-mass spectrometry (LA-ICP-MS) to gemology[J]. Gems & Gemology, 2006, 42(2): 98-118.
Xiang F, Wang C S, Jiang Z D, et al. Rare-earth element characters of jadewares of Jinsha site in Chengdu and its significance for indicating material source[J]. Journal of Earth Sciences and Environment, 2008, 30(1): 54-56. (in Chinese)
[6]
Aggarwal R, Sounderajah V, Martin G, et al. Diagnostic accuracy of deep learning in medical imaging: A systematic review and meta-analysis[J]. NPJ Digital Medicine, 2021, 4(1): 1-23. doi: 10.1038/s41746-020-00373-5
[7]
Kabir M H, Guindo M L, Chen R, et al. Geographic origin discrimination of millet using Vis-NIR spectroscopy combined with machine learning techniques[J]. Foods, 2021, 10(11): 2 767-2 778. doi: 10.3390/foods10112767
[8]
Shen A H, Blodgett T E, Shigley J. Country-of-origin determination of modern gem peridots from LA-ICP-MS trace-element chemistry and linear discriminant analysis (LDA)[C]//Geological Society of America Abstracts. Denver: Geological Society of America, 2013: 525.
[9]
Giuliani G, Caumon G, Rakotosamizanany S, et al. Classification chimique descorindons par analyse factorielle discriminante: Application à La typologie des gisements de rubis et saphirs[J]. Revue De Gemmologie, 2014(188): 14-22.
[10]
Zhang Z, Ye M, Shen A H. Characterisation of peridot from China's Jilin Province and from North Korea[J]. The Journal of Gemmology, 2019, 36(5): 436-446. doi: 10.15506/JoG.2019.36.5.436
[11]
Kochelek K A, Mcmillan N J, Mcmanus C E, et al. Provenance determination of sapphires and rubies using laser-induced breakdown spectroscopy and multivariate analysis[J]. American Mineralogist, 2015, 100(8): 1 921-1 931.
[12]
Burges C J C. A tutorial on support vector machines for pattern recognition[J]. Data Mining and Knowledge Discovery, 1998, 2(2): 121-167.
[13]
Maimon O Z, Rokach L. Data mining with decision trees: Theory and applications[M]. Singapore: World Scientific, 2014.
[14]
周志华. 机器学习[M]. 北京: 清华大学出版社, 2016.
Zhou Z H. Machine learning[M]. Beijing: Tsinghua University Publishing House, 2016. (in Chinese)
[15]
Schmidhuber J. Deep learning in neural Networks: An overview[J]. Neural Networks, 2015, 61(1): 85-117.
[16]
Scott D W, Tapia R A, Thompson J R. Kernel density estimation revisited[J]. Nonlinear Analysis: Theory, Methods & Applications, 1977, 1(4): 339-372.
[17]
De Hoog J C M, Gall L, Cornell D H. Trace-element geochemistry of mantle olivine and application to mantle petrogenesis and geothermobarometry[J]. Chemical Geology, 2010, 270(1): 196-215.