XU Bo. Origin Determination of Precious Gemstones on the Application of Big Data[J]. Journal of Gems & Gemmology, 2024, 26(S1): 61-65.
Citation: XU Bo. Origin Determination of Precious Gemstones on the Application of Big Data[J]. Journal of Gems & Gemmology, 2024, 26(S1): 61-65.

Origin Determination of Precious Gemstones on the Application of Big Data

More Information
  • Received Date: July 14, 2024
  • In the gem trade, geographic provenance can significantly enhance the value of precious gemstone. The origin traceability of the prestigious gemstones such as ruby, sapphire, and emerald is not only the core competency of international gemmological laboratories, but also carries profound research significance including differences of origins, characteristics of deposits, mineralization types, ore-forming and prospecting patterns.

    Since the pioneering work of identifying the origins of coloured gemstones was first initiated by the Swiss Gübelin Gem Lab, Switzerland in the 1950s, the research on the origins of precious gemstones has been increasingly studied, which is the significant differences of conventional gemmological properties, inclusions, spectral characteristics, and chemical compositions of emeralds produced by deposits with different ore-forming backgrounds. Therefore, the current origin identification of gemstone is typically based on the comprehensive judgment of experienced gemologists regarding these four aspects. However, with the explosive growth of research on the origins and the comprehensive systematization of data on gemstones from different sources, the effectiveness of traditional identification methods has gradually declined. The phenomenon of similar inclusions, similar spectral patterns, and overlapping chemical composition ranges is commonplace, challenging the reliability of gemstone origin discrimination. In this study, the chemical fingerprint of gemstone is playing an important role in origin identification. With the advancement of micro-analysis techniques, the test precision of trace elements in gemstones has been improved significantly, and high-precision compositional data are strongly indicative significance for origins. Traditional identification techniques for chemical composition analysis mainly rely on low-dimensional binary/ternary discrimination diagrams, but the extensive overlap of data points cannot adapt to the increasingly large number of compositional data set. Therefore, the machine learning methods capable of efficiently mining high-dimensional compositional data are beneficial in addressing the current challenges faced in the origin identification of gemstone.

    In this study, 303 collected emerald samples from 12 producing areas(Fig. 1)including western emerald belt in Colombia (WEB), Kafubu in Zambia, Itabira in Brazil, Carnaíba in Brazil, Socotó in Brazil, Panjshir in Afghanistan, Swat in Pakistan, Shakiso in Ethiopia, Malysheva in Russia, Gwantu in Nigeria, Mananjary in Madagascar, and Dayakou in China.Firstly, the conventional gemmological parameters, inclusions, ultraviolet-visible-near infrared spectroscopy (UV-Vis-NIR), infrared spectroscopy (IR), Raman spectroscopy, and major and trace element contents of emerald samples from various origins were tested by traditional origin identification methods, and the chemical composition databases of the three prestigious gemstones were compiled and constructed. Additionally, three different machine learning methods were applied to mine gemstone composition data, and three efficient and accurate models for gemstone origin identification were constructed.

    Systematic research on emeralds from global originsindicate that there are three different UV-Vis-NIR spectrum absorption patterns (Fig. 2). The study on deuterium water in the emerald band also indicates that infrared absorption in the range of 2 600-2 850 cm-1 can also divide global emeralds into three groups (Fig. 3), which is a groundbreaking conclusion with strong implications for emerald provenance. The chemical composition database of emerald samples from 45 occurrences in 22 countries were compiled, totaling 2 753 data, including 425 test data in this study. Based on this database, three mature and efficient machine learning methods (random forest, support vector machine, and extreme gradient boosting) were used to mine emerald composition data and construct origin identification models. Among them, random forest (RF) and extreme gradient boosting (XGBoost) models have strong adaptability and superior performance for emerald data. The highest accuracy and F1 score of 22 RF models can reach 99.5% (Fig. 4). Model RF-EM1-1320 can distinguish 11 origins with 100% accuracy (Fig. 5), and model RF-EM12-135 can achieve the accuracy of 99.1% using only 5 elements. Simultaneously, the decoding of high-dimensional element information by the models indicates that V/Cr and alkali metal elements (Li, Na, Rb, Cs) are strongly related to origin, with their characteristic weights ranking at the top in all models, indicating the composition and source of the mother fluid in the ore-forming process.These findings underscore the significant advantages of big data and machine learning technologies in the field of gemstone origin determination, characterized by high accuracy, efficiency, and versatility. The application of these technologies for gemstone origin researche represents a groundbreaking expansion of origini dentification technology, and offers a novel perspective for gemmological research.

  • Figure  1.  Emerald samples from 12 origins
    Figure  2.  Three UV-Vis-NIR spectral patterns of emerald samples
    Figure  3.  Three IR spectral patterns of deuterium water in emeral samples
    Figure  4.  The relationship between the RF model score and the number of features
    Figure  5.  Confusion matrix of the prediction results (a) and feature importance ranking of the model RF-EM1-1320
  • [1]
    Aurisicchio C, Conte A M, Medeghini L, et al. Major and trace element geochemistry of emerald from several deposits: Implications for genetic models and classification schemes[J]. Ore Geology Reviews, 2018(94): 351-366.
    [2]
    Groat L A, Giuliani G, Stone sundberg J, et al. A review of analytical methods used in geographic origin determination of gemstones[J]. Gems & Gemology, 2019, 55(4): 512-535.
    [3]
    Palke A C, Saeseaw S, Renfro N D, et al. Geographic origin determination of blue sapphire[J]. Gems & Gemology, 2019, 55(4): 536-579.
    [4]
    Palke A C, Saeseaw S, Renfro N D, et al. Geographic origin determination of ruby[J]. Gems & Gemology, 2019, 55(4): 580-613.
    [5]
    Saeseaw S, Renfro N D, Palke A C, et al. Geographic origin determination of emerald[J]. Gems & Gemology, 2019, 55(4): 614-646.
    [6]
    Schwarz D, Giuliani G, Grundmann G. The origin of emerald—A controversial topic[J]. Extra Lapis Engl, 2002(2): 18-21.
    [7]
    Schwarz D, Giuliani G, Grundmann G. The geographic origin determination of emeralds[J]. InColor, 2015(30): 98-105.
    [8]
    郑育宇, 余晓艳, 许博, 等. 祖母绿产地鉴别研究综述[J]. 岩石矿物学杂志, 2024, 43(3): 525-561.

    Zheng Y Y, Yu X Y, Xu B, et al. A review of research on the emerald origin determination[J]. Acta Petrologica et Mineralogica, 2024, 43(3): 525 561. (in Chinese)
    [9]
    Zheng Y Y, Yu X Y, Xu B, et al. Characterizing malysheva emeralds (Urals, Russia) by microscopy, spectroscopy, trace element chemistry, and machine learning[J]. Crystals, 2024, 14(8): 683. doi: 10.3390/cryst14080683
  • Related Articles

    [1]YE Xu, YANG Jiong, QIU Zhili, ZHANG Yuefeng, ZHENG Xinyu, LAI Shuqi, YUE Zilong. Exploration on the Technical Method of Modern Serpentine Jade Origin Tracing by Laser Raman Spectroscopy[J]. Journal of Gems & Gemmology, 2024, 26(S1): 112-114.
    [2]LUO Heng, Shen Andy Hsitien, PAN Shaokui, ZHONG Yuan, LI Feng, Pia Tonna. Formation Mechanism of Peridot from Yiqisong, Jilin Province and the Application of Machine Learning to Its Origin Determination[J]. Journal of Gems & Gemmology, 2024, 26(S1): 91-93.
    [3]ZHONG Yuan, Andy Hsitien Shen, ZHANG Zhiqing, YE Min, HAN Yu. Application of Machine Learning Algorithms in the Geographical Origin Determination of Peridot[J]. Journal of Gems & Gemmology, 2023, 25(6): 65-75. DOI: 10.15964/j.cnki.027jgg.2023.06.006
    [4]WANG Jingtao. Innovation and Practice of Jewelry Talent Training Mode of Work-Learning Integration Based on New Apprenticeship System——Taking the School of Jewellery of Shenzhen Institute of Technology as An Example[J]. Journal of Gems & Gemmology, 2023, 25(5): 160-164. DOI: 10.15964/j.cnki.027jgg.2023.05.018
    [5]YANG Ping, QIU Zhili. Inheritance, Mutual Learning and Breakthrough in the Innovative Creation of Contemporary Jade Carving —From Fan Junmin's Works Broken Jade[J]. Journal of Gems & Gemmology, 2022, 24(6): 159-166. DOI: 10.15964/j.cnki.027jgg.2022.06.015
    [6]CHEN Yaoduo. Current Status on Application of Large Data in Jewelry Enterprise[J]. Journal of Gems & Gemmology, 2016, 18(4): 60-65.
    [7]GUO Rui, LIANG Yan, QI Mengjun, DENG Hongbin, WU Xiaogang. Band Construction of Jewellery in Age of Big Data[J]. Journal of Gems & Gemmology, 2015, 17(2): 42-46.
    [8]LI Jing, YUAN Xinqiang. Principle and Development of Automatic Punching Machine for Precious Metal Bead Jewelry[J]. Journal of Gems & Gemmology, 2014, 16(1): 70-76.
    [9]CAI Yi-tao, SUN Jing-yu, ZHU De-mao. Quantifying Data before Valuating Jadeite Jade[J]. Journal of Gems & Gemmology, 2009, 11(3): 34-37.
    [10]Liao Zongting, Zhou Zuyi, Yang Ruzeng. EDUCATION OF LEARNING COMBINED WITH PRODUCTION AND CONSTRUCTION OF GEMMOLOGY DISCIPLINE[J]. Journal of Gems & Gemmology, 2000, 2(1): 52-55.
  • Cited by

    Periodical cited type(1)

    1. 李婷婷,王兴弟,孙志旭. 基于珠宝玉石鉴定中的微量元素分析的溯源技术研究. 实验室检测. 2025(03): 17-19 .

    Other cited types(0)

Catalog

    Figures(5)

    Article Metrics

    Article views (51) PDF downloads (29) Cited by(1)
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return