Article ID Journal Published Year Pages File Type
1201391 Journal of Chromatography A 2013 7 Pages PDF
Abstract

•Compared performance of 5 literature-reported compound identification measures.•Estimated dependency of identification accuracy on the size of reference library.•Weight factors dependent on both reference library and spectral similarity measures.•The semi-partial correlation mixture measure performs the best.

Compound identification in gas chromatography–mass spectrometry (GC–MS) is usually achieved by matching query spectra to spectra present in a reference library. Although several spectral similarity measures have been developed and compared using a small reference library, it still remains unknown how the relationship between the spectral similarity measure and the size of reference library affects on the identification accuracy as well as the optimal weight factor. We used three reference libraries to investigate the dependency of the optimal weight factor, spectral similarity measure and the size of reference library. Our study demonstrated that the optimal weight factor depends on not only spectral similarity measure but also the size of reference library. The mixture semi-partial correlation measure outperforms all existing spectral similarity measures in all tested reference libraries, in spite of the computational expense. Furthermore, the accuracy of compound identification using a larger reference library in future is estimated by varying the size of reference library. Simulation study indicates that the mixture semi-partial correlation measure will have the best performance with the increase of reference library in future.

Related Topics
Physical Sciences and Engineering Chemistry Analytical Chemistry
Authors
, , ,