- Zhang, Chun-Yun;
- Li, Xueshu;
- Stietz, Kimberly P Keil;
- Sethi, Sunjay;
- Yang, Weizhu;
- Marek, Rachel F;
- Ding, Xinxin;
- Lein, Pamela J;
- Hornbuckle, Keri C;
- Lehmler, Hans-Joachim
Laboratory studies of the disposition and toxicity of hydroxylated polychlorinated biphenyl (OH-PCB) metabolites are challenging because authentic analytical standards for most unknown OH-PCBs are not available. To assist with the characterization of these OH-PCBs (as methylated derivatives), we developed machine learning-based models with multiple linear regression (MLR) or random forest regression (RFR) to predict the relative retention times (RRT) and MS/MS responses of methoxylated (MeO-)PCBs on a gas chromatograph-tandem mass spectrometry system. The final MLR model estimated the retention times of MeO-PCBs with a mean absolute error of 0.55 min (n = 121). The similarity coefficients cos θ between the predicted (by RFR model) and experimental MS/MS data of MeO-PCBs were >0.95 for 92% of observations (n = 96). The levels of MeO-PCBs quantified with the predicted MS/MS response factors approximated the experimental values within a 2-fold difference for 85% of observations and 3-fold differences for all observations (n = 89). Subsequently, these model predictions were used to assist with the identification of OH-PCB 95 or OH-PCB 28 metabolites in mouse feces or liver by suggesting candidate ranking information for identifying the metabolite isomers. Thus, predicted retention and MS/MS response data can assist in identifying unknown OH-PCBs.