کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
2121358 | 1546897 | 2014 | 11 صفحه PDF | دانلود رایگان |

• We collected 17 gene expression datasets of non-small cell lung cancer from public databases to generate a training cohort of 1073 and a testing cohort of 659 with institutional variations successfully eliminated.
• We identified seven gene signatures, and combined them with the clinical parameters age and stage, we could differentiate patients into three risk groups and predict patient survival probabilities at 10 and 15 years post-surgical resection, which were extensively verified using 6 other datasets from five different countries.
Lung cancer is a commonly diagnosed cancer. In this era of personalized medicine, genetic predictive models are becoming increasingly important. However, many current predictive models fail verification tests due to small sample sizes and institutional biases. We collected 17 gene expression datasets from public databases to generate our largest training and testing cohorts. After successfully eliminating institutional variations and merging multiple datasets, we generated a training cohort of 1073 and a testing cohort of 659. Using Siggenes, univariate and multivariate analyses, we identified seven gene signatures, and combined them with the clinical parameter age and stage to design the lung cancer prognostic index (LCPI). Using LCPI, we could differentiate lung cancer patients into three risk groups and predict patient survival probabilities at 10 and 15 year post-surgical resection. We extensively verified the predictive ability of LCPI for overall and recurrence free survival using 6 other datasets from five different countries.
Journal: EBioMedicine - Volume 1, Issues 2–3, December 2014, Pages 156–166