The accurate classification of non-small cell lung carcinoma (NSCLC) into lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) is essential for both clinical practice and lung cancer research. Although the standard WHO diagnosis of NSCLC on biopsy material is rapid and economic, more than 13% of NSCLC tumors in the USA are not further classified. The purpose of this study was to analyze the genome-wide pattern differences in copy number variations (CNVs) and to develop a CNV signature as an adjunct test for the routine histopathologic classification of NSCLCs. We investigated the genome-wide CNV differences between these two tumor types using three independent patient datasets. Approximately half of the genes examined exhibited significant differences between LUAD and LUSC tumors and the corresponding non-malignant tissues. A new classifier was developed to identify signature genes out of 20 000 genes. Thirty-three genes were identified as a CNV signature of NSCLC. Using only their CNV values, the classification model separated the LUADs from the LUSCs with an accuracy of 0.88 and 0.84, respectively, in the training and validation datasets. The same signature also classified NSCLC tumors from their corresponding non-malignant samples with an accuracy of 0.96 and 0.98, respectively. We also compared the CNV patterns of NSCLC tumors with those of histologically similar tumors arising at other sites, such as the breast, head, and neck, and four additional tumors. Of greater importance, the significant differences between these tumors may offer the possibility of identifying the origin of tumors whose origin is unknown.
ASJC Scopus subject areas
- Cancer Research