Abstract

Rationale

In chronic obstructive pulmonary disease (COPD) patients identified via community surveys, treatment is necessary not only for those with poor lung function and severe symptoms but also for individuals prone to rapid disease progression.

Objective

We aimed to develop a tool capable of identifying COPD patients who require necessary treatment at the initial assessment.

Method

We included patients with mild to moderate COPD who received a placebo for 2 years. The natural disease progression (experiencing clinical important deterioration [CID]) without intervention of these patients were observed. The placebo groups from the 2 RCTs were utilized as the training and validation sets, respectively, for the machine learning model to predict patients requiring necessary treatment based on their baseline conditions.

Result

In the placebo group of 644 patients, 248 (38.5%) with a relatively severe baseline. Despite having a better baseline, 113 patients (17.5%) developed CID during follow-up. Overall, 361 patients (56.1%) were identified as requiring necessary treatment. Among the three machine learning models, the Random Forest model had the highest Balanced Accuracy 0.852, Specificity 0.950, and PPV 0.963 in the validation set.

Conclusion

Our research offers a predictive tool for identifying which mild to moderate COPD patients require necessary treatment.

Table 1. Various model performances in the validation data sets

Statistics Naïve Bayes Quadratic Discriminant Random Forest
Accuracy (95% CI) 0.796 (0.748, 0.838) 0.780 (0.731, 0.824) 0.827 (0.781, 0.866)
Sensitivity 0.819 0.809 0.755
Specificity 0.756 0.731 0.950
Positive Predictive Value 0.852 0.838 0.963
Negative Predictive Value 0.756 0.691 0.693