Abstract
Background: Bone marrow examination (BME) is the gold standard for diagnosing myelodysplastic syndromes (MDS). Because it is invasive, many physicians and patients (pts) avoid BME, and the MDS Diagnosis is missed. Recently (Oster et al. Int MDS, Valencia, May 2017), we proposed a non-invasive MDS diagnostic model, using logistic regression (LoR) with a small group of patients (47 BME-proven MDS vs 63 controls with no MDS by BME). The variables gender, age, Hb, MCV, PLT and WBC were found to be the most important. Putting the variables in the formula (Fig 1) yielded a score Y, reflecting the probability of having MDS. The area under the curve (AUC) for the ROC curve was 0.748, indicating a good model, but one insufficient to provide a single Y, above which MDS could be established clearly. We therefore established two cut-off values: Pts with Y≥0.633 were found to have high likelihood for MDS, and those with Y≤0.288, a low likelihood of having MDS. Those pts (about 50%) where Y fell between the two cutoffs were indeterminate. In this study, the large European MDS Registry (EUMDS) database was used to validate and improve the model.
Methods: The EUMDS registry contains data of 2200 MDS pts. A random sample (age≥65) of 131 MDS pts from the registry, and 115 controls with no MDS were added to the original dataset, providing 178 in each group. Two models, LoR with covariates selected using the Lasso method, and gradient-boosted (GB) were assessed. AUC was used to evaluate the models. Model stability was assessed by repeating the fit of the models using different randomly chosen groups of 131 EUMDS pts that replaced the first 131 EUMDS pt group (random augmentation). Another analysis was performed using all the EUMDS pts.
Results: For the LoR model, MCV, PLT and WBC were selected, and the AUC was 0.75 (95% CI 0.70 - 0.80, Fig 2). For the GB model, the AUC was 0.87 (95% CI 0.84-0.91, Fig 2). The most influential variables were MCV and WBC. 50 repetitions of the random augmentation led to a mean AUC of 0.87 (IQR 0.86-0.88), attesting to the stability of the model. Using the entire 2200 pts EUMDS data set also provided consistent results.
Conclusions: A classical LoR model using the large EUMDS dataset, confirmed our original finding that using simple parameters, MDS can be diagnosed or excluded in a large portion of pts without BME. Using the complex GB model, that not only combines the chosen variables, but also takes into account the interactions among them, the AUC approaches 90%. We will continue to improve and validate the model. In our original model using two cutoff points, we were able to establish or rule out MDS with high accuracy in approximately 50% of the patients. However, about half of the patients fell within the middle 'indeterminate' zone. With this improved AUC, it is likely that the 'indeterminate' zone will shrink and we will be able to establish or rule out MDS in an even greater proportion of patients without an invasive BME. Moreover, adding more controls will allow increasing the number of variables (e.g. folic acid, B12) that can have an additional impact on the model and its diagnostic capability.
Fenaux: Celgene: Honoraria, Research Funding; Astex: Honoraria, Research Funding; Janssen: Honoraria, Research Funding; Amgen: Honoraria, Research Funding; Novartis: Honoraria, Research Funding; Amgen: Honoraria, Research Funding; Novartis: Honoraria, Research Funding; Janssen: Honoraria, Research Funding; Celgene: Honoraria, Research Funding; Astex: Honoraria, Research Funding. Sanz: Gamida Cell: Research Funding. Germing: Novartis: Honoraria, Research Funding; Janssen: Honoraria; Celgene: Honoraria, Research Funding. Almeida: Novartis: Consultancy; Celgene: Consultancy; Servier: Consultancy; Alexion: Honoraria; Bristol Meyer Squibb: Honoraria.
Author notes
Asterisk with author names denotes non-ASH members.

