This repo houses the code used for Deep Multi-Task Learning for Brain MR Series Classification by Sequence and Orientation a part of the 2020 MSDSE Summer Research Initiative Project, A practical DL-based tool for medical image specifications identification.
Increasingly complex MRI studies and variable series naming conventions reveal limitations of rule-based image routing, especially in health systems with multiple scanners and sites. Accurate methods to identify series based on image content would aid post-processing and PACS viewing. Recent deep/machine learning efforts classify 5–8 basic brain MR sequences. We present an ensemble model combining a convolutional neural network and a random forest classifier to differentiate 25 brain sequences and image orientation.
Series were grouped by descriptions into 25 sequences and 4 orientations. Dataset A, obtained from our institution, was divided into training (16,828 studies; 48,512 series; 112,028 images), validation (4746 studies; 16,612 series; 26,222 images) and test sets (6348 studies; 58,705 series; 3,314,018 images). Dataset B, obtained from a separate hospital, was used for out-of-domain external validation (1252 studies; 2150 series; 234,944 images). We developed an ensemble model combining a 2D convolutional neural network with a custom multi-task learning architecture and random forest classifier trained on DICOM metadata to classify sequence and orientation by series.
The data for this project is not publicly available. Please reach out to me at [email protected] to learn more about the data used here, and what steps we can take to create a similar dataset.
Model Architecture |
---|
The neural network, random forest, and ensemble achieved 95%, 97%, and 98% overall sequence accuracy on dataset A, and 98%, 99%, and 99% accuracy on dataset B, respectively. All models achieved > 99% orientation accuracy on both datasets.
For the latest / best trained models, please reach out to me at [email protected]
For the CNN especially, the design of the dataloader and model was built with the intention of eventually incorporating as many relevant MRI metadata attributes as possible into the prediction procedure, with minimal cost to inference time. With that in mind, one possible follow-up to this work is to extend the CNN's functionality to also predict slice thickness, whether or not this MRI uses a Fat Sat pulse, or extending to predict what body part is imaged.