liu.seSearch for publications in DiVA
Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Knowledge Distillation and Transformer Based Framework for Automatic Spine CT Report Generation
National University of Sciences and Technology, Islamabad, Pakistan.
National University of Sciences and Technology, Islamabad, Pakistan.
National University of Sciences and Technology, Islamabad, Pakistan; Abu Dhabi University, Abu Dhabi, United Arab Emirates.ORCID iD: 0000-0002-2409-3470
Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.ORCID iD: 0000-0001-6421-6001
Show others and affiliations
2025 (English)In: IEEE Access, E-ISSN 2169-3536, p. 1-1Article in journal (Refereed) Published
Abstract [en]

Spine Computed Tomography (SCT) is essential for identifying fractures, tumors and degenerative spine diseases, assisting medical practitioners in formulating an accurate diagnosis and treatment. One of the core element of SCT is reporting. The effectiveness of spine reporting is often limited by challenges such as an inadequate infrastructure and lack of experts. Automated SCT analysis has the potential to revolutionize spinal healthcare and improve patient outcomes. To achieve this objective, we proposed a framework for spine report generation that utilizes transformer architecture, trained on textual reports alongside the visual features extracted from the sagittal slices of the SCT volume. A foundation model is used to perform Knowledge Distillation (KD) alongside an encoder to ensure an optimal performance. The proposed framework is evaluated on the public dataset (VerSe20). The incorporation of KD results improved both the BERT and BLEU1 score on the dataset, from 0.7486 to 0.7522 and 0.6361 to 0.7291. Additionally, the proposed framework is evaluated using four different types of reports: original radiologist reports, reports without spine-level annotations, rephrased reports, and reports generated by ChatGPT-4o (ChatGPT). The evaluation without spine-level annotations demonstrates superior performance across most metrics, achieving the highest BLEU-1 and ROUGE-L scores, with a BLEU-1 of 0.9293 and a ROUGE-L score of 0.9297. In contrast, the other techniques achieved moderate scores across all metrics. Finally, experienced radiologists assessed the spine report and have given high rating to the original reports across all three criteria (completeness, conciseness and correctness), in comparison to the generated reports. This study’s findings suggest that omitting spine-level annotations can improve the quality of text generation.

Place, publisher, year, edition, pages
2025. p. 1-1
Keywords [en]
Spine Report Generation, Knowledge Distillation, Foundation Model, ChatGPT
National Category
Radiology and Medical Imaging
Identifiers
URN: urn:nbn:se:liu:diva-211967DOI: 10.1109/access.2025.3546131ISI: 001446493800034Scopus ID: 2-s2.0-105001061916OAI: oai:DiVA.org:liu-211967DiVA, id: diva2:1941593
Note

Funding Agencies|Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia [PNURSP2025R40]

Available from: 2025-03-01 Created: 2025-03-01 Last updated: 2025-04-08

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Akbar, Muhammad UsmanEklund, Anders

Search in DiVA

By author/editor
Khawaja, Sajid GulAlghamdi, Norah SalehAkram, Muhammad UsmanAkbar, Muhammad UsmanEklund, Anders
By organisation
Division of Biomedical EngineeringFaculty of Science & EngineeringCenter for Medical Image Science and Visualization (CMIV)The Division of Statistics and Machine Learning
In the same journal
IEEE Access
Radiology and Medical Imaging

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 141 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf