Conference item
TiBiX: leveraging temporal information for bidirectional X-ray and report generation
- Abstract:
- With the emergence of vision language models in the medical imaging domain, numerous studies have focused on two dominant research activities: (1) report generation from Chest X-rays (CXR), and (2) synthetic scan generation from text or reports. Despite some research incorporating multi-view CXRs into the generative process, prior patient scans and reports have been generally disregarded. This can inadvertently lead to the leaving out of important medical information, thus affecting generation quality. To address this, we propose TiBiX: Leveraging Temporal information for Bidirectional X-ray and Report Generation. Considering previous scans, our approach facilitates bidirectional generation, primarily addressing two challenging problems: (1) generating the current image from the previous image and current report and (2) generating the current report based on both the previous and current images. Moreover, we extract and release a curated temporal benchmark dataset derived from the MIMIC-CXR dataset, which focuses on temporal data. Our comprehensive experiments and ablation studies explore the merits of incorporating prior CXRs and achieve state-of-the-art (SOTA) results on the report generation task. Furthermore, we attain on-par performance with SOTA image generation efforts, thus serving as a new baseline in longitudinal bidirectional CXR-to-report generation. The code is available at https://github.com/BioMedIA-MBZUAI/TiBiX.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 744.1KB, Terms of use)
-
- Publisher copy:
- 10.1007/978-3-031-72744-3_17
Authors
- Publisher:
- Springer Nature
- Host title:
- 4th MICCAI Workshop, DGM4MICCAI 2024, Held in Conjunction with MICCAI 2024, Marrakesh, Morocco, October 10, 2024, Proceedings
- Pages:
- 169-179
- Series:
- Lecture Notes in Computer Science
- Series number:
- 15224
- Place of publication:
- Cham, Switzerland
- Publication date:
- 2024-10-09
- Acceptance date:
- 2024-07-15
- Event title:
- DGM4MICCAI 2024
- Event website:
- https://caption-workshop.github.io/
- Event start date:
- 2024-10-06
- Event end date:
- 2024-10-06
- DOI:
- EISBN:
- 9783031727443
- ISBN:
- 9783031727436
- Language:
-
English
- Keywords:
- Pubs id:
-
2052412
- UUID:
-
uuid_8bbfb160-d870-4885-a573-98422a3efab5
- Local pid:
-
pubs:2052412
- Deposit date:
-
2025-12-14
Terms of use
- Copyright holder:
- Sanjeev et al
- Copyright date:
- 2024
- Rights statement:
- © 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
- Notes:
- The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record