DB-SAM: Delving into High Quality Universal Medical Image SegmentationShow others and affiliations
2024 (English)In: MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, SPRINGER INTERNATIONAL PUBLISHING AG , 2024, Vol. 15012, p. 498-508Conference paper, Published paper (Refereed)
Abstract [en]
Recently, the Segment Anything Model (SAM) has demonstrated promising segmentation capabilities in a variety of downstream segmentation tasks. However in the context of universal medical image segmentation there exists a notable performance discrepancy when directly applying SAM due to the domain gap between natural and 2D/3D medical data. In this work, we propose a dual-branch adapted SAM framework, named DB-SAM, that strives to effectively bridge this domain gap. Our dual-branch adapted SAM contains two branches in parallel: a ViT branch and a convolution branch. The ViT branch incorporates a learnable channel attention block after each frozen attention block, which captures domain-specific local features. On the other hand, the convolution branch employs a light-weight convolutional block to extract domain-specific shallow features from the input medical image. To perform cross-branch feature fusion, we design a bilateral cross-attention block and a ViT convolution fusion block, which dynamically combine diverse information of two branches for mask decoder. Extensive experiments on large-scale medical image dataset with various 3D and 2D medical segmentation tasks reveal the merits of our proposed contributions. On 21 3D medical image segmentation tasks, our proposed DB-SAM achieves an absolute gain of 8.8%, compared to a recent medical SAM adapter in the literature. The code and model are available at https://github.com/AlfredQin/DB-SAM.
Place, publisher, year, edition, pages
SPRINGER INTERNATIONAL PUBLISHING AG , 2024. Vol. 15012, p. 498-508
Series
Lecture Notes in Computer Science, ISSN 0302-9743
Keywords [en]
Medical Image Segmentation; Segmentation Foundation Model; Multi-modality
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:liu:diva-210357DOI: 10.1007/978-3-031-72390-2_47ISI: 001344002100047ISBN: 9783031723896 (print)ISBN: 9783031723902 (electronic)OAI: oai:DiVA.org:liu-210357DiVA, id: diva2:1920035
Conference
27th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), Palmeraie Conf Ctr, Marrakesh, MOROCCO, oct 06-10, 2024
2024-12-102024-12-102025-02-07