Search | arXiv e-print repository

Language Guided Domain Generalized Medical Image Segmentation

Authors: Shahina Kunhimon, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan

Abstract: Single source domain generalization (SDG) holds promise for more reliable and consistent image segmentation across real-world clinical settings particularly in the medical domain, where data privacy and acquisition cost constraints often limit the availability of diverse datasets. Depending solely on visual features hampers the model's capacity to adapt effectively to various domains, primarily be… ▽ More Single source domain generalization (SDG) holds promise for more reliable and consistent image segmentation across real-world clinical settings particularly in the medical domain, where data privacy and acquisition cost constraints often limit the availability of diverse datasets. Depending solely on visual features hampers the model's capacity to adapt effectively to various domains, primarily because of the presence of spurious correlations and domain-specific characteristics embedded within the image features. Incorporating text features alongside visual features is a potential solution to enhance the model's understanding of the data, as it goes beyond pixel-level information to provide valuable context. Textual cues describing the anatomical structures, their appearances, and variations across various imaging modalities can guide the model in domain adaptation, ultimately contributing to more robust and consistent segmentation. In this paper, we propose an approach that explicitly leverages textual information by incorporating a contrastive learning mechanism guided by the text encoder features to learn a more robust feature representation. We assess the effectiveness of our text-guided contrastive feature alignment technique in various scenarios, including cross-modality, cross-sequence, and cross-site settings for different segmentation tasks. Our approach achieves favorable performance against existing methods in literature. Our code and model weights are available at https://github.com/ShahinaKK/LG_SDG.git. △ Less

Submitted 3 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

Comments: Accepted at ISBI2024

arXiv:2306.09320 [pdf, other]

Learnable Weight Initialization for Volumetric Medical Image Segmentation

Authors: Shahina Kunhimon, Abdelrahman Shaker, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan

Abstract: Hybrid volumetric medical image segmentation models, combining the advantages of local convolution and global attention, have recently received considerable attention. While mainly focusing on architectural modifications, most existing hybrid approaches still use conventional data-independent weight initialization schemes which restrict their performance due to ignoring the inherent volumetric nat… ▽ More Hybrid volumetric medical image segmentation models, combining the advantages of local convolution and global attention, have recently received considerable attention. While mainly focusing on architectural modifications, most existing hybrid approaches still use conventional data-independent weight initialization schemes which restrict their performance due to ignoring the inherent volumetric nature of the medical data. To address this issue, we propose a learnable weight initialization approach that utilizes the available medical training data to effectively learn the contextual and structural cues via the proposed self-supervised objectives. Our approach is easy to integrate into any hybrid model and requires no external training data. Experiments on multi-organ and lung cancer segmentation tasks demonstrate the effectiveness of our approach, leading to state-of-the-art segmentation performance. Our proposed data-dependent initialization approach performs favorably as compared to the Swin-UNETR model pretrained using large-scale datasets on multi-organ segmentation task. Our source code and models are available at: https://github.com/ShahinaKK/LWI-VMS. △ Less

Submitted 3 April, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: Accepted at Elsevier AI in Medicine Journal

arXiv:2305.05111 [pdf, other]

When a CBR in Hand is Better than Twins in the Bush

Authors: Mobyen Uddin Ahmed, Shaibal Barua, Shahina Begum, Mir Riyanul Islam, Rosina O Weber

Abstract: AI methods referred to as interpretable are often discredited as inaccurate by supporters of the existence of a trade-off between interpretability and accuracy. In many problem contexts however this trade-off does not hold. This paper discusses a regression problem context to predict flight take-off delays where the most accurate data regression model was trained via the XGBoost implementation of… ▽ More AI methods referred to as interpretable are often discredited as inaccurate by supporters of the existence of a trade-off between interpretability and accuracy. In many problem contexts however this trade-off does not hold. This paper discusses a regression problem context to predict flight take-off delays where the most accurate data regression model was trained via the XGBoost implementation of gradient boosted decision trees. While building an XGB-CBR Twin and converting the XGBoost feature importance into global weights in the CBR model, the resultant CBR model alone provides the most accurate local prediction, maintains the global importance to provide a global explanation of the model, and offers the most interpretable representation for local explanations. This resultant CBR model becomes a benchmark of accuracy and interpretability for this problem context, and hence it is used to evaluate the two additive feature attribute methods SHAP and LIME to explain the XGBoost regression model. The results with respect to local accuracy and feature attribution lead to potentially valuable future work. △ Less

Submitted 8 May, 2023; originally announced May 2023.

Comments: The version of this paper published in ICCBR XCBR '22 contained an erroneous sum in Equation 3 that we have corrected in this version

Journal ref: ICCBR XCBR '22: 4th Workshop on XCBR: Case-based Reasoning for the Explanation of Intelligent Systems at ICCBR-2022, September, 2022, Nancy, France

arXiv:2208.07885 [pdf, ps, other]

doi 10.1103/PhysRevC.106.025805

Direct measurement of the low energy resonances in $^{22}\rm{Ne}(αあるふぁ,γがんま)^{26}\rm{Mg}$ reaction

Authors: S. Shahina, J. Gorres, D. Robertson, M. Couder, O. Gomez, A. Gula, M. Hanhardt, T. Kadlecek, R. Kelmar, P. Scholz, A. Simon, E. Stech, F. Strieder, M. Wiescher

Abstract: The $^{22}\rm{Ne}(αあるふぁ,γがんま)^{26}\rm{Mg}$ is an important reaction in stellar helium burning environments as it competes directly with one of the main neutron sources for the s-process, the $^{22}\rm{Ne}(αあるふぁ,n)^{25}\rm{Mg}$ reaction. The reaction rate of the $^{22}\rm{Ne}(αあるふぁ,γがんま)^{26}\rm{Mg}$ is dominated by the low energy resonances at $E_αあるふぁ^{lab}$ = 650 and 830 keV respectively. The $E_αあるふぁ^{lab}$ = 830 keV re… ▽ More The $^{22}\rm{Ne}(αあるふぁ,γがんま)^{26}\rm{Mg}$ is an important reaction in stellar helium burning environments as it competes directly with one of the main neutron sources for the s-process, the $^{22}\rm{Ne}(αあるふぁ,n)^{25}\rm{Mg}$ reaction. The reaction rate of the $^{22}\rm{Ne}(αあるふぁ,γがんま)^{26}\rm{Mg}$ is dominated by the low energy resonances at $E_αあるふぁ^{lab}$ = 650 and 830 keV respectively. The $E_αあるふぁ^{lab}$ = 830 keV resonance has been measured previously, but there are some uncertainties in the previous measurements. We confirmed the measurement of the $E_αあるふぁ^{lab}$ = 830 keV resonance using implanted $^{22}$Ne targets. We obtained a resonance strength of $ωおめがγがんま$ = 35 $\pm$ 4 $μみゅーeV$, and provide a weighted average of the present and previous measurements of $ωおめがγがんま$ = 35 $\pm$ 2 $μみゅーeV$ with reduced uncertainties compared to previous studies. We also attempted to measure the strength of the predicted resonance at $E_αあるふぁ^{lab}$ = 650 keV directly for the first time and found an upper limit of $ωおめがγがんま$ $\mathrm{<0.15}$ $μみゅーeV$ for the strength of this resonance. In addition, we also studied the $E_{P}^{lab}$= 851 keV resonance in $^{22}\rm{Ne}(p,γがんま)^{23}\rm{Na}$, and obtained a resonance strength of $ωおめがγがんま$ = 9.2 $\pm$ 0.7 eV with significantly lower uncertainties compared to previous measurements. △ Less

Submitted 16 August, 2022; originally announced August 2022.

Comments: 10 pages, 6 figures

arXiv:2207.14386 [pdf, other]

Efficient NLP Model Finetuning via Multistage Data Filtering

Authors: Xu Ouyang, Shahina Mohd Azam Ansari, Felix Xiaozhu Lin, Yangfeng Ji

Abstract: As model finetuning is central to the modern NLP, we set to maximize its efficiency. Motivated by redundancy in training examples and the sheer sizes of pretrained models, we exploit a key opportunity: training only on important data. To this end, we set to filter training examples in a streaming fashion, in tandem with training the target model. Our key techniques are two: (1) automatically deter… ▽ More As model finetuning is central to the modern NLP, we set to maximize its efficiency. Motivated by redundancy in training examples and the sheer sizes of pretrained models, we exploit a key opportunity: training only on important data. To this end, we set to filter training examples in a streaming fashion, in tandem with training the target model. Our key techniques are two: (1) automatically determine a training loss threshold for skipping backward training passes; (2) run a meta predictor for further skipping forward training passes. We integrate the above techniques in a holistic, three-stage training process. On a diverse set of benchmarks, our method reduces the required training examples by up to 5.3$\times$ and training time by up to 6.8$\times$, while only seeing minor accuracy degradation. Our method is effective even when training one epoch, where each training example is encountered only once. It is simple to implement and is compatible with the existing finetuning techniques. Code is available at: https://github.com/xo28/efficient- NLP-multistage-training △ Less

Submitted 18 May, 2023; v1 submitted 28 July, 2022; originally announced July 2022.

arXiv:2207.08803 [pdf, other]

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations

Authors: Hashmat Shadab Malik, Shahina K Kunhimon, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan

Abstract: Transferable adversarial attacks optimize adversaries from a pretrained surrogate model and known label space to fool the unknown black-box models. Therefore, these attacks are restricted by the availability of an effective surrogate model. In this work, we relax this assumption and propose Adversarial Pixel Restoration as a self-supervised alternative to train an effective surrogate model from sc… ▽ More Transferable adversarial attacks optimize adversaries from a pretrained surrogate model and known label space to fool the unknown black-box models. Therefore, these attacks are restricted by the availability of an effective surrogate model. In this work, we relax this assumption and propose Adversarial Pixel Restoration as a self-supervised alternative to train an effective surrogate model from scratch under the condition of no labels and few data samples. Our training approach is based on a min-max scheme which reduces overfitting via an adversarial objective and thus optimizes for a more generalizable surrogate model. Our proposed attack is complimentary to the adversarial pixel restoration and is independent of any task specific objective as it can be launched in a self-supervised manner. We successfully demonstrate the adversarial transferability of our approach to Vision Transformers as well as Convolutional Neural Networks for the tasks of classification, object detection, and video segmentation. Our training approach improves the transferability of the baseline unsupervised training method by 16.4% on ImageNet val. set. Our codes & pre-trained surrogate models are available at: https://github.com/HashmatShadab/APR △ Less

Submitted 14 October, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

Comments: Accepted at BMVC'22 (Oral)

arXiv:2204.02866 [pdf, other]

Light Response of Poly(ethylene 2,6-napthalate) to Neutrons

Authors: Brennan Hackett, Richard deBoer, Yuri Efremenko, Michael Febbraro, Jason Nattress, Dan Bardayan, Chevelle Boomershine, Kristyn Brandenburg, Stefania Dede, Joseph Derkin, Ruoyu Fang, Adam Fritsch, August Gula, Gyurky Gyorgy, Gula Hamad, Yenuel Jones-Alberty, Beka Kelmar, Khachatur Manukyan, Miriam Matney, John McDonaugh, Shane Moylan, Patrick O'Malley, Shahina Shahina, Nisha Singh

Abstract: There is increasing necessity for low background active materials as ton-scale, rare-event and cryogenic detectors are developed. Poly(ethylene-2,6-naphthalate) (PEN) has been considered for these applications because of its robust structural characteristics, and its scintillation light in the blue wavelength region. Radioluminescent properties of PEN have been measured to aid in the evaluation of… ▽ More There is increasing necessity for low background active materials as ton-scale, rare-event and cryogenic detectors are developed. Poly(ethylene-2,6-naphthalate) (PEN) has been considered for these applications because of its robust structural characteristics, and its scintillation light in the blue wavelength region. Radioluminescent properties of PEN have been measured to aid in the evaluation of this material. In this article we present a measurement of PEN's quenching factor using three different neutron sources; neutrons emitted from spontaneous fission in \cf, neutrons generated from a DD generator, and neutrons emitted from the \Can and the \Lipn nuclear reactions. The fission source used time-of-flight to determine the neutron energy, and the neutron energy from the nuclear reactions was defined using thin targets and reaction kinematics. The Birk's factor and scintillation efficiency were found to be $kB = 0.12 \pm 0.01$ mm MeV$^{-1}$ and $S = 1.31\pm0.09$ MeV$_{ee}$ MeV$^{-1}$ from a simultaneous analysis of the data obtained from the three different sources. With these parameters, it is possible to evaluate PEN as a viable material for large-scale, low background physics experiments. △ Less

Submitted 6 April, 2022; originally announced April 2022.

Comments: 20 pages, 14 figures

arXiv:2202.08236 [pdf, other]

Using the left Gram matrix to cluster high dimensional data

Authors: Shahina Rahman, Valen E. Johnson, Suhasini Subba Rao

Abstract: For high dimensional data, where P features for N objects (P >> N) are represented in an NxP matrix X, we describe a clustering algorithm based on the normalized left Gram matrix, G = XX'/P. Under certain regularity conditions, the rows in G that correspond to objects in the same cluster converge to the same mean vector. By clustering on the row means, the algorithm does not require preprocessing… ▽ More For high dimensional data, where P features for N objects (P >> N) are represented in an NxP matrix X, we describe a clustering algorithm based on the normalized left Gram matrix, G = XX'/P. Under certain regularity conditions, the rows in G that correspond to objects in the same cluster converge to the same mean vector. By clustering on the row means, the algorithm does not require preprocessing by dimension reduction or feature selection techniques and does not require specification of tuning or hyperparameter values. Because it is based on the NxN matrix G, it has a lower computational cost than many methods based on clustering the feature matrix X. When compared to 14 other clustering algorithms applied to 32 benchmarked microarray datasets, the proposed algorithm provided the most accurate estimate of the underlying cluster configuration more than twice as often as its closest competitors. △ Less

Submitted 16 February, 2022; originally announced February 2022.

arXiv:1811.12756 [pdf, ps, other]

Determination of hexadecapole ($βべーた_{4}$) deformation of the light-mass nucleus $^{24}$Mg using quasi-elastic measurement

Authors: Y. K. Gupta, B. K. Nayak, U. Garg, N. Sensharma, Shahina, R. Gandhi, D. C. Biswas, M. Şenyiğit, K. B. Howard, W. Tan, P. D. O'Malley, K. Hagino, M. Smith, O. Hall, M. Hall, Richard J. deBoer, K. Ostdiek, Q. Liu, A. Long, J. Hu, T. Anderson, M. Skulski, W. Lu, E. Lamere, S. Lyons , et al. (4 additional authors not shown)

Abstract: Quasi-elastic scattering measurements have been performed using $^{16}$O and $^{24}$Mg projectiles off $^{90}$Zr at energies around the Coulomb barrier. Experimental data have been analyzed in the framework of coupled channels (CC) calculations using the code CCFULL. The quasi-elastic scattering excitation function and derived barrier distribution for $^{16}$O + $^{90}$Zr reaction are well reprodu… ▽ More Quasi-elastic scattering measurements have been performed using $^{16}$O and $^{24}$Mg projectiles off $^{90}$Zr at energies around the Coulomb barrier. Experimental data have been analyzed in the framework of coupled channels (CC) calculations using the code CCFULL. The quasi-elastic scattering excitation function and derived barrier distribution for $^{16}$O + $^{90}$Zr reaction are well reproduced by the CC calculations using the vibrational coupling strengths for $^{90}$Zr reported in the literature. Using these vibrational coupling strengths, a Bayesian analysis is carried out for $^{24}$Mg + $^{90}$Zr reaction. The $βべーた_{2}$ and $βべーた_{4}$ values for $^{24}$Mg are determined to be $+0.43 \pm 0.02$ and $ - 0.11 \pm 0.02$, respectively. The $βべーた_{2}$ parameter determined in the present work is in good agreement with results obtained using inelastic scattering probes. The hexadecapole deformation of $^{24}$Mg has been measured very precisely for the first time. Present results establish that quasi-elastic scattering could provide a useful probe to determine the ground state deformation of atomic nuclei. △ Less

Submitted 5 May, 2020; v1 submitted 30 November, 2018; originally announced November 2018.

Comments: This is a slightly modified version, accepted for publication in Phys. Lett. B

arXiv:1811.00956 [pdf, ps, other]

A Fast Algorithm for Clustering High Dimensional Feature Vectors

Authors: Shahina Rahman, Valen E. Johnson

Abstract: We propose an algorithm for clustering high dimensional data. If $P$ features for $N$ objects are represented in an $N\times P$ matrix ${\bf X}$, where $N\ll P$, the method is based on exploiting the cluster-dependent structure of the $N\times N$ matrix ${\bf XX}^T$. Computational burden thus depends primarily on $N$, the number of objects to be clustered, rather than $P$, the number of features t… ▽ More We propose an algorithm for clustering high dimensional data. If $P$ features for $N$ objects are represented in an $N\times P$ matrix ${\bf X}$, where $N\ll P$, the method is based on exploiting the cluster-dependent structure of the $N\times N$ matrix ${\bf XX}^T$. Computational burden thus depends primarily on $N$, the number of objects to be clustered, rather than $P$, the number of features that are measured. This makes the method particularly useful in high dimensional settings, where it is substantially faster than a number of other popular clustering algorithms. Aside from an upper bound on the number of potential clusters, the method is independent of tuning parameters. When compared to $16$ other clustering algorithms on $32$ genomic datasets with gold standards, we show that it provides the most accurate cluster configuration more than twice as often than its closest competitors. We illustrate the method on data taken from highly cited genomic studies. △ Less

Submitted 2 November, 2018; originally announced November 2018.

Showing 1–10 of 10 results for author: Shahina