-
Language Guided Domain Generalized Medical Image Segmentation
Authors:
Shahina Kunhimon,
Muzammal Naseer,
Salman Khan,
Fahad Shahbaz Khan
Abstract:
Single source domain generalization (SDG) holds promise for more reliable and consistent image segmentation across real-world clinical settings particularly in the medical domain, where data privacy and acquisition cost constraints often limit the availability of diverse datasets. Depending solely on visual features hampers the model's capacity to adapt effectively to various domains, primarily be…
▽ More
Single source domain generalization (SDG) holds promise for more reliable and consistent image segmentation across real-world clinical settings particularly in the medical domain, where data privacy and acquisition cost constraints often limit the availability of diverse datasets. Depending solely on visual features hampers the model's capacity to adapt effectively to various domains, primarily because of the presence of spurious correlations and domain-specific characteristics embedded within the image features. Incorporating text features alongside visual features is a potential solution to enhance the model's understanding of the data, as it goes beyond pixel-level information to provide valuable context. Textual cues describing the anatomical structures, their appearances, and variations across various imaging modalities can guide the model in domain adaptation, ultimately contributing to more robust and consistent segmentation. In this paper, we propose an approach that explicitly leverages textual information by incorporating a contrastive learning mechanism guided by the text encoder features to learn a more robust feature representation. We assess the effectiveness of our text-guided contrastive feature alignment technique in various scenarios, including cross-modality, cross-sequence, and cross-site settings for different segmentation tasks. Our approach achieves favorable performance against existing methods in literature. Our code and model weights are available at https://github.com/ShahinaKK/LG_SDG.git.
△ Less
Submitted 3 April, 2024; v1 submitted 1 April, 2024;
originally announced April 2024.
-
Learnable Weight Initialization for Volumetric Medical Image Segmentation
Authors:
Shahina Kunhimon,
Abdelrahman Shaker,
Muzammal Naseer,
Salman Khan,
Fahad Shahbaz Khan
Abstract:
Hybrid volumetric medical image segmentation models, combining the advantages of local convolution and global attention, have recently received considerable attention. While mainly focusing on architectural modifications, most existing hybrid approaches still use conventional data-independent weight initialization schemes which restrict their performance due to ignoring the inherent volumetric nat…
▽ More
Hybrid volumetric medical image segmentation models, combining the advantages of local convolution and global attention, have recently received considerable attention. While mainly focusing on architectural modifications, most existing hybrid approaches still use conventional data-independent weight initialization schemes which restrict their performance due to ignoring the inherent volumetric nature of the medical data. To address this issue, we propose a learnable weight initialization approach that utilizes the available medical training data to effectively learn the contextual and structural cues via the proposed self-supervised objectives. Our approach is easy to integrate into any hybrid model and requires no external training data. Experiments on multi-organ and lung cancer segmentation tasks demonstrate the effectiveness of our approach, leading to state-of-the-art segmentation performance. Our proposed data-dependent initialization approach performs favorably as compared to the Swin-UNETR model pretrained using large-scale datasets on multi-organ segmentation task. Our source code and models are available at: https://github.com/ShahinaKK/LWI-VMS.
△ Less
Submitted 3 April, 2024; v1 submitted 15 June, 2023;
originally announced June 2023.
-
When a CBR in Hand is Better than Twins in the Bush
Authors:
Mobyen Uddin Ahmed,
Shaibal Barua,
Shahina Begum,
Mir Riyanul Islam,
Rosina O Weber
Abstract:
AI methods referred to as interpretable are often discredited as inaccurate by supporters of the existence of a trade-off between interpretability and accuracy. In many problem contexts however this trade-off does not hold. This paper discusses a regression problem context to predict flight take-off delays where the most accurate data regression model was trained via the XGBoost implementation of…
▽ More
AI methods referred to as interpretable are often discredited as inaccurate by supporters of the existence of a trade-off between interpretability and accuracy. In many problem contexts however this trade-off does not hold. This paper discusses a regression problem context to predict flight take-off delays where the most accurate data regression model was trained via the XGBoost implementation of gradient boosted decision trees. While building an XGB-CBR Twin and converting the XGBoost feature importance into global weights in the CBR model, the resultant CBR model alone provides the most accurate local prediction, maintains the global importance to provide a global explanation of the model, and offers the most interpretable representation for local explanations. This resultant CBR model becomes a benchmark of accuracy and interpretability for this problem context, and hence it is used to evaluate the two additive feature attribute methods SHAP and LIME to explain the XGBoost regression model. The results with respect to local accuracy and feature attribution lead to potentially valuable future work.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Direct measurement of the low energy resonances in $^{22}\rm{Ne}(α,γ)^{26}\rm{Mg}$ reaction
Authors:
S. Shahina,
J. Gorres,
D. Robertson,
M. Couder,
O. Gomez,
A. Gula,
M. Hanhardt,
T. Kadlecek,
R. Kelmar,
P. Scholz,
A. Simon,
E. Stech,
F. Strieder,
M. Wiescher
Abstract:
The $^{22}\rm{Ne}(α,γ)^{26}\rm{Mg}$ is an important reaction in stellar helium burning environments as it competes directly with one of the main neutron sources for the s-process, the $^{22}\rm{Ne}(α,n)^{25}\rm{Mg}$ reaction. The reaction rate of the $^{22}\rm{Ne}(α,γ)^{26}\rm{Mg}$ is dominated by the low energy resonances at $E_α^{lab}$ = 650 and 830 keV respectively. The $E_α^{lab}$ = 830 keV re…
▽ More
The $^{22}\rm{Ne}(α,γ)^{26}\rm{Mg}$ is an important reaction in stellar helium burning environments as it competes directly with one of the main neutron sources for the s-process, the $^{22}\rm{Ne}(α,n)^{25}\rm{Mg}$ reaction. The reaction rate of the $^{22}\rm{Ne}(α,γ)^{26}\rm{Mg}$ is dominated by the low energy resonances at $E_α^{lab}$ = 650 and 830 keV respectively. The $E_α^{lab}$ = 830 keV resonance has been measured previously, but there are some uncertainties in the previous measurements. We confirmed the measurement of the $E_α^{lab}$ = 830 keV resonance using implanted $^{22}$Ne targets. We obtained a resonance strength of $ωγ$ = 35 $\pm$ 4 $μeV$, and provide a weighted average of the present and previous measurements of $ωγ$ = 35 $\pm$ 2 $μeV$ with reduced uncertainties compared to previous studies. We also attempted to measure the strength of the predicted resonance at $E_α^{lab}$ = 650 keV directly for the first time and found an upper limit of $ωγ$ $\mathrm{<0.15}$ $μeV$ for the strength of this resonance. In addition, we also studied the $E_{P}^{lab}$= 851 keV resonance in $^{22}\rm{Ne}(p,γ)^{23}\rm{Na}$, and obtained a resonance strength of $ωγ$ = 9.2 $\pm$ 0.7 eV with significantly lower uncertainties compared to previous measurements.
△ Less
Submitted 16 August, 2022;
originally announced August 2022.
-
Efficient NLP Model Finetuning via Multistage Data Filtering
Authors:
Xu Ouyang,
Shahina Mohd Azam Ansari,
Felix Xiaozhu Lin,
Yangfeng Ji
Abstract:
As model finetuning is central to the modern NLP, we set to maximize its efficiency. Motivated by redundancy in training examples and the sheer sizes of pretrained models, we exploit a key opportunity: training only on important data. To this end, we set to filter training examples in a streaming fashion, in tandem with training the target model. Our key techniques are two: (1) automatically deter…
▽ More
As model finetuning is central to the modern NLP, we set to maximize its efficiency. Motivated by redundancy in training examples and the sheer sizes of pretrained models, we exploit a key opportunity: training only on important data. To this end, we set to filter training examples in a streaming fashion, in tandem with training the target model. Our key techniques are two: (1) automatically determine a training loss threshold for skipping backward training passes; (2) run a meta predictor for further skipping forward training passes. We integrate the above techniques in a holistic, three-stage training process. On a diverse set of benchmarks, our method reduces the required training examples by up to 5.3$\times$ and training time by up to 6.8$\times$, while only seeing minor accuracy degradation. Our method is effective even when training one epoch, where each training example is encountered only once. It is simple to implement and is compatible with the existing finetuning techniques. Code is available at: https://github.com/xo28/efficient- NLP-multistage-training
△ Less
Submitted 18 May, 2023; v1 submitted 28 July, 2022;
originally announced July 2022.
-
Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations
Authors:
Hashmat Shadab Malik,
Shahina K Kunhimon,
Muzammal Naseer,
Salman Khan,
Fahad Shahbaz Khan
Abstract:
Transferable adversarial attacks optimize adversaries from a pretrained surrogate model and known label space to fool the unknown black-box models. Therefore, these attacks are restricted by the availability of an effective surrogate model. In this work, we relax this assumption and propose Adversarial Pixel Restoration as a self-supervised alternative to train an effective surrogate model from sc…
▽ More
Transferable adversarial attacks optimize adversaries from a pretrained surrogate model and known label space to fool the unknown black-box models. Therefore, these attacks are restricted by the availability of an effective surrogate model. In this work, we relax this assumption and propose Adversarial Pixel Restoration as a self-supervised alternative to train an effective surrogate model from scratch under the condition of no labels and few data samples. Our training approach is based on a min-max scheme which reduces overfitting via an adversarial objective and thus optimizes for a more generalizable surrogate model. Our proposed attack is complimentary to the adversarial pixel restoration and is independent of any task specific objective as it can be launched in a self-supervised manner. We successfully demonstrate the adversarial transferability of our approach to Vision Transformers as well as Convolutional Neural Networks for the tasks of classification, object detection, and video segmentation. Our training approach improves the transferability of the baseline unsupervised training method by 16.4% on ImageNet val. set. Our codes & pre-trained surrogate models are available at: https://github.com/HashmatShadab/APR
△ Less
Submitted 14 October, 2022; v1 submitted 18 July, 2022;
originally announced July 2022.
-
Light Response of Poly(ethylene 2,6-napthalate) to Neutrons
Authors:
Brennan Hackett,
Richard deBoer,
Yuri Efremenko,
Michael Febbraro,
Jason Nattress,
Dan Bardayan,
Chevelle Boomershine,
Kristyn Brandenburg,
Stefania Dede,
Joseph Derkin,
Ruoyu Fang,
Adam Fritsch,
August Gula,
Gyurky Gyorgy,
Gula Hamad,
Yenuel Jones-Alberty,
Beka Kelmar,
Khachatur Manukyan,
Miriam Matney,
John McDonaugh,
Shane Moylan,
Patrick O'Malley,
Shahina Shahina,
Nisha Singh
Abstract:
There is increasing necessity for low background active materials as ton-scale, rare-event and cryogenic detectors are developed. Poly(ethylene-2,6-naphthalate) (PEN) has been considered for these applications because of its robust structural characteristics, and its scintillation light in the blue wavelength region. Radioluminescent properties of PEN have been measured to aid in the evaluation of…
▽ More
There is increasing necessity for low background active materials as ton-scale, rare-event and cryogenic detectors are developed. Poly(ethylene-2,6-naphthalate) (PEN) has been considered for these applications because of its robust structural characteristics, and its scintillation light in the blue wavelength region. Radioluminescent properties of PEN have been measured to aid in the evaluation of this material. In this article we present a measurement of PEN's quenching factor using three different neutron sources; neutrons emitted from spontaneous fission in \cf, neutrons generated from a DD generator, and neutrons emitted from the \Can and the \Lipn nuclear reactions. The fission source used time-of-flight to determine the neutron energy, and the neutron energy from the nuclear reactions was defined using thin targets and reaction kinematics. The Birk's factor and scintillation efficiency were found to be $kB = 0.12 \pm 0.01$ mm MeV$^{-1}$ and $S = 1.31\pm0.09$ MeV$_{ee}$ MeV$^{-1}$ from a simultaneous analysis of the data obtained from the three different sources. With these parameters, it is possible to evaluate PEN as a viable material for large-scale, low background physics experiments.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
Using the left Gram matrix to cluster high dimensional data
Authors:
Shahina Rahman,
Valen E. Johnson,
Suhasini Subba Rao
Abstract:
For high dimensional data, where P features for N objects (P >> N) are represented in an NxP matrix X, we describe a clustering algorithm based on the normalized left Gram matrix, G = XX'/P. Under certain regularity conditions, the rows in G that correspond to objects in the same cluster converge to the same mean vector. By clustering on the row means, the algorithm does not require preprocessing…
▽ More
For high dimensional data, where P features for N objects (P >> N) are represented in an NxP matrix X, we describe a clustering algorithm based on the normalized left Gram matrix, G = XX'/P. Under certain regularity conditions, the rows in G that correspond to objects in the same cluster converge to the same mean vector. By clustering on the row means, the algorithm does not require preprocessing by dimension reduction or feature selection techniques and does not require specification of tuning or hyperparameter values. Because it is based on the NxN matrix G, it has a lower computational cost than many methods based on clustering the feature matrix X. When compared to 14 other clustering algorithms applied to 32 benchmarked microarray datasets, the proposed algorithm provided the most accurate estimate of the underlying cluster configuration more than twice as often as its closest competitors.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.
-
Determination of hexadecapole ($β_{4}$) deformation of the light-mass nucleus $^{24}$Mg using quasi-elastic measurement
Authors:
Y. K. Gupta,
B. K. Nayak,
U. Garg,
N. Sensharma,
Shahina,
R. Gandhi,
D. C. Biswas,
M. Şenyiğit,
K. B. Howard,
W. Tan,
P. D. O'Malley,
K. Hagino,
M. Smith,
O. Hall,
M. Hall,
Richard J. deBoer,
K. Ostdiek,
Q. Liu,
A. Long,
J. Hu,
T. Anderson,
M. Skulski,
W. Lu,
E. Lamere,
S. Lyons
, et al. (4 additional authors not shown)
Abstract:
Quasi-elastic scattering measurements have been performed using $^{16}$O and $^{24}$Mg projectiles off $^{90}$Zr at energies around the Coulomb barrier. Experimental data have been analyzed in the framework of coupled channels (CC) calculations using the code CCFULL. The quasi-elastic scattering excitation function and derived barrier distribution for $^{16}$O + $^{90}$Zr reaction are well reprodu…
▽ More
Quasi-elastic scattering measurements have been performed using $^{16}$O and $^{24}$Mg projectiles off $^{90}$Zr at energies around the Coulomb barrier. Experimental data have been analyzed in the framework of coupled channels (CC) calculations using the code CCFULL. The quasi-elastic scattering excitation function and derived barrier distribution for $^{16}$O + $^{90}$Zr reaction are well reproduced by the CC calculations using the vibrational coupling strengths for $^{90}$Zr reported in the literature. Using these vibrational coupling strengths, a Bayesian analysis is carried out for $^{24}$Mg + $^{90}$Zr reaction. The $β_{2}$ and $β_{4}$ values for $^{24}$Mg are determined to be $+0.43 \pm 0.02$ and $ - 0.11 \pm 0.02$, respectively. The $β_{2}$ parameter determined in the present work is in good agreement with results obtained using inelastic scattering probes. The hexadecapole deformation of $^{24}$Mg has been measured very precisely for the first time. Present results establish that quasi-elastic scattering could provide a useful probe to determine the ground state deformation of atomic nuclei.
△ Less
Submitted 5 May, 2020; v1 submitted 30 November, 2018;
originally announced November 2018.
-
A Fast Algorithm for Clustering High Dimensional Feature Vectors
Authors:
Shahina Rahman,
Valen E. Johnson
Abstract:
We propose an algorithm for clustering high dimensional data. If $P$ features for $N$ objects are represented in an $N\times P$ matrix ${\bf X}$, where $N\ll P$, the method is based on exploiting the cluster-dependent structure of the $N\times N$ matrix ${\bf XX}^T$. Computational burden thus depends primarily on $N$, the number of objects to be clustered, rather than $P$, the number of features t…
▽ More
We propose an algorithm for clustering high dimensional data. If $P$ features for $N$ objects are represented in an $N\times P$ matrix ${\bf X}$, where $N\ll P$, the method is based on exploiting the cluster-dependent structure of the $N\times N$ matrix ${\bf XX}^T$. Computational burden thus depends primarily on $N$, the number of objects to be clustered, rather than $P$, the number of features that are measured. This makes the method particularly useful in high dimensional settings, where it is substantially faster than a number of other popular clustering algorithms. Aside from an upper bound on the number of potential clusters, the method is independent of tuning parameters. When compared to $16$ other clustering algorithms on $32$ genomic datasets with gold standards, we show that it provides the most accurate cluster configuration more than twice as often than its closest competitors. We illustrate the method on data taken from highly cited genomic studies.
△ Less
Submitted 2 November, 2018;
originally announced November 2018.