Search | arXiv e-print repository

arXiv:2407.06022 [pdf]

Investigation of microstructural evolution of irradiation-induced defects in tungsten: an experimental-numerical approach

Authors: Salahudeen Mohamed, Qian Yuan, Dimitri Litvinov, Jie Gao, Ermile Gaganidze, Dmitry Terentyev, Hans-Christian Schneider, Jarir Aktaa

Abstract: The hostile condition in a fusion tokomak reactor poses the main challenge in the development and design of in-vessel components such as divertor and breeding blanket due to fusion relevant irradiation conditions (14 MeV) and large thermal loads. The current work describes the employment of an integrated experimental-numerical approach to assess the microstructure evolution of dislocation loops an… ▽ More The hostile condition in a fusion tokomak reactor poses the main challenge in the development and design of in-vessel components such as divertor and breeding blanket due to fusion relevant irradiation conditions (14 MeV) and large thermal loads. The current work describes the employment of an integrated experimental-numerical approach to assess the microstructure evolution of dislocation loops and voids in tungsten proposed for fusion application. Cluster dynamics (CD) model is implemented and simulations are performed on the irradiated tungsten Disk shape Compact Tension (DCT) specimen used in the experimental test. TEM characterisation is performed on the DCT specimen irradiated at 400 °C and 600 °C with around 1 dpa, respectively. The dpa rate and cascade overlap rate from the experiments and SPECTRA-PKA code, respectively, are implemented in the CD model. Based on the comparison between experimental and computational results, the dose and temperature dependence of irradiation-induced defects (dislocation loops, voids, c15 clusters) are clearly observed. Trap mediated diffusion is studied and the impact of cascades with the pre-existing defects is analysed through full cascade overlap mode and the consequent influence on the defect concentration is evaluated. The exchange of self-interstitial atoms (SIAs) and the change in the size of loops through reaction between <111> and <100> loops are studied in detail by means of the transfer rate of the SIAs. △ Less

Submitted 8 July, 2024; originally announced July 2024.

arXiv:2407.00380 [pdf, other]

Particle acceleration at the bow shock of runaway star LS 2355: non-thermal radio emission but no $γがんま$-ray counterpart

Authors: J. van den Eijnden, S. Mohamed, F. Carotenuto, S. Motta, P. Saikia, D. R. A. Williams-Baldwin

Abstract: Massive stars that travel at supersonic speeds can create bow shocks as their stellar winds interact with the surrounding interstellar medium. These bow shocks - prominent sites for mechanical feedback of individual massive stars - are predominantly observed in the infrared band. Confirmed high-energy emission from stellar bow shocks has remained elusive and confirmed radio counterparts, while ris… ▽ More Massive stars that travel at supersonic speeds can create bow shocks as their stellar winds interact with the surrounding interstellar medium. These bow shocks - prominent sites for mechanical feedback of individual massive stars - are predominantly observed in the infrared band. Confirmed high-energy emission from stellar bow shocks has remained elusive and confirmed radio counterparts, while rising in recent years, remain rare. Here, we present an in-depth multi-wavelength exploration of the bow shock driven by LS 2355, focusing on its non-thermal properties. Using the most-recent Fermi source catalogue, we rule out its previously-proposed association with an unidentified $γがんま$-ray source. Furthermore, we use deep ASKAP observations from the Rapid ASKAP Continuum Survey and the Evolutionary Map of the Universe survey to identify a non-thermal radio counterpart: the third spectrally confirmed non-thermal bow shock counterpart after BD +43$^{\rm o}$ 3654 and BD +60$^{\rm o}$ 2522. We finally use WISE IR data and Gaia to study the surrounding ISM and update the motion of LS 2355. Specifically, we derive a substantially reduced stellar velocity, $v_* = 7.0\pm2.5$ km/s, compared to previous estimates. The observed non-thermal properties of the bow shock can be explained by an interaction between the wind of LS 2355 and a dense HII region, at a magnetic field close to the maximum magnetic field strength allowed by the compressibility of the ISM. Similar to earlier works, we find that the thermal radio emission of the shocked ISM is likely to be substantially suppressed for it to be consistent with the observed radio spectrum. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: Accepted for publication in MNRAS

arXiv:2405.20956 [pdf, other]

A Robot Walks into a Bar: Can Language Models Serve as Creativity Support Tools for Comedy? An Evaluation of LLMs' Humour Alignment with Comedians

Authors: Piotr Wojciech Mirowski, Juliette Love, Kory W. Mathewson, Shakir Mohamed

Abstract: We interviewed twenty professional comedians who perform live shows in front of audiences and who use artificial intelligence in their artistic process as part of 3-hour workshops on ``AI x Comedy'' conducted at the Edinburgh Festival Fringe in August 2023 and online. The workshop consisted of a comedy writing session with large language models (LLMs), a human-computer interaction questionnaire to… ▽ More We interviewed twenty professional comedians who perform live shows in front of audiences and who use artificial intelligence in their artistic process as part of 3-hour workshops on ``AI x Comedy'' conducted at the Edinburgh Festival Fringe in August 2023 and online. The workshop consisted of a comedy writing session with large language models (LLMs), a human-computer interaction questionnaire to assess the Creativity Support Index of AI as a writing tool, and a focus group interrogating the comedians' motivations for and processes of using AI, as well as their ethical concerns about bias, censorship and copyright. Participants noted that existing moderation strategies used in safety filtering and instruction-tuned LLMs reinforced hegemonic viewpoints by erasing minority groups and their perspectives, and qualified this as a form of censorship. At the same time, most participants felt the LLMs did not succeed as a creativity support tool, by producing bland and biased comedy tropes, akin to ``cruise ship comedy material from the 1950s, but a bit less racist''. Our work extends scholarship about the subtle difference between, one the one hand, harmful speech, and on the other hand, ``offensive'' language as a practice of resistance, satire and ``punching up''. We also interrogate the global value alignment behind such language models, and discuss the importance of community-based value alignment and data ownership to build AI tools that better suit artists' needs. △ Less

Submitted 3 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

Comments: 15 pages, 1 figure, published at ACM FAccT 2024

arXiv:2405.09545 [pdf, other]

Intrinsic Voltage Offsets in Memcapacitive Bio-Membranes Enable High-Performance Physical Reservoir Computing

Authors: Ahmed S. Mohamed, Anurag Dhungel, Md Sakib Hasan, Joseph S. Najem

Abstract: Reservoir computing is a brain-inspired machine learning framework for processing temporal data by mapping inputs into high-dimensional spaces. Physical reservoir computers (PRCs) leverage native fading memory and nonlinearity in physical substrates, including atomic switches, photonics, volatile memristors, and, recently, memcapacitors, to achieve efficient high-dimensional mapping. Traditional P… ▽ More Reservoir computing is a brain-inspired machine learning framework for processing temporal data by mapping inputs into high-dimensional spaces. Physical reservoir computers (PRCs) leverage native fading memory and nonlinearity in physical substrates, including atomic switches, photonics, volatile memristors, and, recently, memcapacitors, to achieve efficient high-dimensional mapping. Traditional PRCs often consist of homogeneous device arrays, which rely on input encoding methods and large stochastic device-to-device variations for increased nonlinearity and high-dimensional mapping. These approaches incur high pre-processing costs and restrict real-time deployment. Here, we introduce a novel heterogeneous memcapacitor-based PRC that exploits internal voltage offsets to enable both monotonic and non-monotonic input-state correlations crucial for efficient high-dimensional transformations. We demonstrate our approach's efficacy by predicting a second-order nonlinear dynamical system with an extremely low prediction error (0.00018). Additionally, we predict a chaotic Hénon map, achieving a low normalized root mean square error (0.080). Unlike previous PRCs, such errors are achieved without input encoding methods, underscoring the power of distinct input-state correlations. Most importantly, we generalize our approach to other neuromorphic devices that lack inherent voltage offsets using externally applied offsets to realize various input-state correlations. Our approach and unprecedented performance are a major milestone towards high-performance full in-materia PRCs. △ Less

Submitted 27 April, 2024; originally announced May 2024.

Comments: Supplementary Information is included under the main text

arXiv:2404.14143 [pdf]

Access-Point to Access-Point Connectivity for PON-based OWC Spine and Leaf Data Centre Architecture

Authors: Abrar S. Alhazmi, Sanaa H. Mohamed, Ahmad Qidan, T. E. H. El-Gorashi, Jaafar M. H. Elmirghani

Abstract: In this paper, we propose incorporating Optical Wireless Communication (OWC) and Passive Optical Network (PON) technologies into next generation spine-and-leaf Data Centre Networks (DCNs). In this work, OWC systems are used to connect the Data Centre (DC) racks through Wavelength Division Multiplexing (WDM) Infrared (IR) transceivers. The transceivers are placed on top of the racks and at distribu… ▽ More In this paper, we propose incorporating Optical Wireless Communication (OWC) and Passive Optical Network (PON) technologies into next generation spine-and-leaf Data Centre Networks (DCNs). In this work, OWC systems are used to connect the Data Centre (DC) racks through Wavelength Division Multiplexing (WDM) Infrared (IR) transceivers. The transceivers are placed on top of the racks and at distributed Access Points (APs) in the ceiling. Each transceiver on a rack is connected to a leaf switch that connects the servers within the rack. We replace the spine switches by Optical Line Terminal (OLT) and Network Interface Cards (NIC) in the APs to achieve the desired connectivity. We benchmark the power consumption of the proposed OWC-PON-based spine-and-leaf DC against traditional spine-and-leaf DC and report 46% reduction in the power consumption when considering eight racks. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.10837 [pdf, ps, other]

The CHEPA model: assessing the impact of HEPA filter units in classrooms using a fast-running coupled indoor air quality and dynamic thermal model

Authors: Henry C. Burridge, Sen Liu, Sara Mohamed, Samuel G. A. Wood, Cath J. Noakes

Abstract: The quality of the classroom environment, including ventilation, air quality and thermal conditions, has an important impact on children's health and academic achievements. The use of portable HEPA filter air cleaners is widely suggested as a strategy to mitigate exposure to particulate matter and airborne viruses. However, there is a need to quantify the relative benefits of such devices includin… ▽ More The quality of the classroom environment, including ventilation, air quality and thermal conditions, has an important impact on children's health and academic achievements. The use of portable HEPA filter air cleaners is widely suggested as a strategy to mitigate exposure to particulate matter and airborne viruses. However, there is a need to quantify the relative benefits of such devices including the impacts on energy use. We present a simple coupled dynamic thermal and air quality model and apply it to naturally ventilated classrooms, representative of modern and Victorian era construction. We consider the addition of HEPA filters with, and without, reduced opening of windows, and explore concentrations of carbon dioxide (\co), \PM, airborne viral RNA, classroom temperature and energy use. Results indicate the addition of HEPA filters was predicted to reduce \PM~ by 40--60\% and viral RNA by 30--50\% depending on the classroom design and window opening behaviour. The energy cost of running HEPA filters is likely to be only 1\%--2\% of the classroom heating costs. In scenarios when HEPA filters were on and window opening was reduced (to account for the additional clean air delivery rate of the filters), the heating cost was predicted to be reduced by as much as -13\%, and these maximum reductions grew to -46\% in wintertime simulations. In these scenarios the HEPA filters result in a notable reduction in \PM~and viral RNA, but the \co\ concentration is significantly higher. The model provides a mechanism for exploring the relative impact of ventilation and air cleaning strategies on both exposures and energy costs, enabling an understanding of where trade-offs lie. △ Less

Submitted 5 July, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

Comments: 22 pages, 4 figures

arXiv:2403.01212 [pdf, other]

TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion

Authors: Salaheldin Mohamed

Abstract: In recent years, significant progress has been made in the development of text-to-image generation models. However, these models still face limitations when it comes to achieving full controllability during the generation process. Often, specific training or the use of limited models is required, and even then, they have certain restrictions. To address these challenges, A two-stage method that ef… ▽ More In recent years, significant progress has been made in the development of text-to-image generation models. However, these models still face limitations when it comes to achieving full controllability during the generation process. Often, specific training or the use of limited models is required, and even then, they have certain restrictions. To address these challenges, A two-stage method that effectively combines controllability and high quality in the generation of images is proposed. This approach leverages the expertise of pre-trained models to achieve precise control over the generated images, while also harnessing the power of diffusion models to achieve state-of-the-art quality. By separating controllability from high quality, This method achieves outstanding results. It is compatible with both latent and image space diffusion models, ensuring versatility and flexibility. Moreover, This approach consistently produces comparable outcomes to the current state-of-the-art methods in the field. Overall, This proposed method represents a significant advancement in text-to-image generation, enabling improved controllability without compromising on the quality of the generated images. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2403.00204 [pdf, other]

La$_4$Co$_4$X (X = Pb, Bi, Sb): a demonstration of antagonistic pairs as a route to quasi-low dimensional ternary compounds

Authors: Tyler J. Slade, Nao Furukawa, Matthew Dygert, Siham Mohamed, Atreyee Das, Weiyi Xia, Cai-Zhuang Wang, Sergey L. Budko, Paul C. Canfield

Abstract: We outline how pairs of strongly immiscible elements, referred to here as antagonistic pairs, can be used to synthesize ternary compounds with quasi-reduced dimensional motifs. By identifying third elements that are compatible with a given antagonistic pair, ternary compounds can be formed in which the third element segregates the immiscible atoms into spatially separated substructures. Quasi-low… ▽ More We outline how pairs of strongly immiscible elements, referred to here as antagonistic pairs, can be used to synthesize ternary compounds with quasi-reduced dimensional motifs. By identifying third elements that are compatible with a given antagonistic pair, ternary compounds can be formed in which the third element segregates the immiscible atoms into spatially separated substructures. Quasi-low dimensional structural units are a natural consequence of the immiscible atoms seeking to avoid contact in the solid-state. As proof of principle, we present the discovery and physical properties of La$_4$Co$_4$X (X = Pb, Bi, Sb), a new family of intermetallics based on the antagonistic pairs Co-Pb and Co-Bi. La$_4$Co$_4$X adopts a new orthorhombic crystal structure (space group Pbam) containing quasi-2D Co slabs and La-X layers that stack along the a-axis. Consistent with our proposal, the La atoms separate the Co and X substructures, ensuring there are no direct contacts between immiscible atoms. Within the Co slabs, the atoms occupy the vertices of corner sharing tetrahedra and triangles, and this motif produces flat electronic bands near the Fermi level that favor magnetism. The Co is moment bearing in La$_4$Co$_4$X, and we show that whereas La$_4$Co$_4$Pb behaves as a three dimensional antiferromagnet with T$_N$ = 220 K, La$_4$Co$_4$Bi and La$_4$Co$_4$Sb have behavior consistent with low dimensional magnetic coupling and ordering, with T$_N$ = 153 K and 143 K respectively. In addition to the Pb, Bi, and Sb based La$_4$Co$_4$X compounds, we were likely able to produce an analogous La$_4$Co$_4$Sn in polycrystalline form, although we were unable to isolate single crystals. We anticipate that using mutually compatible third elements with an antagonistic pair represents a generalizable design principle for discovering new materials and structure types containing low-dimensional substructures. △ Less

Submitted 29 February, 2024; originally announced March 2024.

arXiv:2401.08572 [pdf, other]

doi 10.1145/3613904.3642703

The illusion of artificial inclusion

Authors: William Agnew, A. Stevie Bergman, Jennifer Chien, Mark Díaz, Seliem El-Sayed, Jaylen Pittman, Shakir Mohamed, Kevin R. McKee

Abstract: Human participants play a central role in the development of modern artificial intelligence (AI) technology, in psychological science, and in user research. Recent advances in generative AI have attracted growing interest to the possibility of replacing human participants in these domains with AI surrogates. We survey several such "substitution proposals" to better understand the arguments for and… ▽ More Human participants play a central role in the development of modern artificial intelligence (AI) technology, in psychological science, and in user research. Recent advances in generative AI have attracted growing interest to the possibility of replacing human participants in these domains with AI surrogates. We survey several such "substitution proposals" to better understand the arguments for and against substituting human participants with modern generative AI. Our scoping review indicates that the recent wave of these proposals is motivated by goals such as reducing the costs of research and development work and increasing the diversity of collected data. However, these proposals ignore and ultimately conflict with foundational values of work with human participants: representation, inclusion, and understanding. This paper critically examines the principles and goals underlying human participation to help chart out paths for future work that truly centers and empowers participants. △ Less

Submitted 5 February, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

Comments: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI 2024)

arXiv:2312.15796 [pdf, other]

GenCast: Diffusion-based ensemble forecasting for medium-range weather

Authors: Ilan Price, Alvaro Sanchez-Gonzalez, Ferran Alet, Tom R. Andersson, Andrew El-Kadi, Dominic Masters, Timo Ewalds, Jacklynn Stott, Shakir Mohamed, Peter Battaglia, Remi Lam, Matthew Willson

Abstract: Weather forecasts are fundamentally uncertain, so predicting the range of probable weather scenarios is crucial for important decisions, from warning the public about hazardous weather, to planning renewable energy use. Here, we introduce GenCast, a probabilistic weather model with greater skill and speed than the top operational medium-range weather forecast in the world, the European Centre for… ▽ More Weather forecasts are fundamentally uncertain, so predicting the range of probable weather scenarios is crucial for important decisions, from warning the public about hazardous weather, to planning renewable energy use. Here, we introduce GenCast, a probabilistic weather model with greater skill and speed than the top operational medium-range weather forecast in the world, the European Centre for Medium-Range Forecasts (ECMWF)'s ensemble forecast, ENS. Unlike traditional approaches, which are based on numerical weather prediction (NWP), GenCast is a machine learning weather prediction (MLWP) method, trained on decades of reanalysis data. GenCast generates an ensemble of stochastic 15-day global forecasts, at 12-hour steps and 0.25 degree latitude-longitude resolution, for over 80 surface and atmospheric variables, in 8 minutes. It has greater skill than ENS on 97.4% of 1320 targets we evaluated, and better predicts extreme weather, tropical cyclones, and wind power production. This work helps open the next chapter in operational weather forecasting, where critical weather-dependent decisions are made with greater accuracy and efficiency. △ Less

Submitted 1 May, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

Comments: Main text 11 pages, Appendices 76 pages

arXiv:2312.15364 [pdf, other]

WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments

Authors: Kavisha Vidanapathirana, Joshua Knights, Stephen Hausler, Mark Cox, Milad Ramezani, Jason Jooste, Ethan Griffiths, Shaheer Mohamed, Sridha Sridharan, Clinton Fookes, Peyman Moghadam

Abstract: Recent progress in semantic scene understanding has primarily been enabled by the availability of semantically annotated bi-modal (camera and lidar) datasets in urban environments. However, such annotated datasets are also needed for natural, unstructured environments to enable semantic perception for applications, including conservation, search and rescue, environment monitoring, and agricultural… ▽ More Recent progress in semantic scene understanding has primarily been enabled by the availability of semantically annotated bi-modal (camera and lidar) datasets in urban environments. However, such annotated datasets are also needed for natural, unstructured environments to enable semantic perception for applications, including conservation, search and rescue, environment monitoring, and agricultural automation. Therefore, we introduce WildScenes, a bi-modal benchmark dataset consisting of multiple large-scale traversals in natural environments, including semantic annotations in high-resolution 2D images and dense 3D lidar point clouds, and accurate 6-DoF pose information. The data is (1) trajectory-centric with accurate localization and globally aligned point clouds, (2) calibrated and synchronized to support bi-modal inference, and (3) containing different natural environments over 6 months to support research on domain adaptation. Our 3D semantic labels are obtained via an efficient automated process that transfers the human-annotated 2D labels from multiple views into 3D point clouds, thus circumventing the need for expensive and time-consuming human annotation in 3D. We introduce benchmarks on 2D and 3D semantic segmentation and evaluate a variety of recent deep-learning techniques to demonstrate the challenges in semantic segmentation in natural environments. We propose train-val-test splits for standard benchmarks as well as domain adaptation benchmarks and utilize an automated split generation technique to ensure the balance of class label distributions. The data, evaluation scripts and pretrained models will be released upon acceptance at https://csiro-robotics.github.io/WildScenes. △ Less

Submitted 23 December, 2023; originally announced December 2023.

Comments: Under review. The first 3 authors contributed equally

arXiv:2311.09828 [pdf, other]

AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages

Authors: Jiayi Wang, David Ifeoluwa Adelani, Sweta Agrawal, Marek Masiak, Ricardo Rei, Eleftheria Briakou, Marine Carpuat, Xuanli He, Sofia Bourhim, Andiswa Bukula, Muhidin Mohamed, Temitayo Olatoye, Tosin Adewumi, Hamam Mokayed, Christine Mwase, Wangui Kimotho, Foutse Yuehgoh, Anuoluwapo Aremu, Jessica Ojo, Shamsuddeen Hassan Muhammad, Salomey Osei, Abdul-Hakeem Omotayo, Chiamaka Chukwuneke, Perez Ogayo, Oumaima Hourrane , et al. (33 additional authors not shown)

Abstract: Despite the recent progress on scaling multilingual machine translation (MT) to several under-resourced African languages, accurately measuring this progress remains challenging, since evaluation is often performed on n-gram matching metrics such as BLEU, which typically show a weaker correlation with human judgments. Learned metrics such as COMET have higher correlation; however, the lack of eval… ▽ More Despite the recent progress on scaling multilingual machine translation (MT) to several under-resourced African languages, accurately measuring this progress remains challenging, since evaluation is often performed on n-gram matching metrics such as BLEU, which typically show a weaker correlation with human judgments. Learned metrics such as COMET have higher correlation; however, the lack of evaluation data with human ratings for under-resourced languages, complexity of annotation guidelines like Multidimensional Quality Metrics (MQM), and limited language coverage of multilingual encoders have hampered their applicability to African languages. In this paper, we address these challenges by creating high-quality human evaluation data with simplified MQM guidelines for error detection and direct assessment (DA) scoring for 13 typologically diverse African languages. Furthermore, we develop AfriCOMET: COMET evaluation metrics for African languages by leveraging DA data from well-resourced languages and an African-centric multilingual encoder (AfroXLM-R) to create the state-of-the-art MT evaluation metrics for African languages with respect to Spearman-rank correlation with human judgments (0.441). △ Less

Submitted 23 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

Comments: Accepted by NAACL 2024

arXiv:2311.01856 [pdf, ps, other]

The uniform companion for fields with free operators in characteristic zero

Authors: Shezad Mohamed

Abstract: Generalising the uniform companion for large fields with a single derivation, we construct a theory $\text{UC}_{\mathcal{D}}$ of fields of characteristic $0$ with free operators -- operators determined by a homomorphism from the field to its tensor product with $\mathcal{D}$, a finite-dimensional $\mathbb{Q}$-algebra -- which is the model companion of any theory of a field with free operators whos… ▽ More Generalising the uniform companion for large fields with a single derivation, we construct a theory $\text{UC}_{\mathcal{D}}$ of fields of characteristic $0$ with free operators -- operators determined by a homomorphism from the field to its tensor product with $\mathcal{D}$, a finite-dimensional $\mathbb{Q}$-algebra -- which is the model companion of any theory of a field with free operators whose associated difference field is difference large and model complete. Under the assumption that $\mathcal{D}$ is a local ring, we show that simplicity is transferred from the theory of the underlying field to the theory of the field with operators, and we use this to study the model theory of bounded, PAC fields with free operators. △ Less

Submitted 4 January, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

Comments: 28 pages. The results in section 3 have been strengthened, and preliminaries in section 2 have changed accordingly

MSC Class: 03C60; 12H99

arXiv:2310.18737 [pdf, other]

Pre-training with Random Orthogonal Projection Image Modeling

Authors: Maryam Haghighat, Peyman Moghadam, Shaheer Mohamed, Piotr Koniusz

Abstract: Masked Image Modeling (MIM) is a powerful self-supervised strategy for visual pre-training without the use of labels. MIM applies random crops to input images, processes them with an encoder, and then recovers the masked inputs with a decoder, which encourages the network to capture and learn structural information about objects and scenes. The intermediate feature representations obtained from MI… ▽ More Masked Image Modeling (MIM) is a powerful self-supervised strategy for visual pre-training without the use of labels. MIM applies random crops to input images, processes them with an encoder, and then recovers the masked inputs with a decoder, which encourages the network to capture and learn structural information about objects and scenes. The intermediate feature representations obtained from MIM are suitable for fine-tuning on downstream tasks. In this paper, we propose an Image Modeling framework based on random orthogonal projection instead of binary masking as in MIM. Our proposed Random Orthogonal Projection Image Modeling (ROPIM) reduces spatially-wise token information under guaranteed bound on the noise variance and can be considered as masking entire spatial image area under locally varying masking degrees. Since ROPIM uses a random subspace for the projection that realizes the masking step, the readily available complement of the subspace can be used during unmasking to promote recovery of removed information. In this paper, we show that using random orthogonal projection leads to superior performance compared to crop-based masking. We demonstrate state-of-the-art results on several popular benchmarks. △ Less

Submitted 21 April, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

Comments: Published as a conference paper at the International Conference on Learning Representations (ICLR) 2024. 19 pages

arXiv:2310.16331 [pdf, other]

Brain-Inspired Reservoir Computing Using Memristors with Tunable Dynamics and Short-Term Plasticity

Authors: Nicholas X. Armendarez, Ahmed S. Mohamed, Anurag Dhungel, Md Razuan Hossain, Md Sakib Hasan, Joseph S. Najem

Abstract: Recent advancements in reservoir computing research have created a demand for analog devices with dynamics that can facilitate the physical implementation of reservoirs, promising faster information processing while consuming less energy and occupying a smaller area footprint. Studies have demonstrated that dynamic memristors, with nonlinear and short-term memory dynamics, are excellent candidates… ▽ More Recent advancements in reservoir computing research have created a demand for analog devices with dynamics that can facilitate the physical implementation of reservoirs, promising faster information processing while consuming less energy and occupying a smaller area footprint. Studies have demonstrated that dynamic memristors, with nonlinear and short-term memory dynamics, are excellent candidates as information-processing devices or reservoirs for temporal classification and prediction tasks. Previous implementations relied on nominally identical memristors that applied the same nonlinear transformation to the input data, which is not enough to achieve a rich state space. To address this limitation, researchers either diversified the data encoding across multiple memristors or harnessed the stochastic device-to-device variability among the memristors. However, this approach requires additional pre-processing steps and leads to synchronization issues. Instead, it is preferable to encode the data once and pass it through a reservoir layer consisting of memristors with distinct dynamics. Here, we demonstrate that ion-channel-based memristors with voltage-dependent dynamics can be controllably and predictively tuned through voltage or adjustment of the ion channel concentration to exhibit diverse dynamic properties. We show, through experiments and simulations, that reservoir layers constructed with a small number of distinct memristors exhibit significantly higher predictive and classification accuracies with a single data encoding. We found that for a second-order nonlinear dynamical system prediction task, the varied memristor reservoir experimentally achieved a normalized mean square error of 0.0015 using only five distinct memristors. Moreover, in a neural activity classification task, a reservoir of just three distinct memristors experimentally attained an accuracy of 96.5%. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2309.09431 [pdf, other]

FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pretraining

Authors: Shaheer Mohamed, Maryam Haghighat, Tharindu Fernando, Sridha Sridharan, Clinton Fookes, Peyman Moghadam

Abstract: Hyperspectral images (HSIs) contain rich spectral and spatial information. Motivated by the success of transformers in the field of natural language processing and computer vision where they have shown the ability to learn long range dependencies within input data, recent research has focused on using transformers for HSIs. However, current state-of-the-art hyperspectral transformers only tokenize… ▽ More Hyperspectral images (HSIs) contain rich spectral and spatial information. Motivated by the success of transformers in the field of natural language processing and computer vision where they have shown the ability to learn long range dependencies within input data, recent research has focused on using transformers for HSIs. However, current state-of-the-art hyperspectral transformers only tokenize the input HSI sample along the spectral dimension, resulting in the under-utilization of spatial information. Moreover, transformers are known to be data-hungry and their performance relies heavily on large-scale pretraining, which is challenging due to limited annotated hyperspectral data. Therefore, the full potential of HSI transformers has not been fully realized. To overcome these limitations, we propose a novel factorized spectral-spatial transformer that incorporates factorized self-supervised pretraining procedures, leading to significant improvements in performance. The factorization of the inputs allows the spectral and spatial transformers to better capture the interactions within the hyperspectral data cubes. Inspired by masked image modeling pretraining, we also devise efficient masking strategies for pretraining each of the spectral and spatial transformers. We conduct experiments on six publicly available datasets for HSI classification task and demonstrate that our model achieves state-of-the-art performance in all the datasets. The code for our model will be made available at https://github.com/csiro-robotics/factoformer. △ Less

Submitted 3 January, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

Comments: Accepted to IEEE Transactions on Geoscience and Remote Sensing in December 2023

arXiv:2309.07177 [pdf]

Electromechanical Study of a Ring-Brush Sliding Contact

Authors: Eddy Chevallier, Tania Garcia, Sabrina Ait Mohamed

Abstract: We report a study about the electrical response from a sliding contact made of a silver-graphite brush and a brass ring. This study focuses specifically on the voltage variations due to the mechanical interactions across the contact according to the rotational speed. This study is part of the research and the development about the monitoring of dynamical interfaces. We report a study about the electrical response from a sliding contact made of a silver-graphite brush and a brass ring. This study focuses specifically on the voltage variations due to the mechanical interactions across the contact according to the rotational speed. This study is part of the research and the development about the monitoring of dynamical interfaces. △ Less

Submitted 19 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

arXiv:2309.05814 [pdf, ps, other]

Reinforcement Learning for Supply Chain Attacks Against Frequency and Voltage Control

Authors: Amr S. Mohamed, Sumin Lee, Deepa Kundur

Abstract: The ongoing modernization of the power system, involving new equipment installations and upgrades, exposes the power system to the introduction of malware into its operation through supply chain attacks. Supply chain attacks present a significant threat to power systems, allowing cybercriminals to bypass network defenses and execute deliberate attacks at the physical layer. Given the exponential a… ▽ More The ongoing modernization of the power system, involving new equipment installations and upgrades, exposes the power system to the introduction of malware into its operation through supply chain attacks. Supply chain attacks present a significant threat to power systems, allowing cybercriminals to bypass network defenses and execute deliberate attacks at the physical layer. Given the exponential advancements in machine intelligence, cybercriminals will leverage this technology to create sophisticated and adaptable attacks that can be incorporated into supply chain attacks. We demonstrate the use of reinforcement learning for developing intelligent attacks incorporated into supply chain attacks against generation control devices. We simulate potential disturbances impacting frequency and voltage regulation. The presented method can provide valuable guidance for defending against supply chain attacks. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 7 pages, conference, IEEE International Conference on Machine Learning and Applications (ICMLA) 2023

arXiv:2309.05204 [pdf, other]

Accelerated Proximal Iterative re-Weighted $\ell_1$ Alternating Minimization for Image Deblurring

Authors: Tarmizi Adam, Alexander Malyshev, Mohd Fikree Hassan, Nur Syarafina Mohamed, Md Sah Hj Salam

Abstract: The quadratic penalty alternating minimization (AM) method is widely used for solving the convex $\ell_1$ total variation (TV) image deblurring problem. However, quadratic penalty AM for solving the nonconvex nonsmooth $\ell_p$, $0 < p < 1$ TV image deblurring problems is less studied. In this paper, we propose two algorithms, namely proximal iterative re-weighted $\ell_1$ AM (PIRL1-AM) and its ac… ▽ More The quadratic penalty alternating minimization (AM) method is widely used for solving the convex $\ell_1$ total variation (TV) image deblurring problem. However, quadratic penalty AM for solving the nonconvex nonsmooth $\ell_p$, $0 < p < 1$ TV image deblurring problems is less studied. In this paper, we propose two algorithms, namely proximal iterative re-weighted $\ell_1$ AM (PIRL1-AM) and its accelerated version, accelerated proximal iterative re-weighted $\ell_1$ AM (APIRL1-AM) for solving the nonconvex nonsmooth $\ell_p$ TV image deblurring problem. The proposed algorithms are derived from the proximal iterative re-weighted $\ell_1$ (IRL1) algorithm and the proximal gradient algorithm. Numerical results show that PIRL1-AM is effective in retaining sharp edges in image deblurring while APIRL1-AM can further provide convergence speed up in terms of the number of algorithm iterations and computational time. △ Less

Submitted 10 September, 2023; originally announced September 2023.

arXiv:2309.04687 [pdf, other]

A Review on Robot Manipulation Methods in Human-Robot Interactions

Authors: Haoxu Zhang, Parham M. Kebria, Shady Mohamed, Samson Yu, Saeid Nahavandi

Abstract: Robot manipulation is an important part of human-robot interaction technology. However, traditional pre-programmed methods can only accomplish simple and repetitive tasks. To enable effective communication between robots and humans, and to predict and adapt to uncertain environments, this paper reviews recent autonomous and adaptive learning in robotic manipulation algorithms. It includes typical… ▽ More Robot manipulation is an important part of human-robot interaction technology. However, traditional pre-programmed methods can only accomplish simple and repetitive tasks. To enable effective communication between robots and humans, and to predict and adapt to uncertain environments, this paper reviews recent autonomous and adaptive learning in robotic manipulation algorithms. It includes typical applications and challenges of human-robot interaction, fundamental tasks of robot manipulation and one of the most widely used formulations of robot manipulation, Markov Decision Process. Recent research focusing on robot manipulation is mainly based on Reinforcement Learning and Imitation Learning. This review paper shows the importance of Deep Reinforcement Learning, which plays an important role in manipulating robots to complete complex tasks in disturbed and unfamiliar environments. With the introduction of Imitation Learning, it is possible for robot manipulation to get rid of reward function design and achieve a simple, stable and supervised learning process. This paper reviews and compares the main features and popular algorithms for both Reinforcement Learning and Imitation Learning. △ Less

Submitted 9 September, 2023; originally announced September 2023.

arXiv:2308.14669 [pdf, other]

doi 10.1109/IMSA58542.2023.10217635

ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based Approach

Authors: Abdelrahman "Boda" Sadallah, Omar Ahmed, Shimaa Mohamed, Omar Hatem, Doaa Hesham, Ahmed H. Yousef

Abstract: One of the main tasks of Natural Language Processing (NLP), is Named Entity Recognition (NER). It is used in many applications and also can be used as an intermediate step for other tasks. We present ANER, a web-based named entity recognizer for the Arabic, and Arabizi languages. The model is built upon BERT, which is a transformer-based encoder. It can recognize 50 different entity classes, cover… ▽ More One of the main tasks of Natural Language Processing (NLP), is Named Entity Recognition (NER). It is used in many applications and also can be used as an intermediate step for other tasks. We present ANER, a web-based named entity recognizer for the Arabic, and Arabizi languages. The model is built upon BERT, which is a transformer-based encoder. It can recognize 50 different entity classes, covering various fields. We trained our model on the WikiFANE\_Gold dataset which consists of Wikipedia articles. We achieved an F1 score of 88.7\%, which beats CAMeL Tools' F1 score of 83\% on the ANERcorp dataset, which has only 4 classes. We also got an F1 score of 77.7\% on the NewsFANE\_Gold dataset which contains out-of-domain data from News articles. The system is deployed on a user-friendly web interface that accepts users' inputs in Arabic, or Arabizi. It allows users to explore the entities in the text by highlighting them. It can also direct users to get information about entities through Wikipedia directly. We added the ability to do NER using our model, or CAMeL Tools' model through our website. ANER is publicly accessible at \url{http://www.aner.online}. We also deployed our model on HuggingFace at https://huggingface.co/boda/ANER, to allow developers to test and use it. △ Less

Submitted 28 August, 2023; originally announced August 2023.

arXiv:2308.01785 [pdf, other]

Lexicon and Rule-based Word Lemmatization Approach for the Somali Language

Authors: Shafie Abdi Mohamed, Muhidin Abdullahi Mohamed

Abstract: Lemmatization is a Natural Language Processing (NLP) technique used to normalize text by changing morphological derivations of words to their root forms. It is used as a core pre-processing step in many NLP tasks including text indexing, information retrieval, and machine learning for NLP, among others. This paper pioneers the development of text lemmatization for the Somali language, a low-resour… ▽ More Lemmatization is a Natural Language Processing (NLP) technique used to normalize text by changing morphological derivations of words to their root forms. It is used as a core pre-processing step in many NLP tasks including text indexing, information retrieval, and machine learning for NLP, among others. This paper pioneers the development of text lemmatization for the Somali language, a low-resource language with very limited or no prior effective adoption of NLP methods and datasets. We especially develop a lexicon and rule-based lemmatizer for Somali text, which is a starting point for a full-fledged Somali lemmatization system for various NLP tasks. With consideration of the language morphological rules, we have developed an initial lexicon of 1247 root words and 7173 derivationally related terms enriched with rules for lemmatizing words not present in the lexicon. We have tested the algorithm on 120 documents of various lengths including news articles, social media posts, and text messages. Our initial results demonstrate that the algorithm achieves an accuracy of 57\% for relatively long documents (e.g. full news articles), 60.57\% for news article extracts, and high accuracy of 95.87\% for short texts such as social media messages. △ Less

Submitted 3 August, 2023; originally announced August 2023.

arXiv:2307.13541 [pdf]

Group Activity Recognition in Computer Vision: A Comprehensive Review, Challenges, and Future Perspectives

Authors: Chuanchuan Wang, Ahmad Sufril Azlan Mohamed

Abstract: Group activity recognition is a hot topic in computer vision. Recognizing activities through group relationships plays a vital role in group activity recognition. It holds practical implications in various scenarios, such as video analysis, surveillance, automatic driving, and understanding social activities. The model's key capabilities encompass efficiently modeling hierarchical relationships wi… ▽ More Group activity recognition is a hot topic in computer vision. Recognizing activities through group relationships plays a vital role in group activity recognition. It holds practical implications in various scenarios, such as video analysis, surveillance, automatic driving, and understanding social activities. The model's key capabilities encompass efficiently modeling hierarchical relationships within a scene and accurately extracting distinctive spatiotemporal features from groups. Given this technology's extensive applicability, identifying group activities has garnered significant research attention. This work examines the current progress in technology for recognizing group activities, with a specific focus on global interactivity and activities. Firstly, we comprehensively review the pertinent literature and various group activity recognition approaches, from traditional methodologies to the latest methods based on spatial structure, descriptors, non-deep learning, hierarchical recurrent neural networks (HRNN), relationship models, and attention mechanisms. Subsequently, we present the relational network and relational architectures for each module. Thirdly, we investigate methods for recognizing group activity and compare their performance with state-of-the-art technologies. We summarize the existing challenges and provide comprehensive guidance for newcomers to understand group activity recognition. Furthermore, we review emerging perspectives in group activity recognition to explore new directions and possibilities. △ Less

Submitted 25 July, 2023; originally announced July 2023.

arXiv:2307.12146 [pdf, other]

CloudScent: a model for code smell analysis in open-source cloud

Authors: Raj Narendra Shah, Sameer Ahmed Mohamed, Asif Imran, Tevfik Kosar

Abstract: The low cost and rapid provisioning capabilities have made open-source cloud a desirable platform to launch industrial applications. However, as open-source cloud moves towards maturity, it still suffers from quality issues like code smells. Although, a great emphasis has been provided on the economic benefits of deploying open-source cloud, low importance has been provided to improve the quality… ▽ More The low cost and rapid provisioning capabilities have made open-source cloud a desirable platform to launch industrial applications. However, as open-source cloud moves towards maturity, it still suffers from quality issues like code smells. Although, a great emphasis has been provided on the economic benefits of deploying open-source cloud, low importance has been provided to improve the quality of the source code of the cloud itself to ensure its maintainability in the industrial scenario. Code refactoring has been associated with improving the maintenance and understanding of software code by removing code smells. However, analyzing what smells are more prevalent in cloud environment and designing a tool to define and detect those smells require further attention. In this paper, we propose a model called CloudScent which is an open source mechanism to detect smells in open-source cloud. We test our experiments in a real-life cloud environment using OpenStack. Results show that CloudScent is capable of accurately detecting 8 code smells in cloud. This will permit cloud service providers with advanced knowledge about the smells prevalent in open-source cloud platform, thus allowing for timely code refactoring and improving code quality of the cloud platforms. △ Less

Submitted 22 July, 2023; originally announced July 2023.

arXiv:2307.04019 [pdf, other]

GP-guided MPPI for Efficient Navigation in Complex Unknown Cluttered Environments

Authors: Ihab S. Mohamed, Mahmoud Ali, Lantao Liu

Abstract: Robotic navigation in unknown, cluttered environments with limited sensing capabilities poses significant challenges in robotics. Local trajectory optimization methods, such as Model Predictive Path Intergal (MPPI), are a promising solution to this challenge. However, global guidance is required to ensure effective navigation, especially when encountering challenging environmental conditions or na… ▽ More Robotic navigation in unknown, cluttered environments with limited sensing capabilities poses significant challenges in robotics. Local trajectory optimization methods, such as Model Predictive Path Intergal (MPPI), are a promising solution to this challenge. However, global guidance is required to ensure effective navigation, especially when encountering challenging environmental conditions or navigating beyond the planning horizon. This study presents the GP-MPPI, an online learning-based control strategy that integrates MPPI with a local perception model based on Sparse Gaussian Process (SGP). The key idea is to leverage the learning capability of SGP to construct a variance (uncertainty) surface, which enables the robot to learn about the navigable space surrounding it, identify a set of suggested subgoals, and ultimately recommend the optimal subgoal that minimizes a predefined cost function to the local MPPI planner. Afterward, MPPI computes the optimal control sequence that satisfies the robot and collision avoidance constraints. Such an approach eliminates the necessity of a global map of the environment or an offline training process. We validate the efficiency and robustness of our proposed control strategy through both simulated and real-world experiments of 2D autonomous navigation tasks in complex unknown environments, demonstrating its superiority in guiding the robot safely towards its desired goal while avoiding obstacles and escaping entrapment in local minima. The GPU implementation of GP-MPPI, including the supplementary video, is available at https://github.com/IhabMohamed/GP-MPPI. △ Less

Submitted 28 July, 2023; v1 submitted 8 July, 2023; originally announced July 2023.

Comments: This paper has 8 pages, 6 figures, 2 tables. It has been accepted for publication at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, Michigan, USA, 2023

arXiv:2306.12369 [pdf, other]

Towards Efficient MPPI Trajectory Generation with Unscented Guidance: U-MPPI Control Strategy

Authors: Ihab S. Mohamed, Junhong Xu, Gaurav S Sukhatme, Lantao Liu

Abstract: The classical Model Predictive Path Integral (MPPI) control framework lacks reliable safety guarantees since it relies on a risk-neutral trajectory evaluation technique, which can present challenges for safety-critical applications such as autonomous driving. Additionally, if the majority of MPPI sampled trajectories concentrate in high-cost regions, it may generate an infeasible control sequence.… ▽ More The classical Model Predictive Path Integral (MPPI) control framework lacks reliable safety guarantees since it relies on a risk-neutral trajectory evaluation technique, which can present challenges for safety-critical applications such as autonomous driving. Additionally, if the majority of MPPI sampled trajectories concentrate in high-cost regions, it may generate an infeasible control sequence. To address this challenge, we propose the U-MPPI control strategy, a novel methodology that can effectively manage system uncertainties while integrating a more efficient trajectory sampling strategy. The core concept is to leverage the Unscented Transform (UT) to propagate not only the mean but also the covariance of the system dynamics, going beyond the traditional MPPI method. As a result, it introduces a novel and more efficient trajectory sampling strategy, significantly enhancing state-space exploration and ultimately reducing the risk of being trapped in local minima. Furthermore, by leveraging the uncertainty information provided by UT, we incorporate a risk-sensitive cost function that explicitly accounts for risk or uncertainty throughout the trajectory evaluation process, resulting in a more resilient control system capable of handling uncertain conditions. By conducting extensive simulations of 2D aggressive autonomous navigation in both known and unknown cluttered environments, we verify the efficiency and robustness of our proposed U-MPPI control strategy compared to the baseline MPPI. We further validate the practicality of U-MPPI through real-world demonstrations in unknown cluttered environments, showcasing its superior ability to incorporate both the UT and local costmap into the optimization problem without introducing additional complexity. △ Less

Submitted 9 October, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

Comments: This paper has 15 pages, 10 figures, 4 tables

arXiv:2306.09780 [pdf, other]

Understanding Deep Generative Models with Generalized Empirical Likelihoods

Authors: Suman Ravuri, Mélanie Rey, Shakir Mohamed, Marc Deisenroth

Abstract: Understanding how well a deep generative model captures a distribution of high-dimensional data remains an important open challenge. It is especially difficult for certain model classes, such as Generative Adversarial Networks and Diffusion Models, whose models do not admit exact likelihoods. In this work, we demonstrate that generalized empirical likelihood (GEL) methods offer a family of diagnos… ▽ More Understanding how well a deep generative model captures a distribution of high-dimensional data remains an important open challenge. It is especially difficult for certain model classes, such as Generative Adversarial Networks and Diffusion Models, whose models do not admit exact likelihoods. In this work, we demonstrate that generalized empirical likelihood (GEL) methods offer a family of diagnostic tools that can identify many deficiencies of deep generative models (DGMs). We show, with appropriate specification of moment conditions, that the proposed method can identify which modes have been dropped, the degree to which DGMs are mode imbalanced, and whether DGMs sufficiently capture intra-class diversity. We show how to combine techniques from Maximum Mean Discrepancy and Generalized Empirical Likelihood to create not only distribution tests that retain per-sample interpretability, but also metrics that include label information. We find that such tests predict the degree of mode dropping and mode imbalance up to 60% better than metrics such as improved precision/recall. We provide an implementation at https://github.com/deepmind/understanding_deep_generative_models_with_generalized_empirical_likelihood/. △ Less

Submitted 7 August, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

Comments: Computer Vision and Pattern Recognition 2023 (Highlight, top 2.6% of submissions)

arXiv:2305.18937 [pdf]

WDM/TDM over Passive Optical Networks with Cascaded-AWGRs for Data Centers

Authors: Mohammed Alharthi, Sanaa H. Mohamed, Taisir E. H. El-Gorashi, Jaafar M. H. Elmirghani

Abstract: Data centers based on Passive Optical Networks (PONs) can provide high capacity, low cost, scalability, elasticity and high energy-efficiency. This paper introduces the use of WDM-TDM multiple access in a PON-based data center that offers multipath routing via two-tier cascaded Arrayed Waveguide Grating Routers (AWGRs) to improve the utilization of resources. A Mixed Integer Linear Programming (MI… ▽ More Data centers based on Passive Optical Networks (PONs) can provide high capacity, low cost, scalability, elasticity and high energy-efficiency. This paper introduces the use of WDM-TDM multiple access in a PON-based data center that offers multipath routing via two-tier cascaded Arrayed Waveguide Grating Routers (AWGRs) to improve the utilization of resources. A Mixed Integer Linear Programming (MILP) model is developed to optimize resource allocation while considering multipath routing. The results show that all-to-all connectivity is achieved in the architecture through the use of two different wavelength within different time slots for the communication between racks in the same or different cells, as well as with the OLT switches. △ Less

Submitted 30 May, 2023; originally announced May 2023.

arXiv:2305.12025 [pdf, other]

doi 10.1002/aisy.202300346

Biomembrane-based Memcapacitive Reservoir Computing System for Energy Efficient Temporal Data Processing

Authors: Md Razuan Hossain, Ahmed Salah Mohamed, Nicholas Xavier Armendarez, Joseph S. Najem, Md Sakib Hasan

Abstract: Reservoir computing is a highly efficient machine learning framework for processing temporal data by extracting features from the input signal and mapping them into higher dimensional spaces. Physical reservoir layers have been realized using spintronic oscillators, atomic switch networks, silicon photonic modules, ferroelectric transistors, and volatile memristors. However, these devices are intr… ▽ More Reservoir computing is a highly efficient machine learning framework for processing temporal data by extracting features from the input signal and mapping them into higher dimensional spaces. Physical reservoir layers have been realized using spintronic oscillators, atomic switch networks, silicon photonic modules, ferroelectric transistors, and volatile memristors. However, these devices are intrinsically energy-dissipative due to their resistive nature, which leads to increased power consumption. Therefore, capacitive memory devices can provide a more energy-efficient approach. Here, we leverage volatile biomembrane-based memcapacitors that closely mimic certain short-term synaptic plasticity functions as reservoirs to solve classification tasks and analyze time-series data in simulation and experimentally. Our system achieves a 99.6% accuracy rate for spoken digit classification and a normalized mean square error of 7.81*10^{-4} in a second-order non-linear regression task. Furthermore, to showcase the device's real-time temporal data processing capability, we achieve 100% accuracy for a real-time epilepsy detection problem from an inputted electroencephalography (EEG) signal. Most importantly, we demonstrate that each memcapacitor consumes an average of 41.5 fJ of energy per spike, regardless of the selected input voltage pulse width, while maintaining an average power of 415 fW for a pulse width of 100 ms. These values are orders of magnitude lower than those achieved by state-of-the-art memristors used as reservoirs. Lastly, we believe the biocompatible, soft nature of our memcapacitor makes it highly suitable for computing and signal-processing applications in biological environments. △ Less

Submitted 15 November, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

Comments: Supplementary information is attached under the main text

arXiv:2304.14925 [pdf, other]

Uncertainty Aware Neural Network from Similarity and Sensitivity

Authors: H M Dipu Kabir, Subrota Kumar Mondal, Sadia Khanam, Abbas Khosravi, Shafin Rahman, Mohammad Reza Chalak Qazani, Roohallah Alizadehsani, Houshyar Asadi, Shady Mohamed, Saeid Nahavandi, U Rajendra Acharya

Abstract: Researchers have proposed several approaches for neural network (NN) based uncertainty quantification (UQ). However, most of the approaches are developed considering strong assumptions. Uncertainty quantification algorithms often perform poorly in an input domain and the reason for poor performance remains unknown. Therefore, we present a neural network training method that considers similar sampl… ▽ More Researchers have proposed several approaches for neural network (NN) based uncertainty quantification (UQ). However, most of the approaches are developed considering strong assumptions. Uncertainty quantification algorithms often perform poorly in an input domain and the reason for poor performance remains unknown. Therefore, we present a neural network training method that considers similar samples with sensitivity awareness in this paper. In the proposed NN training method for UQ, first, we train a shallow NN for the point prediction. Then, we compute the absolute differences between prediction and targets and train another NN for predicting those absolute differences or absolute errors. Domains with high average absolute errors represent a high uncertainty. In the next step, we select each sample in the training set one by one and compute both prediction and error sensitivities. Then we select similar samples with sensitivity consideration and save indexes of similar samples. The ranges of an input parameter become narrower when the output is highly sensitive to that parameter. After that, we construct initial uncertainty bounds (UB) by considering the distribution of sensitivity aware similar samples. Prediction intervals (PIs) from initial uncertainty bounds are larger and cover more samples than required. Therefore, we train bound correction NN. As following all the steps for finding UB for each sample requires a lot of computation and memory access, we train a UB computation NN. The UB computation NN takes an input sample and provides an uncertainty bound. The UB computation NN is the final product of the proposed approach. Scripts of the proposed method are available in the following GitHub repository: github.com/dipuk0506/UQ △ Less

Submitted 26 April, 2023; originally announced April 2023.

arXiv:2304.09972 [pdf, other]

MasakhaNEWS: News Topic Classification for African languages

Authors: David Ifeoluwa Adelani, Marek Masiak, Israel Abebe Azime, Jesujoba Alabi, Atnafu Lambebo Tonja, Christine Mwase, Odunayo Ogundepo, Bonaventure F. P. Dossou, Akintunde Oladipo, Doreen Nixdorf, Chris Chinenye Emezue, sana al-azzawi, Blessing Sibanda, Davis David, Lolwethu Ndolela, Jonathan Mukiibi, Tunde Ajayi, Tatiana Moteu, Brian Odhiambo, Abraham Owodunni, Nnaemeka Obiefuna, Muhidin Mohamed, Shamsuddeen Hassan Muhammad, Teshome Mulugeta Ababu, Saheed Abdullahi Salahudeen , et al. (40 additional authors not shown)

Abstract: African languages are severely under-represented in NLP research due to lack of datasets covering several NLP tasks. While there are individual language specific datasets that are being expanded to different tasks, only a handful of NLP tasks (e.g. named entity recognition and machine translation) have standardized benchmark datasets covering several geographical and typologically-diverse African… ▽ More African languages are severely under-represented in NLP research due to lack of datasets covering several NLP tasks. While there are individual language specific datasets that are being expanded to different tasks, only a handful of NLP tasks (e.g. named entity recognition and machine translation) have standardized benchmark datasets covering several geographical and typologically-diverse African languages. In this paper, we develop MasakhaNEWS -- a new benchmark dataset for news topic classification covering 16 languages widely spoken in Africa. We provide an evaluation of baseline models by training classical machine learning models and fine-tuning several language models. Furthermore, we explore several alternatives to full fine-tuning of language models that are better suited for zero-shot and few-shot learning such as cross-lingual parameter-efficient fine-tuning (like MAD-X), pattern exploiting training (PET), prompting language models (like ChatGPT), and prompt-free sentence transformer fine-tuning (SetFit and Cohere Embedding API). Our evaluation in zero-shot setting shows the potential of prompting ChatGPT for news topic classification in low-resource African languages, achieving an average performance of 70 F1 points without leveraging additional supervision like MAD-X. In few-shot setting, we show that with as little as 10 examples per label, we achieved more than 90\% (i.e. 86.0 F1 points) of the performance of full supervised training (92.6 F1 points) leveraging the PET approach. △ Less

Submitted 20 September, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

Comments: Accepted to IJCNLP-AACL 2023 (main conference)

arXiv:2304.07600 [pdf, other]

A novel approach of a deep reinforcement learning based motion cueing algorithm for vehicle driving simulation

Authors: Hendrik Scheidel, Houshyar Asadi, Tobias Bellmann, Andreas Seefried, Shady Mohamed, Saeid Nahavandi

Abstract: In the field of motion simulation, the level of immersion strongly depends on the motion cueing algorithm (MCA), as it transfers the reference motion of the simulated vehicle to a motion of the motion simulation platform (MSP). The challenge for the MCA is to reproduce the motion perception of a real vehicle driver as accurately as possible without exceeding the limits of the workspace of the MSP… ▽ More In the field of motion simulation, the level of immersion strongly depends on the motion cueing algorithm (MCA), as it transfers the reference motion of the simulated vehicle to a motion of the motion simulation platform (MSP). The challenge for the MCA is to reproduce the motion perception of a real vehicle driver as accurately as possible without exceeding the limits of the workspace of the MSP in order to provide a realistic virtual driving experience. In case of a large discrepancy between the perceived motion signals and the optical cues, motion sickness may occur with the typical symptoms of nausea, dizziness, headache and fatigue. Existing approaches either produce non-optimal results, e.g., due to filtering, linearization, or simplifications, or the required computational time exceeds the real-time requirements of a closed-loop application. In this work a new solution is presented, where not a human designer specifies the principles of the MCA but an artificial intelligence (AI) learns the optimal motion by trial and error in an interaction with the MSP. To achieve this, deep reinforcement learning (RL) is applied, where an agent interacts with an environment formulated as a Markov decision process~(MDP). This allows the agent to directly control a simulated MSP to obtain feedback on its performance in terms of platform workspace usage and the motion acting on the simulator user. The RL algorithm used is proximal policy optimization (PPO), where the value function and the policy corresponding to the control strategy are learned and both are mapped in artificial neural networks (ANN). This approach is implemented in Python and the functionality is demonstrated by the practical example of pre-recorded lateral maneuvers. The subsequent validation on a standardized double lane change shows that the RL algorithm is able to learn the control strategy and improve the quality of... △ Less

Submitted 15 April, 2023; originally announced April 2023.

arXiv:2304.04870 [pdf, other]

DASS Good: Explainable Data Mining of Spatial Cohort Data

Authors: Andrew Wentzel, Carla Floricel, Guadalupe Canahuate, Mohamed A. Naser, Abdallah S. Mohamed, Clifton David Fuller, Lisanne van Dijk, G. Elisabeta Marai

Abstract: Developing applicable clinical machine learning models is a difficult task when the data includes spatial information, for example, radiation dose distributions across adjacent organs at risk. We describe the co-design of a modeling system, DASS, to support the hybrid human-machine development and validation of predictive models for estimating long-term toxicities related to radiotherapy doses in… ▽ More Developing applicable clinical machine learning models is a difficult task when the data includes spatial information, for example, radiation dose distributions across adjacent organs at risk. We describe the co-design of a modeling system, DASS, to support the hybrid human-machine development and validation of predictive models for estimating long-term toxicities related to radiotherapy doses in head and neck cancer patients. Developed in collaboration with domain experts in oncology and data mining, DASS incorporates human-in-the-loop visual steering, spatial data, and explainable AI to augment domain knowledge with automatic data mining. We demonstrate DASS with the development of two practical clinical stratification models and report feedback from domain experts. Finally, we describe the design lessons learned from this collaborative experience. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: 10 pages, 9 figures

arXiv:2304.04502 [pdf]

Energy Efficient Resource Allocation for Demand Intensive Applications in a VLC Based Fog Architecture

Authors: Wafaa B. M. Fadlelmula, Sanaa H. Mohamed, Taisir E. H. El-Gorashi, Jaafar M. H. Elmirghani

Abstract: In this paper, we propose an energy efficient passive optical network (PON) architecture for backhaul connectivity in indoor visible light communication (VLC) systems. The proposed network is used to support a fog computing architecture designed to allow users with processing demands to access dedicated fog nodes and idle processing resources in other user devices (UDs) within the same building. T… ▽ More In this paper, we propose an energy efficient passive optical network (PON) architecture for backhaul connectivity in indoor visible light communication (VLC) systems. The proposed network is used to support a fog computing architecture designed to allow users with processing demands to access dedicated fog nodes and idle processing resources in other user devices (UDs) within the same building. The fog resources within a building complement fog nodes at the access and metro networks and the central cloud data center. A mixed integer linear programming (MILP) model is developed to minimize the total power consumption associated with serving demands over the proposed architecture. A scenario that considers applications with intensive demands is examined to evaluate the energy efficiency of the proposed architecture. A comparison is conducted between allocating the demands in the fog nodes and serving the demands in the conventional cloud data center. Additionally, the proposed architecture is compared with an architecture based on state-of-art Spine-and-Leaf (SL) connectivity. Relative to the SL architecture and serving all the demands in the cloud, the adoption of the PON-based architecture achieves 84% and 86% reductions, respectively. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2203.11380

arXiv:2304.04493 [pdf]

Multiuser beam steering OWC system based on NOMA

Authors: Y. Zeng, Sanaa H. Mohamed, Ahmad Qidan, Taisir E. H. El-Gorashi, Jaafar M. H. Elmirghani

Abstract: In this paper, we propose applying Non-Orthogonal Multiple Access (NOMA) technology in a multiuser beam steering OWC system. We study the performance of the NOMA-based multiuser beam steering system in terms of the achievable rate and Bit Error Rate (BER). We investigate the impact of the power allocation factor of NOMA and the number of users in the room. The results show that the power allocatio… ▽ More In this paper, we propose applying Non-Orthogonal Multiple Access (NOMA) technology in a multiuser beam steering OWC system. We study the performance of the NOMA-based multiuser beam steering system in terms of the achievable rate and Bit Error Rate (BER). We investigate the impact of the power allocation factor of NOMA and the number of users in the room. The results show that the power allocation factor is a vital parameter in NOMA-based transmission that affects the performance of the network in terms of data rate and BER. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: ICTON 2023

arXiv:2304.04492 [pdf]

Relay Assisted Multiuser OWC Systems under Human Blockage

Authors: Y. Zeng, Sanaa H. Mohamed, Ahmad Qidan, Taisir E. H. El-Gorashi, Jaafar M. H. Elmirghani

Abstract: This paper proposes using cooperative communication based on optoelectronic (O-E-O) amplify-and-forward relay terminals to reduce the influence of the blockage and shadowing resulting from human movement in a beam steering Optical Wireless Communication (OWC) system. The simulation results indicate that on average, the outage probability of the cooperative communication mode with O-E-O relay termi… ▽ More This paper proposes using cooperative communication based on optoelectronic (O-E-O) amplify-and-forward relay terminals to reduce the influence of the blockage and shadowing resulting from human movement in a beam steering Optical Wireless Communication (OWC) system. The simulation results indicate that on average, the outage probability of the cooperative communication mode with O-E-O relay terminals is two orders of magnitude lower than the outage probability of the system without relay terminals. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: ICTON 2023

arXiv:2303.15736 [pdf, ps, other]

doi 10.1109/TSG.2023.3343100

On the Use of Reinforcement Learning for Attacking and Defending Load Frequency Control

Authors: Amr S. Mohamed, Deepa Kundur

Abstract: The electric grid is an attractive target for cyberattackers given its critical nature in society. With the increasing sophistication of cyberattacks, effective grid defense will benefit from proactively identifying vulnerabilities and attack strategies. We develop a deep reinforcement learning-based method that recognizes vulnerabilities in load frequency control, an essential process that mainta… ▽ More The electric grid is an attractive target for cyberattackers given its critical nature in society. With the increasing sophistication of cyberattacks, effective grid defense will benefit from proactively identifying vulnerabilities and attack strategies. We develop a deep reinforcement learning-based method that recognizes vulnerabilities in load frequency control, an essential process that maintains grid security and reliability. We demonstrate how our method can synthesize a variety of attacks involving false data injection and load switching, while specifying the attack and threat models - providing insight into potential attack strategies and impact. We discuss how our approach can be employed for testing electric grid vulnerabilities. Moreover our method can be employed to generate data to inform the design of defense strategies and develop attack detection methods. For this, we design and compare a (deep learning-based) supervised attack detector with an unsupervised anomaly detector to highlight the benefits of developing defense strategies based on identified attack strategies. △ Less

Submitted 28 March, 2023; originally announced March 2023.

arXiv:2303.04028 [pdf]

Response to "On the giant deformation and ferroelectricity of guanidinium nitrate" by Marek Szafrański and Andrzej Katrusiak

Authors: Durga Prasad Karothu, Rodrigo Ferreira, Ghada Dushaq, Ejaz Ahmed, Luca Catalano, Jad Mahmoud Halabi, Zainab Alhaddad, Ibrahim Tahir, Liang Li, Sharmarke Mohamed, Mahmoud Rasras, Panče Naumov

Abstract: Following a well-established practice of publishing commentaries to articles of other authors who work on materials that were earlier studied by them (n.b. six published comments[1-6]), Marek Szafrański(MS) and Andrzej Katrusiak (AK) have filed on the preprint server arXiv a manuscript entitled "On the giant deformation and ferroelectricity of guanidinium nitrate"[7] with comments on our article "… ▽ More Following a well-established practice of publishing commentaries to articles of other authors who work on materials that were earlier studied by them (n.b. six published comments[1-6]), Marek Szafrański(MS) and Andrzej Katrusiak (AK) have filed on the preprint server arXiv a manuscript entitled "On the giant deformation and ferroelectricity of guanidinium nitrate"[7] with comments on our article "Exceptionally high work density of a ferroelectric dynamic organic crystal around room temperature" published in Nature Communications (2022, 13, 2823).[8] Both in the submitted comment as well as in the required (by the journal) direct communication with us preceding its posting, MS and AK have expressed dissatisfaction with the choice of literature references in our article, for which they felt that their previous work on this material has not been cited to a sufficient extent. In their comment, they summarize their other remarks on our article as "the structural determinations of GN [guanidinium nitrate] crystals, their phase transitions and associated giant deformation, as well as its detailed structural mechanism, the molecular dynamics and dielectric properties were reported before, while the semiconductivity, ferroelectricity, and fatigue resistance of the GN [guanidinium nitrate] crystals cannot be confirmed."[7] Apart from the sentiments of MS and AK on our choice of cited literature, we find their comments on the scientific content of our article to be strongly biased towards their own results and unfounded. Below, we provide a detailed response to their comments. △ Less

Submitted 7 September, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: 13 pages, 1 figure

arXiv:2303.02197 [pdf, ps, other]

On the Use of Safety Critical Control for Cyber-Physical Security in the Smart Grid

Authors: Amr S. Mohamed, Mohsen Khalaf, Deepa Kundur

Abstract: The tight coupling between communication and control in cyber-physical systems is necessary to enable the complex regulation required to operate these systems. Unfortunately, cyberattackers can exploit network vulnerabilities to compromise communication and force unsafe decision-making and dynamics. If a cyberattack is not detected and isolated in a timely manner, the control process must balance… ▽ More The tight coupling between communication and control in cyber-physical systems is necessary to enable the complex regulation required to operate these systems. Unfortunately, cyberattackers can exploit network vulnerabilities to compromise communication and force unsafe decision-making and dynamics. If a cyberattack is not detected and isolated in a timely manner, the control process must balance adhering to the received measurement signals to maintain system operation and ensuring that temporary compromise of the signals does not force unsafe dynamics. For this purpose, we present and employ a safety critical controller based on control barrier functions to mitigate attacks against load frequency control in smart power grids. We validate the paper's findings using simulation on a high-fidelity testbed. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: 9 pages, 7 figures, conference. Accepted for publishing at the 2023 IEEE Power & Energy Society General Meeting (GM)

arXiv:2302.14126 [pdf, other]

A Probabilistic Approach to Adaptive Protection in the Smart Grid

Authors: Amr S. Mohamed, Deepa Kundur, Mohsen Khalaf

Abstract: Smart grids are critical cyber-physical systems that are vital to our energy future. Smart grids' fault resilience is dependent on the use of advanced protection systems that can reliably adapt to changing conditions within the grid. The vast amount of operational data generated and collected in smart grids can be used to develop these protection systems. However, given the safety-criticality of p… ▽ More Smart grids are critical cyber-physical systems that are vital to our energy future. Smart grids' fault resilience is dependent on the use of advanced protection systems that can reliably adapt to changing conditions within the grid. The vast amount of operational data generated and collected in smart grids can be used to develop these protection systems. However, given the safety-criticality of protection, the algorithms used to analyze this data must be stable, transparent, and easily interpretable to ensure the reliability of the protection decisions. Additionally, the protection decisions must be fast, selective, simple, and reliable. To address these challenges, this paper proposes a data-driven protection strategy, based on Gaussian Discriminant Analysis, for fault detection and isolation. This strategy minimizes the communication requirements for time-inverse relays, facilitates their coordination, and optimizes their settings. The interpretability of the protection decisions is a key focus of this paper. The method is demonstrated by showing how it can protect the medium-voltage CIGRE network as it transitions between islanded and grid-connected modes, and radial and mesh topologies. △ Less

Submitted 27 February, 2023; originally announced February 2023.

Comments: journal, 21 pages

arXiv:2302.11219 [pdf, other]

doi 10.1088/2057-1976/acba9f

Deformable registration with intensity correction for CESM monitoring response to Neoadjuvant Chemotherapy

Authors: Clément Jailin, Pablo Milioni De Carvalho, Sara Mohamed, Laurence Vancamberg, Amr Farouk Ibrahim Moustafa, Mohammed Gomaa, Rasha Mohammed Kamal, Serge Muller

Abstract: This paper proposes a robust longitudinal registration method for Contrast Enhanced Spectral Mammography in monitoring neoadjuvant chemotherapy. Because breast texture intensity changes with the treatment, a non-rigid registration procedure with local intensity compensations is developed. The approach allows registering the low energy images of the exams acquired before and after the chemotherapy.… ▽ More This paper proposes a robust longitudinal registration method for Contrast Enhanced Spectral Mammography in monitoring neoadjuvant chemotherapy. Because breast texture intensity changes with the treatment, a non-rigid registration procedure with local intensity compensations is developed. The approach allows registering the low energy images of the exams acquired before and after the chemotherapy. The measured motion is then applied to the corresponding recombined images. The difference of registered images, called residual, makes vanishing the breast texture that did not changed between the two exams. Consequently, this registered residual allows identifying local density and iodine changes, especially in the lesion area. The method is validated with a synthetic NAC case where ground truths are available. Then the procedure is applied to 51 patients with 208 CESM image pairs acquired before and after the chemotherapy treatment. The proposed registration converged in all 208 cases. The intensity-compensated registration approach is evaluated with different mathematical metrics and through the repositioning of clinical landmarks (RMSE: 5.9 mm) and outperforms state-of-the-art registration techniques. △ Less

Submitted 22 February, 2023; originally announced February 2023.

Journal ref: Biomedical Physics & Engineering Express (2023)

arXiv:2301.02775 [pdf, other]

The messy death of a multiple star system and the resulting planetary nebula as observed by JWST

Authors: Orsola De Marco, Muhammad Akashi, Stavros Akras, Javier Alcolea, Isabel Aleman, Philippe Amram, Bruce Balick, Elvire De Beck, Eric G. Blackman, Henri M. J. Boffin, Panos Boumis, Jesse Bublitz, Beatrice Bucciarelli, Valentin Bujarrabal, Jan Cami, Nicholas Chornay, You-Hua Chu, Romano L. M. Corradi, Adam Frank, Guillermo Garcia-Segura, D. A. Garcia-Hernandez, Jorge Garcia-Rojas, Veronica Gomez-Llanos, Denise R. Goncalves, Martin A. Guerrero , et al. (44 additional authors not shown)

Abstract: Planetary nebulae (PNe), the ejected envelopes of red giant stars, provide us with a history of the last, mass-losing phases of 90 percent of stars initially more massive than the Sun. Here, we analyse James Webb Space Telescope (JWST) Early Release Observation (ERO) images of the PN NGC3132. A structured, extended H2 halo surrounding an ionised central bubble is imprinted with spiral structures,… ▽ More Planetary nebulae (PNe), the ejected envelopes of red giant stars, provide us with a history of the last, mass-losing phases of 90 percent of stars initially more massive than the Sun. Here, we analyse James Webb Space Telescope (JWST) Early Release Observation (ERO) images of the PN NGC3132. A structured, extended H2 halo surrounding an ionised central bubble is imprinted with spiral structures, likely shaped by a low-mass companion orbiting the central star at 40-60 AUえーゆー. The images also reveal a mid-IR excess at the central star interpreted as a dusty disk, indicative of an interaction with another, closer companion. Including the previously known, A-type visual companion, the progenitor of the NGC3132 PN must have been at least a stellar quartet. The JWST images allow us to generate a model of the illumination, ionisation and hydrodynamics of the molecular halo, demonstrating the power of JWST to investigate complex stellar outflows. Further, new measurements of the A-type visual companion allow us to derive the value for the mass of the progenitor of a central star to date with excellent precision: 2.86+/-0.06 Mo. These results serve as path finders for future JWST observations of PNe providing unique insight into fundamental astrophysical processes including colliding winds, and binary star interactions, with implications for supernovae and gravitational wave systems. △ Less

Submitted 6 January, 2023; originally announced January 2023.

Comments: 32 pages, 5 figures for the main article. 12 pages 8 figures for the supplementary material

Journal ref: Nature Astronomy, 2022, Vol. 6, p. 1421

arXiv:2212.12808 [pdf, ps, other]

A Comprehensive Review on Autonomous Navigation

Authors: Saeid Nahavandi, Roohallah Alizadehsani, Darius Nahavandi, Shady Mohamed, Navid Mohajer, Mohammad Rokonuzzaman, Ibrahim Hossain

Abstract: The field of autonomous mobile robots has undergone dramatic advancements over the past decades. Despite achieving important milestones, several challenges are yet to be addressed. Aggregating the achievements of the robotic community as survey papers is vital to keep the track of current state-of-the-art and the challenges that must be tackled in the future. This paper tries to provide a comprehe… ▽ More The field of autonomous mobile robots has undergone dramatic advancements over the past decades. Despite achieving important milestones, several challenges are yet to be addressed. Aggregating the achievements of the robotic community as survey papers is vital to keep the track of current state-of-the-art and the challenges that must be tackled in the future. This paper tries to provide a comprehensive review of autonomous mobile robots covering topics such as sensor types, mobile robot platforms, simulation tools, path planning and following, sensor fusion methods, obstacle avoidance, and SLAM. The urge to present a survey paper is twofold. First, autonomous navigation field evolves fast so writing survey papers regularly is crucial to keep the research community well-aware of the current status of this field. Second, deep learning methods have revolutionized many fields including autonomous navigation. Therefore, it is necessary to give an appropriate treatment of the role of deep learning in autonomous navigation as well which is covered in this paper. Future works and research gaps will also be discussed. △ Less

Submitted 24 December, 2022; originally announced December 2022.

arXiv:2212.12794 [pdf, other]

GraphCast: Learning skillful medium-range global weather forecasting

Authors: Remi Lam, Alvaro Sanchez-Gonzalez, Matthew Willson, Peter Wirnsberger, Meire Fortunato, Ferran Alet, Suman Ravuri, Timo Ewalds, Zach Eaton-Rosen, Weihua Hu, Alexander Merose, Stephan Hoyer, George Holland, Oriol Vinyals, Jacklynn Stott, Alexander Pritzel, Shakir Mohamed, Peter Battaglia

Abstract: Global medium-range weather forecasting is critical to decision-making across many social and economic domains. Traditional numerical weather prediction uses increased compute resources to improve forecast accuracy, but cannot directly use historical weather data to improve the underlying model. We introduce a machine learning-based method called "GraphCast", which can be trained directly from rea… ▽ More Global medium-range weather forecasting is critical to decision-making across many social and economic domains. Traditional numerical weather prediction uses increased compute resources to improve forecast accuracy, but cannot directly use historical weather data to improve the underlying model. We introduce a machine learning-based method called "GraphCast", which can be trained directly from reanalysis data. It predicts hundreds of weather variables, over 10 days at 0.25 degree resolution globally, in under one minute. We show that GraphCast significantly outperforms the most accurate operational deterministic systems on 90% of 1380 verification targets, and its forecasts support better severe event prediction, including tropical cyclones, atmospheric rivers, and extreme temperatures. GraphCast is a key advance in accurate and efficient weather forecasting, and helps realize the promise of machine learning for modeling complex dynamical systems. △ Less

Submitted 4 August, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

Comments: GraphCast code and trained weights are available at: https://github.com/deepmind/graphcast

arXiv:2211.09856 [pdf, other]

Machine Learning-Assisted Recurrence Prediction for Early-Stage Non-Small-Cell Lung Cancer Patients

Authors: Adrianna Janik, Maria Torrente, Luca Costabello, Virginia Calvo, Brian Walsh, Carlos Camps, Sameh K. Mohamed, Ana L. Ortega, Vít Nováček, Bartomeu Massutí, Pasquale Minervini, M. Rosario Garcia Campelo, Edel del Barco, Joaquim Bosch-Barrera, Ernestina Menasalvas, Mohan Timilsina, Mariano Provencio

Abstract: Background: Stratifying cancer patients according to risk of relapse can personalize their care. In this work, we provide an answer to the following research question: How to utilize machine learning to estimate probability of relapse in early-stage non-small-cell lung cancer patients? Methods: For predicting relapse in 1,387 early-stage (I-II), non-small-cell lung cancer (NSCLC) patients from t… ▽ More Background: Stratifying cancer patients according to risk of relapse can personalize their care. In this work, we provide an answer to the following research question: How to utilize machine learning to estimate probability of relapse in early-stage non-small-cell lung cancer patients? Methods: For predicting relapse in 1,387 early-stage (I-II), non-small-cell lung cancer (NSCLC) patients from the Spanish Lung Cancer Group data (65.7 average age, 24.8% females, 75.2% males) we train tabular and graph machine learning models. We generate automatic explanations for the predictions of such models. For models trained on tabular data, we adopt SHAP local explanations to gauge how each patient feature contributes to the predicted outcome. We explain graph machine learning predictions with an example-based method that highlights influential past patients. Results: Machine learning models trained on tabular data exhibit a 76% accuracy for the Random Forest model at predicting relapse evaluated with a 10-fold cross-validation (model was trained 10 times with different independent sets of patients in test, train and validation sets, the reported metrics are averaged over these 10 test sets). Graph machine learning reaches 68% accuracy over a 200-patient, held-out test set, calibrated on a held-out set of 100 patients. Conclusions: Our results show that machine learning models trained on tabular and graph data can enable objective, personalised and reproducible prediction of relapse and therefore, disease outcome in patients with early-stage NSCLC. With further prospective and multisite validation, and additional radiological and molecular data, this prognostic model could potentially serve as a predictive decision support tool for deciding the use of adjuvant treatments in early-stage lung cancer. Keywords: Non-Small-Cell Lung Cancer, Tumor Recurrence Prediction, Machine Learning △ Less

Submitted 17 November, 2022; originally announced November 2022.

arXiv:2209.09556 [pdf, other]

CoV-TI-Net: Transferred Initialization with Modified End Layer for COVID-19 Diagnosis

Authors: Sadia Khanam, Mohammad Reza Chalak Qazani, Subrota Kumar Mondal, H M Dipu Kabir, Abadhan S. Sabyasachi, Houshyar Asadi, Keshav Kumar, Farzin Tabarsinezhad, Shady Mohamed, Abbas Khorsavi, Saeid Nahavandi

Abstract: This paper proposes transferred initialization with modified fully connected layers for COVID-19 diagnosis. Convolutional neural networks (CNN) achieved a remarkable result in image classification. However, training a high-performing model is a very complicated and time-consuming process because of the complexity of image recognition applications. On the other hand, transfer learning is a relative… ▽ More This paper proposes transferred initialization with modified fully connected layers for COVID-19 diagnosis. Convolutional neural networks (CNN) achieved a remarkable result in image classification. However, training a high-performing model is a very complicated and time-consuming process because of the complexity of image recognition applications. On the other hand, transfer learning is a relatively new learning method that has been employed in many sectors to achieve good performance with fewer computations. In this research, the PyTorch pre-trained models (VGG19\_bn and WideResNet -101) are applied in the MNIST dataset for the first time as initialization and with modified fully connected layers. The employed PyTorch pre-trained models were previously trained in ImageNet. The proposed model is developed and verified in the Kaggle notebook, and it reached the outstanding accuracy of 99.77% without taking a huge computational time during the training process of the network. We also applied the same methodology to the SIIM-FISABIO-RSNA COVID-19 Detection dataset and achieved 80.01% accuracy. In contrast, the previous methods need a huge compactional time during the training process to reach a high-performing model. Codes are available at the following link: github.com/dipuk0506/SpinalNet △ Less

Submitted 20 September, 2022; originally announced September 2022.

arXiv:2209.07572 [pdf, ps, other]

doi 10.1145/3551624.3555290

Power to the People? Opportunities and Challenges for Participatory AI

Authors: Abeba Birhane, William Isaac, Vinodkumar Prabhakaran, Mark Díaz, Madeleine Clare Elish, Iason Gabriel, Shakir Mohamed

Abstract: Participatory approaches to artificial intelligence (AI) and machine learning (ML) are gaining momentum: the increased attention comes partly with the view that participation opens the gateway to an inclusive, equitable, robust, responsible and trustworthy AI.Among other benefits, participatory approaches are essential to understanding and adequately representing the needs, desires and perspective… ▽ More Participatory approaches to artificial intelligence (AI) and machine learning (ML) are gaining momentum: the increased attention comes partly with the view that participation opens the gateway to an inclusive, equitable, robust, responsible and trustworthy AI.Among other benefits, participatory approaches are essential to understanding and adequately representing the needs, desires and perspectives of historically marginalized communities. However, there currently exists lack of clarity on what meaningful participation entails and what it is expected to do. In this paper we first review participatory approaches as situated in historical contexts as well as participatory methods and practices within the AI and ML pipeline. We then introduce three case studies in participatory AI.Participation holds the potential for beneficial, emancipatory and empowering technology design, development and deployment while also being at risk for concerns such as cooptation and conflation with other activities. We lay out these limitations and concerns and argue that as participatory AI/ML becomes in vogue, a contextual and nuanced understanding of the term as well as consideration of who the primary beneficiaries of participatory activities ought to be constitute crucial factors to realizing the benefits and opportunities that participation brings. △ Less

Submitted 15 September, 2022; originally announced September 2022.

Comments: To appear in the proceeding of EAAMO 2022

arXiv:2207.09546 [pdf, ps, other]

The Weil descent functor in the category of algebras with free operators

Authors: Shezad Mohamed

Abstract: We prove that there exists a version of Weil descent, or Weil restriction, in the category of $\mathcal{D}$-algebras. The objects of this category are $k$-algebras $R$ equipped with a homomorphism $e \colon R \to R \otimes_k \mathcal{D}$ for some fixed field $k$ and finite-dimensional $k$-algebra $\mathcal{D}$. We do this under a mild assumption on the so-called associated endomorphisms. In partic… ▽ More We prove that there exists a version of Weil descent, or Weil restriction, in the category of $\mathcal{D}$-algebras. The objects of this category are $k$-algebras $R$ equipped with a homomorphism $e \colon R \to R \otimes_k \mathcal{D}$ for some fixed field $k$ and finite-dimensional $k$-algebra $\mathcal{D}$. We do this under a mild assumption on the so-called associated endomorphisms. In particular, this yields the existence of the Weil descent functor in the category of difference algebras, which, to our knowledge, does not appear elsewhere. △ Less

Submitted 19 July, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

Comments: 32 pages. This version contains a different proof of the main theorem (section 5). A sketch of the original proof now appears in the appendix

MSC Class: 12H05; 12H10; 14A99

arXiv:2206.10532 [pdf]

Terabit Indoor Laser-Based Wireless Communications: LiFi 2.0 for 6G

Authors: Mohammad Dehghani Soltani, Hossein Kazemi, Elham Sarbazi, Ahmad Adnan Qidan, Barzan Yosuf, Sanaa Mohamed, Ravinder Singh, Bela Berde, Dominique Chiaroni, Bastien Béchadergue, Fathi Abdeldayem, Hardik Soni, Jose Tabu, Micheline Perrufel, Nikola Serafimovski, Taisir E. H. El-Gorashi, Jaafar Elmirghani, Richard Penty, Ian H. White, Harald Haas, Majid Safari

Abstract: This paper provides a summary of available technologies required for implementing indoor laser-based wireless networks capable of achieving aggregate data-rates of terabits per second as widely accepted as a sixth generation (6G) key performance indicator. The main focus of this paper is on the technologies supporting the near infrared region of the optical spectrum. The main challenges in the des… ▽ More This paper provides a summary of available technologies required for implementing indoor laser-based wireless networks capable of achieving aggregate data-rates of terabits per second as widely accepted as a sixth generation (6G) key performance indicator. The main focus of this paper is on the technologies supporting the near infrared region of the optical spectrum. The main challenges in the design of the transmitter and receiver systems and communication/networking schemes are identified and new insights are provided. This paper also covers the previous and recent standards as well as industrial applications for optical wireless communications (OWC) and LiFi. △ Less

Submitted 21 June, 2022; originally announced June 2022.

Comments: 7 pages, 7 figures

arXiv:2206.05011 [pdf, other]

Finite size mediated radiative coupling of lasing plasmonic bound state in continuum

Authors: Benjamin O. Asamoah, Marek Nečada, Wenzhe Liu, Janne Heikkinen, Sughra Mohamed, Atri Halder, Heikki Rekola, Matias Koivurova, Aaro I. Väkeväinen, Päivi Törmä, Jari Turunen, Tero Setälä, Ari T. Friberg, Lei Shi, Tommi K. Hakala

Abstract: Radiative properties of lasing plasmonic bound state in continuum are analyzed. The topological charge of the lasing signal is analyzed in the far field as well as in the source plane of the finite sized plasmonic lattice. The physical mechanism enabling the coupling of the BIC to radiation continuum is identified. We show that while the BICs have their origin in multipolar resonances, their far-f… ▽ More Radiative properties of lasing plasmonic bound state in continuum are analyzed. The topological charge of the lasing signal is analyzed in the far field as well as in the source plane of the finite sized plasmonic lattice. The physical mechanism enabling the coupling of the BIC to radiation continuum is identified. We show that while the BICs have their origin in multipolar resonances, their far-field radiation properties are governed by the position dependent dipole moment distribution induced by the symmetry breaking in a finite plasmonic lattice. Remarkably, this dipole-moment enabled coupling to radiation continuum maintains the essential topological features of the infinite lattice BICs. △ Less

Submitted 10 June, 2022; originally announced June 2022.

Showing 1–50 of 200 results for author: Mohamed, S