Search | arXiv e-print repository

Correlation of Magnetic State Configurations in Nanotubes with FMR spectrum

Authors: Abhishek Kumar, Chirag Kalouni, Raghvendra Posti, Vivek K Malik, Dhananjay Tiwari, Debangsu Roy

Abstract: Magnetic nanotubes have garnered immense attention for their potential in high-density magnetic memory, owing to their stable flux closure configuration and fast, reproducible reversal processes. However, characterizing their magnetic configuration through straightforward methodologies remains a challenge in both scope and detail. Here, we elucidate the magnetic state details using Remanence Field… ▽ More Magnetic nanotubes have garnered immense attention for their potential in high-density magnetic memory, owing to their stable flux closure configuration and fast, reproducible reversal processes. However, characterizing their magnetic configuration through straightforward methodologies remains a challenge in both scope and detail. Here, we elucidate the magnetic state details using Remanence Field Ferromagnetic Resonance Spectroscopy (RFMR) for arrays of electrodeposited nanotubes. Micromagnetic simulations revealed distinct spin configurations while coming from saturation, including the edge vortex, onion, uniform and curling states, with chirality variations depending on the preparation field direction. Dynamic measurements, coupled with RFMR spectra analysis, unveiled multiple FMR modes corresponding to these spin configurations. The evolution of spin configurations under bias fields were studied, indicating nucleation within the curling state. Observations revealed opposite RFMR spectra, denoting opposite magnetic spin configurations after removing the positive and negative saturating fields when the magnetic field was applied along {theta_H=0} and perpendicular {theta_H= 90} to the nanotube axis. We observed a mixture of the non-uniform curling states with the end vortex state (onion-like curling state) at the end of the nanotubes for the theta_H=0(90) and uniform magnetization states in the middle of the nanotubes for the theta_H=0 configuration. Building on RFMR information, frequency-swept FMR absorption spectra obtained at different bias fields allowed the characterization of magnetization states. This picture was supported by micromagnetic simulations. These findings were further substantiated with First Order Reversal Curve measurements (FORC). △ Less

Submitted 18 August, 2024; originally announced August 2024.

arXiv:2407.21783 [pdf, other]

The Llama 3 Herd of Models

Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development. △ Less

Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

arXiv:2405.19271 [pdf, other]

Detecting the Stochastic Gravitational Wave Background from Primordial Black Holes in Slow-reheating Scenarios

Authors: Luis E. Padilla, Juan Carlos Hidalgo, Karim A. Malik, David Mulryne

Abstract: After primordial inflation, the universe may have experienced a prolonged reheating epoch, potentially leading to a phase of matter domination supported by the oscillating inflaton field. During such an epoch, perturbations in the inflaton virialize upon reentering the cosmological horizon, forming inflaton structures. If the primordial overdensities are sufficiently large, these structures collap… ▽ More After primordial inflation, the universe may have experienced a prolonged reheating epoch, potentially leading to a phase of matter domination supported by the oscillating inflaton field. During such an epoch, perturbations in the inflaton virialize upon reentering the cosmological horizon, forming inflaton structures. If the primordial overdensities are sufficiently large, these structures collapse to form primordial black holes (PBHs). To occur at a significant rate, this process requires an enhanced primordial power spectrum (PPS) at small scales. The enhancement of the PPS, as well as the formation and tidal interaction of the primordial structures, will in turn source a stochastic gravitational wave background(SGWB) that could be detected by current and/or future gravitational wave detectors. In this paper, we study the SGWB arising from these different sources during slow-reheating, focusing on a PPS that satisfies the requirements necessary for the formation of PBHs with a mass of $M_{\rm PBH}\simeq 10^{21}$ and that constitute the entirety of dark matter in the universe. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 15 pages, 6 figures. Comments are welcome!

arXiv:2405.13619 [pdf]

Drastic modification in thermal conductivity of TiCoSb Half-Heusler alloy: Phonon engineering by lattice softening and ionic polarization

Authors: S. Mahakal, Avijit Jana, Diptasikha Das, Nabakumar Rana, Pallabi Sardar, Aritra Banerjee, Shamima Hussain, Santanu K. Maiti, K. Malik

Abstract: A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron mi… ▽ More A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron microscopy data. Local structures of the synthesized samples are explored for the first time by X-ray absorption spectroscopy measurements for TiCoSb system and corroborated with Rietveld refinement data. Lattice dynamics are revealed using Raman Spectroscopy (RS) measurements in unprecedented attempts for TiCoSb system. XRD and RS data accomplishes that variation in \k{appa} as a function of Sb concentration is observed owing to an alteration in phonon group velocity related to lattice softening. Polar nature of TiCoSb HH sample is revealed. LO-TO splitting (related to polar optical phonon scattering) in phonon vibration is observed due to polar nature of TiCoSb synthesized samples. Tailoring in LO-TO splitting due to screening effect, correlated with Co vacancies is reported for TiCoSb1+x synthesized samples. Lattice softening and LO-TO splitting lead to decreases in \k{appa}~47% for TiCoSb1.02 synthesized sample. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Main article (17 pages, 10 figures), Supplemental article (5 pages, 7 figures), Comments are welcome

arXiv:2402.03542 [pdf, other]

Primordial black hole formation during slow-reheating: A review

Authors: Luis E. Padilla, Juan Carlos Hidalgo, Tadeo D. Gomez-Aguilar, Karim A. Malik, Gabriel German

Abstract: In this paper we review the possible mechanisms for the production of primordial black holes (PBHs) during a slow-reheating period {in which the energy transfer of the inflaton field to standard model particles becomes effective at slow temperatures}, offering a comprehensive examination of the theoretical foundations and conditions required for each of formation channel. In particular, we focus o… ▽ More In this paper we review the possible mechanisms for the production of primordial black holes (PBHs) during a slow-reheating period {in which the energy transfer of the inflaton field to standard model particles becomes effective at slow temperatures}, offering a comprehensive examination of the theoretical foundations and conditions required for each of formation channel. In particular, we focus on post-inflationary scenarios where there are no self-resonances and the reheating epoch can be described {by the inflaton evolving in} a quadratic-like potential. In the hydrodynamical interpretation of this field during the slow-reheating epoch, the gravitational collapse of primordial fluctuations is subject to conditions on their sphericity, limits on their spin, as well as a maximum velocity dispersion. We show how to account for all conditions and show that PBHs form with different masses depending on the collapse mechanism. Finally we show, through an example, how PBH production serves to probe both the physics after primordial inflation, as well as the primordial powerspectrum at the smallest scales. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: 8 figures. Submitted to Frontier in Astronomy and Space Sciences. Comments are welcome!

arXiv:2401.15675 [pdf]

doi 10.1201/9781003433958-11

Detection of a facemask in real-time using deep learning methods: Prevention of Covid 19

Authors: Gautam Siddharth Kashyap, Jatin Sohlot, Ayesha Siddiqui, Ramsha Siddiqui, Karan Malik, Samar Wazir, Alexander E. I. Brownlee

Abstract: A health crisis is raging all over the world with the rapid transmission of the novel-coronavirus disease (Covid-19). Out of the guidelines issued by the World Health Organisation (WHO) to protect us against Covid-19, wearing a facemask is the most effective. Many countries have necessitated the wearing of face masks, but monitoring a large number of people to ensure that they are wearing masks in… ▽ More A health crisis is raging all over the world with the rapid transmission of the novel-coronavirus disease (Covid-19). Out of the guidelines issued by the World Health Organisation (WHO) to protect us against Covid-19, wearing a facemask is the most effective. Many countries have necessitated the wearing of face masks, but monitoring a large number of people to ensure that they are wearing masks in a crowded place is a challenging task in itself. The novel-coronavirus disease (Covid-19) has already affected our day-to-day life as well as world trade movements. By the end of April 2021, the world has recorded 144,358,956 confirmed cases of novel-coronavirus disease (Covid-19) including 3,066,113 deaths according to the world health organization (WHO). These increasing numbers motivate automated techniques for the detection of a facemask in real-time scenarios for the prevention of Covid-19. We propose a technique using deep learning that works for single and multiple people in a frame recorded via webcam in still or in motion. We have also experimented with our approach in night light. The accuracy of our model is good compared to the other approaches in the literature; ranging from 74% for multiple people in a nightlight to 99% for a single person in daylight. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: Research Advances in Network Technologies (Volume 2) (CRC Press Taylor and Francis), 2023 (Accepted)

arXiv:2401.15493 [pdf]

The WTP-WTA Gap for Public Goods: New Insights from Compensating and Equivalent Variation Closed-Form Solutions

Authors: Daniel H. Karney, Khyati Malik

Abstract: This study finds exact closed-form solutions for compensating variation (CV) and equivalent variation (EV) for both marginal and non-marginal changes in public goods given homothetic utility. The parameters for these solutions are recoverable from observable data in empirical applications as a single sufficient statistic summarizes consumer preferences. The closed-form CV and EV expressions identi… ▽ More This study finds exact closed-form solutions for compensating variation (CV) and equivalent variation (EV) for both marginal and non-marginal changes in public goods given homothetic utility. The parameters for these solutions are recoverable from observable data in empirical applications as a single sufficient statistic summarizes consumer preferences. The closed-form CV and EV expressions identify three economic mechanisms that determine the magnitudes of CV and EV. One of these mechanisms, the relative preference effect, helps explain the disparity between willingness to pay (WTP) and willingness to accept (WTA) for public goods. △ Less

Submitted 18 July, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

Comments: New title; New results related to EV (and associated theorems) and certainty equivalence; Edits to Introduction with minor edits elsewhere; Additional references

arXiv:2312.15563 [pdf, ps, other]

Dynamics of Global Emission Permit Prices and Regional Social Cost of Carbon under Noncooperation

Authors: Yongyang Cai, Khyati Malik, Hyeseon Shin

Abstract: We build a dynamic multi-region model of climate and economy with emission permit trading among 12 aggregated regions in the world. We solve for the dynamic Nash equilibrium under noncooperation, wherein each region adheres to the emission cap constraints following commitments that were first outlined in the 2015 Paris Agreement and updated in subsequent years. Our model shows that the emission pe… ▽ More We build a dynamic multi-region model of climate and economy with emission permit trading among 12 aggregated regions in the world. We solve for the dynamic Nash equilibrium under noncooperation, wherein each region adheres to the emission cap constraints following commitments that were first outlined in the 2015 Paris Agreement and updated in subsequent years. Our model shows that the emission permit price reaches $811 per ton of carbon by 2050. We demonstrate that a regional carbon tax is complementary to the global cap-and-trade system, and the optimal regional carbon tax is equal to the difference between the regional marginal abatement cost and the permit price. △ Less

Submitted 13 April, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

arXiv:2311.14513 [pdf, other]

Induced gravitational waves: the effect of first order tensor perturbations

Authors: Raphael Picard, Karim A. Malik

Abstract: Scalar induced gravitational waves contribute to the cosmological gravitational wave background. They can be related to the primordial density power spectrum produced towards the end of inflation and therefore are a convenient new tool to constrain models of inflation. These waves are sourced by terms quadratic in perturbations and hence appear at second order in cosmological perturbation theory.… ▽ More Scalar induced gravitational waves contribute to the cosmological gravitational wave background. They can be related to the primordial density power spectrum produced towards the end of inflation and therefore are a convenient new tool to constrain models of inflation. These waves are sourced by terms quadratic in perturbations and hence appear at second order in cosmological perturbation theory. While the focus of research so far was on purely scalar source terms we also study the effect of including first order tensor perturbations as an additional source. This gives rise to two additional source terms: a term quadratic in the tensor perturbations and a cross term involving mixed scalar and tensor perturbations. We present full analytical expressions for the spectral density of these new source terms and discuss their general behaviour. To illustrate the generation mechanism we study two toy models containing a peak on small scales. For these models we show that the scalar-tensor contribution becomes non-negligible compared to the scalar-scalar contribution on smaller scales. We also consider implications for future gravitational wave surveys. △ Less

Submitted 13 December, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

Comments: 31 pages, 7 figures

arXiv:2311.09496 [pdf, ps, other]

Posterior-Mean Separable Costs of Information Acquisition

Authors: Jeffrey Mensch, Komal Malik

Abstract: We analyze a problem of revealed preference given state-dependent stochastic choice data in which the payoff to a decision maker (DM) only depends on their beliefs about posterior means. Often, the DM must also learn about or pay attention to the state; in applied work on this subject, a convenient assumption is that the costs of such learning are linearly dependent in the distribution over poster… ▽ More We analyze a problem of revealed preference given state-dependent stochastic choice data in which the payoff to a decision maker (DM) only depends on their beliefs about posterior means. Often, the DM must also learn about or pay attention to the state; in applied work on this subject, a convenient assumption is that the costs of such learning are linearly dependent in the distribution over posterior means. We provide testable conditions to identify whether this assumption holds. This allows for the use of information design techniques to solve the DM's problem. △ Less

Submitted 11 December, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

Comments: As currently written, Lemma 6, which is central for the main result (Theorem 1), is incorrect. We need to try to replace it with a correct claim before this paper can be considered

arXiv:2311.00326 [pdf]

Transport and electrical properties of cryogenic thermoelectric FeSb2: the effect of isoelectronic and hole doping

Authors: Deepak Gujjar, Sunidhi Gujjar, V. K. Malik, Hem C. Kandpal

Abstract: Thermoelectric materials operating at cryogenic temperatures are in high demand for efficient cooling and power generation in applications ranging from superconductors to quantum computing. The narrow band-gap semiconductor FeSb2, known for its colossal Seebeck coefficient, holds promise for such applications, provided its thermal conductivity value can be reduced. This study investigates the impa… ▽ More Thermoelectric materials operating at cryogenic temperatures are in high demand for efficient cooling and power generation in applications ranging from superconductors to quantum computing. The narrow band-gap semiconductor FeSb2, known for its colossal Seebeck coefficient, holds promise for such applications, provided its thermal conductivity value can be reduced. This study investigates the impact of isoelectronic substitution (Bi) and hole doping (Pb) at the Sb site on the transport properties of FeSb2, with a particular focus on thermal conductivity (\k{appa}). Polycrystalline FeSb2 powder, along with Bi- and Pb-doped samples, were synthesized using a simple co-precipitation approach, followed by thermal treatment in an H2 atmosphere. XRD and SEM analysis confirms the formation of the desired phase pre- and post-consolidation using spark plasma sintering (SPS). The consolidation process resulted in a high compaction density and the formation of submicrometer-sized grains, as substantiated by electron backscattered diffraction (EBSD) analysis. Substituting 1% of Bi and Pb at the Sb site successfully suppressed the thermal conductivity (\k{appa}) from ~15 W/m-K in pure FeSb2 to ~10 and ~8.7 W/m-K, respectively. Importantly, resistivity measurements revealed a metal-to-insulator transition at around 6.5 K in undoped FeSb2 and isoelectronically Bi-substituted FeSb2, suggesting the existence of metallic surface states and provides valuable evidence for the perplexing topological behavior exhibited by FeSb2. △ Less

Submitted 1 November, 2023; originally announced November 2023.

Comments: 19 pages, 5 figures

arXiv:2310.03856 [pdf, other]

Securing Voice Biometrics: One-Shot Learning Approach for Audio Deepfake Detection

Authors: Awais Khan, Khalid Mahmood Malik

Abstract: The Automatic Speaker Verification (ASV) system is vulnerable to fraudulent activities using audio deepfakes, also known as logical-access voice spoofing attacks. These deepfakes pose a concerning threat to voice biometrics due to recent advancements in generative AI and speech synthesis technologies. While several deep learning models for speech synthesis detection have been developed, most of th… ▽ More The Automatic Speaker Verification (ASV) system is vulnerable to fraudulent activities using audio deepfakes, also known as logical-access voice spoofing attacks. These deepfakes pose a concerning threat to voice biometrics due to recent advancements in generative AI and speech synthesis technologies. While several deep learning models for speech synthesis detection have been developed, most of them show poor generalizability, especially when the attacks have different statistical distributions from the ones seen. Therefore, this paper presents Quick-SpoofNet, an approach for detecting both seen and unseen synthetic attacks in the ASV system using one-shot learning and metric learning techniques. By using the effective spectral feature set, the proposed method extracts compact and representative temporal embeddings from the voice samples and utilizes metric learning and triplet loss to assess the similarity index and distinguish different embeddings. The system effectively clusters similar speech embeddings, classifying bona fide speeches as the target class and identifying other clusters as spoofing attacks. The proposed system is evaluated using the ASVspoof 2019 logical access (LA) dataset and tested against unseen deepfake attacks from the ASVspoof 2021 dataset. Additionally, its generalization ability towards unseen bona fide speech is assessed using speech data from the VSDC dataset. △ Less

Submitted 5 October, 2023; originally announced October 2023.

arXiv:2309.16039 [pdf, other]

Effective Long-Context Scaling of Foundation Models

Authors: Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, Karthik Abinav Sankararaman, Barlas Oguz, Madian Khabsa, Han Fang, Yashar Mehdad, Sharan Narang, Kshitiz Malik, Angela Fan, Shruti Bhosale, Sergey Edunov, Mike Lewis, Sinong Wang, Hao Ma

Abstract: We present a series of long-context LLMs that support effective context windows of up to 32,768 tokens. Our model series are built through continual pretraining from Llama 2 with longer training sequences and on a dataset where long texts are upsampled. We perform extensive evaluation on language modeling, synthetic context probing tasks, and a wide range of research benchmarks. On research benchm… ▽ More We present a series of long-context LLMs that support effective context windows of up to 32,768 tokens. Our model series are built through continual pretraining from Llama 2 with longer training sequences and on a dataset where long texts are upsampled. We perform extensive evaluation on language modeling, synthetic context probing tasks, and a wide range of research benchmarks. On research benchmarks, our models achieve consistent improvements on most regular tasks and significant improvements on long-context tasks over Llama 2. Notably, with a cost-effective instruction tuning procedure that does not require human-annotated long instruction data, the 70B variant can already surpass gpt-3.5-turbo-16k's overall performance on a suite of long-context tasks. Alongside these results, we provide an in-depth analysis on the individual components of our method. We delve into Llama's position encodings and discuss its limitation in modeling long dependencies. We also examine the impact of various design choices in the pretraining process, including the data mix and the training curriculum of sequence lengths -- our ablation experiments suggest that having abundant long texts in the pretrain dataset is not the key to achieving strong performance, and we empirically verify that long context continual pretraining is more efficient and similarly effective compared to pretraining from scratch with long sequences. △ Less

Submitted 13 November, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

arXiv:2309.10560 [pdf, other]

Bridging the Spoof Gap: A Unified Parallel Aggregation Network for Voice Presentation Attacks

Authors: Awais Khan, Khalid Mahmood Malik

Abstract: Automatic Speaker Verification (ASV) systems are increasingly used in voice bio-metrics for user authentication but are susceptible to logical and physical spoofing attacks, posing security risks. Existing research mainly tackles logical or physical attacks separately, leading to a gap in unified spoofing detection. Moreover, when existing systems attempt to handle both types of attacks, they ofte… ▽ More Automatic Speaker Verification (ASV) systems are increasingly used in voice bio-metrics for user authentication but are susceptible to logical and physical spoofing attacks, posing security risks. Existing research mainly tackles logical or physical attacks separately, leading to a gap in unified spoofing detection. Moreover, when existing systems attempt to handle both types of attacks, they often exhibit significant disparities in the Equal Error Rate (EER). To bridge this gap, we present a Parallel Stacked Aggregation Network that processes raw audio. Our approach employs a split-transform-aggregation technique, dividing utterances into convolved representations, applying transformations, and aggregating the results to identify logical (LA) and physical (PA) spoofing attacks. Evaluation of the ASVspoof-2019 and VSDC datasets shows the effectiveness of the proposed system. It outperforms state-of-the-art solutions, displaying reduced EER disparities and superior performance in detecting spoofing attacks. This highlights the proposed method's generalizability and superiority. In a world increasingly reliant on voice-based security, our unified spoofing detection system provides a robust defense against a spectrum of voice spoofing attacks, safeguarding ASVs and user data effectively. △ Less

Submitted 19 September, 2023; originally announced September 2023.

arXiv:2309.09837 [pdf, other]

Frame-to-Utterance Convergence: A Spectra-Temporal Approach for Unified Spoofing Detection

Authors: Awais Khan, Khalid Mahmood Malik, Shah Nawaz

Abstract: Voice spoofing attacks pose a significant threat to automated speaker verification systems. Existing anti-spoofing methods often simulate specific attack types, such as synthetic or replay attacks. However, in real-world scenarios, the countermeasures are unaware of the generation schema of the attack, necessitating a unified solution. Current unified solutions struggle to detect spoofing artifact… ▽ More Voice spoofing attacks pose a significant threat to automated speaker verification systems. Existing anti-spoofing methods often simulate specific attack types, such as synthetic or replay attacks. However, in real-world scenarios, the countermeasures are unaware of the generation schema of the attack, necessitating a unified solution. Current unified solutions struggle to detect spoofing artifacts, especially with recent spoofing mechanisms. For instance, the spoofing algorithms inject spectral or temporal anomalies, which are challenging to identify. To this end, we present a spectra-temporal fusion leveraging frame-level and utterance-level coefficients. We introduce a novel local spectral deviation coefficient (SDC) for frame-level inconsistencies and employ a bi-LSTM-based network for sequential temporal coefficients (STC), which capture utterance-level artifacts. Our spectra-temporal fusion strategy combines these coefficients, and an auto-encoder generates spectra-temporal deviated coefficients (STDC) to enhance robustness. Our proposed approach addresses multiple spoofing categories, including synthetic, replay, and partial deepfake attacks. Extensive evaluation on diverse datasets (ASVspoof2019, ASVspoof2021, VSDC, partial spoofs, and in-the-wild deepfakes) demonstrated its robustness for a wide range of voice applications. △ Less

Submitted 18 September, 2023; originally announced September 2023.

arXiv:2309.07712 [pdf, other]

Normalized factorial moments of spatial distributions of particles in high multiplicity events: A Toy model study

Authors: Sheetal Sharma, Salman Khurshid Malik, Zarina Banoo, Ramni Gupta

Abstract: In ultra-relativistic heavy-ion collisions a strongly interacting complex system of quarks and gluons is formed. The nature of the system so created and the mechanism of multi-particle production in these collisions may be revealed by studying the normalized factorial moments ($F_{\rm{q}}$) as function of various parameters. The resilience of $F_{\rm{q}}$ moments studied using Toy model events sho… ▽ More In ultra-relativistic heavy-ion collisions a strongly interacting complex system of quarks and gluons is formed. The nature of the system so created and the mechanism of multi-particle production in these collisions may be revealed by studying the normalized factorial moments ($F_{\rm{q}}$) as function of various parameters. The resilience of $F_{\rm{q}}$ moments studied using Toy model events shows that these are sensitive to the presence of dynamical fluctuations in the system and are robust against the uniform efficiencies in the data measurements. Results of this study serve as a suitable reference baseline for the experimental and simulation studies. △ Less

Submitted 16 October, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

Comments: 6 pages, 9 figures

arXiv:2307.08713 [pdf, other]

doi 10.1109/TFUZZ.2024.3400898

Intuitionistic Fuzzy Broad Learning System: Enhancing Robustness Against Noise and Outliers

Authors: M. Sajid, A. K. Malik, M. Tanveer

Abstract: In the realm of data classification, broad learning system (BLS) has proven to be a potent tool that utilizes a layer-by-layer feed-forward neural network. However, the traditional BLS treats all samples as equally significant, which makes it less robust and less effective for real-world datasets with noises and outliers. To address this issue, we propose fuzzy broad learning system (F-BLS) and th… ▽ More In the realm of data classification, broad learning system (BLS) has proven to be a potent tool that utilizes a layer-by-layer feed-forward neural network. However, the traditional BLS treats all samples as equally significant, which makes it less robust and less effective for real-world datasets with noises and outliers. To address this issue, we propose fuzzy broad learning system (F-BLS) and the intuitionistic fuzzy broad learning system (IF-BLS) models that confront challenges posed by the noise and outliers present in the dataset and enhance overall robustness. Employing a fuzzy membership technique, the proposed F-BLS model embeds sample neighborhood information based on the proximity of each class center within the inherent feature space of the BLS framework. Furthermore, the proposed IF-BLS model introduces intuitionistic fuzzy concepts encompassing membership, non-membership, and score value functions. IF-BLS strategically considers homogeneity and heterogeneity in sample neighborhoods in the kernel space. We evaluate the performance of proposed F-BLS and IF-BLS models on UCI benchmark datasets with and without Gaussian noise. As an application, we implement the proposed F-BLS and IF-BLS models to diagnose Alzheimer's disease (AD). Experimental findings and statistical analyses consistently highlight the superior generalization capabilities of the proposed F-BLS and IF-BLS models over baseline models across all scenarios. The proposed models offer a promising solution to enhance the BLS framework's ability to handle noise and outliers. The source code link of the proposed model is available at https://github.com/mtanveer1/IF-BLS. △ Less

Submitted 11 May, 2024; v1 submitted 15 July, 2023; originally announced July 2023.

Journal ref: IEEE Transactions on Fuzzy Systems, 2024

arXiv:2307.07881 [pdf, ps, other]

doi 10.1109/TNNLS.2024.3353531

Graph Embedded Intuitionistic Fuzzy Random Vector Functional Link Neural Network for Class Imbalance Learning

Authors: M. A. Ganaie, M. Sajid, A. K. Malik, M. Tanveer

Abstract: The domain of machine learning is confronted with a crucial research area known as class imbalance learning, which presents considerable hurdles in precise classification of minority classes. This issue can result in biased models where the majority class takes precedence in the training process, leading to the underrepresentation of the minority class. The random vector functional link (RVFL) net… ▽ More The domain of machine learning is confronted with a crucial research area known as class imbalance learning, which presents considerable hurdles in precise classification of minority classes. This issue can result in biased models where the majority class takes precedence in the training process, leading to the underrepresentation of the minority class. The random vector functional link (RVFL) network is a widely used and effective learning model for classification due to its good generalization performance and efficiency. However, it suffers when dealing with imbalanced datasets. To overcome this limitation, we propose a novel graph embedded intuitionistic fuzzy RVFL for class imbalance learning (GE-IFRVFL-CIL) model incorporating a weighting mechanism to handle imbalanced datasets. The proposed GE-IFRVFL-CIL model offers plethora of benefits: $(i)$ leveraging graph embedding to preserve the inherent topological structure of the datasets, $(ii)$ employing intuitionistic fuzzy theory to handle uncertainty and imprecision in the data, $(iii)$ and the most important, it tackles class imbalance learning. The amalgamation of a weighting scheme, graph embedding, and intuitionistic fuzzy sets leads to the superior performance of the proposed models on KEEL benchmark imbalanced datasets with and without Gaussian noise. Furthermore, we implemented the proposed GE-IFRVFL-CIL on the ADNI dataset and achieved promising results, demonstrating the model's effectiveness in real-world applications. The proposed GE-IFRVFL-CIL model offers a promising solution to address the class imbalance issue, mitigates the detrimental effect of noise and outliers, and preserves the inherent geometrical structures of the dataset. △ Less

Submitted 16 February, 2024; v1 submitted 15 July, 2023; originally announced July 2023.

Comments: IEEE Transactions on Neural Networks and Learning Systems

Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2024

arXiv:2306.02308 [pdf]

Roulette-Wheel Selection-Based PSO Algorithm for Solving the Vehicle Routing Problem with Time Windows

Authors: Gautam Siddharth Kashyap, Alexander E. I. Brownlee, Orchid Chetia Phukan, Karan Malik, Samar Wazir

Abstract: The well-known Vehicle Routing Problem with Time Windows (VRPTW) aims to reduce the cost of moving goods between several destinations while accommodating constraints like set time windows for certain locations and vehicle capacity. Applications of the VRPTW problem in the real world include Supply Chain Management (SCM) and logistic dispatching, both of which are crucial to the economy and are exp… ▽ More The well-known Vehicle Routing Problem with Time Windows (VRPTW) aims to reduce the cost of moving goods between several destinations while accommodating constraints like set time windows for certain locations and vehicle capacity. Applications of the VRPTW problem in the real world include Supply Chain Management (SCM) and logistic dispatching, both of which are crucial to the economy and are expanding quickly as work habits change. Therefore, to solve the VRPTW problem, metaheuristic algorithms i.e. Particle Swarm Optimization (PSO) have been found to work effectively, however, they can experience premature convergence. To lower the risk of PSO's premature convergence, the authors have solved VRPTW in this paper utilising a novel form of the PSO methodology that uses the Roulette Wheel Method (RWPSO). Computing experiments using the Solomon VRPTW benchmark datasets on the RWPSO demonstrate that RWPSO is competitive with other state-of-the-art algorithms from the literature. Also, comparisons with two cutting-edge algorithms from the literature show how competitive the suggested algorithm is. △ Less

Submitted 4 June, 2023; originally announced June 2023.

arXiv:2305.15303 [pdf]

Transport phenomena of TiCoSb: Defects induced modification in structure and density of states

Authors: S. Mahakal, Diptasikha Das, Pintu Singha, Aritra Banerjee, S. Chatterjee, Santanu K. Maiti, S. Assa Aravindh, K. Malik

Abstract: TiCoSb1+x (x=0.0, 0.01, 0.02, 0.03, 0.04, 0.06) samples have been synthesized, employing solid state reaction method followed by arc menting. Theoretical calculations, using Density Functional Theory (DFT) have been performed to estimate band structure and density of states (DOS). Further, energitic calculations, using first principle have been carried out to reveal the formation energy for vacanc… ▽ More TiCoSb1+x (x=0.0, 0.01, 0.02, 0.03, 0.04, 0.06) samples have been synthesized, employing solid state reaction method followed by arc menting. Theoretical calculations, using Density Functional Theory (DFT) have been performed to estimate band structure and density of states (DOS). Further, energitic calculations, using first principle have been carried out to reveal the formation energy for vacancy, interstitial, anti-site defects. Detail structural calculation, employing Rietveld refinement reveals the presence of embedded phases, vacancy and interstitial atom, which is also supported by the theoretical calculations. Lattice strain, crystalline size and dislocation density have been estimated by Williamson-Hall and modified Williamson-Hall methods. Thermal variation of resistivity [\r{ho}(T)] and thermopower [S(T)] have been explained using Mott equation and density of states (DOS) modification near the Fermi surface due to Co vancancy and embedded phases. Figure of merit (ZT) has been calculated and 4 to 5 times higher ZT for TiCoSb than earlier reported value is obtained at room temperature. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: 12 pages, 12 figures (comments are welcome)

arXiv:2301.04844 [pdf, other]

SACDNet: Towards Early Type 2 Diabetes Prediction with Uncertainty for Electronic Health Records

Authors: Tayyab Nasir, Muhammad Kamran Malik

Abstract: Type 2 diabetes mellitus (T2DM) is one of the most common diseases and a leading cause of death. The problem of early diagnosis of T2DM is challenging and necessary to prevent serious complications. This study proposes a novel neural network architecture for early T2DM prediction using multi-headed self-attention and dense layers to extract features from historic diagnoses, patient vitals, and dem… ▽ More Type 2 diabetes mellitus (T2DM) is one of the most common diseases and a leading cause of death. The problem of early diagnosis of T2DM is challenging and necessary to prevent serious complications. This study proposes a novel neural network architecture for early T2DM prediction using multi-headed self-attention and dense layers to extract features from historic diagnoses, patient vitals, and demographics. The proposed technique is called the Self-Attention for Comorbid Disease Net (SACDNet), achieving an accuracy of 89.3% and an F1-Score of 89.1%, having a 1.6% increased accuracy and 1.3% increased f1-score compared to the baseline techniques. Monte Carlo (MC) Dropout is applied to the SACDNet to get a bayesian approximation. A T2DM prediction framework based on the MC Dropout SACDNet is proposed to quantize the uncertainty associated with the predictions. A T2DM prediction dataset is also built as part of this study which is based on real-world routine Electronic Health Record (EHR) data comprising 4,124 diabetic and 181,767 non-diabetic examples, collected from 295 different EHR systems running in different parts of the United States of America. This dataset is further used to evaluate 7 different machine learning and 3 deep learning-based models. Finally, a detailed analysis of the fairness of every technique against different patient demographic groups is performed to validate the unbiased generalization of the techniques and the diversity of the data. △ Less

Submitted 18 January, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

Comments: Misspelled SACEDNet changed to SACDNet in Abstract. Related Work corrected tehcniques to techniques. Dataset rh replaced with Rh. Methodology, mod replaced with models, andrepresentation with representations. Removed bold formatting from Table 3 in Methodology. Results replaced Hence the with Hence, this

arXiv:2212.14367 [pdf, ps, other]

Optimal Robust Mechanism in Bilateral Trading

Authors: Komal Malik

Abstract: We consider a model of bilateral trade with private values. The value of the buyer and the cost of the seller are jointly distributed. The true joint distribution is unknown to the designer, however, the marginal distributions of the value and the cost are known to the designer. The designer wants to find a trading mechanism that is robustly Bayesian incentive compatible, robustly individually rat… ▽ More We consider a model of bilateral trade with private values. The value of the buyer and the cost of the seller are jointly distributed. The true joint distribution is unknown to the designer, however, the marginal distributions of the value and the cost are known to the designer. The designer wants to find a trading mechanism that is robustly Bayesian incentive compatible, robustly individually rational, budget-balanced and maximizes the expected gains from trade over all such mechanisms. We refer to such a mechanism as an optimal robust mechanism. We establish equivalence between Bayesian incentive compatible mechanisms (BIC) and dominant strategy mechanisms (DSIC). We characterise the worst distribution for a given mechanism and use this characterisation to find an optimal robust mechanism. We show that there is an optimal robust mechanism that is deterministic (posted-price), dominant strategy incentive compatible, and ex-post individually rational. We also derive an explicit expression of the posted-price of such an optimal robust mechanism. We also show the equivalence between the efficiency gains from the optimal robust mechanism (max-min problem) and guaranteed efficiency gains if the designer could choose the mechanism after observing the true joint distribution (min-max problem). △ Less

Submitted 29 December, 2022; originally announced December 2022.

arXiv:2211.08670 [pdf, ps, other]

doi 10.1017/jfm.2022.926

Experimental observation of a confined bubble moving in shear-thinning fluids

Authors: SungGyu Chun, Bingqiang Ji, Zhengyu Yang, Vinit Kumar Malik, Jie Feng

Abstract: The motion of a long gas bubble in a confined capillary tube is ubiquitous in a wide range of engineering and biological applications. While the understanding of the deposited thin viscous film near the tube wall in Newtonian fluids is well developed, the deposition dynamics in commonly encountered non-Newtonian fluids remains much less studied. Here, we investigate the dynamics of a confined bubb… ▽ More The motion of a long gas bubble in a confined capillary tube is ubiquitous in a wide range of engineering and biological applications. While the understanding of the deposited thin viscous film near the tube wall in Newtonian fluids is well developed, the deposition dynamics in commonly encountered non-Newtonian fluids remains much less studied. Here, we investigate the dynamics of a confined bubble moving in shear-thinning fluids with systematic experiments, varying the zero-shear-rate capillary number $Ca_0$ in the range of $O(10^{-3}-10^2)$ considering the zero-shear-rate viscosity. The thickness of the deposited liquid film, the bubble speed and the bubble front/rear menisci are measured, which are further rationalized with the recent theoretical studies based on appropriate rheological models. Compared with Newtonian fluids, the film thickness decreases for both the carboxymethyl cellulose and Carbopol solutions when the shear-thinning effect dominates. We show that the film thickness follows the scaling law from \citet{aussillous2000quick} with an effective capillary number $Ca_e$, considering the characteristic shear rate in the film as proposed by \citet{picchi2021motion}. $Ca_e$ is calculated by the Carreau number and the power-law index from the Carreau-Yasuda rheological model. The shear-thinning effect also influences the bubble speed and delays the transition to the parabolic region in the bubble front and rear menisci. In particular, a high degree of undulations on the bubble surface results in intricate rear viscosity distribution for the rear meniscus and the deviation between the experiments and theory may require a further investigation to resolve the axial velocity field. Our study may advance the fundamental understandings and engineering guidelines for coating processes involving thin-film flows and non-Newtonian fluids. △ Less

Submitted 15 November, 2022; originally announced November 2022.

arXiv:2210.08090 [pdf, other]

Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning

Authors: John Nguyen, Jianyu Wang, Kshitiz Malik, Maziar Sanjabi, Michael Rabbat

Abstract: An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to the fact that client devices have different system capabilities. A considerable number of federated optimization methods address this challenge. In the literature,… ▽ More An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to the fact that client devices have different system capabilities. A considerable number of federated optimization methods address this challenge. In the literature, empirical evaluations usually start federated training from random initialization. However, in many practical applications of federated learning, the server has access to proxy data for the training task that can be used to pre-train a model before starting federated training. We empirically study the impact of starting from a pre-trained model in federated learning using four standard federated learning benchmark datasets. Unsurprisingly, starting from a pre-trained model reduces the training time required to reach a target error rate and enables the training of more accurate models (up to 40\%) than is possible when starting from random initialization. Surprisingly, we also find that starting federated learning from a pre-trained initialization reduces the effect of both data and system heterogeneity. We recommend that future work proposing and evaluating federated optimization methods evaluate the performance when starting from random and pre-trained initializations. We also believe this study raises several questions for further work on understanding the role of heterogeneity in federated optimization. △ Less

Submitted 14 October, 2022; originally announced October 2022.

Comments: v2. arXiv admin note: substantial text overlap with arXiv:2206.15387

arXiv:2210.07942 [pdf, other]

Intermittency analysis of charged hadrons generated in Pb-Pb collisions at $\sqrt{s_{NN}}$= 2.76 TeV and 5.02 TeV using PYTHIA8/Angantyr

Authors: Salman Khurshid Malik, Ramni Gupta

Abstract: Local density fluctuations are expected to scale as a universal power-law when the system approaches critical point. Such power-law fluctuations are studied within the framework of intermittency through the measurement of normalized factorial moments in ($ηいーた$, $φふぁい$) phase space. Observations and results from the intermittency analysis performed for charged particles in Pb-Pb collisions using PYTHIA8… ▽ More Local density fluctuations are expected to scale as a universal power-law when the system approaches critical point. Such power-law fluctuations are studied within the framework of intermittency through the measurement of normalized factorial moments in ($ηいーた$, $φふぁい$) phase space. Observations and results from the intermittency analysis performed for charged particles in Pb-Pb collisions using PYTHIA8/Angantyr at 2.76 TeV and 5.02 TeV are reported. We observe no scaling behaviour in the particle generation for any of the centrality studied in narrow p$_T$ bins. The scaling exponent $νにゅー$ shows no dependence on the centrality ranges. △ Less

Submitted 27 November, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

arXiv:2210.00417 [pdf, other]

doi 10.48550/arXiv.2210.00417

Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward

Authors: Awais Khan, Khalid Mahmood Malik, James Ryan, Mikul Saravanan

Abstract: Malicious actors may seek to use different voice-spoofing attacks to fool ASV systems and even use them for spreading misinformation. Various countermeasures have been proposed to detect these spoofing attacks. Due to the extensive work done on spoofing detection in automated speaker verification (ASV) systems in the last 6-7 years, there is a need to classify the research and perform qualitative… ▽ More Malicious actors may seek to use different voice-spoofing attacks to fool ASV systems and even use them for spreading misinformation. Various countermeasures have been proposed to detect these spoofing attacks. Due to the extensive work done on spoofing detection in automated speaker verification (ASV) systems in the last 6-7 years, there is a need to classify the research and perform qualitative and quantitative comparisons on state-of-the-art countermeasures. Additionally, no existing survey paper has reviewed integrated solutions to voice spoofing evaluation and speaker verification, adversarial/antiforensics attacks on spoofing countermeasures, and ASV itself, or unified solutions to detect multiple attacks using a single model. Further, no work has been done to provide an apples-to-apples comparison of published countermeasures in order to assess their generalizability by evaluating them across corpora. In this work, we conduct a review of the literature on spoofing detection using hand-crafted features, deep learning, end-to-end, and universal spoofing countermeasure solutions to detect speech synthesis (SS), voice conversion (VC), and replay attacks. Additionally, we also review integrated solutions to voice spoofing evaluation and speaker verification, adversarial and anti-forensics attacks on voice countermeasures, and ASV. The limitations and challenges of the existing spoofing countermeasures are also presented. We report the performance of these countermeasures on several datasets and evaluate them across corpora. For the experiments, we employ the ASVspoof2019 and VSDC datasets along with GMM, SVM, CNN, and CNN-GRU classifiers. (For reproduceability of the results, the code of the test bed can be found in our GitHub Repository. △ Less

Submitted 21 November, 2022; v1 submitted 1 October, 2022; originally announced October 2022.

arXiv:2208.02060 [pdf, other]

doi 10.1103/PhysRevB.106.075105

Large nonsaturating magnetoresistance, weak anti-localization and non-trivial topological states in SrAl$_2$Si$_2$

Authors: Sudip Malick, A. B. Sarkar, Antu Laha, M. Anas, V. K. Malik, Amit Agarwal, Z. Hossain, J. Nayak

Abstract: We explore the electronic and topological properties of single crystal SrAl$_2$Si$_2$ using magnetotransport experiments in conjunction with first-principle calculations. We find that the temperature-dependent resistivity shows a pronounced peak near 50 K. We observe several remarkable features at low temperatures, such as large non-saturating magnetoresistance, Shubnikov-de Haas oscillations and… ▽ More We explore the electronic and topological properties of single crystal SrAl$_2$Si$_2$ using magnetotransport experiments in conjunction with first-principle calculations. We find that the temperature-dependent resistivity shows a pronounced peak near 50 K. We observe several remarkable features at low temperatures, such as large non-saturating magnetoresistance, Shubnikov-de Haas oscillations and cusp-like magneto-conductivity. The maximum value of magnetoresistance turns out to be 459\% at 2 K and 12 T. The analysis of the cusp-like feature in magneto-conductivity indicates a clear signature of weak anti-localization. Our Hall resistivity measurements confirm the presence of two types of charge carriers in SrAl$_2$Si$_2$, with low carrier density. △ Less

Submitted 3 August, 2022; originally announced August 2022.

Journal ref: Phys. Rev. B 106, 075105 (2022)

arXiv:2206.15387 [pdf, other]

Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning

Authors: John Nguyen, Jianyu Wang, Kshitiz Malik, Maziar Sanjabi, Michael Rabbat

Abstract: An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to client devices having different system capabilities. A considerable number of federated optimization methods address this challenge. In the literature, empirical ev… ▽ More An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to client devices having different system capabilities. A considerable number of federated optimization methods address this challenge. In the literature, empirical evaluations usually start federated training from random initialization. However, in many practical applications of federated learning, the server has access to proxy data for the training task that can be used to pre-train a model before starting federated training. Using four standard federated learning benchmark datasets, we empirically study the impact of starting from a pre-trained model in federated learning. Unsurprisingly, starting from a pre-trained model reduces the training time required to reach a target error rate and enables the training of more accurate models (up to 40\%) than is possible when starting from random initialization. Surprisingly, we also find that starting federated learning from a pre-trained initialization reduces the effect of both data and system heterogeneity. We recommend future work proposing and evaluating federated optimization methods to evaluate the performance when starting from random and pre-trained initializations. This study raises several questions for further work on understanding the role of heterogeneity in federated optimization. \footnote{Our code is available at: \url{https://github.com/facebookresearch/where_to_begin}} △ Less

Submitted 24 March, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

Comments: Accepted at ICLR

Journal ref: International Conference on Learning Representations 2023

arXiv:2205.09793 [pdf, other]

doi 10.1088/1361-648X/ac8a35

The effect of antisite disorder on magnetic and exchange bias properties of Gd-substituted Y$_2$CoMnO$_6$ double perovskite

Authors: Anasua Khan, Sarita Rajput, M. Anas, V. K. Malik, T. Maitra, T. K Nath, A. Taraphder

Abstract: Combining experimental investigations and first-principles DFT calculations, we report physical and magnetic properties of Gd-substituted Y$_2$CoMnO$_6$ double perovskite, which are strongly influenced by antisite-disorder-driven spin configurations. On Gd doping, Co and Mn ions are present in mixed-valence (Co$^{3+}$, Co$^{2+}$, Mn$^{3+}$ and Mn$^{4+}$) states. Multiple magnetic transitions have… ▽ More Combining experimental investigations and first-principles DFT calculations, we report physical and magnetic properties of Gd-substituted Y$_2$CoMnO$_6$ double perovskite, which are strongly influenced by antisite-disorder-driven spin configurations. On Gd doping, Co and Mn ions are present in mixed-valence (Co$^{3+}$, Co$^{2+}$, Mn$^{3+}$ and Mn$^{4+}$) states. Multiple magnetic transitions have been observed: i) paramagnetic to ferromagnetic transition is found to occur at \textit{T}$_C$=95.5 K, ii) antiferromagnetic transition at \textit{T}$_N$=47 K is driven by $3d-4f$ polarisation and antisite disorder present in the sample, iii) change in magnetization below \textit{T}$\leq$20 K, primarily originating from Gd ordering, as revealed from our DFT calculations. AC susceptibility measurement confirms the absence of any spin-glass or cluster-glass phases in this material. A significantly large exchange bias effect (\textit{H}$_{EB}$=1.07 kOe) is found to occur below 47 K due to interfaces of FM and AFM clusters created by antisite-disorder. △ Less

Submitted 17 August, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

Comments: 19 pages, 12 figures

Journal ref: Journal of Physics: Condensed Matter, 34, (2022), 435801

arXiv:2204.03809 [pdf, other]

Federated Learning with Partial Model Personalization

Authors: Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael Rabbat, Maziar Sanjabi, Lin Xiao

Abstract: We consider two federated learning algorithms for training partially personalized models, where the shared and personal parameters are updated either simultaneously or alternately on the devices. Both algorithms have been proposed in the literature, but their convergence properties are not fully understood, especially for the alternating variant. We provide convergence analyses of both algorithms… ▽ More We consider two federated learning algorithms for training partially personalized models, where the shared and personal parameters are updated either simultaneously or alternately on the devices. Both algorithms have been proposed in the literature, but their convergence properties are not fully understood, especially for the alternating variant. We provide convergence analyses of both algorithms in the general nonconvex setting with partial participation and delineate the regime where one dominates the other. Our experiments on real-world image, text, and speech datasets demonstrate that (a) partial personalization can obtain most of the benefits of full model personalization with a small fraction of personal parameters, and, (b) the alternating update algorithm often outperforms the simultaneous update algorithm by a small but consistent margin. △ Less

Submitted 15 August, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

Journal ref: ICML 2022: 17716-17758

arXiv:2204.01273 [pdf, ps, other]

FedSynth: Gradient Compression via Synthetic Data in Federated Learning

Authors: Shengyuan Hu, Jack Goetz, Kshitiz Malik, Hongyuan Zhan, Zhe Liu, Yue Liu

Abstract: Model compression is important in federated learning (FL) with large models to reduce communication cost. Prior works have been focusing on sparsification based compression that could desparately affect the global model accuracy. In this work, we propose a new scheme for upstream communication where instead of transmitting the model update, each client learns and transmits a light-weight synthetic… ▽ More Model compression is important in federated learning (FL) with large models to reduce communication cost. Prior works have been focusing on sparsification based compression that could desparately affect the global model accuracy. In this work, we propose a new scheme for upstream communication where instead of transmitting the model update, each client learns and transmits a light-weight synthetic dataset such that using it as the training data, the model performs similarly well on the real training data. The server will recover the local model update via the synthetic data and apply standard aggregation. We then provide a new algorithm FedSynth to learn the synthetic data locally. Empirically, we find our method is comparable/better than random masking baselines in all three common federated learning benchmark datasets. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: 8 pages

arXiv:2203.11316 [pdf, other]

doi 10.1016/j.asoc.2023.110377

Random vector functional link network: recent developments, applications, and future directions

Authors: A. K. Malik, Ruobin Gao, M. A. Ganaie, M. Tanveer, P. N. Suganthan

Abstract: Neural networks have been successfully employed in various domains such as classification, regression and clustering, etc. Generally, the back propagation (BP) based iterative approaches are used to train the neural networks, however, it results in the issues of local minima, sensitivity to learning rate and slow convergence. To overcome these issues, randomization based neural networks such as ra… ▽ More Neural networks have been successfully employed in various domains such as classification, regression and clustering, etc. Generally, the back propagation (BP) based iterative approaches are used to train the neural networks, however, it results in the issues of local minima, sensitivity to learning rate and slow convergence. To overcome these issues, randomization based neural networks such as random vector functional link (RVFL) network have been proposed. RVFL model has several characteristics such as fast training speed, direct links, simple architecture, and universal approximation capability, that make it a viable randomized neural network. This article presents the first comprehensive review of the evolution of RVFL model, which can serve as the extensive summary for the beginners as well as practitioners. We discuss the shallow RVFLs, ensemble RVFLs, deep RVFLs and ensemble deep RVFL models. The variations, improvements and applications of RVFL models are discussed in detail. Moreover, we discuss the different hyperparameter optimization techniques followed in the literature to improve the generalization performance of the RVFL model. Finally, we give potential future research directions/opportunities that can inspire the researchers to improve the RVFL's architecture and learning algorithm further. △ Less

Submitted 23 April, 2023; v1 submitted 13 February, 2022; originally announced March 2022.

arXiv:2111.04877 [pdf, other]

Papaya: Practical, Private, and Scalable Federated Learning

Authors: Dzmitry Huba, John Nguyen, Kshitiz Malik, Ruiyu Zhu, Mike Rabbat, Ashkan Yousefpour, Carole-Jean Wu, Hongyuan Zhan, Pavel Ustinov, Harish Srinivas, Kaikai Wang, Anthony Shoumikhin, Jesik Min, Mani Malek

Abstract: Cross-device Federated Learning (FL) is a distributed learning paradigm with several challenges that differentiate it from traditional distributed learning, variability in the system characteristics on each device, and millions of clients coordinating with a central server being primary ones. Most FL systems described in the literature are synchronous - they perform a synchronized aggregation of m… ▽ More Cross-device Federated Learning (FL) is a distributed learning paradigm with several challenges that differentiate it from traditional distributed learning, variability in the system characteristics on each device, and millions of clients coordinating with a central server being primary ones. Most FL systems described in the literature are synchronous - they perform a synchronized aggregation of model updates from individual clients. Scaling synchronous FL is challenging since increasing the number of clients training in parallel leads to diminishing returns in training speed, analogous to large-batch training. Moreover, stragglers hinder synchronous FL training. In this work, we outline a production asynchronous FL system design. Our work tackles the aforementioned issues, sketches of some of the system design challenges and their solutions, and touches upon principles that emerged from building a production FL system for millions of clients. Empirically, we demonstrate that asynchronous FL converges faster than synchronous FL when training across nearly one hundred million devices. In particular, in high concurrency settings, asynchronous FL is 5x faster and has nearly 8x less communication overhead than synchronous FL. △ Less

Submitted 25 April, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

arXiv:2110.14584 [pdf, other]

doi 10.1103/PhysRevD.106.023519

A new mechanism for primordial black hole formation during reheating

Authors: Luis E. Padilla, Juan Carlos Hidalgo, Karim A. Malik

Abstract: The Reheating process at the end of inflation is often modeled by an oscillating scalar field which shows a background dust-like behaviour, prompting the analysis of gravitational collapse and black hole formation in this era to be approached by the spherical collapse of standard structure formation. In the scalar field dark matter structure formation process virialized halos halt the direct colla… ▽ More The Reheating process at the end of inflation is often modeled by an oscillating scalar field which shows a background dust-like behaviour, prompting the analysis of gravitational collapse and black hole formation in this era to be approached by the spherical collapse of standard structure formation. In the scalar field dark matter structure formation process virialized halos halt the direct collapse, resulting in halos with condensed central cores at the de Broglie scale of the dominant scalar field. We show that a similar process can take place during reheating, leading to the formation of primordial black holes (PBHs). We study the formation of PBHs through the gravitational further collapse of structures virialized during reheating, looking at the collapse of either the whole structure, or that of the central core within these configurations. We compute the threshold amplitude for the density contrast to undergo this process, for both free and self-interacting scalar fields. We discuss the relevance of our results for the abundance of PBHs at the lower end of the mass spectrum. △ Less

Submitted 7 July, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

Comments: 6 pages, 1 figure. Minor updates to accord with the accepted version in Phys. Rev. D

arXiv:2110.09294 [pdf]

Comparative Analysis of Deep Learning Algorithms for Classification of COVID-19 X-Ray Images

Authors: Unsa Maheen, Khawar Iqbal Malik, Gohar Ali

Abstract: The Coronavirus was first emerged in December, in the city of China named Wuhan in 2019 and spread quickly all over the world. It has very harmful effects all over the global economy, education, social, daily living and general health of humans. To restrict the quick expansion of the disease initially, main difficulty is to explore the positive corona patients as quickly as possible. As there are… ▽ More The Coronavirus was first emerged in December, in the city of China named Wuhan in 2019 and spread quickly all over the world. It has very harmful effects all over the global economy, education, social, daily living and general health of humans. To restrict the quick expansion of the disease initially, main difficulty is to explore the positive corona patients as quickly as possible. As there are no automatic tool kits accessible the requirement for supplementary diagnostic tools has risen up. Previous studies have findings acquired from radiological techniques proposed that this kind of images have important details related to the coronavirus. The usage of modified Artificial Intelligence (AI) system in combination with radio-graphical images can be fruitful for the precise and exact solution of this virus and can also be helpful to conquer the issue of deficiency of professional physicians in distant villages. In our research, we analyze the different techniques for the detection of COVID-19 using X-Ray radiographic images of the chest, we examined the different pre-trained CNN models AlexNet, VGG-16, MobileNet-V2, SqeezeNet, ResNet-34, ResNet-50 and COVIDX-Net to correct analytics for classification system of COVID-19. Our study shows that the pre trained CNN Model with ResNet-34 technique gives the higher accuracy rate of 98.33, 96.77% precision, and 98.36 F1-score, which is better than other CNN techniques. Our model may be helpful for the researchers to fine train the CNN model for the the quick screening of COVID patients. △ Less

Submitted 14 October, 2021; originally announced October 2021.

arXiv:2110.09283 [pdf, ps, other]

Complex interplay of magnetic ordering and spin-lattice coupling in orthochromite Nd$_{0.5}$Dy$_{0.5}$CrO$_{3}$

Authors: M. Anas, Padmanabhan Balasubramanian, K. Vikram, Ankita Singh, C. M. N. Kumar, Andreas Hoser, Dariusz Rusinek, A. K. Sinha, V. Srihari, Ranjan K. Singh, Rinku Kumar, Mukul Gupta, T. Maitra, V. K. Malik

Abstract: The mixed rare-earth orthochromite Nd$_{0.5}$Dy$_{0.5}$CrO$_{3}$ has a Néel temperature ($T_\mathrm{N}$) of ${\sim}$ 175\,K, resulting in the G-type antiferromagnetic ordering of Cr$^{3+}$ spins. The inverse susceptibility shows a deviation from Curie-Weiss law at 230\,K, with a large effective paramagnetic moment of 8.8\,$μみゅー_{\mathrm{B}}$. The ZFC-FC magnetization bifurcate just above… ▽ More The mixed rare-earth orthochromite Nd$_{0.5}$Dy$_{0.5}$CrO$_{3}$ has a Néel temperature ($T_\mathrm{N}$) of ${\sim}$ 175\,K, resulting in the G-type antiferromagnetic ordering of Cr$^{3+}$ spins. The inverse susceptibility shows a deviation from Curie-Weiss law at 230\,K, with a large effective paramagnetic moment of 8.8\,$μみゅー_{\mathrm{B}}$. The ZFC-FC magnetization bifurcate just above $T_\mathrm{N}$ and show a distinct signature of spin reorientation near 60\,K. Neutron diffraction show that below $T_\mathrm{N}$, the Cr$^{3+}$ spins align in $Γがんま_{2}$ representation as ($F_{x}$, $G_{z}$). Below 60\,K, due to spin reorientation, the magnetic structure is in $Γがんま_{1}$ ($G_{y}$) configuration. The neutron diffraction does not show any signature of rare-earth ordering even at 1.5\,K. First principles density functional theory calculations within GGA+U and GGA+U+SO approximations reveal that the G-type antiferromagnetic order is the ground state magnetic structure of Cr sublattice and the spin-reorientation of Cr$^{3+}$ spins can happen in the absence of 3d-4f interactions unlike in the case of orthoferrites. The specific heat shows a `$λらむだ$' anomaly at $T_\mathrm{N}$, while at low temperature two distinct Schottky anomalies are observed; a Schottky peak at 2\,K and an additional step-like feature above 10\,K. Above $T_\mathrm{N}$, the magnetic transition is preceded by structural anomalies as seen in our x-ray diffraction and Raman measurements. The deviation of structural parameters near Néel temperature is smaller. The phonon frequencies show deviation from the standard anharmonic behaviour: first near 250\,K, due to magneto-volume effects while the second deviation occurs near 200\,K due to spin-phonon coupling. △ Less

Submitted 18 October, 2021; originally announced October 2021.

Comments: 12 pages, 13 figures

arXiv:2108.09975 [pdf, other]

doi 10.1103/PhysRevB.105.214436

Coexisting magnetic structures and spin-reorientation in Er$_{0.5}$Dy$_{0.5}$FeO$_{3}$: Bulk magnetization, neutron scattering, specific heat, and \emph{Ab-initio} studies

Authors: Sarita Rajput, Padmanabhan Balasubramanian, Ankita Singh, Francoise Damay, C. M. N. Kumar, W. Tabis, T. Maitra, V. K. Malik

Abstract: The complex magnetic structures, spin-reorientation and associated exchange interactions have been investigate in Er$_{0.5}$Dy$_{0.5}$FeO$_3$ using bulk magnetization, neutron diffraction, specific heat measurements and density functional theory calculations. The Fe$^{3+}$ spins order as G-type antiferromagnet structure depicted by $Γがんま_{4}$($G_{x}$,$A_{y}$,$F_{z}$) irreducible representation below… ▽ More The complex magnetic structures, spin-reorientation and associated exchange interactions have been investigate in Er$_{0.5}$Dy$_{0.5}$FeO$_3$ using bulk magnetization, neutron diffraction, specific heat measurements and density functional theory calculations. The Fe$^{3+}$ spins order as G-type antiferromagnet structure depicted by $Γがんま_{4}$($G_{x}$,$A_{y}$,$F_{z}$) irreducible representation below 700K, similar to its end compounds. The bulk magnetization data indicate occurrence of the spin-reorientation and rare-earth magnetic ordering below $\sim$75 K and 10 K, respectively. The neutron diffraction studies confirm an "incomplete" $Γがんま_{4}$${\rightarrow}$ $Γがんま_{2}$($F_{x}$,$C_{y}$,$G_{z}$) spin-reorientation initiated $\leq$75 K. Although, the relative volume fraction of the two magnetic structures varies with decreasing temperature, both co-exist even at 1.5 K. At 8 K, Er$^{3+}$/Dy$^{3+}$ moments order as $c_{y}^R$ arrangement develop, which gradually increases in intensity with decreasing temperature. At 2 K, magnetic structure associated with $c_{z}^R$ arrangement of Er$^{3+}$/Dy$^{3+}$ moments also appears. At 1.5 K the magnetic structure of Fe$^{3+}$ spins is represented by a combination of $Γがんま_{2}$+$Γがんま_{4}$+$Γがんま_{1}$, while the rare earth moments coexists as $c_{y}^R$ and $c_{z}^R$ corresponding to $Γがんま_{2}$ and $Γがんま_{1}$ representation, respectively. The observed Schottky anomaly at 2.5 K suggests that the "rare-earth ordering" is induced by polarization due to Fe$^{3+}$ spins. The Er$^{3+}$-Fe$^{3+}$ and Er$^{3+}$-Dy$^{3+}$ exchange interactions, obtained from first principle calculations, primarily cause the complicated spin-reorientation and $c_{y}^R$ rare-earth ordering, respectively, while the dipolar interactions between rare-earth moments, result in the $c_{z}^R$ type rare-earth ordering at 2 K. △ Less

Submitted 4 October, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

Comments: 15 pages, 11 figures

arXiv:2108.02830 [pdf]

doi 10.1145/3414524

Hate Speech Detection in Roman Urdu

Authors: Moin Khan, Khurram Shahzad, Kamran Malik

Abstract: Hate speech is a specific type of controversial content that is widely legislated as a crime that must be identified and blocked. However, due to the sheer volume and velocity of the Twitter data stream, hate speech detection cannot be performed manually. To address this issue, several studies have been conducted for hate speech detection in European languages, whereas little attention has been pa… ▽ More Hate speech is a specific type of controversial content that is widely legislated as a crime that must be identified and blocked. However, due to the sheer volume and velocity of the Twitter data stream, hate speech detection cannot be performed manually. To address this issue, several studies have been conducted for hate speech detection in European languages, whereas little attention has been paid to low-resource South Asian languages, making the social media vulnerable for millions of users. In particular, to the best of our knowledge, no study has been conducted for hate speech detection in Roman Urdu text, which is widely used in the sub-continent. In this study, we have scrapped more than 90,000 tweets and manually parsed them to identify 5,000 Roman Urdu tweets. Subsequently, we have employed an iterative approach to develop guidelines and used them for generating the Hate Speech Roman Urdu 2020 corpus. The tweets in the this corpus are classified at three levels: Neutral-Hostile, Simple-Complex, and Offensive-Hate speech. As another contribution, we have used five supervised learning techniques, including a deep learning technique, to evaluate and compare their effectiveness for hate speech detection. The results show that Logistic Regression outperformed all other techniques, including deep learning techniques for the two levels of classification, by achieved an F1 score of 0.906 for distinguishing between Neutral-Hostile tweets, and 0.756 for distinguishing between Offensive-Hate speech tweets. △ Less

Submitted 5 August, 2021; originally announced August 2021.

Comments: This is a pre-print of a contribution published in ACM Transactions on Asian and Low-Resource Language Information Processing. The final version is available online at the given journal link

Journal ref: ACM Transactions on Asian and Low Resource Language Information Processing; Volume 20; Issue 1; April 2021; Article Number 9; pp 1 to 19

arXiv:2108.00525 [pdf]

Sb concentration dependent Structural and Transport properties of Polycrystalline (Bi1-xSbx)2Te3 Mixed crystal

Authors: K. Malik, S. Mahakal, Diptasikha Das, Aritra Banerjee, S. Chatterjee, Anusree Das

Abstract: (Bi1-xSbx)2Te3 (x=0.60, 0.65, 0.68, 0.70, 0.75 and 0.80) mixed crystals have been synthesized by solid state reaction. In depth structural, thermal, transport and electronic properties are reported. Defect and disorder play a crucial role in structural and transport behaviour. Disorder induced iso-structural phase transition is observed at x=0.70, which is supported by the structural and transport… ▽ More (Bi1-xSbx)2Te3 (x=0.60, 0.65, 0.68, 0.70, 0.75 and 0.80) mixed crystals have been synthesized by solid state reaction. In depth structural, thermal, transport and electronic properties are reported. Defect and disorder play a crucial role in structural and transport behaviour. Disorder induced iso-structural phase transition is observed at x=0.70, which is supported by the structural and transport properties data. Debye temperature has been estimated from the powder diffraction data. Differential scanning calorimetry (DSC) data confirms the glass transition in the material. Low temperature resistivity data shows Variable range hopping mechanism whereas high temperature data follows activated behaviour. Activation energy is calculated from the semiconducting region of resistivity data. Both Hall measurement and temperature dependent thermopower data (S(T)) confirms that samples are p-type in nature. Density of state effective mass has been estimated from Pisarenko relation and corroborated with resistivity data. Thermal conductivity (k) is estimated using experimentally obtained data. Figure of Merit (ZT) of the synthesized samples are calculated using resistivity, S(T) and k. Structural and transport properties are correlated, confirms the transition from disorder to order state. Defect and disorder are corroborated with structural and Thermoelectric properties of the synthesized samples. △ Less

Submitted 1 August, 2021; originally announced August 2021.

arXiv:2107.10815 [pdf, other]

doi 10.1088/1475-7516/2021/12/025

Contributions from primordial non-Gaussianity and General Relativity to the galaxy power spectrum

Authors: Rebeca Martinez-Carrillo, Juan Carlos Hidalgo, Karim A. Malik, Alkistis Pourtsidou

Abstract: We compute the real space galaxy power spectrum, including the leading order effects of General Relativity and primordial non-Gaussianity from the $f_{\mathrm{NL}}$ and $g_{\mathrm{NL}}$ parameters. Such contributions come from the one-loop matter power spectrum terms dominant at large scales, and from the factors of the non-linear bias parameter $b_{\mathrm{NL}}$ (akin to the Newtonian $b_φふぁい$). We… ▽ More We compute the real space galaxy power spectrum, including the leading order effects of General Relativity and primordial non-Gaussianity from the $f_{\mathrm{NL}}$ and $g_{\mathrm{NL}}$ parameters. Such contributions come from the one-loop matter power spectrum terms dominant at large scales, and from the factors of the non-linear bias parameter $b_{\mathrm{NL}}$ (akin to the Newtonian $b_φふぁい$). We assess the detectability of these contributions in Stage-IV surveys. In particular, we note that specific values of the bias parameter may erase the primordial and relativistic contributions to the configuration space power spectrum. △ Less

Submitted 30 November, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

Comments: Version accepted for publication in JCAP

Journal ref: JCAP12(2021)025

arXiv:2107.05731 [pdf]

Detecting Ideal Instagram Influencer Using Social Network Analysis

Authors: M. M. H Dihyat, K Malik, M. A Khan, B Imran

Abstract: Social Media is a key aspect of modern society where people share their thoughts, views, feelings and sentiments. Over the last few years, the inflation in popularity of social media has resulted in a monumental increase in data. Users use this medium to express their thoughts, feelings, and opinions on a wide variety of subjects, including politics and celebrities. Social Media has thus evolved i… ▽ More Social Media is a key aspect of modern society where people share their thoughts, views, feelings and sentiments. Over the last few years, the inflation in popularity of social media has resulted in a monumental increase in data. Users use this medium to express their thoughts, feelings, and opinions on a wide variety of subjects, including politics and celebrities. Social Media has thus evolved into a lucrative platform for companies to expand their scope and improve their prospects. The paper focuses on social network analysis (SNA) for a real-world online marketing strategy. The study contributes by comparing various centrality measures to identify the most central nodes in the network and uses a linear threshold model to understand the spreading behaviour of individual users. In conclusion, the paper correlates different centrality measures and spreading behaviour to identify the most influential user in the network △ Less

Submitted 12 July, 2021; originally announced July 2021.

arXiv:2107.05244 [pdf]

doi 10.1093/mnras/stab1959

Differential rotation of the solar transition region from STEREO/EUVI 30.4 nm images

Authors: Jaidev Sharma, Brajesh Kumar, Anil K Malik, Hari Om Vats

Abstract: The solar photosphere, chromosphere and corona are known to rotate differentially as a function of latitude. To date, it is unclear if the solar transition region also rotates differentially. In this paper, we investigate differential rotational profile of solar transition region as a function of latitude, using solar full disk (SFD) images at 30.4 nm wavelength recorded by Extreme Ultraviolet Ima… ▽ More The solar photosphere, chromosphere and corona are known to rotate differentially as a function of latitude. To date, it is unclear if the solar transition region also rotates differentially. In this paper, we investigate differential rotational profile of solar transition region as a function of latitude, using solar full disk (SFD) images at 30.4 nm wavelength recorded by Extreme Ultraviolet Imager (EUVI) onboard Solar Terrestrial Relations Observatory (STEREO) space mission for the period from 2008 to 2018 (Solar Cycle 24). Our investigations show that solar transition region rotates differentially. The sidereal rotation rate obtained at +/- 5 degree equatorial band is quite high (~ 14.7 degree/day), which drops to ~ 13.6 degree/day towards both polar regions. We also obtain that the rotational differentiality is low during the period of high solar activity (rotation rate varies from 14.86 to 14.27 degree/day) while it increases during the ascending and the descending phases of the 24th solar cycle (rotation rate varies from 14.56 to 13.56 degree/day in 2008 and 14.6 to 13.1 degree/day in 2018). Average sidereal rotation rate (over SFD) follows the trend of solar activity (maximum ~ 14.97 degree/day during the peak phase of the solar activity, which slowly decreases to minimum ~ 13.9 degree/day during ascending and the descending phases of the 24th solar cycle). We also observe that solar transition region rotates less differentially than the corona. △ Less

Submitted 12 July, 2021; originally announced July 2021.

Comments: 8 pages, 6 figures, 2 tables, Accepted for publication in MNRAS

arXiv:2106.06639 [pdf, other]

Federated Learning with Buffered Asynchronous Aggregation

Authors: John Nguyen, Kshitiz Malik, Hongyuan Zhan, Ashkan Yousefpour, Michael Rabbat, Mani Malek, Dzmitry Huba

Abstract: Scalability and privacy are two critical concerns for cross-device federated learning (FL) systems. In this work, we identify that synchronous FL - synchronized aggregation of client updates in FL - cannot scale efficiently beyond a few hundred clients training in parallel. It leads to diminishing returns in model performance and training speed, analogous to large-batch training. On the other hand… ▽ More Scalability and privacy are two critical concerns for cross-device federated learning (FL) systems. In this work, we identify that synchronous FL - synchronized aggregation of client updates in FL - cannot scale efficiently beyond a few hundred clients training in parallel. It leads to diminishing returns in model performance and training speed, analogous to large-batch training. On the other hand, asynchronous aggregation of client updates in FL (i.e., asynchronous FL) alleviates the scalability issue. However, aggregating individual client updates is incompatible with Secure Aggregation, which could result in an undesirable level of privacy for the system. To address these concerns, we propose a novel buffered asynchronous aggregation method, FedBuff, that is agnostic to the choice of optimizer, and combines the best properties of synchronous and asynchronous FL. We empirically demonstrate that FedBuff is 3.3x more efficient than synchronous FL and up to 2.5x more efficient than asynchronous FL, while being compatible with privacy-preserving technologies such as Secure Aggregation and differential privacy. We provide theoretical convergence guarantees in a smooth non-convex setting. Finally, we show that under differentially private training, FedBuff can outperform FedAvgM at low privacy settings and achieve the same utility for higher privacy settings. △ Less

Submitted 7 March, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

Comments: Accepted at AISTATS 2022. Previously accepted at FL-ICML 2021

arXiv:2104.10204 [pdf, other]

doi 10.1088/1475-7516/2021/08/046

The intrinsic bispectrum of the CMB from isocurvature initial conditions

Authors: Pedro Carrilho, Karim A. Malik

Abstract: Non-linear effects in the early Universe generate non-zero bispectra of the cosmic microwave background (CMB) temperature and polarization, even in the absence of primordial non-Gaussianity. In this paper, we compute the contributions from isocurvature modes to the CMB bispectra using a modified version of the second-order Boltzmann solver SONG. We investigate the ability of current and future CMB… ▽ More Non-linear effects in the early Universe generate non-zero bispectra of the cosmic microwave background (CMB) temperature and polarization, even in the absence of primordial non-Gaussianity. In this paper, we compute the contributions from isocurvature modes to the CMB bispectra using a modified version of the second-order Boltzmann solver SONG. We investigate the ability of current and future CMB experiments to constrain these modes with observations of the bispectrum. Our results show that the enhancement due to single isocurvature modes mixed with the adiabatic mode is negligible for the parameter ranges currently allowed by the most recent Planck results. However, we find that a large compensated isocurvature mode can produce a detectable bispectrum when its correlation with the adiabatic mode is appreciable. The non-observation of this contribution in searches for the lensing bispectrum from Planck allows us to place a new constraint on the relative amplitude of the correlated part of the compensated isocurvature mode of $f_{\rm CIP}=1\pm100$. We compute forecasts for future observations by COrE, SO, CMB-S4 and an ideal experiment and conclude that a dedicated search for the bispectrum from compensated modes could rule out a number of scenarios realised in the curvaton model. In addition, the CMB-S4 experiment could detect the most extreme of those scenarios ($f_{\rm CIP}=16.5$) at 2 to 3-$σしぐま$ significance. △ Less

Submitted 8 September, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

Comments: 22 pages, 13 figures. V2: Additional clarifications and details. Matches version published in JCAP

Journal ref: JCAP 08 (2021) 046

arXiv:2104.02395 [pdf, other]

doi 10.1016/j.engappai.2022.105151

Ensemble deep learning: A review

Authors: M. A. Ganaie, Minghui Hu, A. K. Malik, M. Tanveer, P. N. Suganthan

Abstract: Ensemble learning combines several individual models to obtain better generalization performance. Currently, deep learning architectures are showing better performance compared to the shallow or traditional models. Deep ensemble learning models combine the advantages of both the deep learning models as well as the ensemble learning such that the final model has better generalization performance. T… ▽ More Ensemble learning combines several individual models to obtain better generalization performance. Currently, deep learning architectures are showing better performance compared to the shallow or traditional models. Deep ensemble learning models combine the advantages of both the deep learning models as well as the ensemble learning such that the final model has better generalization performance. This paper reviews the state-of-art deep ensemble models and hence serves as an extensive summary for the researchers. The ensemble models are broadly categorised into bagging, boosting, stacking, negative correlation based deep ensemble models, explicit/implicit ensembles, homogeneous/heterogeneous ensemble, decision fusion strategies based deep ensemble models. Applications of deep ensemble models in different domains are also briefly discussed. Finally, we conclude this paper with some potential future research directions. △ Less

Submitted 8 August, 2022; v1 submitted 6 April, 2021; originally announced April 2021.

Journal ref: Engineering Applications of Artificial Intelligence, 2022

arXiv:2103.00484 [pdf]

Deepfakes Generation and Detection: State-of-the-art, open challenges, countermeasures, and way forward

Authors: Momina Masood, Marriam Nawaz, Khalid Mahmood Malik, Ali Javed, Aun Irtaza

Abstract: Easy access to audio-visual content on social media, combined with the availability of modern tools such as Tensorflow or Keras, open-source trained models, and economical computing infrastructure, and the rapid evolution of deep-learning (DL) methods, especially Generative Adversarial Networks (GAN), have made it possible to generate deepfakes to disseminate disinformation, revenge porn, financia… ▽ More Easy access to audio-visual content on social media, combined with the availability of modern tools such as Tensorflow or Keras, open-source trained models, and economical computing infrastructure, and the rapid evolution of deep-learning (DL) methods, especially Generative Adversarial Networks (GAN), have made it possible to generate deepfakes to disseminate disinformation, revenge porn, financial frauds, hoaxes, and to disrupt government functioning. The existing surveys have mainly focused on the detection of deepfake images and videos. This paper provides a comprehensive review and detailed analysis of existing tools and machine learning (ML) based approaches for deepfake generation and the methodologies used to detect such manipulations for both audio and visual deepfakes. For each category of deepfake, we discuss information related to manipulation approaches, current public datasets, and key standards for the performance evaluation of deepfake detection techniques along with their results. Additionally, we also discuss open challenges and enumerate future directions to guide future researchers on issues that need to be considered to improve the domains of both deepfake generation and detection. This work is expected to assist the readers in understanding the creation and detection mechanisms of deepfakes, along with their current limitations and future direction. △ Less

Submitted 22 November, 2021; v1 submitted 25 February, 2021; originally announced March 2021.

arXiv:2012.15326 [pdf, ps, other]

doi 10.1088/1361-6382/ac1be6

Galaxy number counts at second order in perturbation theory: a leading-order term comparison

Authors: Jorge L. Fuentes, Juan Carlos Hidalgo, Karim A. Malik

Abstract: The galaxy number density is a key quantity to compare theoretical predictions to the observational data from current and future Large Scale Structure surveys. The precision demanded by these Stage IV surveys requires the use of second order cosmological perturbation theory. Based on the independent calculation published previously, we present the result of the comparison with the results of three… ▽ More The galaxy number density is a key quantity to compare theoretical predictions to the observational data from current and future Large Scale Structure surveys. The precision demanded by these Stage IV surveys requires the use of second order cosmological perturbation theory. Based on the independent calculation published previously, we present the result of the comparison with the results of three other groups at leading order. Overall we find that the differences between the different approaches lie mostly on the definition of certain quantities, where the ambiguity of signs results in the addition of extra terms at second order in perturbation theory. △ Less

Submitted 8 August, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

Comments: V2: 10 pages. Version accepted for publication in CQG

arXiv:2011.05840 [pdf, other]

Selling two complementary goods

Authors: Komal Malik, Kolagani Paramahamsa

Abstract: A seller is selling a pair of divisible complementary goods to an agent. The agent consumes the goods only in a specific ratio and freely disposes of excess in either goods. The value of the bundle and the ratio are private information of the agent. In this two-dimensional type space model, we characterize the incentive constraints and show that the optimal (expected revenue-maximizing) mechanism… ▽ More A seller is selling a pair of divisible complementary goods to an agent. The agent consumes the goods only in a specific ratio and freely disposes of excess in either goods. The value of the bundle and the ratio are private information of the agent. In this two-dimensional type space model, we characterize the incentive constraints and show that the optimal (expected revenue-maximizing) mechanism is a ratio-dependent posted price or a posted price mechanism for a class of distributions. We also show that the optimal mechanism is a posted price mechanism when the value and the ratio are independently distributed. △ Less

Submitted 14 July, 2022; v1 submitted 11 November, 2020; originally announced November 2020.

Comments: Minor revisions

arXiv:2010.16408 [pdf]

Sentiment Analysis for Roman Urdu Text over Social Media, a Comparative Study

Authors: Irfan Qutab, Khawar Iqbal Malik, Hira Arooj

Abstract: In present century, data volume is increasing enormously. The data could be in form for image, text, voice, and video. One factor in this huge growth of data is usage of social media where everyone is posting data on daily basis during chatting, exchanging information, and uploading their personal and official credential. Research of sentiments seeks to uncover abstract knowledge in Published text… ▽ More In present century, data volume is increasing enormously. The data could be in form for image, text, voice, and video. One factor in this huge growth of data is usage of social media where everyone is posting data on daily basis during chatting, exchanging information, and uploading their personal and official credential. Research of sentiments seeks to uncover abstract knowledge in Published texts in which users communicate their emotions and thoughts about shared content, including blogs, news and social networks. Roman Urdu is the one of most dominant language on social networks in Pakistan and India. Roman Urdu is among the varieties of the world's third largest Urdu language but yet not sufficient work has been done in this language. In this article we addressed the prior concepts and strategies used to examine the sentiment of the roman Urdu text and reported their results as well. △ Less

Submitted 5 October, 2020; originally announced October 2020.

Comments: 8 Pages, 12 Figures. International Journal of Computer Science and Network - 2020

arXiv:2009.14029 [pdf]

doi 10.1111/maps.13540

Mineralogy, chemistry and composition of organic compounds in the fresh carbonaceous chondrite Mukundpura: CM1 or CM2?

Authors: S. Potin, P. Beck, L. Bonal, B. Schmitt, A. Garenne, F. Moynier, A. Agranier, P. Schmitt-Kopplin, A. K. Malik, E. Quirico

Abstract: We present here several laboratory analyses performed on the freshly fallen Mukundpura CM chondrite. Results of infrared transmission spectroscopy, thermogravimetry analysis and reflectance spectroscopy show that Mukundpura is mainly composed of phyllosilicates. The rare earth trace elements composition and ultrahigh resolution mass spectrometry of the soluble organic matter (SOM) give results con… ▽ More We present here several laboratory analyses performed on the freshly fallen Mukundpura CM chondrite. Results of infrared transmission spectroscopy, thermogravimetry analysis and reflectance spectroscopy show that Mukundpura is mainly composed of phyllosilicates. The rare earth trace elements composition and ultrahigh resolution mass spectrometry of the soluble organic matter (SOM) give results consistent with CM chondrites. Finally, Raman spectroscopy shows no signs of thermal alteration of the meteorite. All the results agree that Mukundpura has been strongly altered by water on its parent body. Comparison of the results obtained on the meteorite with those of other chondrites of known petrologic types lead to the conclusion that Mukundpura is similar to CM1 chondrites, which differs from its original classification as a CM2. △ Less

Submitted 29 September, 2020; originally announced September 2020.

Showing 1–50 of 211 results for author: Malik, K