-
Correlation of Magnetic State Configurations in Nanotubes with FMR spectrum
Authors:
Abhishek Kumar,
Chirag Kalouni,
Raghvendra Posti,
Vivek K Malik,
Dhananjay Tiwari,
Debangsu Roy
Abstract:
Magnetic nanotubes have garnered immense attention for their potential in high-density magnetic memory, owing to their stable flux closure configuration and fast, reproducible reversal processes. However, characterizing their magnetic configuration through straightforward methodologies remains a challenge in both scope and detail. Here, we elucidate the magnetic state details using Remanence Field…
▽ More
Magnetic nanotubes have garnered immense attention for their potential in high-density magnetic memory, owing to their stable flux closure configuration and fast, reproducible reversal processes. However, characterizing their magnetic configuration through straightforward methodologies remains a challenge in both scope and detail. Here, we elucidate the magnetic state details using Remanence Field Ferromagnetic Resonance Spectroscopy (RFMR) for arrays of electrodeposited nanotubes. Micromagnetic simulations revealed distinct spin configurations while coming from saturation, including the edge vortex, onion, uniform and curling states, with chirality variations depending on the preparation field direction. Dynamic measurements, coupled with RFMR spectra analysis, unveiled multiple FMR modes corresponding to these spin configurations. The evolution of spin configurations under bias fields were studied, indicating nucleation within the curling state. Observations revealed opposite RFMR spectra, denoting opposite magnetic spin configurations after removing the positive and negative saturating fields when the magnetic field was applied along {theta_H=0} and perpendicular {theta_H= 90} to the nanotube axis. We observed a mixture of the non-uniform curling states with the end vortex state (onion-like curling state) at the end of the nanotubes for the theta_H=0(90) and uniform magnetization states in the middle of the nanotubes for the theta_H=0 configuration. Building on RFMR information, frequency-swept FMR absorption spectra obtained at different bias fields allowed the characterization of magnetization states. This picture was supported by micromagnetic simulations. These findings were further substantiated with First Order Reversal Curve measurements (FORC).
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
The Llama 3 Herd of Models
Authors:
Abhimanyu Dubey,
Abhinav Jauhri,
Abhinav Pandey,
Abhishek Kadian,
Ahmad Al-Dahle,
Aiesha Letman,
Akhil Mathur,
Alan Schelten,
Amy Yang,
Angela Fan,
Anirudh Goyal,
Anthony Hartshorn,
Aobo Yang,
Archi Mitra,
Archie Sravankumar,
Artem Korenev,
Arthur Hinsvark,
Arun Rao,
Aston Zhang,
Aurelien Rodriguez,
Austen Gregerson,
Ava Spataru,
Baptiste Roziere,
Bethany Biron,
Binh Tang
, et al. (510 additional authors not shown)
Abstract:
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical…
▽ More
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development.
△ Less
Submitted 15 August, 2024; v1 submitted 31 July, 2024;
originally announced July 2024.
-
Detecting the Stochastic Gravitational Wave Background from Primordial Black Holes in Slow-reheating Scenarios
Authors:
Luis E. Padilla,
Juan Carlos Hidalgo,
Karim A. Malik,
David Mulryne
Abstract:
After primordial inflation, the universe may have experienced a prolonged reheating epoch, potentially leading to a phase of matter domination supported by the oscillating inflaton field. During such an epoch, perturbations in the inflaton virialize upon reentering the cosmological horizon, forming inflaton structures. If the primordial overdensities are sufficiently large, these structures collap…
▽ More
After primordial inflation, the universe may have experienced a prolonged reheating epoch, potentially leading to a phase of matter domination supported by the oscillating inflaton field. During such an epoch, perturbations in the inflaton virialize upon reentering the cosmological horizon, forming inflaton structures. If the primordial overdensities are sufficiently large, these structures collapse to form primordial black holes (PBHs). To occur at a significant rate, this process requires an enhanced primordial power spectrum (PPS) at small scales. The enhancement of the PPS, as well as the formation and tidal interaction of the primordial structures, will in turn source a stochastic gravitational wave background(SGWB) that could be detected by current and/or future gravitational wave detectors. In this paper, we study the SGWB arising from these different sources during slow-reheating, focusing on a PPS that satisfies the requirements necessary for the formation of PBHs with a mass of $M_{\rm PBH}\simeq 10^{21}$ and that constitute the entirety of dark matter in the universe.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Drastic modification in thermal conductivity of TiCoSb Half-Heusler alloy: Phonon engineering by lattice softening and ionic polarization
Authors:
S. Mahakal,
Avijit Jana,
Diptasikha Das,
Nabakumar Rana,
Pallabi Sardar,
Aritra Banerjee,
Shamima Hussain,
Santanu K. Maiti,
K. Malik
Abstract:
A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron mi…
▽ More
A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron microscopy data. Local structures of the synthesized samples are explored for the first time by X-ray absorption spectroscopy measurements for TiCoSb system and corroborated with Rietveld refinement data. Lattice dynamics are revealed using Raman Spectroscopy (RS) measurements in unprecedented attempts for TiCoSb system. XRD and RS data accomplishes that variation in \k{appa} as a function of Sb concentration is observed owing to an alteration in phonon group velocity related to lattice softening. Polar nature of TiCoSb HH sample is revealed. LO-TO splitting (related to polar optical phonon scattering) in phonon vibration is observed due to polar nature of TiCoSb synthesized samples. Tailoring in LO-TO splitting due to screening effect, correlated with Co vacancies is reported for TiCoSb1+x synthesized samples. Lattice softening and LO-TO splitting lead to decreases in \k{appa}~47% for TiCoSb1.02 synthesized sample.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Primordial black hole formation during slow-reheating: A review
Authors:
Luis E. Padilla,
Juan Carlos Hidalgo,
Tadeo D. Gomez-Aguilar,
Karim A. Malik,
Gabriel German
Abstract:
In this paper we review the possible mechanisms for the production of primordial black holes (PBHs) during a slow-reheating period {in which the energy transfer of the inflaton field to standard model particles becomes effective at slow temperatures}, offering a comprehensive examination of the theoretical foundations and conditions required for each of formation channel. In particular, we focus o…
▽ More
In this paper we review the possible mechanisms for the production of primordial black holes (PBHs) during a slow-reheating period {in which the energy transfer of the inflaton field to standard model particles becomes effective at slow temperatures}, offering a comprehensive examination of the theoretical foundations and conditions required for each of formation channel. In particular, we focus on post-inflationary scenarios where there are no self-resonances and the reheating epoch can be described {by the inflaton evolving in} a quadratic-like potential. In the hydrodynamical interpretation of this field during the slow-reheating epoch, the gravitational collapse of primordial fluctuations is subject to conditions on their sphericity, limits on their spin, as well as a maximum velocity dispersion. We show how to account for all conditions and show that PBHs form with different masses depending on the collapse mechanism. Finally we show, through an example, how PBH production serves to probe both the physics after primordial inflation, as well as the primordial powerspectrum at the smallest scales.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Detection of a facemask in real-time using deep learning methods: Prevention of Covid 19
Authors:
Gautam Siddharth Kashyap,
Jatin Sohlot,
Ayesha Siddiqui,
Ramsha Siddiqui,
Karan Malik,
Samar Wazir,
Alexander E. I. Brownlee
Abstract:
A health crisis is raging all over the world with the rapid transmission of the novel-coronavirus disease (Covid-19). Out of the guidelines issued by the World Health Organisation (WHO) to protect us against Covid-19, wearing a facemask is the most effective. Many countries have necessitated the wearing of face masks, but monitoring a large number of people to ensure that they are wearing masks in…
▽ More
A health crisis is raging all over the world with the rapid transmission of the novel-coronavirus disease (Covid-19). Out of the guidelines issued by the World Health Organisation (WHO) to protect us against Covid-19, wearing a facemask is the most effective. Many countries have necessitated the wearing of face masks, but monitoring a large number of people to ensure that they are wearing masks in a crowded place is a challenging task in itself. The novel-coronavirus disease (Covid-19) has already affected our day-to-day life as well as world trade movements. By the end of April 2021, the world has recorded 144,358,956 confirmed cases of novel-coronavirus disease (Covid-19) including 3,066,113 deaths according to the world health organization (WHO). These increasing numbers motivate automated techniques for the detection of a facemask in real-time scenarios for the prevention of Covid-19. We propose a technique using deep learning that works for single and multiple people in a frame recorded via webcam in still or in motion. We have also experimented with our approach in night light. The accuracy of our model is good compared to the other approaches in the literature; ranging from 74% for multiple people in a nightlight to 99% for a single person in daylight.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
The WTP-WTA Gap for Public Goods: New Insights from Compensating and Equivalent Variation Closed-Form Solutions
Authors:
Daniel H. Karney,
Khyati Malik
Abstract:
This study finds exact closed-form solutions for compensating variation (CV) and equivalent variation (EV) for both marginal and non-marginal changes in public goods given homothetic utility. The parameters for these solutions are recoverable from observable data in empirical applications as a single sufficient statistic summarizes consumer preferences. The closed-form CV and EV expressions identi…
▽ More
This study finds exact closed-form solutions for compensating variation (CV) and equivalent variation (EV) for both marginal and non-marginal changes in public goods given homothetic utility. The parameters for these solutions are recoverable from observable data in empirical applications as a single sufficient statistic summarizes consumer preferences. The closed-form CV and EV expressions identify three economic mechanisms that determine the magnitudes of CV and EV. One of these mechanisms, the relative preference effect, helps explain the disparity between willingness to pay (WTP) and willingness to accept (WTA) for public goods.
△ Less
Submitted 18 July, 2024; v1 submitted 27 January, 2024;
originally announced January 2024.
-
Dynamics of Global Emission Permit Prices and Regional Social Cost of Carbon under Noncooperation
Authors:
Yongyang Cai,
Khyati Malik,
Hyeseon Shin
Abstract:
We build a dynamic multi-region model of climate and economy with emission permit trading among 12 aggregated regions in the world. We solve for the dynamic Nash equilibrium under noncooperation, wherein each region adheres to the emission cap constraints following commitments that were first outlined in the 2015 Paris Agreement and updated in subsequent years. Our model shows that the emission pe…
▽ More
We build a dynamic multi-region model of climate and economy with emission permit trading among 12 aggregated regions in the world. We solve for the dynamic Nash equilibrium under noncooperation, wherein each region adheres to the emission cap constraints following commitments that were first outlined in the 2015 Paris Agreement and updated in subsequent years. Our model shows that the emission permit price reaches $811 per ton of carbon by 2050. We demonstrate that a regional carbon tax is complementary to the global cap-and-trade system, and the optimal regional carbon tax is equal to the difference between the regional marginal abatement cost and the permit price.
△ Less
Submitted 13 April, 2024; v1 submitted 24 December, 2023;
originally announced December 2023.
-
Induced gravitational waves: the effect of first order tensor perturbations
Authors:
Raphael Picard,
Karim A. Malik
Abstract:
Scalar induced gravitational waves contribute to the cosmological gravitational wave background. They can be related to the primordial density power spectrum produced towards the end of inflation and therefore are a convenient new tool to constrain models of inflation. These waves are sourced by terms quadratic in perturbations and hence appear at second order in cosmological perturbation theory.…
▽ More
Scalar induced gravitational waves contribute to the cosmological gravitational wave background. They can be related to the primordial density power spectrum produced towards the end of inflation and therefore are a convenient new tool to constrain models of inflation. These waves are sourced by terms quadratic in perturbations and hence appear at second order in cosmological perturbation theory. While the focus of research so far was on purely scalar source terms we also study the effect of including first order tensor perturbations as an additional source. This gives rise to two additional source terms: a term quadratic in the tensor perturbations and a cross term involving mixed scalar and tensor perturbations. We present full analytical expressions for the spectral density of these new source terms and discuss their general behaviour. To illustrate the generation mechanism we study two toy models containing a peak on small scales. For these models we show that the scalar-tensor contribution becomes non-negligible compared to the scalar-scalar contribution on smaller scales. We also consider implications for future gravitational wave surveys.
△ Less
Submitted 13 December, 2023; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Posterior-Mean Separable Costs of Information Acquisition
Authors:
Jeffrey Mensch,
Komal Malik
Abstract:
We analyze a problem of revealed preference given state-dependent stochastic choice data in which the payoff to a decision maker (DM) only depends on their beliefs about posterior means. Often, the DM must also learn about or pay attention to the state; in applied work on this subject, a convenient assumption is that the costs of such learning are linearly dependent in the distribution over poster…
▽ More
We analyze a problem of revealed preference given state-dependent stochastic choice data in which the payoff to a decision maker (DM) only depends on their beliefs about posterior means. Often, the DM must also learn about or pay attention to the state; in applied work on this subject, a convenient assumption is that the costs of such learning are linearly dependent in the distribution over posterior means. We provide testable conditions to identify whether this assumption holds. This allows for the use of information design techniques to solve the DM's problem.
△ Less
Submitted 11 December, 2023; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Transport and electrical properties of cryogenic thermoelectric FeSb2: the effect of isoelectronic and hole doping
Authors:
Deepak Gujjar,
Sunidhi Gujjar,
V. K. Malik,
Hem C. Kandpal
Abstract:
Thermoelectric materials operating at cryogenic temperatures are in high demand for efficient cooling and power generation in applications ranging from superconductors to quantum computing. The narrow band-gap semiconductor FeSb2, known for its colossal Seebeck coefficient, holds promise for such applications, provided its thermal conductivity value can be reduced. This study investigates the impa…
▽ More
Thermoelectric materials operating at cryogenic temperatures are in high demand for efficient cooling and power generation in applications ranging from superconductors to quantum computing. The narrow band-gap semiconductor FeSb2, known for its colossal Seebeck coefficient, holds promise for such applications, provided its thermal conductivity value can be reduced. This study investigates the impact of isoelectronic substitution (Bi) and hole doping (Pb) at the Sb site on the transport properties of FeSb2, with a particular focus on thermal conductivity (\k{appa}). Polycrystalline FeSb2 powder, along with Bi- and Pb-doped samples, were synthesized using a simple co-precipitation approach, followed by thermal treatment in an H2 atmosphere. XRD and SEM analysis confirms the formation of the desired phase pre- and post-consolidation using spark plasma sintering (SPS). The consolidation process resulted in a high compaction density and the formation of submicrometer-sized grains, as substantiated by electron backscattered diffraction (EBSD) analysis. Substituting 1% of Bi and Pb at the Sb site successfully suppressed the thermal conductivity (\k{appa}) from ~15 W/m-K in pure FeSb2 to ~10 and ~8.7 W/m-K, respectively. Importantly, resistivity measurements revealed a metal-to-insulator transition at around 6.5 K in undoped FeSb2 and isoelectronically Bi-substituted FeSb2, suggesting the existence of metallic surface states and provides valuable evidence for the perplexing topological behavior exhibited by FeSb2.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
Securing Voice Biometrics: One-Shot Learning Approach for Audio Deepfake Detection
Authors:
Awais Khan,
Khalid Mahmood Malik
Abstract:
The Automatic Speaker Verification (ASV) system is vulnerable to fraudulent activities using audio deepfakes, also known as logical-access voice spoofing attacks. These deepfakes pose a concerning threat to voice biometrics due to recent advancements in generative AI and speech synthesis technologies. While several deep learning models for speech synthesis detection have been developed, most of th…
▽ More
The Automatic Speaker Verification (ASV) system is vulnerable to fraudulent activities using audio deepfakes, also known as logical-access voice spoofing attacks. These deepfakes pose a concerning threat to voice biometrics due to recent advancements in generative AI and speech synthesis technologies. While several deep learning models for speech synthesis detection have been developed, most of them show poor generalizability, especially when the attacks have different statistical distributions from the ones seen. Therefore, this paper presents Quick-SpoofNet, an approach for detecting both seen and unseen synthetic attacks in the ASV system using one-shot learning and metric learning techniques. By using the effective spectral feature set, the proposed method extracts compact and representative temporal embeddings from the voice samples and utilizes metric learning and triplet loss to assess the similarity index and distinguish different embeddings. The system effectively clusters similar speech embeddings, classifying bona fide speeches as the target class and identifying other clusters as spoofing attacks. The proposed system is evaluated using the ASVspoof 2019 logical access (LA) dataset and tested against unseen deepfake attacks from the ASVspoof 2021 dataset. Additionally, its generalization ability towards unseen bona fide speech is assessed using speech data from the VSDC dataset.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Effective Long-Context Scaling of Foundation Models
Authors:
Wenhan Xiong,
Jingyu Liu,
Igor Molybog,
Hejia Zhang,
Prajjwal Bhargava,
Rui Hou,
Louis Martin,
Rashi Rungta,
Karthik Abinav Sankararaman,
Barlas Oguz,
Madian Khabsa,
Han Fang,
Yashar Mehdad,
Sharan Narang,
Kshitiz Malik,
Angela Fan,
Shruti Bhosale,
Sergey Edunov,
Mike Lewis,
Sinong Wang,
Hao Ma
Abstract:
We present a series of long-context LLMs that support effective context windows of up to 32,768 tokens. Our model series are built through continual pretraining from Llama 2 with longer training sequences and on a dataset where long texts are upsampled. We perform extensive evaluation on language modeling, synthetic context probing tasks, and a wide range of research benchmarks. On research benchm…
▽ More
We present a series of long-context LLMs that support effective context windows of up to 32,768 tokens. Our model series are built through continual pretraining from Llama 2 with longer training sequences and on a dataset where long texts are upsampled. We perform extensive evaluation on language modeling, synthetic context probing tasks, and a wide range of research benchmarks. On research benchmarks, our models achieve consistent improvements on most regular tasks and significant improvements on long-context tasks over Llama 2. Notably, with a cost-effective instruction tuning procedure that does not require human-annotated long instruction data, the 70B variant can already surpass gpt-3.5-turbo-16k's overall performance on a suite of long-context tasks. Alongside these results, we provide an in-depth analysis on the individual components of our method. We delve into Llama's position encodings and discuss its limitation in modeling long dependencies. We also examine the impact of various design choices in the pretraining process, including the data mix and the training curriculum of sequence lengths -- our ablation experiments suggest that having abundant long texts in the pretrain dataset is not the key to achieving strong performance, and we empirically verify that long context continual pretraining is more efficient and similarly effective compared to pretraining from scratch with long sequences.
△ Less
Submitted 13 November, 2023; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Bridging the Spoof Gap: A Unified Parallel Aggregation Network for Voice Presentation Attacks
Authors:
Awais Khan,
Khalid Mahmood Malik
Abstract:
Automatic Speaker Verification (ASV) systems are increasingly used in voice bio-metrics for user authentication but are susceptible to logical and physical spoofing attacks, posing security risks. Existing research mainly tackles logical or physical attacks separately, leading to a gap in unified spoofing detection. Moreover, when existing systems attempt to handle both types of attacks, they ofte…
▽ More
Automatic Speaker Verification (ASV) systems are increasingly used in voice bio-metrics for user authentication but are susceptible to logical and physical spoofing attacks, posing security risks. Existing research mainly tackles logical or physical attacks separately, leading to a gap in unified spoofing detection. Moreover, when existing systems attempt to handle both types of attacks, they often exhibit significant disparities in the Equal Error Rate (EER). To bridge this gap, we present a Parallel Stacked Aggregation Network that processes raw audio. Our approach employs a split-transform-aggregation technique, dividing utterances into convolved representations, applying transformations, and aggregating the results to identify logical (LA) and physical (PA) spoofing attacks. Evaluation of the ASVspoof-2019 and VSDC datasets shows the effectiveness of the proposed system. It outperforms state-of-the-art solutions, displaying reduced EER disparities and superior performance in detecting spoofing attacks. This highlights the proposed method's generalizability and superiority. In a world increasingly reliant on voice-based security, our unified spoofing detection system provides a robust defense against a spectrum of voice spoofing attacks, safeguarding ASVs and user data effectively.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Frame-to-Utterance Convergence: A Spectra-Temporal Approach for Unified Spoofing Detection
Authors:
Awais Khan,
Khalid Mahmood Malik,
Shah Nawaz
Abstract:
Voice spoofing attacks pose a significant threat to automated speaker verification systems. Existing anti-spoofing methods often simulate specific attack types, such as synthetic or replay attacks. However, in real-world scenarios, the countermeasures are unaware of the generation schema of the attack, necessitating a unified solution. Current unified solutions struggle to detect spoofing artifact…
▽ More
Voice spoofing attacks pose a significant threat to automated speaker verification systems. Existing anti-spoofing methods often simulate specific attack types, such as synthetic or replay attacks. However, in real-world scenarios, the countermeasures are unaware of the generation schema of the attack, necessitating a unified solution. Current unified solutions struggle to detect spoofing artifacts, especially with recent spoofing mechanisms. For instance, the spoofing algorithms inject spectral or temporal anomalies, which are challenging to identify. To this end, we present a spectra-temporal fusion leveraging frame-level and utterance-level coefficients. We introduce a novel local spectral deviation coefficient (SDC) for frame-level inconsistencies and employ a bi-LSTM-based network for sequential temporal coefficients (STC), which capture utterance-level artifacts. Our spectra-temporal fusion strategy combines these coefficients, and an auto-encoder generates spectra-temporal deviated coefficients (STDC) to enhance robustness. Our proposed approach addresses multiple spoofing categories, including synthetic, replay, and partial deepfake attacks. Extensive evaluation on diverse datasets (ASVspoof2019, ASVspoof2021, VSDC, partial spoofs, and in-the-wild deepfakes) demonstrated its robustness for a wide range of voice applications.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Normalized factorial moments of spatial distributions of particles in high multiplicity events: A Toy model study
Authors:
Sheetal Sharma,
Salman Khurshid Malik,
Zarina Banoo,
Ramni Gupta
Abstract:
In ultra-relativistic heavy-ion collisions a strongly interacting complex system of quarks and gluons is formed. The nature of the system so created and the mechanism of multi-particle production in these collisions may be revealed by studying the normalized factorial moments ($F_{\rm{q}}$) as function of various parameters. The resilience of $F_{\rm{q}}$ moments studied using Toy model events sho…
▽ More
In ultra-relativistic heavy-ion collisions a strongly interacting complex system of quarks and gluons is formed. The nature of the system so created and the mechanism of multi-particle production in these collisions may be revealed by studying the normalized factorial moments ($F_{\rm{q}}$) as function of various parameters. The resilience of $F_{\rm{q}}$ moments studied using Toy model events shows that these are sensitive to the presence of dynamical fluctuations in the system and are robust against the uniform efficiencies in the data measurements. Results of this study serve as a suitable reference baseline for the experimental and simulation studies.
△ Less
Submitted 16 October, 2023; v1 submitted 14 September, 2023;
originally announced September 2023.
-
Intuitionistic Fuzzy Broad Learning System: Enhancing Robustness Against Noise and Outliers
Authors:
M. Sajid,
A. K. Malik,
M. Tanveer
Abstract:
In the realm of data classification, broad learning system (BLS) has proven to be a potent tool that utilizes a layer-by-layer feed-forward neural network. However, the traditional BLS treats all samples as equally significant, which makes it less robust and less effective for real-world datasets with noises and outliers. To address this issue, we propose fuzzy broad learning system (F-BLS) and th…
▽ More
In the realm of data classification, broad learning system (BLS) has proven to be a potent tool that utilizes a layer-by-layer feed-forward neural network. However, the traditional BLS treats all samples as equally significant, which makes it less robust and less effective for real-world datasets with noises and outliers. To address this issue, we propose fuzzy broad learning system (F-BLS) and the intuitionistic fuzzy broad learning system (IF-BLS) models that confront challenges posed by the noise and outliers present in the dataset and enhance overall robustness. Employing a fuzzy membership technique, the proposed F-BLS model embeds sample neighborhood information based on the proximity of each class center within the inherent feature space of the BLS framework. Furthermore, the proposed IF-BLS model introduces intuitionistic fuzzy concepts encompassing membership, non-membership, and score value functions. IF-BLS strategically considers homogeneity and heterogeneity in sample neighborhoods in the kernel space. We evaluate the performance of proposed F-BLS and IF-BLS models on UCI benchmark datasets with and without Gaussian noise. As an application, we implement the proposed F-BLS and IF-BLS models to diagnose Alzheimer's disease (AD). Experimental findings and statistical analyses consistently highlight the superior generalization capabilities of the proposed F-BLS and IF-BLS models over baseline models across all scenarios. The proposed models offer a promising solution to enhance the BLS framework's ability to handle noise and outliers. The source code link of the proposed model is available at https://github.com/mtanveer1/IF-BLS.
△ Less
Submitted 11 May, 2024; v1 submitted 15 July, 2023;
originally announced July 2023.
-
Graph Embedded Intuitionistic Fuzzy Random Vector Functional Link Neural Network for Class Imbalance Learning
Authors:
M. A. Ganaie,
M. Sajid,
A. K. Malik,
M. Tanveer
Abstract:
The domain of machine learning is confronted with a crucial research area known as class imbalance learning, which presents considerable hurdles in precise classification of minority classes. This issue can result in biased models where the majority class takes precedence in the training process, leading to the underrepresentation of the minority class. The random vector functional link (RVFL) net…
▽ More
The domain of machine learning is confronted with a crucial research area known as class imbalance learning, which presents considerable hurdles in precise classification of minority classes. This issue can result in biased models where the majority class takes precedence in the training process, leading to the underrepresentation of the minority class. The random vector functional link (RVFL) network is a widely used and effective learning model for classification due to its good generalization performance and efficiency. However, it suffers when dealing with imbalanced datasets. To overcome this limitation, we propose a novel graph embedded intuitionistic fuzzy RVFL for class imbalance learning (GE-IFRVFL-CIL) model incorporating a weighting mechanism to handle imbalanced datasets. The proposed GE-IFRVFL-CIL model offers plethora of benefits: $(i)$ leveraging graph embedding to preserve the inherent topological structure of the datasets, $(ii)$ employing intuitionistic fuzzy theory to handle uncertainty and imprecision in the data, $(iii)$ and the most important, it tackles class imbalance learning. The amalgamation of a weighting scheme, graph embedding, and intuitionistic fuzzy sets leads to the superior performance of the proposed models on KEEL benchmark imbalanced datasets with and without Gaussian noise. Furthermore, we implemented the proposed GE-IFRVFL-CIL on the ADNI dataset and achieved promising results, demonstrating the model's effectiveness in real-world applications. The proposed GE-IFRVFL-CIL model offers a promising solution to address the class imbalance issue, mitigates the detrimental effect of noise and outliers, and preserves the inherent geometrical structures of the dataset.
△ Less
Submitted 16 February, 2024; v1 submitted 15 July, 2023;
originally announced July 2023.
-
Roulette-Wheel Selection-Based PSO Algorithm for Solving the Vehicle Routing Problem with Time Windows
Authors:
Gautam Siddharth Kashyap,
Alexander E. I. Brownlee,
Orchid Chetia Phukan,
Karan Malik,
Samar Wazir
Abstract:
The well-known Vehicle Routing Problem with Time Windows (VRPTW) aims to reduce the cost of moving goods between several destinations while accommodating constraints like set time windows for certain locations and vehicle capacity. Applications of the VRPTW problem in the real world include Supply Chain Management (SCM) and logistic dispatching, both of which are crucial to the economy and are exp…
▽ More
The well-known Vehicle Routing Problem with Time Windows (VRPTW) aims to reduce the cost of moving goods between several destinations while accommodating constraints like set time windows for certain locations and vehicle capacity. Applications of the VRPTW problem in the real world include Supply Chain Management (SCM) and logistic dispatching, both of which are crucial to the economy and are expanding quickly as work habits change. Therefore, to solve the VRPTW problem, metaheuristic algorithms i.e. Particle Swarm Optimization (PSO) have been found to work effectively, however, they can experience premature convergence. To lower the risk of PSO's premature convergence, the authors have solved VRPTW in this paper utilising a novel form of the PSO methodology that uses the Roulette Wheel Method (RWPSO). Computing experiments using the Solomon VRPTW benchmark datasets on the RWPSO demonstrate that RWPSO is competitive with other state-of-the-art algorithms from the literature. Also, comparisons with two cutting-edge algorithms from the literature show how competitive the suggested algorithm is.
△ Less
Submitted 4 June, 2023;
originally announced June 2023.
-
Transport phenomena of TiCoSb: Defects induced modification in structure and density of states
Authors:
S. Mahakal,
Diptasikha Das,
Pintu Singha,
Aritra Banerjee,
S. Chatterjee,
Santanu K. Maiti,
S. Assa Aravindh,
K. Malik
Abstract:
TiCoSb1+x (x=0.0, 0.01, 0.02, 0.03, 0.04, 0.06) samples have been synthesized, employing solid state reaction method followed by arc menting. Theoretical calculations, using Density Functional Theory (DFT) have been performed to estimate band structure and density of states (DOS). Further, energitic calculations, using first principle have been carried out to reveal the formation energy for vacanc…
▽ More
TiCoSb1+x (x=0.0, 0.01, 0.02, 0.03, 0.04, 0.06) samples have been synthesized, employing solid state reaction method followed by arc menting. Theoretical calculations, using Density Functional Theory (DFT) have been performed to estimate band structure and density of states (DOS). Further, energitic calculations, using first principle have been carried out to reveal the formation energy for vacancy, interstitial, anti-site defects. Detail structural calculation, employing Rietveld refinement reveals the presence of embedded phases, vacancy and interstitial atom, which is also supported by the theoretical calculations. Lattice strain, crystalline size and dislocation density have been estimated by Williamson-Hall and modified Williamson-Hall methods. Thermal variation of resistivity [\r{ho}(T)] and thermopower [S(T)] have been explained using Mott equation and density of states (DOS) modification near the Fermi surface due to Co vancancy and embedded phases. Figure of merit (ZT) has been calculated and 4 to 5 times higher ZT for TiCoSb than earlier reported value is obtained at room temperature.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
SACDNet: Towards Early Type 2 Diabetes Prediction with Uncertainty for Electronic Health Records
Authors:
Tayyab Nasir,
Muhammad Kamran Malik
Abstract:
Type 2 diabetes mellitus (T2DM) is one of the most common diseases and a leading cause of death. The problem of early diagnosis of T2DM is challenging and necessary to prevent serious complications. This study proposes a novel neural network architecture for early T2DM prediction using multi-headed self-attention and dense layers to extract features from historic diagnoses, patient vitals, and dem…
▽ More
Type 2 diabetes mellitus (T2DM) is one of the most common diseases and a leading cause of death. The problem of early diagnosis of T2DM is challenging and necessary to prevent serious complications. This study proposes a novel neural network architecture for early T2DM prediction using multi-headed self-attention and dense layers to extract features from historic diagnoses, patient vitals, and demographics. The proposed technique is called the Self-Attention for Comorbid Disease Net (SACDNet), achieving an accuracy of 89.3% and an F1-Score of 89.1%, having a 1.6% increased accuracy and 1.3% increased f1-score compared to the baseline techniques. Monte Carlo (MC) Dropout is applied to the SACDNet to get a bayesian approximation. A T2DM prediction framework based on the MC Dropout SACDNet is proposed to quantize the uncertainty associated with the predictions. A T2DM prediction dataset is also built as part of this study which is based on real-world routine Electronic Health Record (EHR) data comprising 4,124 diabetic and 181,767 non-diabetic examples, collected from 295 different EHR systems running in different parts of the United States of America. This dataset is further used to evaluate 7 different machine learning and 3 deep learning-based models. Finally, a detailed analysis of the fairness of every technique against different patient demographic groups is performed to validate the unbiased generalization of the techniques and the diversity of the data.
△ Less
Submitted 18 January, 2023; v1 submitted 12 January, 2023;
originally announced January 2023.
-
Optimal Robust Mechanism in Bilateral Trading
Authors:
Komal Malik
Abstract:
We consider a model of bilateral trade with private values. The value of the buyer and the cost of the seller are jointly distributed. The true joint distribution is unknown to the designer, however, the marginal distributions of the value and the cost are known to the designer. The designer wants to find a trading mechanism that is robustly Bayesian incentive compatible, robustly individually rat…
▽ More
We consider a model of bilateral trade with private values. The value of the buyer and the cost of the seller are jointly distributed. The true joint distribution is unknown to the designer, however, the marginal distributions of the value and the cost are known to the designer. The designer wants to find a trading mechanism that is robustly Bayesian incentive compatible, robustly individually rational, budget-balanced and maximizes the expected gains from trade over all such mechanisms. We refer to such a mechanism as an optimal robust mechanism. We establish equivalence between Bayesian incentive compatible mechanisms (BIC) and dominant strategy mechanisms (DSIC).
We characterise the worst distribution for a given mechanism and use this characterisation to find an optimal robust mechanism. We show that there is an optimal robust mechanism that is deterministic (posted-price), dominant strategy incentive compatible, and ex-post individually rational. We also derive an explicit expression of the posted-price of such an optimal robust mechanism. We also show the equivalence between the efficiency gains from the optimal robust mechanism (max-min problem) and guaranteed efficiency gains if the designer could choose the mechanism after observing the true joint distribution (min-max problem).
△ Less
Submitted 29 December, 2022;
originally announced December 2022.
-
Experimental observation of a confined bubble moving in shear-thinning fluids
Authors:
SungGyu Chun,
Bingqiang Ji,
Zhengyu Yang,
Vinit Kumar Malik,
Jie Feng
Abstract:
The motion of a long gas bubble in a confined capillary tube is ubiquitous in a wide range of engineering and biological applications. While the understanding of the deposited thin viscous film near the tube wall in Newtonian fluids is well developed, the deposition dynamics in commonly encountered non-Newtonian fluids remains much less studied. Here, we investigate the dynamics of a confined bubb…
▽ More
The motion of a long gas bubble in a confined capillary tube is ubiquitous in a wide range of engineering and biological applications. While the understanding of the deposited thin viscous film near the tube wall in Newtonian fluids is well developed, the deposition dynamics in commonly encountered non-Newtonian fluids remains much less studied. Here, we investigate the dynamics of a confined bubble moving in shear-thinning fluids with systematic experiments, varying the zero-shear-rate capillary number $Ca_0$ in the range of $O(10^{-3}-10^2)$ considering the zero-shear-rate viscosity. The thickness of the deposited liquid film, the bubble speed and the bubble front/rear menisci are measured, which are further rationalized with the recent theoretical studies based on appropriate rheological models. Compared with Newtonian fluids, the film thickness decreases for both the carboxymethyl cellulose and Carbopol solutions when the shear-thinning effect dominates. We show that the film thickness follows the scaling law from \citet{aussillous2000quick} with an effective capillary number $Ca_e$, considering the characteristic shear rate in the film as proposed by \citet{picchi2021motion}. $Ca_e$ is calculated by the Carreau number and the power-law index from the Carreau-Yasuda rheological model. The shear-thinning effect also influences the bubble speed and delays the transition to the parabolic region in the bubble front and rear menisci. In particular, a high degree of undulations on the bubble surface results in intricate rear viscosity distribution for the rear meniscus and the deviation between the experiments and theory may require a further investigation to resolve the axial velocity field. Our study may advance the fundamental understandings and engineering guidelines for coating processes involving thin-film flows and non-Newtonian fluids.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning
Authors:
John Nguyen,
Jianyu Wang,
Kshitiz Malik,
Maziar Sanjabi,
Michael Rabbat
Abstract:
An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to the fact that client devices have different system capabilities. A considerable number of federated optimization methods address this challenge. In the literature,…
▽ More
An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to the fact that client devices have different system capabilities. A considerable number of federated optimization methods address this challenge. In the literature, empirical evaluations usually start federated training from random initialization. However, in many practical applications of federated learning, the server has access to proxy data for the training task that can be used to pre-train a model before starting federated training. We empirically study the impact of starting from a pre-trained model in federated learning using four standard federated learning benchmark datasets. Unsurprisingly, starting from a pre-trained model reduces the training time required to reach a target error rate and enables the training of more accurate models (up to 40\%) than is possible when starting from random initialization. Surprisingly, we also find that starting federated learning from a pre-trained initialization reduces the effect of both data and system heterogeneity. We recommend that future work proposing and evaluating federated optimization methods evaluate the performance when starting from random and pre-trained initializations. We also believe this study raises several questions for further work on understanding the role of heterogeneity in federated optimization.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
Intermittency analysis of charged hadrons generated in Pb-Pb collisions at $\sqrt{s_{NN}}$= 2.76 TeV and 5.02 TeV using PYTHIA8/Angantyr
Authors:
Salman Khurshid Malik,
Ramni Gupta
Abstract:
Local density fluctuations are expected to scale as a universal power-law when the system approaches critical point. Such power-law fluctuations are studied within the framework of intermittency through the measurement of normalized factorial moments in ($η$, $φ$) phase space. Observations and results from the intermittency analysis performed for charged particles in Pb-Pb collisions using PYTHIA8…
▽ More
Local density fluctuations are expected to scale as a universal power-law when the system approaches critical point. Such power-law fluctuations are studied within the framework of intermittency through the measurement of normalized factorial moments in ($η$, $φ$) phase space. Observations and results from the intermittency analysis performed for charged particles in Pb-Pb collisions using PYTHIA8/Angantyr at 2.76 TeV and 5.02 TeV are reported. We observe no scaling behaviour in the particle generation for any of the centrality studied in narrow p$_T$ bins. The scaling exponent $ν$ shows no dependence on the centrality ranges.
△ Less
Submitted 27 November, 2022; v1 submitted 14 October, 2022;
originally announced October 2022.
-
Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward
Authors:
Awais Khan,
Khalid Mahmood Malik,
James Ryan,
Mikul Saravanan
Abstract:
Malicious actors may seek to use different voice-spoofing attacks to fool ASV systems and even use them for spreading misinformation. Various countermeasures have been proposed to detect these spoofing attacks. Due to the extensive work done on spoofing detection in automated speaker verification (ASV) systems in the last 6-7 years, there is a need to classify the research and perform qualitative…
▽ More
Malicious actors may seek to use different voice-spoofing attacks to fool ASV systems and even use them for spreading misinformation. Various countermeasures have been proposed to detect these spoofing attacks. Due to the extensive work done on spoofing detection in automated speaker verification (ASV) systems in the last 6-7 years, there is a need to classify the research and perform qualitative and quantitative comparisons on state-of-the-art countermeasures. Additionally, no existing survey paper has reviewed integrated solutions to voice spoofing evaluation and speaker verification, adversarial/antiforensics attacks on spoofing countermeasures, and ASV itself, or unified solutions to detect multiple attacks using a single model. Further, no work has been done to provide an apples-to-apples comparison of published countermeasures in order to assess their generalizability by evaluating them across corpora. In this work, we conduct a review of the literature on spoofing detection using hand-crafted features, deep learning, end-to-end, and universal spoofing countermeasure solutions to detect speech synthesis (SS), voice conversion (VC), and replay attacks. Additionally, we also review integrated solutions to voice spoofing evaluation and speaker verification, adversarial and anti-forensics attacks on voice countermeasures, and ASV. The limitations and challenges of the existing spoofing countermeasures are also presented. We report the performance of these countermeasures on several datasets and evaluate them across corpora. For the experiments, we employ the ASVspoof2019 and VSDC datasets along with GMM, SVM, CNN, and CNN-GRU classifiers. (For reproduceability of the results, the code of the test bed can be found in our GitHub Repository.
△ Less
Submitted 21 November, 2022; v1 submitted 1 October, 2022;
originally announced October 2022.
-
Large nonsaturating magnetoresistance, weak anti-localization and non-trivial topological states in SrAl$_2$Si$_2$
Authors:
Sudip Malick,
A. B. Sarkar,
Antu Laha,
M. Anas,
V. K. Malik,
Amit Agarwal,
Z. Hossain,
J. Nayak
Abstract:
We explore the electronic and topological properties of single crystal SrAl$_2$Si$_2$ using magnetotransport experiments in conjunction with first-principle calculations. We find that the temperature-dependent resistivity shows a pronounced peak near 50 K. We observe several remarkable features at low temperatures, such as large non-saturating magnetoresistance, Shubnikov-de Haas oscillations and…
▽ More
We explore the electronic and topological properties of single crystal SrAl$_2$Si$_2$ using magnetotransport experiments in conjunction with first-principle calculations. We find that the temperature-dependent resistivity shows a pronounced peak near 50 K. We observe several remarkable features at low temperatures, such as large non-saturating magnetoresistance, Shubnikov-de Haas oscillations and cusp-like magneto-conductivity. The maximum value of magnetoresistance turns out to be 459\% at 2 K and 12 T. The analysis of the cusp-like feature in magneto-conductivity indicates a clear signature of weak anti-localization. Our Hall resistivity measurements confirm the presence of two types of charge carriers in SrAl$_2$Si$_2$, with low carrier density.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning
Authors:
John Nguyen,
Jianyu Wang,
Kshitiz Malik,
Maziar Sanjabi,
Michael Rabbat
Abstract:
An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to client devices having different system capabilities. A considerable number of federated optimization methods address this challenge. In the literature, empirical ev…
▽ More
An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to client devices having different system capabilities. A considerable number of federated optimization methods address this challenge. In the literature, empirical evaluations usually start federated training from random initialization. However, in many practical applications of federated learning, the server has access to proxy data for the training task that can be used to pre-train a model before starting federated training. Using four standard federated learning benchmark datasets, we empirically study the impact of starting from a pre-trained model in federated learning. Unsurprisingly, starting from a pre-trained model reduces the training time required to reach a target error rate and enables the training of more accurate models (up to 40\%) than is possible when starting from random initialization. Surprisingly, we also find that starting federated learning from a pre-trained initialization reduces the effect of both data and system heterogeneity. We recommend future work proposing and evaluating federated optimization methods to evaluate the performance when starting from random and pre-trained initializations. This study raises several questions for further work on understanding the role of heterogeneity in federated optimization. \footnote{Our code is available at: \url{https://github.com/facebookresearch/where_to_begin}}
△ Less
Submitted 24 March, 2023; v1 submitted 30 June, 2022;
originally announced June 2022.
-
The effect of antisite disorder on magnetic and exchange bias properties of Gd-substituted Y$_2$CoMnO$_6$ double perovskite
Authors:
Anasua Khan,
Sarita Rajput,
M. Anas,
V. K. Malik,
T. Maitra,
T. K Nath,
A. Taraphder
Abstract:
Combining experimental investigations and first-principles DFT calculations, we report physical and magnetic properties of Gd-substituted Y$_2$CoMnO$_6$ double perovskite, which are strongly influenced by antisite-disorder-driven spin configurations. On Gd doping, Co and Mn ions are present in mixed-valence (Co$^{3+}$, Co$^{2+}$, Mn$^{3+}$ and Mn$^{4+}$) states. Multiple magnetic transitions have…
▽ More
Combining experimental investigations and first-principles DFT calculations, we report physical and magnetic properties of Gd-substituted Y$_2$CoMnO$_6$ double perovskite, which are strongly influenced by antisite-disorder-driven spin configurations. On Gd doping, Co and Mn ions are present in mixed-valence (Co$^{3+}$, Co$^{2+}$, Mn$^{3+}$ and Mn$^{4+}$) states. Multiple magnetic transitions have been observed: i) paramagnetic to ferromagnetic transition is found to occur at \textit{T}$_C$=95.5 K, ii) antiferromagnetic transition at \textit{T}$_N$=47 K is driven by $3d-4f$ polarisation and antisite disorder present in the sample, iii) change in magnetization below \textit{T}$\leq$20 K, primarily originating from Gd ordering, as revealed from our DFT calculations. AC susceptibility measurement confirms the absence of any spin-glass or cluster-glass phases in this material. A significantly large exchange bias effect (\textit{H}$_{EB}$=1.07 kOe) is found to occur below 47 K due to interfaces of FM and AFM clusters created by antisite-disorder.
△ Less
Submitted 17 August, 2022; v1 submitted 5 May, 2022;
originally announced May 2022.
-
Federated Learning with Partial Model Personalization
Authors:
Krishna Pillutla,
Kshitiz Malik,
Abdelrahman Mohamed,
Michael Rabbat,
Maziar Sanjabi,
Lin Xiao
Abstract:
We consider two federated learning algorithms for training partially personalized models, where the shared and personal parameters are updated either simultaneously or alternately on the devices. Both algorithms have been proposed in the literature, but their convergence properties are not fully understood, especially for the alternating variant. We provide convergence analyses of both algorithms…
▽ More
We consider two federated learning algorithms for training partially personalized models, where the shared and personal parameters are updated either simultaneously or alternately on the devices. Both algorithms have been proposed in the literature, but their convergence properties are not fully understood, especially for the alternating variant. We provide convergence analyses of both algorithms in the general nonconvex setting with partial participation and delineate the regime where one dominates the other. Our experiments on real-world image, text, and speech datasets demonstrate that (a) partial personalization can obtain most of the benefits of full model personalization with a small fraction of personal parameters, and, (b) the alternating update algorithm often outperforms the simultaneous update algorithm by a small but consistent margin.
△ Less
Submitted 15 August, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
FedSynth: Gradient Compression via Synthetic Data in Federated Learning
Authors:
Shengyuan Hu,
Jack Goetz,
Kshitiz Malik,
Hongyuan Zhan,
Zhe Liu,
Yue Liu
Abstract:
Model compression is important in federated learning (FL) with large models to reduce communication cost. Prior works have been focusing on sparsification based compression that could desparately affect the global model accuracy. In this work, we propose a new scheme for upstream communication where instead of transmitting the model update, each client learns and transmits a light-weight synthetic…
▽ More
Model compression is important in federated learning (FL) with large models to reduce communication cost. Prior works have been focusing on sparsification based compression that could desparately affect the global model accuracy. In this work, we propose a new scheme for upstream communication where instead of transmitting the model update, each client learns and transmits a light-weight synthetic dataset such that using it as the training data, the model performs similarly well on the real training data. The server will recover the local model update via the synthetic data and apply standard aggregation. We then provide a new algorithm FedSynth to learn the synthetic data locally. Empirically, we find our method is comparable/better than random masking baselines in all three common federated learning benchmark datasets.
△ Less
Submitted 4 April, 2022;
originally announced April 2022.
-
Random vector functional link network: recent developments, applications, and future directions
Authors:
A. K. Malik,
Ruobin Gao,
M. A. Ganaie,
M. Tanveer,
P. N. Suganthan
Abstract:
Neural networks have been successfully employed in various domains such as classification, regression and clustering, etc. Generally, the back propagation (BP) based iterative approaches are used to train the neural networks, however, it results in the issues of local minima, sensitivity to learning rate and slow convergence. To overcome these issues, randomization based neural networks such as ra…
▽ More
Neural networks have been successfully employed in various domains such as classification, regression and clustering, etc. Generally, the back propagation (BP) based iterative approaches are used to train the neural networks, however, it results in the issues of local minima, sensitivity to learning rate and slow convergence. To overcome these issues, randomization based neural networks such as random vector functional link (RVFL) network have been proposed. RVFL model has several characteristics such as fast training speed, direct links, simple architecture, and universal approximation capability, that make it a viable randomized neural network. This article presents the first comprehensive review of the evolution of RVFL model, which can serve as the extensive summary for the beginners as well as practitioners. We discuss the shallow RVFLs, ensemble RVFLs, deep RVFLs and ensemble deep RVFL models. The variations, improvements and applications of RVFL models are discussed in detail. Moreover, we discuss the different hyperparameter optimization techniques followed in the literature to improve the generalization performance of the RVFL model. Finally, we give potential future research directions/opportunities that can inspire the researchers to improve the RVFL's architecture and learning algorithm further.
△ Less
Submitted 23 April, 2023; v1 submitted 13 February, 2022;
originally announced March 2022.
-
Papaya: Practical, Private, and Scalable Federated Learning
Authors:
Dzmitry Huba,
John Nguyen,
Kshitiz Malik,
Ruiyu Zhu,
Mike Rabbat,
Ashkan Yousefpour,
Carole-Jean Wu,
Hongyuan Zhan,
Pavel Ustinov,
Harish Srinivas,
Kaikai Wang,
Anthony Shoumikhin,
Jesik Min,
Mani Malek
Abstract:
Cross-device Federated Learning (FL) is a distributed learning paradigm with several challenges that differentiate it from traditional distributed learning, variability in the system characteristics on each device, and millions of clients coordinating with a central server being primary ones. Most FL systems described in the literature are synchronous - they perform a synchronized aggregation of m…
▽ More
Cross-device Federated Learning (FL) is a distributed learning paradigm with several challenges that differentiate it from traditional distributed learning, variability in the system characteristics on each device, and millions of clients coordinating with a central server being primary ones. Most FL systems described in the literature are synchronous - they perform a synchronized aggregation of model updates from individual clients. Scaling synchronous FL is challenging since increasing the number of clients training in parallel leads to diminishing returns in training speed, analogous to large-batch training. Moreover, stragglers hinder synchronous FL training. In this work, we outline a production asynchronous FL system design. Our work tackles the aforementioned issues, sketches of some of the system design challenges and their solutions, and touches upon principles that emerged from building a production FL system for millions of clients. Empirically, we demonstrate that asynchronous FL converges faster than synchronous FL when training across nearly one hundred million devices. In particular, in high concurrency settings, asynchronous FL is 5x faster and has nearly 8x less communication overhead than synchronous FL.
△ Less
Submitted 25 April, 2022; v1 submitted 8 November, 2021;
originally announced November 2021.
-
A new mechanism for primordial black hole formation during reheating
Authors:
Luis E. Padilla,
Juan Carlos Hidalgo,
Karim A. Malik
Abstract:
The Reheating process at the end of inflation is often modeled by an oscillating scalar field which shows a background dust-like behaviour, prompting the analysis of gravitational collapse and black hole formation in this era to be approached by the spherical collapse of standard structure formation. In the scalar field dark matter structure formation process virialized halos halt the direct colla…
▽ More
The Reheating process at the end of inflation is often modeled by an oscillating scalar field which shows a background dust-like behaviour, prompting the analysis of gravitational collapse and black hole formation in this era to be approached by the spherical collapse of standard structure formation. In the scalar field dark matter structure formation process virialized halos halt the direct collapse, resulting in halos with condensed central cores at the de Broglie scale of the dominant scalar field. We show that a similar process can take place during reheating, leading to the formation of primordial black holes (PBHs). We study the formation of PBHs through the gravitational further collapse of structures virialized during reheating, looking at the collapse of either the whole structure, or that of the central core within these configurations. We compute the threshold amplitude for the density contrast to undergo this process, for both free and self-interacting scalar fields. We discuss the relevance of our results for the abundance of PBHs at the lower end of the mass spectrum.
△ Less
Submitted 7 July, 2022; v1 submitted 27 October, 2021;
originally announced October 2021.
-
Comparative Analysis of Deep Learning Algorithms for Classification of COVID-19 X-Ray Images
Authors:
Unsa Maheen,
Khawar Iqbal Malik,
Gohar Ali
Abstract:
The Coronavirus was first emerged in December, in the city of China named Wuhan in 2019 and spread quickly all over the world. It has very harmful effects all over the global economy, education, social, daily living and general health of humans. To restrict the quick expansion of the disease initially, main difficulty is to explore the positive corona patients as quickly as possible. As there are…
▽ More
The Coronavirus was first emerged in December, in the city of China named Wuhan in 2019 and spread quickly all over the world. It has very harmful effects all over the global economy, education, social, daily living and general health of humans. To restrict the quick expansion of the disease initially, main difficulty is to explore the positive corona patients as quickly as possible. As there are no automatic tool kits accessible the requirement for supplementary diagnostic tools has risen up. Previous studies have findings acquired from radiological techniques proposed that this kind of images have important details related to the coronavirus. The usage of modified Artificial Intelligence (AI) system in combination with radio-graphical images can be fruitful for the precise and exact solution of this virus and can also be helpful to conquer the issue of deficiency of professional physicians in distant villages. In our research, we analyze the different techniques for the detection of COVID-19 using X-Ray radiographic images of the chest, we examined the different pre-trained CNN models AlexNet, VGG-16, MobileNet-V2, SqeezeNet, ResNet-34, ResNet-50 and COVIDX-Net to correct analytics for classification system of COVID-19. Our study shows that the pre trained CNN Model with ResNet-34 technique gives the higher accuracy rate of 98.33, 96.77% precision, and 98.36 F1-score, which is better than other CNN techniques. Our model may be helpful for the researchers to fine train the CNN model for the the quick screening of COVID patients.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Complex interplay of magnetic ordering and spin-lattice coupling in orthochromite Nd$_{0.5}$Dy$_{0.5}$CrO$_{3}$
Authors:
M. Anas,
Padmanabhan Balasubramanian,
K. Vikram,
Ankita Singh,
C. M. N. Kumar,
Andreas Hoser,
Dariusz Rusinek,
A. K. Sinha,
V. Srihari,
Ranjan K. Singh,
Rinku Kumar,
Mukul Gupta,
T. Maitra,
V. K. Malik
Abstract:
The mixed rare-earth orthochromite Nd$_{0.5}$Dy$_{0.5}$CrO$_{3}$ has a Néel temperature ($T_\mathrm{N}$) of ${\sim}$ 175\,K, resulting in the G-type antiferromagnetic ordering of Cr$^{3+}$ spins. The inverse susceptibility shows a deviation from Curie-Weiss law at 230\,K, with a large effective paramagnetic moment of 8.8\,$μ_{\mathrm{B}}$. The ZFC-FC magnetization bifurcate just above…
▽ More
The mixed rare-earth orthochromite Nd$_{0.5}$Dy$_{0.5}$CrO$_{3}$ has a Néel temperature ($T_\mathrm{N}$) of ${\sim}$ 175\,K, resulting in the G-type antiferromagnetic ordering of Cr$^{3+}$ spins. The inverse susceptibility shows a deviation from Curie-Weiss law at 230\,K, with a large effective paramagnetic moment of 8.8\,$μ_{\mathrm{B}}$. The ZFC-FC magnetization bifurcate just above $T_\mathrm{N}$ and show a distinct signature of spin reorientation near 60\,K. Neutron diffraction show that below $T_\mathrm{N}$, the Cr$^{3+}$ spins align in $Γ_{2}$ representation as ($F_{x}$, $G_{z}$). Below 60\,K, due to spin reorientation, the magnetic structure is in $Γ_{1}$ ($G_{y}$) configuration. The neutron diffraction does not show any signature of rare-earth ordering even at 1.5\,K. First principles density functional theory calculations within GGA+U and GGA+U+SO approximations reveal that the G-type antiferromagnetic order is the ground state magnetic structure of Cr sublattice and the spin-reorientation of Cr$^{3+}$ spins can happen in the absence of 3d-4f interactions unlike in the case of orthoferrites. The specific heat shows a `$λ$' anomaly at $T_\mathrm{N}$, while at low temperature two distinct Schottky anomalies are observed; a Schottky peak at 2\,K and an additional step-like feature above 10\,K. Above $T_\mathrm{N}$, the magnetic transition is preceded by structural anomalies as seen in our x-ray diffraction and Raman measurements. The deviation of structural parameters near Néel temperature is smaller. The phonon frequencies show deviation from the standard anharmonic behaviour: first near 250\,K, due to magneto-volume effects while the second deviation occurs near 200\,K due to spin-phonon coupling.
△ Less
Submitted 18 October, 2021;
originally announced October 2021.
-
Coexisting magnetic structures and spin-reorientation in Er$_{0.5}$Dy$_{0.5}$FeO$_{3}$: Bulk magnetization, neutron scattering, specific heat, and \emph{Ab-initio} studies
Authors:
Sarita Rajput,
Padmanabhan Balasubramanian,
Ankita Singh,
Francoise Damay,
C. M. N. Kumar,
W. Tabis,
T. Maitra,
V. K. Malik
Abstract:
The complex magnetic structures, spin-reorientation and associated exchange interactions have been investigate in Er$_{0.5}$Dy$_{0.5}$FeO$_3$ using bulk magnetization, neutron diffraction, specific heat measurements and density functional theory calculations. The Fe$^{3+}$ spins order as G-type antiferromagnet structure depicted by $Γ_{4}$($G_{x}$,$A_{y}$,$F_{z}$) irreducible representation below…
▽ More
The complex magnetic structures, spin-reorientation and associated exchange interactions have been investigate in Er$_{0.5}$Dy$_{0.5}$FeO$_3$ using bulk magnetization, neutron diffraction, specific heat measurements and density functional theory calculations. The Fe$^{3+}$ spins order as G-type antiferromagnet structure depicted by $Γ_{4}$($G_{x}$,$A_{y}$,$F_{z}$) irreducible representation below 700K, similar to its end compounds. The bulk magnetization data indicate occurrence of the spin-reorientation and rare-earth magnetic ordering below $\sim$75 K and 10 K, respectively. The neutron diffraction studies confirm an "incomplete" $Γ_{4}$${\rightarrow}$ $Γ_{2}$($F_{x}$,$C_{y}$,$G_{z}$) spin-reorientation initiated $\leq$75 K. Although, the relative volume fraction of the two magnetic structures varies with decreasing temperature, both co-exist even at 1.5 K. At 8 K, Er$^{3+}$/Dy$^{3+}$ moments order as $c_{y}^R$ arrangement develop, which gradually increases in intensity with decreasing temperature. At 2 K, magnetic structure associated with $c_{z}^R$ arrangement of Er$^{3+}$/Dy$^{3+}$ moments also appears. At 1.5 K the magnetic structure of Fe$^{3+}$ spins is represented by a combination of $Γ_{2}$+$Γ_{4}$+$Γ_{1}$, while the rare earth moments coexists as $c_{y}^R$ and $c_{z}^R$ corresponding to $Γ_{2}$ and $Γ_{1}$ representation, respectively. The observed Schottky anomaly at 2.5 K suggests that the "rare-earth ordering" is induced by polarization due to Fe$^{3+}$ spins. The Er$^{3+}$-Fe$^{3+}$ and Er$^{3+}$-Dy$^{3+}$ exchange interactions, obtained from first principle calculations, primarily cause the complicated spin-reorientation and $c_{y}^R$ rare-earth ordering, respectively, while the dipolar interactions between rare-earth moments, result in the $c_{z}^R$ type rare-earth ordering at 2 K.
△ Less
Submitted 4 October, 2021; v1 submitted 23 August, 2021;
originally announced August 2021.
-
Hate Speech Detection in Roman Urdu
Authors:
Moin Khan,
Khurram Shahzad,
Kamran Malik
Abstract:
Hate speech is a specific type of controversial content that is widely legislated as a crime that must be identified and blocked. However, due to the sheer volume and velocity of the Twitter data stream, hate speech detection cannot be performed manually. To address this issue, several studies have been conducted for hate speech detection in European languages, whereas little attention has been pa…
▽ More
Hate speech is a specific type of controversial content that is widely legislated as a crime that must be identified and blocked. However, due to the sheer volume and velocity of the Twitter data stream, hate speech detection cannot be performed manually. To address this issue, several studies have been conducted for hate speech detection in European languages, whereas little attention has been paid to low-resource South Asian languages, making the social media vulnerable for millions of users. In particular, to the best of our knowledge, no study has been conducted for hate speech detection in Roman Urdu text, which is widely used in the sub-continent. In this study, we have scrapped more than 90,000 tweets and manually parsed them to identify 5,000 Roman Urdu tweets. Subsequently, we have employed an iterative approach to develop guidelines and used them for generating the Hate Speech Roman Urdu 2020 corpus. The tweets in the this corpus are classified at three levels: Neutral-Hostile, Simple-Complex, and Offensive-Hate speech. As another contribution, we have used five supervised learning techniques, including a deep learning technique, to evaluate and compare their effectiveness for hate speech detection. The results show that Logistic Regression outperformed all other techniques, including deep learning techniques for the two levels of classification, by achieved an F1 score of 0.906 for distinguishing between Neutral-Hostile tweets, and 0.756 for distinguishing between Offensive-Hate speech tweets.
△ Less
Submitted 5 August, 2021;
originally announced August 2021.
-
Sb concentration dependent Structural and Transport properties of Polycrystalline (Bi1-xSbx)2Te3 Mixed crystal
Authors:
K. Malik,
S. Mahakal,
Diptasikha Das,
Aritra Banerjee,
S. Chatterjee,
Anusree Das
Abstract:
(Bi1-xSbx)2Te3 (x=0.60, 0.65, 0.68, 0.70, 0.75 and 0.80) mixed crystals have been synthesized by solid state reaction. In depth structural, thermal, transport and electronic properties are reported. Defect and disorder play a crucial role in structural and transport behaviour. Disorder induced iso-structural phase transition is observed at x=0.70, which is supported by the structural and transport…
▽ More
(Bi1-xSbx)2Te3 (x=0.60, 0.65, 0.68, 0.70, 0.75 and 0.80) mixed crystals have been synthesized by solid state reaction. In depth structural, thermal, transport and electronic properties are reported. Defect and disorder play a crucial role in structural and transport behaviour. Disorder induced iso-structural phase transition is observed at x=0.70, which is supported by the structural and transport properties data. Debye temperature has been estimated from the powder diffraction data. Differential scanning calorimetry (DSC) data confirms the glass transition in the material. Low temperature resistivity data shows Variable range hopping mechanism whereas high temperature data follows activated behaviour. Activation energy is calculated from the semiconducting region of resistivity data. Both Hall measurement and temperature dependent thermopower data (S(T)) confirms that samples are p-type in nature. Density of state effective mass has been estimated from Pisarenko relation and corroborated with resistivity data. Thermal conductivity (k) is estimated using experimentally obtained data. Figure of Merit (ZT) of the synthesized samples are calculated using resistivity, S(T) and k. Structural and transport properties are correlated, confirms the transition from disorder to order state. Defect and disorder are corroborated with structural and Thermoelectric properties of the synthesized samples.
△ Less
Submitted 1 August, 2021;
originally announced August 2021.
-
Contributions from primordial non-Gaussianity and General Relativity to the galaxy power spectrum
Authors:
Rebeca Martinez-Carrillo,
Juan Carlos Hidalgo,
Karim A. Malik,
Alkistis Pourtsidou
Abstract:
We compute the real space galaxy power spectrum, including the leading order effects of General Relativity and primordial non-Gaussianity from the $f_{\mathrm{NL}}$ and $g_{\mathrm{NL}}$ parameters. Such contributions come from the one-loop matter power spectrum terms dominant at large scales, and from the factors of the non-linear bias parameter $b_{\mathrm{NL}}$ (akin to the Newtonian $b_φ$). We…
▽ More
We compute the real space galaxy power spectrum, including the leading order effects of General Relativity and primordial non-Gaussianity from the $f_{\mathrm{NL}}$ and $g_{\mathrm{NL}}$ parameters. Such contributions come from the one-loop matter power spectrum terms dominant at large scales, and from the factors of the non-linear bias parameter $b_{\mathrm{NL}}$ (akin to the Newtonian $b_φ$). We assess the detectability of these contributions in Stage-IV surveys. In particular, we note that specific values of the bias parameter may erase the primordial and relativistic contributions to the configuration space power spectrum.
△ Less
Submitted 30 November, 2021; v1 submitted 22 July, 2021;
originally announced July 2021.
-
Detecting Ideal Instagram Influencer Using Social Network Analysis
Authors:
M. M. H Dihyat,
K Malik,
M. A Khan,
B Imran
Abstract:
Social Media is a key aspect of modern society where people share their thoughts, views, feelings and sentiments. Over the last few years, the inflation in popularity of social media has resulted in a monumental increase in data. Users use this medium to express their thoughts, feelings, and opinions on a wide variety of subjects, including politics and celebrities. Social Media has thus evolved i…
▽ More
Social Media is a key aspect of modern society where people share their thoughts, views, feelings and sentiments. Over the last few years, the inflation in popularity of social media has resulted in a monumental increase in data. Users use this medium to express their thoughts, feelings, and opinions on a wide variety of subjects, including politics and celebrities. Social Media has thus evolved into a lucrative platform for companies to expand their scope and improve their prospects. The paper focuses on social network analysis (SNA) for a real-world online marketing strategy. The study contributes by comparing various centrality measures to identify the most central nodes in the network and uses a linear threshold model to understand the spreading behaviour of individual users. In conclusion, the paper correlates different centrality measures and spreading behaviour to identify the most influential user in the network
△ Less
Submitted 12 July, 2021;
originally announced July 2021.
-
Differential rotation of the solar transition region from STEREO/EUVI 30.4 nm images
Authors:
Jaidev Sharma,
Brajesh Kumar,
Anil K Malik,
Hari Om Vats
Abstract:
The solar photosphere, chromosphere and corona are known to rotate differentially as a function of latitude. To date, it is unclear if the solar transition region also rotates differentially. In this paper, we investigate differential rotational profile of solar transition region as a function of latitude, using solar full disk (SFD) images at 30.4 nm wavelength recorded by Extreme Ultraviolet Ima…
▽ More
The solar photosphere, chromosphere and corona are known to rotate differentially as a function of latitude. To date, it is unclear if the solar transition region also rotates differentially. In this paper, we investigate differential rotational profile of solar transition region as a function of latitude, using solar full disk (SFD) images at 30.4 nm wavelength recorded by Extreme Ultraviolet Imager (EUVI) onboard Solar Terrestrial Relations Observatory (STEREO) space mission for the period from 2008 to 2018 (Solar Cycle 24). Our investigations show that solar transition region rotates differentially. The sidereal rotation rate obtained at +/- 5 degree equatorial band is quite high (~ 14.7 degree/day), which drops to ~ 13.6 degree/day towards both polar regions. We also obtain that the rotational differentiality is low during the period of high solar activity (rotation rate varies from 14.86 to 14.27 degree/day) while it increases during the ascending and the descending phases of the 24th solar cycle (rotation rate varies from 14.56 to 13.56 degree/day in 2008 and 14.6 to 13.1 degree/day in 2018). Average sidereal rotation rate (over SFD) follows the trend of solar activity (maximum ~ 14.97 degree/day during the peak phase of the solar activity, which slowly decreases to minimum ~ 13.9 degree/day during ascending and the descending phases of the 24th solar cycle). We also observe that solar transition region rotates less differentially than the corona.
△ Less
Submitted 12 July, 2021;
originally announced July 2021.
-
Federated Learning with Buffered Asynchronous Aggregation
Authors:
John Nguyen,
Kshitiz Malik,
Hongyuan Zhan,
Ashkan Yousefpour,
Michael Rabbat,
Mani Malek,
Dzmitry Huba
Abstract:
Scalability and privacy are two critical concerns for cross-device federated learning (FL) systems. In this work, we identify that synchronous FL - synchronized aggregation of client updates in FL - cannot scale efficiently beyond a few hundred clients training in parallel. It leads to diminishing returns in model performance and training speed, analogous to large-batch training. On the other hand…
▽ More
Scalability and privacy are two critical concerns for cross-device federated learning (FL) systems. In this work, we identify that synchronous FL - synchronized aggregation of client updates in FL - cannot scale efficiently beyond a few hundred clients training in parallel. It leads to diminishing returns in model performance and training speed, analogous to large-batch training. On the other hand, asynchronous aggregation of client updates in FL (i.e., asynchronous FL) alleviates the scalability issue. However, aggregating individual client updates is incompatible with Secure Aggregation, which could result in an undesirable level of privacy for the system. To address these concerns, we propose a novel buffered asynchronous aggregation method, FedBuff, that is agnostic to the choice of optimizer, and combines the best properties of synchronous and asynchronous FL. We empirically demonstrate that FedBuff is 3.3x more efficient than synchronous FL and up to 2.5x more efficient than asynchronous FL, while being compatible with privacy-preserving technologies such as Secure Aggregation and differential privacy. We provide theoretical convergence guarantees in a smooth non-convex setting. Finally, we show that under differentially private training, FedBuff can outperform FedAvgM at low privacy settings and achieve the same utility for higher privacy settings.
△ Less
Submitted 7 March, 2022; v1 submitted 11 June, 2021;
originally announced June 2021.
-
The intrinsic bispectrum of the CMB from isocurvature initial conditions
Authors:
Pedro Carrilho,
Karim A. Malik
Abstract:
Non-linear effects in the early Universe generate non-zero bispectra of the cosmic microwave background (CMB) temperature and polarization, even in the absence of primordial non-Gaussianity. In this paper, we compute the contributions from isocurvature modes to the CMB bispectra using a modified version of the second-order Boltzmann solver SONG. We investigate the ability of current and future CMB…
▽ More
Non-linear effects in the early Universe generate non-zero bispectra of the cosmic microwave background (CMB) temperature and polarization, even in the absence of primordial non-Gaussianity. In this paper, we compute the contributions from isocurvature modes to the CMB bispectra using a modified version of the second-order Boltzmann solver SONG. We investigate the ability of current and future CMB experiments to constrain these modes with observations of the bispectrum. Our results show that the enhancement due to single isocurvature modes mixed with the adiabatic mode is negligible for the parameter ranges currently allowed by the most recent Planck results. However, we find that a large compensated isocurvature mode can produce a detectable bispectrum when its correlation with the adiabatic mode is appreciable. The non-observation of this contribution in searches for the lensing bispectrum from Planck allows us to place a new constraint on the relative amplitude of the correlated part of the compensated isocurvature mode of $f_{\rm CIP}=1\pm100$. We compute forecasts for future observations by COrE, SO, CMB-S4 and an ideal experiment and conclude that a dedicated search for the bispectrum from compensated modes could rule out a number of scenarios realised in the curvaton model. In addition, the CMB-S4 experiment could detect the most extreme of those scenarios ($f_{\rm CIP}=16.5$) at 2 to 3-$σ$ significance.
△ Less
Submitted 8 September, 2021; v1 submitted 20 April, 2021;
originally announced April 2021.
-
Ensemble deep learning: A review
Authors:
M. A. Ganaie,
Minghui Hu,
A. K. Malik,
M. Tanveer,
P. N. Suganthan
Abstract:
Ensemble learning combines several individual models to obtain better generalization performance. Currently, deep learning architectures are showing better performance compared to the shallow or traditional models. Deep ensemble learning models combine the advantages of both the deep learning models as well as the ensemble learning such that the final model has better generalization performance. T…
▽ More
Ensemble learning combines several individual models to obtain better generalization performance. Currently, deep learning architectures are showing better performance compared to the shallow or traditional models. Deep ensemble learning models combine the advantages of both the deep learning models as well as the ensemble learning such that the final model has better generalization performance. This paper reviews the state-of-art deep ensemble models and hence serves as an extensive summary for the researchers. The ensemble models are broadly categorised into bagging, boosting, stacking, negative correlation based deep ensemble models, explicit/implicit ensembles, homogeneous/heterogeneous ensemble, decision fusion strategies based deep ensemble models. Applications of deep ensemble models in different domains are also briefly discussed. Finally, we conclude this paper with some potential future research directions.
△ Less
Submitted 8 August, 2022; v1 submitted 6 April, 2021;
originally announced April 2021.
-
Deepfakes Generation and Detection: State-of-the-art, open challenges, countermeasures, and way forward
Authors:
Momina Masood,
Marriam Nawaz,
Khalid Mahmood Malik,
Ali Javed,
Aun Irtaza
Abstract:
Easy access to audio-visual content on social media, combined with the availability of modern tools such as Tensorflow or Keras, open-source trained models, and economical computing infrastructure, and the rapid evolution of deep-learning (DL) methods, especially Generative Adversarial Networks (GAN), have made it possible to generate deepfakes to disseminate disinformation, revenge porn, financia…
▽ More
Easy access to audio-visual content on social media, combined with the availability of modern tools such as Tensorflow or Keras, open-source trained models, and economical computing infrastructure, and the rapid evolution of deep-learning (DL) methods, especially Generative Adversarial Networks (GAN), have made it possible to generate deepfakes to disseminate disinformation, revenge porn, financial frauds, hoaxes, and to disrupt government functioning. The existing surveys have mainly focused on the detection of deepfake images and videos. This paper provides a comprehensive review and detailed analysis of existing tools and machine learning (ML) based approaches for deepfake generation and the methodologies used to detect such manipulations for both audio and visual deepfakes. For each category of deepfake, we discuss information related to manipulation approaches, current public datasets, and key standards for the performance evaluation of deepfake detection techniques along with their results. Additionally, we also discuss open challenges and enumerate future directions to guide future researchers on issues that need to be considered to improve the domains of both deepfake generation and detection. This work is expected to assist the readers in understanding the creation and detection mechanisms of deepfakes, along with their current limitations and future direction.
△ Less
Submitted 22 November, 2021; v1 submitted 25 February, 2021;
originally announced March 2021.
-
Galaxy number counts at second order in perturbation theory: a leading-order term comparison
Authors:
Jorge L. Fuentes,
Juan Carlos Hidalgo,
Karim A. Malik
Abstract:
The galaxy number density is a key quantity to compare theoretical predictions to the observational data from current and future Large Scale Structure surveys. The precision demanded by these Stage IV surveys requires the use of second order cosmological perturbation theory. Based on the independent calculation published previously, we present the result of the comparison with the results of three…
▽ More
The galaxy number density is a key quantity to compare theoretical predictions to the observational data from current and future Large Scale Structure surveys. The precision demanded by these Stage IV surveys requires the use of second order cosmological perturbation theory. Based on the independent calculation published previously, we present the result of the comparison with the results of three other groups at leading order. Overall we find that the differences between the different approaches lie mostly on the definition of certain quantities, where the ambiguity of signs results in the addition of extra terms at second order in perturbation theory.
△ Less
Submitted 8 August, 2021; v1 submitted 30 December, 2020;
originally announced December 2020.
-
Selling two complementary goods
Authors:
Komal Malik,
Kolagani Paramahamsa
Abstract:
A seller is selling a pair of divisible complementary goods to an agent. The agent consumes the goods only in a specific ratio and freely disposes of excess in either goods. The value of the bundle and the ratio are private information of the agent. In this two-dimensional type space model, we characterize the incentive constraints and show that the optimal (expected revenue-maximizing) mechanism…
▽ More
A seller is selling a pair of divisible complementary goods to an agent. The agent consumes the goods only in a specific ratio and freely disposes of excess in either goods. The value of the bundle and the ratio are private information of the agent. In this two-dimensional type space model, we characterize the incentive constraints and show that the optimal (expected revenue-maximizing) mechanism is a ratio-dependent posted price or a posted price mechanism for a class of distributions. We also show that the optimal mechanism is a posted price mechanism when the value and the ratio are independently distributed.
△ Less
Submitted 14 July, 2022; v1 submitted 11 November, 2020;
originally announced November 2020.
-
Sentiment Analysis for Roman Urdu Text over Social Media, a Comparative Study
Authors:
Irfan Qutab,
Khawar Iqbal Malik,
Hira Arooj
Abstract:
In present century, data volume is increasing enormously. The data could be in form for image, text, voice, and video. One factor in this huge growth of data is usage of social media where everyone is posting data on daily basis during chatting, exchanging information, and uploading their personal and official credential. Research of sentiments seeks to uncover abstract knowledge in Published text…
▽ More
In present century, data volume is increasing enormously. The data could be in form for image, text, voice, and video. One factor in this huge growth of data is usage of social media where everyone is posting data on daily basis during chatting, exchanging information, and uploading their personal and official credential. Research of sentiments seeks to uncover abstract knowledge in Published texts in which users communicate their emotions and thoughts about shared content, including blogs, news and social networks. Roman Urdu is the one of most dominant language on social networks in Pakistan and India. Roman Urdu is among the varieties of the world's third largest Urdu language but yet not sufficient work has been done in this language. In this article we addressed the prior concepts and strategies used to examine the sentiment of the roman Urdu text and reported their results as well.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Mineralogy, chemistry and composition of organic compounds in the fresh carbonaceous chondrite Mukundpura: CM1 or CM2?
Authors:
S. Potin,
P. Beck,
L. Bonal,
B. Schmitt,
A. Garenne,
F. Moynier,
A. Agranier,
P. Schmitt-Kopplin,
A. K. Malik,
E. Quirico
Abstract:
We present here several laboratory analyses performed on the freshly fallen Mukundpura CM chondrite. Results of infrared transmission spectroscopy, thermogravimetry analysis and reflectance spectroscopy show that Mukundpura is mainly composed of phyllosilicates. The rare earth trace elements composition and ultrahigh resolution mass spectrometry of the soluble organic matter (SOM) give results con…
▽ More
We present here several laboratory analyses performed on the freshly fallen Mukundpura CM chondrite. Results of infrared transmission spectroscopy, thermogravimetry analysis and reflectance spectroscopy show that Mukundpura is mainly composed of phyllosilicates. The rare earth trace elements composition and ultrahigh resolution mass spectrometry of the soluble organic matter (SOM) give results consistent with CM chondrites. Finally, Raman spectroscopy shows no signs of thermal alteration of the meteorite. All the results agree that Mukundpura has been strongly altered by water on its parent body. Comparison of the results obtained on the meteorite with those of other chondrites of known petrologic types lead to the conclusion that Mukundpura is similar to CM1 chondrites, which differs from its original classification as a CM2.
△ Less
Submitted 29 September, 2020;
originally announced September 2020.