-
Generalized inflation in the context of $κ$-deformed theories
Authors:
B W Ribeiro,
I M Macêdo,
F C Cabral
Abstract:
A new inflationary scenario driven by a slowly-rolling homogeneous scalar field whose potential $V\left(\varphi\right)$ is given by a generalized exponential function is investigated. Within the {\it slow-roll} approximation we obtain the main predictions of the model and compare them with current data from cosmic microwave background and large-scale structure observations. We show that this singl…
▽ More
A new inflationary scenario driven by a slowly-rolling homogeneous scalar field whose potential $V\left(\varphi\right)$ is given by a generalized exponential function is investigated. Within the {\it slow-roll} approximation we obtain the main predictions of the model and compare them with current data from cosmic microwave background and large-scale structure observations. We show that this single scalar field model admits a wider set of solutions than usual exponential scenarios and predicts acceptable values of the spectral index, running of the spectral index and tensor-to-scalar ratio for the remaining number of {\it e}-folds lying in the interval $N = 55 \pm 5$ and an energy scale on which $λ\geq \sqrt{2}$; in particular, we observe that the value of the model parameter $κ$ depends on the analysis. Finally, the primordial local non-Gaussianity is briefly discussed where we conclude that $k\gtrsim 0.02$ for $f_\text{NL}^\text{local} \ll 1$.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
Results from ON-OFF analysis of the Neutrinos-Angra detector
Authors:
E. Kemp,
W. V. Santos,
J. C. Anjos,
P. Chimenti,
L. F. G. Gonzalez,
G. P. Guedes,
H. P. Lima Jr.,
R. A. Nóbrega,
I. M. Pepe,
D. B. S. Ribeiro
Abstract:
The Neutrinos Angra Experiment, a water-based Cherenkov detector, is located at the Angra dos Reis nuclear power plant in Brazil. Designed to detect electron antineutrinos produced in the nuclear reactor, the primary objective of the experiment is to demonstrate the feasibility of monitoring reactor activity using an antineutrino detector. This effort aligns with the International Atomic Energy Ag…
▽ More
The Neutrinos Angra Experiment, a water-based Cherenkov detector, is located at the Angra dos Reis nuclear power plant in Brazil. Designed to detect electron antineutrinos produced in the nuclear reactor, the primary objective of the experiment is to demonstrate the feasibility of monitoring reactor activity using an antineutrino detector. This effort aligns with the International Atomic Energy Agency (IAEA) program to identify potential and novel technologies applicable to nonproliferation safeguards. Operating on the surface presents challenges such as high noise rates, necessitating the development of very sensitive, yet small-scale detectors. These conditions make the Angra experiment an excellent platform for both developing the application and gaining expertise in new technologies and analysis methods. The detector employs a water-based target doped with gadolinium to enhance its sensitivity to antineutrinos. In this work, we describe the main features of the detector and the electronics chain, including front-end and data acquisition components. We detail the data acquisition strategies and the methodologies applied for signal processing and event selection. Preliminary physics results suggest that the detector can reliably monitor reactor operations by detecting the inverse beta decay induced by electron antineutrinos from the reactor.
△ Less
Submitted 7 August, 2024; v1 submitted 29 July, 2024;
originally announced July 2024.
-
DiGRAF: Diffeomorphic Graph-Adaptive Activation Function
Authors:
Krishna Sri Ipsit Mantri,
Xinzhi Wang,
Carola-Bibiane Schönlieb,
Bruno Ribeiro,
Beatrice Bevilacqua,
Moshe Eliasof
Abstract:
In this paper, we propose a novel activation function tailored specifically for graph data in Graph Neural Networks (GNNs). Motivated by the need for graph-adaptive and flexible activation functions, we introduce DiGRAF, leveraging Continuous Piecewise-Affine Based (CPAB) transformations, which we augment with an additional GNN to learn a graph-adaptive diffeomorphic activation function in an end-…
▽ More
In this paper, we propose a novel activation function tailored specifically for graph data in Graph Neural Networks (GNNs). Motivated by the need for graph-adaptive and flexible activation functions, we introduce DiGRAF, leveraging Continuous Piecewise-Affine Based (CPAB) transformations, which we augment with an additional GNN to learn a graph-adaptive diffeomorphic activation function in an end-to-end manner. In addition to its graph-adaptivity and flexibility, DiGRAF also possesses properties that are widely recognized as desirable for activation functions, such as differentiability, boundness within the domain and computational efficiency. We conduct an extensive set of experiments across diverse datasets and tasks, demonstrating a consistent and superior performance of DiGRAF compared to traditional and graph-specific activation functions, highlighting its effectiveness as an activation function for GNNs.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
The Significance of Latent Data Divergence in Predicting System Degradation
Authors:
Miguel Fernandes,
Catarina Silva,
Alberto Cardoso,
Bernardete Ribeiro
Abstract:
Condition-Based Maintenance is pivotal in enabling the early detection of potential failures in engineering systems, where precise prediction of the Remaining Useful Life is essential for effective maintenance and operation. However, a predominant focus in the field centers on predicting the Remaining Useful Life using unprocessed or minimally processed data, frequently neglecting the intricate dy…
▽ More
Condition-Based Maintenance is pivotal in enabling the early detection of potential failures in engineering systems, where precise prediction of the Remaining Useful Life is essential for effective maintenance and operation. However, a predominant focus in the field centers on predicting the Remaining Useful Life using unprocessed or minimally processed data, frequently neglecting the intricate dynamics inherent in the dataset. In this work we introduce a novel methodology grounded in the analysis of statistical similarity within latent data from system components. Leveraging a specifically designed architecture based on a Vector Quantized Variational Autoencoder, we create a sequence of discrete vectors which is used to estimate system-specific priors. We infer the similarity between systems by evaluating the divergence of these priors, offering a nuanced understanding of individual system behaviors. The efficacy of our approach is demonstrated through experiments on the NASA commercial modular aero-propulsion system simulation (C-MAPSS) dataset. Our validation not only underscores the potential of our method in advancing the study of latent statistical divergence but also demonstrates its superiority over existing techniques.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative Study
Authors:
David Pissarra,
Isabel Curioso,
João Alveira,
Duarte Pereira,
Bruno Ribeiro,
Tomás Souper,
Vasco Gomes,
André V. Carreiro,
Vitor Rolla
Abstract:
Automated clinical text anonymization has the potential to unlock the widespread sharing of textual health data for secondary usage while assuring patient privacy and safety. Despite the proposal of many complex and theoretically successful anonymization solutions in literature, these techniques remain flawed. As such, clinical institutions are still reluctant to apply them for open access to thei…
▽ More
Automated clinical text anonymization has the potential to unlock the widespread sharing of textual health data for secondary usage while assuring patient privacy and safety. Despite the proposal of many complex and theoretically successful anonymization solutions in literature, these techniques remain flawed. As such, clinical institutions are still reluctant to apply them for open access to their data. Recent advances in developing Large Language Models (LLMs) pose a promising opportunity to further the field, given their capability to perform various tasks. This paper proposes six new evaluation metrics tailored to the challenges of generative anonymization with LLMs. Moreover, we present a comparative study of LLM-based methods, testing them against two baseline techniques. Our results establish LLM-based models as a reliable alternative to common approaches, paving the way toward trustworthy anonymization of clinical text.
△ Less
Submitted 29 May, 2024;
originally announced June 2024.
-
Prediction of soil fertility parameters using USB-microscope imagery and portable X-ray fluorescence spectrometry
Authors:
Shubhadip Dasgupta,
Satwik Pate,
Divya Rathore,
L. G. Divyanth,
Ayan Das,
Anshuman Nayak,
Subhadip Dey,
Asim Biswas,
David C. Weindorf,
Bin Li,
Sergio Henrique Godinho Silva,
Bruno Teixeira Ribeiro,
Sanjay Srivastava,
Somsubhra Chakraborty
Abstract:
This study investigated the use of portable X-ray fluorescence (PXRF) spectrometry and soil image analysis for rapid soil fertility assessment, with a focus on key indicators such as available boron (B), organic carbon (OC), available manganese (Mn), available sulfur (S), and the sulfur availability index (SAI). A total of 1,133 soil samples from diverse agro-climatic zones in Eastern India were a…
▽ More
This study investigated the use of portable X-ray fluorescence (PXRF) spectrometry and soil image analysis for rapid soil fertility assessment, with a focus on key indicators such as available boron (B), organic carbon (OC), available manganese (Mn), available sulfur (S), and the sulfur availability index (SAI). A total of 1,133 soil samples from diverse agro-climatic zones in Eastern India were analyzed. The research integrated color and texture features from microscopic soil images, PXRF data, and auxiliary soil variables (AVs) using a Random Forest model. Results showed that combining image features (IFs) with AVs significantly improved prediction accuracy for available B (R2 = 0.80) and OC (R2 = 0.88). A data fusion approach, incorporating IFs, AVs, and PXRF data, further enhanced predictions for available Mn and SAI, with R2 values of 0.72 and 0.70, respectively. The study highlights the potential of integrating these technologies to offer rapid, cost-effective soil testing methods, paving the way for more advanced predictive models and a deeper understanding of soil fertility. Future work should explore the application of deep learning models on a larger dataset, incorporating soils from a wider range of agro-climatic zones under field conditions.
△ Less
Submitted 5 September, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
Zero-shot Logical Query Reasoning on any Knowledge Graph
Authors:
Mikhail Galkin,
Jincheng Zhou,
Bruno Ribeiro,
Jian Tang,
Zhaocheng Zhu
Abstract:
Complex logical query answering (CLQA) in knowledge graphs (KGs) goes beyond simple KG completion and aims at answering compositional queries comprised of multiple projections and logical operations. Existing CLQA methods that learn parameters bound to certain entity or relation vocabularies can only be applied to the graph they are trained on which requires substantial training time before being…
▽ More
Complex logical query answering (CLQA) in knowledge graphs (KGs) goes beyond simple KG completion and aims at answering compositional queries comprised of multiple projections and logical operations. Existing CLQA methods that learn parameters bound to certain entity or relation vocabularies can only be applied to the graph they are trained on which requires substantial training time before being deployed on a new graph. Here we present UltraQuery, an inductive reasoning model that can zero-shot answer logical queries on any KG. The core idea of UltraQuery is to derive both projections and logical operations as vocabulary-independent functions which generalize to new entities and relations in any KG. With the projection operation initialized from a pre-trained inductive KG reasoning model, UltraQuery can solve CLQA on any KG even if it is only finetuned on a single dataset. Experimenting on 23 datasets, UltraQuery in the zero-shot inference mode shows competitive or better query answering performance than best available baselines and sets a new state of the art on 14 of them.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Hybridization induced triplet superconductivity with $S^z=0$
Authors:
Edine Silva,
R. C. Bento Ribeiro,
Heron Caldas,
Mucio A. Continentino
Abstract:
The Kitaev superconducting chain is a model of spinless fermions with triplet-like superconductivity. It has raised interest since for some values of its parameters it presents a non-trivial topological phase that host Majorana fermions. The physical realization of a Kitaev chain is complicated by the scarcity of triplet superconductivity in real physical systems. Many proposals have been put forw…
▽ More
The Kitaev superconducting chain is a model of spinless fermions with triplet-like superconductivity. It has raised interest since for some values of its parameters it presents a non-trivial topological phase that host Majorana fermions. The physical realization of a Kitaev chain is complicated by the scarcity of triplet superconductivity in real physical systems. Many proposals have been put forward to overcome this difficulty and fabricate artificial triplet superconducting chains. In this work we study a superconducting chain of spinful fermions forming Cooper pairs, in a triplet $S=1$ state, but with $S^z=0$. The motivation is that such pairing can be induced in chains that couple through an antisymmetric hybridization to an s-wave superconducting substrate. We study the nature of edge states and the topological properties of these chains. In the presence of a magnetic field the chain can sustain gapless superconductivity with pairs of Fermi points. The momentum space topology of these Fermi points is non-trivial, in the sense that they can only disappear by annihilating each other. For small magnetic fields, we find well defined degenerate edge modes with finite Zeemann energy. These modes are not symmetry protected and decay abruptly in the bulk as their energy merges with the continuum of excitations.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
The Kormendy relation of cluster galaxies in PPS regions
Authors:
André L. B. Ribeiro,
Paulo A. A. Lopes,
Dailer F. Morell,
Christine C. Dantas,
Monyke H. S. Fonseca,
Beatriz G. Amarante,
Flávio R. Morais-Neto
Abstract:
We study a sample of 936 early-type galaxies located in 48 low-z regular galaxy clusters with $M_{200}\geq 10^{14}~ M_\odot$ at $z< 0.1$. We examine variations in the Kormendy relation (KR) according to their location in the projected phase space (PPS) of the clusters. We have used a combination of Bayesian statistical methods to identify possible differences between the fitted relations. Our resu…
▽ More
We study a sample of 936 early-type galaxies located in 48 low-z regular galaxy clusters with $M_{200}\geq 10^{14}~ M_\odot$ at $z< 0.1$. We examine variations in the Kormendy relation (KR) according to their location in the projected phase space (PPS) of the clusters. We have used a combination of Bayesian statistical methods to identify possible differences between the fitted relations. Our results indicate that the overall KR is better fitted when we take into account the information about PPS regions. We also find that objects with time since infall $\geq 6.5$ Gyr have a significant statistical difference of the KR coefficients relative to objects that are more recent in the cluster environment. We show that giant central ellipticals are responsible for tilting the KR relation towards smaller slopes. These galaxies present a late growth probably due to cumulative preprocessing during infall, plus cannibalism and accretion of smaller stripped objects near the center of the clusters.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts
Authors:
Shirley Wu,
Kaidi Cao,
Bruno Ribeiro,
James Zou,
Jure Leskovec
Abstract:
Graph data are inherently complex and heterogeneous, leading to a high natural diversity of distributional shifts. However, it remains unclear how to build machine learning architectures that generalize to complex non-synthetic distributional shifts naturally occurring in the real world. Here we develop GraphMETRO, a Graph Neural Network architecture, that reliably models natural diversity and cap…
▽ More
Graph data are inherently complex and heterogeneous, leading to a high natural diversity of distributional shifts. However, it remains unclear how to build machine learning architectures that generalize to complex non-synthetic distributional shifts naturally occurring in the real world. Here we develop GraphMETRO, a Graph Neural Network architecture, that reliably models natural diversity and captures complex distributional shifts. GraphMETRO employs a Mixture-of-Experts (MoE) architecture with a gating model and multiple expert models, where each expert model targets a specific distributional shift to produce a shift-invariant representation, and the gating model identifies shift components. Additionally, we design a novel objective that aligns the representations from different expert models to ensure smooth optimization. GraphMETRO achieves state-of-the-art results on four datasets from GOOD benchmark comprised of complex and natural real-world distribution shifts, improving by 67% and 4.2% on WebKB and Twitch datasets.
△ Less
Submitted 5 February, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
MIST: Defending Against Membership Inference Attacks Through Membership-Invariant Subspace Training
Authors:
Jiacheng Li,
Ninghui Li,
Bruno Ribeiro
Abstract:
In Member Inference (MI) attacks, the adversary try to determine whether an instance is used to train a machine learning (ML) model. MI attacks are a major privacy concern when using private data to train ML models. Most MI attacks in the literature take advantage of the fact that ML models are trained to fit the training data well, and thus have very low loss on training instances. Most defenses…
▽ More
In Member Inference (MI) attacks, the adversary try to determine whether an instance is used to train a machine learning (ML) model. MI attacks are a major privacy concern when using private data to train ML models. Most MI attacks in the literature take advantage of the fact that ML models are trained to fit the training data well, and thus have very low loss on training instances. Most defenses against MI attacks therefore try to make the model fit the training data less well. Doing so, however, generally results in lower accuracy. We observe that training instances have different degrees of vulnerability to MI attacks. Most instances will have low loss even when not included in training. For these instances, the model can fit them well without concerns of MI attacks. An effective defense only needs to (possibly implicitly) identify instances that are vulnerable to MI attacks and avoids overfitting them. A major challenge is how to achieve such an effect in an efficient training process. Leveraging two distinct recent advancements in representation learning: counterfactually-invariant representations and subspace learning methods, we introduce a novel Membership-Invariant Subspace Training (MIST) method to defend against MI attacks. MIST avoids overfitting the vulnerable instances without significant impact on other instances. We have conducted extensive experimental studies, comparing MIST with various other state-of-the-art (SOTA) MI defenses against several SOTA MI attacks. We find that MIST outperforms other defenses while resulting in minimal reduction in testing accuracy.
△ Less
Submitted 29 May, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Efficient Subgraph GNNs by Learning Effective Selection Policies
Authors:
Beatrice Bevilacqua,
Moshe Eliasof,
Eli Meirom,
Bruno Ribeiro,
Haggai Maron
Abstract:
Subgraph GNNs are provably expressive neural architectures that learn graph representations from sets of subgraphs. Unfortunately, their applicability is hampered by the computational complexity associated with performing message passing on many subgraphs. In this paper, we consider the problem of learning to select a small subset of the large set of possible subgraphs in a data-driven fashion. We…
▽ More
Subgraph GNNs are provably expressive neural architectures that learn graph representations from sets of subgraphs. Unfortunately, their applicability is hampered by the computational complexity associated with performing message passing on many subgraphs. In this paper, we consider the problem of learning to select a small subset of the large set of possible subgraphs in a data-driven fashion. We first motivate the problem by proving that there are families of WL-indistinguishable graphs for which there exist efficient subgraph selection policies: small subsets of subgraphs that can already identify all the graphs within the family. We then propose a new approach, called Policy-Learn, that learns how to select subgraphs in an iterative manner. We prove that, unlike popular random policies and prior work addressing the same problem, our architecture is able to learn the efficient policies mentioned above. Our experimental results demonstrate that Policy-Learn outperforms existing baselines across a wide range of datasets.
△ Less
Submitted 20 March, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
The Role of Groups in Galaxy Evolution: compelling evidence of pre-processing out to the turnaround radius of clusters
Authors:
P. A. A. Lopes,
A. L. B. Ribeiro,
D. Brambila
Abstract:
We present clear and direct evidence of the pre-processing effect of group galaxies falling into clusters in the local Universe ($z \lesssim 0.1$). We start with a sample of 238 clusters, from which we select 153 with N$_{200} \ge$ 20. We considered 1641 groups within the turnaround radius ($\sim$ 5$\times$R$_{200}$) of these 153 clusters. There are 6654 {\it individual cluster galaxies} and 4133…
▽ More
We present clear and direct evidence of the pre-processing effect of group galaxies falling into clusters in the local Universe ($z \lesssim 0.1$). We start with a sample of 238 clusters, from which we select 153 with N$_{200} \ge$ 20. We considered 1641 groups within the turnaround radius ($\sim$ 5$\times$R$_{200}$) of these 153 clusters. There are 6654 {\it individual cluster galaxies} and 4133 {\it group galaxies} within this radius. We considered two control samples of galaxies, in isolated groups and in the field. The first comprises 2601 galaxies within 1606 {\it isolated groups}, and the latter has 4273 field objects. The fraction of star forming galaxies in infalling groups has a distinct clustercentric behavior in comparison to the remaining cluster galaxies. Even at $5 \times $R$_{200}$ the {\it group galaxies} already show a reduced fraction of star forming objects. At this radius, the results for the {\it individual cluster galaxies} is actually compatible to the field. That is strong evidence that the group environment is effective to quench the star formation prior to the cluster arrival. The group star forming fraction remains roughly constant inwards, decreasing significantly only within the cluster R$_{200}$ radius. We have also found that the pre-processing effect depends on the group mass (indicated by the number of members). The effect is larger for more massive groups. However, it is significant even for pairs an triplets. Finally, we find evidence that the time scale required for morphological transformation is larger than the one for quenching.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
A Multi-Task Perspective for Link Prediction with New Relation Types and Nodes
Authors:
Jincheng Zhou,
Beatrice Bevilacqua,
Bruno Ribeiro
Abstract:
The task of inductive link prediction in (discrete) attributed multigraphs infers missing attributed links (relations) between nodes in new test multigraphs. Traditional relational learning methods face the challenge of limited generalization to test multigraphs containing both novel nodes and novel relation types not seen in training. Recently, under the only assumption that all relation types sh…
▽ More
The task of inductive link prediction in (discrete) attributed multigraphs infers missing attributed links (relations) between nodes in new test multigraphs. Traditional relational learning methods face the challenge of limited generalization to test multigraphs containing both novel nodes and novel relation types not seen in training. Recently, under the only assumption that all relation types share the same structural predictive patterns (single task), Gao et al. (2023) proposed a link prediction method using the theoretical concept of double equivariance (equivariance for nodes & relation types), in contrast to the (single) equivariance (only for nodes) used to design Graph Neural Networks (GNNs). In this work we further extend the double equivariance concept to multi-task double equivariance, where we define link prediction in attributed multigraphs that can have distinct and potentially conflicting predictive patterns for different sets of relation types (multiple tasks). Our empirical results on real-world datasets demonstrate that our approach can effectively generalize to test graphs with multi-task structures without access to additional information.
△ Less
Submitted 4 December, 2023; v1 submitted 12 July, 2023;
originally announced July 2023.
-
Spin-Polarized Majorana Zero Modes in Proximitized Superconducting Penta-Silicene Nanoribbons
Authors:
R. C. Bento Ribeiro,
J. H. Correa,
L. S. Ricco,
I. A. Shelykh,
M. A. Continentino,
A. C. Seridonio,
M. Minissale,
G. L. Lay,
M. S. Figueira
Abstract:
We theoretically investigate the possibility of obtaining Majorana zero modes (MZMs) in penta-silicene nanoribbons (p-SiNRs) with induced \textit{p}-wave superconductivity. The model explicitly considers an external magnetic field perpendicularly applied to the nanoribbon plane, as well as an extrinsic Rashba spin-orbit coupling (RSOC), in addition to the first nearest neighbor hopping term and \t…
▽ More
We theoretically investigate the possibility of obtaining Majorana zero modes (MZMs) in penta-silicene nanoribbons (p-SiNRs) with induced \textit{p}-wave superconductivity. The model explicitly considers an external magnetic field perpendicularly applied to the nanoribbon plane, as well as an extrinsic Rashba spin-orbit coupling (RSOC), in addition to the first nearest neighbor hopping term and \textit{p}-wave superconducting pairing. By analyzing the dispersion relation profiles, we observe the successive closing and reopening of the induced superconducting gap with a single spin component, indicating a spin-polarized topological phase transition (TPT). Correspondingly, the plots of the energy spectrum versus the chemical potential reveal the existence of zero-energy states with a preferential spin orientation characterized by nonoverlapping wave functions localized at opposite ends of the superconducting p-SiNRs. These findings strongly suggest the emergence of topologically protected, spin-polarized MZMs at the ends of the p-SiNRs with induced \textit{p}-wave superconducting pairing, which can be realized by proximitizing the nanoribbon with an \textit{s}-wave superconductor, such as lead. The proposal paves the way for silicene-based Majorana devices hosting multiple MZMs with a well-defined spin orientation, with possible applications in fault-tolerant quantum computing platforms and Majorana spintronics.
△ Less
Submitted 6 July, 2023;
originally announced July 2023.
-
Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents
Authors:
Sergio Pelaez,
Gaurav Verma,
Barbara Ribeiro,
Philip Shapira
Abstract:
Labeling data is essential for training text classifiers but is often difficult to accomplish accurately, especially for complex and abstract concepts. Seeking an improved method, this paper employs a novel approach using a generative language model (GPT-4) to produce labels and rationales for large-scale text analysis. We apply this approach to the task of discovering public value expressions in…
▽ More
Labeling data is essential for training text classifiers but is often difficult to accomplish accurately, especially for complex and abstract concepts. Seeking an improved method, this paper employs a novel approach using a generative language model (GPT-4) to produce labels and rationales for large-scale text analysis. We apply this approach to the task of discovering public value expressions in US AI patents. We collect a database comprising 154,934 patent documents using an advanced Boolean query submitted to InnovationQ+. The results are merged with full patent text from the USPTO, resulting in 5.4 million sentences. We design a framework for identifying and labeling public value expressions in these AI patent sentences. A prompt for GPT-4 is developed which includes definitions, guidelines, examples, and rationales for text classification. We evaluate the quality of the labels and rationales produced by GPT-4 using BLEU scores and topic modeling and find that they are accurate, diverse, and faithful. These rationales also serve as a chain-of-thought for the model, a transparent mechanism for human verification, and support for human annotators to overcome cognitive limitations. We conclude that GPT-4 achieved a high-level of recognition of public value theory from our framework, which it also uses to discover unseen public value expressions. We use the labels produced by GPT-4 to train BERT-based classifiers and predict sentences on the entire database, achieving high F1 scores for the 3-class (0.85) and 2-class classification (0.91) tasks. We discuss the implications of our approach for conducting large-scale text analyses with complex and abstract concepts and suggest that, with careful framework design and interactive human oversight, generative language models can offer significant advantages in quality and in reduced time and costs for producing labels and rationales.
△ Less
Submitted 18 May, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Examining transitional galaxies to understand the role of clusters and their dynamical status in galaxy quenching
Authors:
Douglas Brambila,
Paulo A. A. Lopes,
André L. B. Ribeiro,
Arianna Cortesi
Abstract:
In this work, we consider four different galaxy populations and two distinct global environments in the local Universe (z $\leq 0.11$) to investigate the evolution of transitional galaxies (such as star-forming spheroids and passive discs) across different environments. Our sample is composed of 3,899 galaxies within the R$_{200}$ radius of 231 clusters and 11,460 field galaxies. We also investiga…
▽ More
In this work, we consider four different galaxy populations and two distinct global environments in the local Universe (z $\leq 0.11$) to investigate the evolution of transitional galaxies (such as star-forming spheroids and passive discs) across different environments. Our sample is composed of 3,899 galaxies within the R$_{200}$ radius of 231 clusters and 11,460 field galaxies. We also investigate the impact of the cluster's dynamic state, as well as the galaxy's location in the projected phase space diagram (PPS). We found that although the cluster environment as a whole influences galaxy evolution, the cluster dynamical state does not. Furthermore, star-forming galaxies represent recent cluster arrivals in comparison to passive galaxies (especially in the case of early-types). Among the ETGs, we find that the D$_n(4000)$ and H$_δ$ parameters indicate a smooth transition between the subpopulations. In particular, for the SF-ETGs, we detect a significant difference between field and cluster galaxies, as a function of stellar mass, for objects with Log $M_*$/M$_{\odot} > 10.5$. Analyzing the color gradient, the results point toward a picture where field galaxies are more likely to follow the monolithic scenario, while the cluster galaxies the hierarchical scenario. In particular, if we split the ETGs into lenticulars and ellipticals, we find that the steeper color gradients are more common for the lenticulars. Finally, our results indicate the need for galaxy pre-processing in smaller groups, before entering clusters.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Cosmological constraints on $R^2$-corrected Appleby-Battye model
Authors:
Bruno Ribeiro,
Armando Bernui,
Marcela Campista
Abstract:
Nowadays, efforts are being devoted to the study of alternative cosmological scenarios in which modifications of General Relativity have been proposed to explain the late cosmic acceleration without assuming the existence of dark energy. In this scenario, we investigate the $R^2$-AB model, which consists of an $f(R)$ model with only one extra free parameter, $b$, in addition to the 6 of the flat-…
▽ More
Nowadays, efforts are being devoted to the study of alternative cosmological scenarios in which modifications of General Relativity have been proposed to explain the late cosmic acceleration without assuming the existence of dark energy. In this scenario, we investigate the $R^2$-AB model, which consists of an $f(R)$ model with only one extra free parameter, $b$, in addition to the 6 of the flat-$Λ$CDM. Regarding this model, it was already shown that a positive value for $b$ is required for the model to be consistent with Solar System tests, moreover, the condition for the existence of a de~Sitter state requires $b \ge 1.6$. To impose observational constraints on the $R^2$-AB model we consider three datasets: 31 $H(z)$ measurements from Cosmic Chronometers (CC), 20 $[{fσ}_{8}](z)$ measurements from Redshift-Space Distortion (RSD), and the most recent type Ia Supernovae (SNe Ia) sample from Pantheon+. Next, we perform two different analyses: we have considered only SNe Ia data and the combined likelihood SNe+CC+RSD. The first one has provided $b=2.28^{+6.52}_{-0.55}$, while the second one $b=2.18^{+5.41}_{-0.55}$. In the first case it was necessary to set the absolute magnitude $M_B = -19.253$ from SH0ES collaboration, while in the second we did a marginalization over the Hubble constant $H_0$ in the normalized growth function. We have also observed that the $H_0-M_B$ degeneracy was broken by adding CC data to the SNe data. Additionally, we perform illustrative analyses that compare this $f(R)$ model with the flat-$Λ$CDM model, considering several values of the parameter $b$, for diverse cosmological functions like the Hubble function $H(z)$, the equation of state $w_{\rm eff}(z)$, the parametrized growth rate of cosmic structures $[fσ_8](z)$, and $σ_8(z)$. We conclude that the model fits well the data, but the parameter $b$ was not unambiguously constrained.
△ Less
Submitted 12 September, 2024; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Proximal Curriculum for Reinforcement Learning Agents
Authors:
Georgios Tzannetos,
Bárbara Gomes Ribeiro,
Parameswaran Kamalaruban,
Adish Singla
Abstract:
We consider the problem of curriculum design for reinforcement learning (RL) agents in contextual multi-task settings. Existing techniques on automatic curriculum design typically require domain-specific hyperparameter tuning or have limited theoretical underpinnings. To tackle these limitations, we design our curriculum strategy, ProCuRL, inspired by the pedagogical concept of Zone of Proximal De…
▽ More
We consider the problem of curriculum design for reinforcement learning (RL) agents in contextual multi-task settings. Existing techniques on automatic curriculum design typically require domain-specific hyperparameter tuning or have limited theoretical underpinnings. To tackle these limitations, we design our curriculum strategy, ProCuRL, inspired by the pedagogical concept of Zone of Proximal Development (ZPD). ProCuRL captures the intuition that learning progress is maximized when picking tasks that are neither too hard nor too easy for the learner. We mathematically derive ProCuRL by analyzing two simple learning settings. We also present a practical variant of ProCuRL that can be directly integrated with deep RL frameworks with minimal hyperparameter tuning. Experimental results on a variety of domains demonstrate the effectiveness of our curriculum strategy over state-of-the-art baselines in accelerating the training process of deep RL agents.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
MetaPhysiCa: OOD Robustness in Physics-informed Machine Learning
Authors:
S Chandra Mouli,
Muhammad Ashraful Alam,
Bruno Ribeiro
Abstract:
A fundamental challenge in physics-informed machine learning (PIML) is the design of robust PIML methods for out-of-distribution (OOD) forecasting tasks. These OOD tasks require learning-to-learn from observations of the same (ODE) dynamical system with different unknown ODE parameters, and demand accurate forecasts even under out-of-support initial conditions and out-of-support ODE parameters. In…
▽ More
A fundamental challenge in physics-informed machine learning (PIML) is the design of robust PIML methods for out-of-distribution (OOD) forecasting tasks. These OOD tasks require learning-to-learn from observations of the same (ODE) dynamical system with different unknown ODE parameters, and demand accurate forecasts even under out-of-support initial conditions and out-of-support ODE parameters. In this work we propose a solution for such tasks, which we define as a meta-learning procedure for causal structure discovery (including invariant risk minimization). Using three different OOD tasks, we empirically observe that the proposed approach significantly outperforms existing state-of-the-art PIML and deep learning methods.
△ Less
Submitted 6 March, 2023;
originally announced March 2023.
-
Video Action Recognition Collaborative Learning with Dynamics via PSO-ConvNet Transformer
Authors:
Nguyen Huu Phong,
Bernardete Ribeiro
Abstract:
Recognizing human actions in video sequences, known as Human Action Recognition (HAR), is a challenging task in pattern recognition. While Convolutional Neural Networks (ConvNets) have shown remarkable success in image recognition, they are not always directly applicable to HAR, as temporal features are critical for accurate classification. In this paper, we propose a novel dynamic PSO-ConvNet mod…
▽ More
Recognizing human actions in video sequences, known as Human Action Recognition (HAR), is a challenging task in pattern recognition. While Convolutional Neural Networks (ConvNets) have shown remarkable success in image recognition, they are not always directly applicable to HAR, as temporal features are critical for accurate classification. In this paper, we propose a novel dynamic PSO-ConvNet model for learning actions in videos, building on our recent work in image recognition. Our approach leverages a framework where the weight vector of each neural network represents the position of a particle in phase space, and particles share their current weight vectors and gradient estimates of the Loss function. To extend our approach to video, we integrate ConvNets with state-of-the-art temporal methods such as Transformer and Recurrent Neural Networks. Our experimental results on the UCF-101 dataset demonstrate substantial improvements of up to 9% in accuracy, which confirms the effectiveness of our proposed method. In addition, we conducted experiments on larger and more variety of datasets including Kinetics-400 and HMDB-51 and obtained preference for Collaborative Learning in comparison with Non-Collaborative Learning (Individual Learning). Overall, our dynamic PSO-ConvNet model provides a promising direction for improving HAR by better capturing the spatio-temporal dynamics of human actions in videos. The code is available at https://github.com/leonlha/Video-Action-Recognition-Collaborative-Learning-with-Dynamics-via-PSO-ConvNet-Transformer.
△ Less
Submitted 21 September, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Late growth of early-type galaxies in low-z massive clusters
Authors:
A. L. B. Ribeiro,
R. S. Nascimento,
D. F. Morell,
P. A. A. Lopes,
C. C. Dantas,
M. H. S. Fonseca
Abstract:
We study a sample of 936 early-type galaxies (ETGs) located in 48 low-z regular galaxy clusters with $M_{200}\geq 10^{14}~ M_\odot$ at $z< 0.1$. We examine variations in the concentration index, radius, and color gradient of ETGs as a function of their stellar mass and loci in the projected phase space (PPS) of the clusters. We aim to understand the environmental influence on the growth of ETGs ac…
▽ More
We study a sample of 936 early-type galaxies (ETGs) located in 48 low-z regular galaxy clusters with $M_{200}\geq 10^{14}~ M_\odot$ at $z< 0.1$. We examine variations in the concentration index, radius, and color gradient of ETGs as a function of their stellar mass and loci in the projected phase space (PPS) of the clusters. We aim to understand the environmental influence on the growth of ETGs according to the time since infall into their host clusters. Our analysis indicates a significant change in the behavior of the concentration index $C$ and color gradient around $M_{\ast} \approx 2\times 10^{11} ~M_\odot \equiv \tilde{M}_{\ast}$. Objects less massive than $ \tilde{M}_{\ast}$ present a slight growth of $C$ with $M_{\ast}$ with negative and approximately constant color gradients in all regions of the PPS. Objects more massive than $ \tilde{M}_{\ast}$ present a slight decrease of $C$ with $M_{\ast}$ with color gradients becoming less negative and approaching zero. We also find that objects more massive than $ \tilde{M}_{\ast}$, in all PPS regions, have smaller $R_{90}$ for a given $R_{50}$, suggesting a smaller external growth in these objects or even a shrinkage possibly due to tidal stripping. Finally, we estimate different dark matter fractions for galaxies in different regions of the PPS, with the ancient satellites having the largest fractions, $f_{DM}\approx$ 65%. These results favor a scenario where cluster ETGs experience environmental influence the longer they remain and the deeper into the gravitational potential they lie, indicating a combination of tidal stripping + harassment, which predominate during infall, followed by mergers + feedback effects affecting the late growth of ancient satellites and BCGs.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Double Equivariance for Inductive Link Prediction for Both New Nodes and New Relation Types
Authors:
Jianfei Gao,
Yangze Zhou,
Jincheng Zhou,
Bruno Ribeiro
Abstract:
The task of inductive link prediction in knowledge graphs (KGs) generally focuses on test predictions with solely new nodes but not both new nodes and new relation types. In this work, we formally define the concept of double permutation-equivariant representations that are equivariant to permutations of both node identities and edge relation types. We then show how double-equivariant architecture…
▽ More
The task of inductive link prediction in knowledge graphs (KGs) generally focuses on test predictions with solely new nodes but not both new nodes and new relation types. In this work, we formally define the concept of double permutation-equivariant representations that are equivariant to permutations of both node identities and edge relation types. We then show how double-equivariant architectures are able to self-supervise pre-train on distinct KG domains and zero-shot predict links on a new KG domain (with completely new entities and new relation types). We also introduce the concept of distributionally double equivariant positional embeddings designed to perform the same task. Finally, we empirically demonstrate the capability of the proposed models against baselines on a set of novel real-world benchmarks. More interestingly, we show that self-supervised pre-training on more KG domains increases the zero-shot ability of our model to predict on new relation types over new entities on unseen KG domains.
△ Less
Submitted 14 December, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Causal Lifting and Link Prediction
Authors:
Leonardo Cotta,
Beatrice Bevilacqua,
Nesreen Ahmed,
Bruno Ribeiro
Abstract:
Existing causal models for link prediction assume an underlying set of inherent node factors -- an innate characteristic defined at the node's birth -- that governs the causal evolution of links in the graph. In some causal tasks, however, link formation is path-dependent: The outcome of link interventions depends on existing links. Unfortunately, these existing causal methods are not designed for…
▽ More
Existing causal models for link prediction assume an underlying set of inherent node factors -- an innate characteristic defined at the node's birth -- that governs the causal evolution of links in the graph. In some causal tasks, however, link formation is path-dependent: The outcome of link interventions depends on existing links. Unfortunately, these existing causal methods are not designed for path-dependent link formation, as the cascading functional dependencies between links (arising from path dependence) are either unidentifiable or require an impractical number of control variables. To overcome this, we develop the first causal model capable of dealing with path dependencies in link prediction. In this work we introduce the concept of causal lifting, an invariance in causal models of independent interest that, on graphs, allows the identification of causal link prediction queries using limited interventional data. Further, we show how structural pairwise embeddings exhibit lower bias and correctly represent the task's causal structure, as opposed to existing node embeddings, e.g., graph neural network node embeddings and matrix factorization. Finally, we validate our theoretical findings on three scenarios for causal link prediction tasks: knowledge base completion, covariance matrix estimation and consumer-product recommendations.
△ Less
Submitted 27 July, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Galaxy Distributions as Fractal Systems
Authors:
Sharon Teles,
Amanda R. Lopes,
Marcelo B. Ribeiro
Abstract:
This paper discusses if large scale galaxy distribution samples containing almost one million objects can be characterized as fractal systems. The analysis performed by Teles et al. (2021; arXiv:2012.07164) on the UltraVISTA DR1 survey is extended here to the SPLASH and COSMOS2015 catalogs, hence adding 750k new galaxies with measured redshifts to the studied samples. The standard $Λ$CDM cosmology…
▽ More
This paper discusses if large scale galaxy distribution samples containing almost one million objects can be characterized as fractal systems. The analysis performed by Teles et al. (2021; arXiv:2012.07164) on the UltraVISTA DR1 survey is extended here to the SPLASH and COSMOS2015 catalogs, hence adding 750k new galaxies with measured redshifts to the studied samples. The standard $Λ$CDM cosmology having $H_0=(70\pm5)$ km/s/Mpc and number density tools required for describing these galaxy distributions as single fractal systems with dimension $D$ are adopted. We use the luminosity distance $d_L$, redshift distance $d_z$ and galaxy area distance (transverse comoving distance) $d_G$ as relativistic distance definitions to derive galaxy number densities in the redshift interval $0.1\le z\le4$ at volume limited subsamples defined by absolute magnitudes in the K-band. Similar to the findings of Teles et al. (2021; arXiv:2012.07164), the results show two consecutive redshift scales where galaxy distribution data behave as single fractal structures. For $z<1$ we found $D=1.00\pm0.12$ for the SPLASH galaxies, and $D=1,39\pm0.19$ for the COSMOS2015. For $1\le z\le4$ we respectively found $D=0.83^{+0.36}_{-0.37}$ and $D=0.54^{+0.27}_{-0.26}$. These results were verified to be robust under the assumed Hubble constant uncertainty. Calculations considering blue and red galaxies subsamples in both surveys showed that the fractal dimensions of blue galaxies as basically unchanged, but the ones for the red galaxies changed mostly to smaller values, meaning that $D$ may be seen as a more intrinsic property of the distribution of objects in the Universe, therefore allowing for the fractal dimension to be used as a tool to study different populations of galaxies. All results confirm the decades old theoretical prediction of a decrease in the fractal dimension for $z>1$.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Bias Challenges in Counterfactual Data Augmentation
Authors:
S Chandra Mouli,
Yangze Zhou,
Bruno Ribeiro
Abstract:
Deep learning models tend not to be out-of-distribution robust primarily due to their reliance on spurious features to solve the task. Counterfactual data augmentations provide a general way of (approximately) achieving representations that are counterfactual-invariant to spurious features, a requirement for out-of-distribution (OOD) robustness. In this work, we show that counterfactual data augme…
▽ More
Deep learning models tend not to be out-of-distribution robust primarily due to their reliance on spurious features to solve the task. Counterfactual data augmentations provide a general way of (approximately) achieving representations that are counterfactual-invariant to spurious features, a requirement for out-of-distribution (OOD) robustness. In this work, we show that counterfactual data augmentations may not achieve the desired counterfactual-invariance if the augmentation is performed by a context-guessing machine, an abstract machine that guesses the most-likely context of a given input. We theoretically analyze the invariance imposed by such counterfactual data augmentations and describe an exemplar NLP task where counterfactual data augmentation by a context-guessing machine does not lead to robust OOD classifiers.
△ Less
Submitted 13 September, 2022; v1 submitted 12 September, 2022;
originally announced September 2022.
-
Veritas: Answering Causal Queries from Video Streaming Traces
Authors:
Chandan Bothra,
Jianfei Gao,
Sanjay Rao,
Bruno Ribeiro
Abstract:
In this paper, we seek to answer what-if questions - i.e., given recorded data of an existing deployed networked system, what would be the performance impact if we changed the design of the system (a task also known as causal inference). We make three contributions. First, we expose the complexity of causal inference in the context of adaptive bit rate video streaming, a challenging domain where t…
▽ More
In this paper, we seek to answer what-if questions - i.e., given recorded data of an existing deployed networked system, what would be the performance impact if we changed the design of the system (a task also known as causal inference). We make three contributions. First, we expose the complexity of causal inference in the context of adaptive bit rate video streaming, a challenging domain where the network conditions during the session act as a sequence of latent and confounding variables, and a change at any point in the session has a cascading impact on the rest of the session. Second, we present Veritas, a novel framework that tackles causal reasoning for video streaming without resorting to randomised trials. Integral to Veritas is an easy to interpret domain-specific ML model (an embedded Hidden Markov Model) that relates the latent stochastic process (intrinsic bandwidth that the video session can achieve) to actual observations (download times) while exploiting control variables such as the TCP state (e.g., congestion window) observed at the start of the download of video chunks. We show through experiments on an emulation testbed that Veritas can answer both counterfactual queries (e.g., the performance of a completed video session had it used a different buffer size) and interventional queries (e.g., estimating the download time for every possible video quality choice for the next chunk in a session in progress). In doing so, Veritas achieves accuracy close to an ideal oracle, while significantly outperforming both a commonly used baseline approach, and Fugu (an off-the-shelf neural network) neither of which account for causal effects.
△ Less
Submitted 26 August, 2022;
originally announced August 2022.
-
Metal content of the circumgalactic medium around star-forming galaxies at z $\sim$ 2.6 as revealed by the VIMOS Ultra-Deep Survey
Authors:
H. Méndez-Hernández,
P. Cassata,
E. Ibar,
R Amorín,
M. Aravena,
S. Bardelli,
O. Cucciati,
B. Garilli,
M. Giavalisco,
L. Guaita,
N. Hathi,
A. Koekemoer,
V. Le Brun,
B. C. Lemaux,
D. Maccagni,
B. Ribeiro,
L. Tasca,
N. Tejos,
R. Thomas,
L. Tresse,
D. Vergani,
G. Zamorani,
E. Zucca
Abstract:
The circumgalactic medium (CGM) is the location where the interplay between large-scale outflows and accretion onto galaxies occurs. Metals in different ionization states flowing between the circumgalactic and intergalactic mediums are affected by large galactic outflows and low-ionization state inflowing gas. Observational studies on their spatial distribution and their relation with galaxy prope…
▽ More
The circumgalactic medium (CGM) is the location where the interplay between large-scale outflows and accretion onto galaxies occurs. Metals in different ionization states flowing between the circumgalactic and intergalactic mediums are affected by large galactic outflows and low-ionization state inflowing gas. Observational studies on their spatial distribution and their relation with galaxy properties may provide important constraints on models of galaxy formation and evolution. To provide new insights into the spatial distribution of the circumgalactic of star-forming galaxies, we select a sample of 238 close pairs at $1.5 < z <4.5$ ($\langle z\rangle\sim$2.6) from the VIMOS Ultra Deep Survey. We then generate composite spectra by co-adding spectra of $background$ galaxies that provide different sight-lines across the CGM to examine the spatial distribution of the gas located around these galaxies and investigate possible correlations between the strength of the low- and high-ionization absorption features with different galaxy properties. We detect C II, Si II, Si IV and C IV) up to separations $\langle b \rangle=$ 172 kpc and 146 kpc. Our $W_{0}$ radial profiles suggest a potential redshift evolution for the CGM gas content producing these absorptions. We find a correlation between C II and C IV with star formation rate, stellar mass and trends with galaxy size estimated by the effective radius and azimuthal angle. Galaxies with high star formation rate show stronger C IV absorptions compared with star-forming galaxies with low SFR and low stellar mass. These results could be explained by stronger outflows, softer radiation fields unable to ionize high-ionization state lines or by the galactic fountain scenario where metal-rich gas ejected from previous star-formation episodes fall back to the galaxy.
△ Less
Submitted 25 July, 2022; v1 submitted 17 June, 2022;
originally announced June 2022.
-
OOD Link Prediction Generalization Capabilities of Message-Passing GNNs in Larger Test Graphs
Authors:
Yangze Zhou,
Gitta Kutyniok,
Bruno Ribeiro
Abstract:
This work provides the first theoretical study on the ability of graph Message Passing Neural Networks (gMPNNs) -- such as Graph Neural Networks (GNNs) -- to perform inductive out-of-distribution (OOD) link prediction tasks, where deployment (test) graph sizes are larger than training graphs. We first prove non-asymptotic bounds showing that link predictors based on permutation-equivariant (struct…
▽ More
This work provides the first theoretical study on the ability of graph Message Passing Neural Networks (gMPNNs) -- such as Graph Neural Networks (GNNs) -- to perform inductive out-of-distribution (OOD) link prediction tasks, where deployment (test) graph sizes are larger than training graphs. We first prove non-asymptotic bounds showing that link predictors based on permutation-equivariant (structural) node embeddings obtained by gMPNNs can converge to a random guess as test graphs get larger. We then propose a theoretically-sound gMPNN that outputs structural pairwise (2-node) embeddings and prove non-asymptotic bounds showing that, as test graphs grow, these embeddings converge to embeddings of a continuous function that retains its ability to predict links OOD. Empirical results on random graphs show agreement with our theoretical results.
△ Less
Submitted 9 October, 2022; v1 submitted 30 May, 2022;
originally announced May 2022.
-
Leading edge vortex formation and wake trajectory: Synthesizing measurements, analysis, and machine learning
Authors:
Howon Lee,
Nicholas Simone,
Yunxing Su,
Yuanhang Zhu,
Bernardo Luiz R. Ribeiro,
Jennifer A. Franck,
Kenneth Breuer
Abstract:
The strength and trajectory of a leading edge vortex (LEV) formed by a pitching-heaving hydrofoil (chord $c$) is studied. The LEV is identified using the $Q$-criterion method, which is calculated from the 2D velocity field obtained from PIV measurements. The relative angle of attack at mid-stroke, ${α_{T/4}} $, proves to be an effective method of combining heave amplitude ($h_0/c$), pitch amplitud…
▽ More
The strength and trajectory of a leading edge vortex (LEV) formed by a pitching-heaving hydrofoil (chord $c$) is studied. The LEV is identified using the $Q$-criterion method, which is calculated from the 2D velocity field obtained from PIV measurements. The relative angle of attack at mid-stroke, ${α_{T/4}} $, proves to be an effective method of combining heave amplitude ($h_0/c$), pitch amplitude ($θ_0$), and reduced frequency ($f^*$) into a single variable that predicts the maximum value of $Q$ over a wide range of operating conditions. Once the LEV separates from the foil, it travels downstream and rapidly weakens and diffuses. The downstream trajectory of the LEV has two characteristic shapes. At low values of ${α_{T/4}}$, it travels straight downstream after separating from the foil, while at higher values of ${α_{T/4}} $, an accompanying Trailing Edge Vortex (TEV) forms and the induced velocity generates a cross-stream component to the vortex trajectories. This behavior is accurately predicted using a potential flow model for the LEV and TEV. Supervised machine learning algorithms, namely Support Vector Regression and Gaussian Process Regression, are used to create regression models that predicts the vortex strength, shape and trajectory during growth and after separation. The regression model successfully captures the features of two vortex regimes observed at different values of ${α_{T/4}} $. However, the predicted LEV trajectories are somewhat smoother than observed in the experiments. The strengths of the vortex is often under-predicted. Both of these shortcomings may be attributed to the relatively small size of the training data set.
△ Less
Submitted 25 May, 2022;
originally announced May 2022.
-
Action Recognition for American Sign Language
Authors:
Nguyen Huu Phong,
Bernardete Ribeiro
Abstract:
In this research, we present our findings to recognize American Sign Language from series of hand gestures. While most researches in literature focus only on static handshapes, our work target dynamic hand gestures. Since dynamic signs dataset are very few, we collect an initial dataset of 150 videos for 10 signs and an extension of 225 videos for 15 signs. We apply transfer learning models in com…
▽ More
In this research, we present our findings to recognize American Sign Language from series of hand gestures. While most researches in literature focus only on static handshapes, our work target dynamic hand gestures. Since dynamic signs dataset are very few, we collect an initial dataset of 150 videos for 10 signs and an extension of 225 videos for 15 signs. We apply transfer learning models in combination with deep neural networks and background subtraction for videos in different temporal settings. Our primarily results show that we can get an accuracy of $0.86$ and $0.71$ using DenseNet201, LSTM with video sequence of 12 frames accordingly.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
PSO-Convolutional Neural Networks with Heterogeneous Learning Rate
Authors:
Nguyen Huu Phong,
Augusto Santos,
Bernardete Ribeiro
Abstract:
Convolutional Neural Networks (ConvNets or CNNs) have been candidly deployed in the scope of computer vision and related fields. Nevertheless, the dynamics of training of these neural networks lie still elusive: it is hard and computationally expensive to train them. A myriad of architectures and training strategies have been proposed to overcome this challenge and address several problems in imag…
▽ More
Convolutional Neural Networks (ConvNets or CNNs) have been candidly deployed in the scope of computer vision and related fields. Nevertheless, the dynamics of training of these neural networks lie still elusive: it is hard and computationally expensive to train them. A myriad of architectures and training strategies have been proposed to overcome this challenge and address several problems in image processing such as speech, image and action recognition as well as object detection. In this article, we propose a novel Particle Swarm Optimization (PSO) based training for ConvNets. In such framework, the vector of weights of each ConvNet is typically cast as the position of a particle in phase space whereby PSO collaborative dynamics intertwines with Stochastic Gradient Descent (SGD) in order to boost training performance and generalization. Our approach goes as follows: i) [regular phase] each ConvNet is trained independently via SGD; ii) [collaborative phase] ConvNets share among themselves their current vector of weights (or particle-position) along with their gradient estimates of the Loss function. Distinct step sizes are coined by distinct ConvNets. By properly blending ConvNets with large (possibly random) step-sizes along with more conservative ones, we propose an algorithm with competitive performance with respect to other PSO-based approaches on Cifar-10 and Cifar-100 (accuracy of 98.31% and 87.48%). These accuracy levels are obtained by resorting to only four ConvNets -- such results are expected to scale with the number of collaborative ConvNets accordingly. We make our source codes available for download https://github.com/leonlha/PSO-ConvNet-Dynamics.
△ Less
Submitted 12 September, 2023; v1 submitted 20 May, 2022;
originally announced May 2022.
-
A Machine Learning Approach to Classify Vortex Wakes of Energy Harvesting Oscillating Foils
Authors:
Bernardo Luiz R. Ribeiro,
Jennifer A. Franck
Abstract:
A machine learning model is developed to establish wake patterns behind oscillating foils whose kinematics are within the energy harvesting regime. The role of wake structure is particularly important for array deployments of oscillating foils, since the unsteady wake highly influences performance of downstream foils. This work explores 46 oscillating foil kinematics, with the goal of parameterizi…
▽ More
A machine learning model is developed to establish wake patterns behind oscillating foils whose kinematics are within the energy harvesting regime. The role of wake structure is particularly important for array deployments of oscillating foils, since the unsteady wake highly influences performance of downstream foils. This work explores 46 oscillating foil kinematics, with the goal of parameterizing the wake based on the input kinematic variables and grouping vortex wakes through image analysis of vorticity fields. A combination of a convolutional neural network (CNN) with long short-term memory (LSTM) units is developed to classify the wakes into three groups. To fully verify the physical wake differences among foil kinematics, a convolutional autoencoder combined with k-means++ clustering is utilized and four different wake patterns are found. With the classification model, these patterns are associated with a range of foil kinematics. Future work can use these correlations to predict the performance of foils placed in the wake and build optimal foil arrangements for tidal energy harvesting.
△ Less
Submitted 5 November, 2022; v1 submitted 12 May, 2022;
originally announced May 2022.
-
z~2-9 Galaxies magnified by the Hubble Frontier Field Clusters I: Source Selection and Surface Density-Magnification Constraints from >2500 galaxies
Authors:
R. J. Bouwens,
G. Illingworth,
R. S. Ellis,
P. Oesch,
A. Paulino-Afonso,
B. Ribeiro,
M. Stefanon
Abstract:
We assemble a large comprehensive sample of 2534 z~2, 3, 4, 5, 6, 7, 8, and 9 galaxies lensed by the six clusters from the Hubble Frontier Fields (HFF) program. Making use of the availability of multiple independent magnification models for each of the HFF clusters and alternatively treating one of the models as the "truth," we show that the median magnification factors from the v4 parametric mode…
▽ More
We assemble a large comprehensive sample of 2534 z~2, 3, 4, 5, 6, 7, 8, and 9 galaxies lensed by the six clusters from the Hubble Frontier Fields (HFF) program. Making use of the availability of multiple independent magnification models for each of the HFF clusters and alternatively treating one of the models as the "truth," we show that the median magnification factors from the v4 parametric models are typically reliable to values of 30 to 50, and in one case to 100. Using the median magnification factor from the latest v4 models, we estimate the UV luminosities of the 2534 lensed z~2-9 galaxies, finding sources as faint as -12.4 mag at z~3 and -12.9 mag at z~7. We explicitly demonstrate the power of the surface density-magnification relations Sigma(z) vs. mu in the HFF clusters to constrain both distant galaxy properties and cluster lensing properties. Based on the Sigma(z) vs. mu relations, we show that the median magnification estimates from existing public models must be reliable predictors of the true magnification mu to mu<15 (95% confidence). We also use the observed Sigma(z) vs. mu relations to derive constraints on the evolution of the luminosity function faint-end slope from z~7 to z~2, showing that faint-end slope results can be consistent with blank-field studies if, and only if, the selection efficiency shows no strong dependence on the magnification factor mu. This can only be the case if very low luminosity galaxies are very small, being unresolved in deep lensing probes.
△ Less
Submitted 1 May, 2022; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Thermoelectric properties of topological chains coupled to a quantum dot
Authors:
A. C. P. Lima,
R. C. Bento Ribeiro,
J. H. Correa,
Fernanda Deus,
M. S. Figueira,
Mucio A. Continentino
Abstract:
Topological one-dimensional superconductors can sustain in their extremities zero energy modes that are protected by different kinds of symmetries. The observation of these excitations in the form of Majorana fermions is one of the most intensive quests in condensed matter physics. Their study is not only interesting in itself, but also because they have promising applications in the area of quant…
▽ More
Topological one-dimensional superconductors can sustain in their extremities zero energy modes that are protected by different kinds of symmetries. The observation of these excitations in the form of Majorana fermions is one of the most intensive quests in condensed matter physics. Their study is not only interesting in itself, but also because they have promising applications in the area of quantum computation. In this work we are interested in another class of one dimensional topological systems, namely topological insulators. These also present symmetry protected end modes with robust properties and do not require the low temperatures necessary for topological superconductivity. We consider the simplest kind of topological insulators, namely chains of atoms with hybridized $sp$ orbitals. We study the transport properties of these chains in the trivial, non-trivial topological phases and at the quantum topological transition. We use a simple device consisting of two semi-infinite hybridized $sp$-chains connected to a quantum dot and obtain the thermoelectric properties of this system as a function of temperature and distance to the topological transition. We show that the electrical conductance and the Wiedemann-Franz ratio of the device at the topological transition have universal values at very low temperatures. The thermopower gives direct evidence of fractional charges in these systems.
△ Less
Submitted 3 September, 2022; v1 submitted 20 December, 2021;
originally announced December 2021.
-
Set Twister for Single-hop Node Classification
Authors:
Yangze Zhou,
Vinayak Rao,
Bruno Ribeiro
Abstract:
Node classification is a central task in relational learning, with the current state-of-the-art hinging on two key principles: (i) predictions are permutation-invariant to the ordering of a node's neighbors, and (ii) predictions are a function of the node's $r$-hop neighborhood topology and attributes, $r \geq 2$. Both graph neural networks and collective inference methods (e.g., belief propagatio…
▽ More
Node classification is a central task in relational learning, with the current state-of-the-art hinging on two key principles: (i) predictions are permutation-invariant to the ordering of a node's neighbors, and (ii) predictions are a function of the node's $r$-hop neighborhood topology and attributes, $r \geq 2$. Both graph neural networks and collective inference methods (e.g., belief propagation) rely on information from up to $r$-hops away. In this work, we study if the use of more powerful permutation-invariant functions can sometimes avoid the need for classifiers to collect information beyond $1$-hop. Towards this, we introduce a new architecture, the Set Twister, which generalizes DeepSets (Zaheer et al., 2017), a simple and widely-used permutation-invariant representation. Set Twister theoretically increases expressiveness of DeepSets, allowing it to capture higher-order dependencies, while keeping its simplicity and low computational cost. Empirically, we see accuracy improvements of Set Twister over DeepSets as well as a variety of graph neural networks and collective inference schemes in several tasks, while showcasing its implementation simplicity and computational efficiency.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
Sizes of Lensed Lower-luminosity z=4-8 Galaxies from the Hubble Frontier Field Program
Authors:
R. J. Bouwens,
G. D. Illingworth,
P. G. van Dokkum,
P. A. Oesch,
M. Stefanon,
B. Ribeiro
Abstract:
We constrain the rest-UV size-luminosity relation for star-forming galaxies at z~4 and z~6, 7, and 8 identified behind clusters from the Hubble Frontier Fields (HFF) program. The size-luminosity relation is key to deriving accurate luminosity functions (LF) for faint galaxies. Making use of the latest lensing models and full data set for these clusters, lensing-corrected sizes and luminosities are…
▽ More
We constrain the rest-UV size-luminosity relation for star-forming galaxies at z~4 and z~6, 7, and 8 identified behind clusters from the Hubble Frontier Fields (HFF) program. The size-luminosity relation is key to deriving accurate luminosity functions (LF) for faint galaxies. Making use of the latest lensing models and full data set for these clusters, lensing-corrected sizes and luminosities are derived for 68 z~4, 184 z~6, 93 z~7, and 53 z~8 galaxies. We show that size measurements can be reliably measured up to linear magnifications of 30x, where the lensing models are well calibrated. The sizes we measure span a >1-dex range, from <50 pc to >~500 pc. Uncertainties are based on both the formal fit errors and systematic differences between the public lensing models. These uncertainties range from ~20 pc for the smallest sources to 50 pc for the largest. Using a forward-modeling procedure to model the impact of incompleteness and magnification uncertainties, we characterize the size-luminosity relation at both z~4 and z~6-8. We find that the source sizes of star-forming galaxies at z~4 and z~6-8 scale with luminosity L as L^{0.54\pm0.08} and L^{0.40+/-0.04}, respectively, such that lower luminosity (>~-18 mag) galaxies are smaller than expected from extrapolating the size-luminosity relation at high luminosities (<~-18 mag). The new evidence for a steeper size-luminosity relation (3 sigma) adds to earlier evidence for small sizes based on the prevalence of highly magnified galaxies in high shear regions, theoretical arguments against upturns in the LFs, and other independent determinations of the size-luminosity relation from the HFF clusters.
△ Less
Submitted 6 December, 2021;
originally announced December 2021.
-
Contextual Unsupervised Outlier Detection in Sequences
Authors:
Mohamed A. Zahran,
Leonardo Teixeira,
Vinayak Rao,
Bruno Ribeiro
Abstract:
This work proposes an unsupervised learning framework for trajectory (sequence) outlier detection that combines ranking tests with user sequence models. The overall framework identifies sequence outliers at a desired false positive rate (FPR), in an otherwise parameter-free manner. We evaluate our methodology on a collection of real and simulated datasets based on user actions at the websites last…
▽ More
This work proposes an unsupervised learning framework for trajectory (sequence) outlier detection that combines ranking tests with user sequence models. The overall framework identifies sequence outliers at a desired false positive rate (FPR), in an otherwise parameter-free manner. We evaluate our methodology on a collection of real and simulated datasets based on user actions at the websites last.fm and msnbc.com, where we know ground truth, and demonstrate improved accuracy over existing approaches. We also apply our approach to a large real-world dataset of Pinterest and Facebook users, where we find that users tend to re-share Pinterest posts of Facebook friends significantly more than other types of users, pointing to a potential influence of Facebook friendship on sharing behavior on Pinterest.
△ Less
Submitted 6 November, 2021;
originally announced November 2021.
-
Unveiling the internal structure of Hercules supercluster
Authors:
R. Monteiro-Oliveira,
D. F. Morell,
V. M. Sampaio,
A. L. B. Ribeiro,
R. R. de Carvalho
Abstract:
We have investigated the structure of the Hercules supercluster (SCL160) based on data originally extracted from the Sloan Digital Sky Survey SDSS-DR7. We have traced the mass distribution in the field through the numerical density-weighted by the $r^\prime$-luminosity of the galaxies and classified them based on their spatial position and redshift. This has allowed us not only to address the kine…
▽ More
We have investigated the structure of the Hercules supercluster (SCL160) based on data originally extracted from the Sloan Digital Sky Survey SDSS-DR7. We have traced the mass distribution in the field through the numerical density-weighted by the $r^\prime$-luminosity of the galaxies and classified them based on their spatial position and redshift. This has allowed us not only to address the kinematics of the supercluster as a whole, but also the internal kinematic of each cluster, which was no further explored before. We have confirmed that the Hercules supercluster is composed of the galaxy clusters A2147, A2151, and A2152. A2151 consists of five subclusters, A2147 on two and A2152 on at least two. They form the heart of the Hercules supercluster. We also have found two other gravitationally bond clusters, increasing, therefore, the known members of the supercluster. We have estimated a total mass of $2.1\pm0.2 \times 10^{15}$ M$_\odot$ for the Hercules supercluster. To determine the dynamical masses in this work, we have resorted to the $M_{200}-σ$ scaling relation and the caustic technique. Comparing both methods with simulated data of bimodal merging clusters, we found the caustic, as well as the $σ$-based masses, are biased through the merger age, showing a boost just after the pericentric passage. This is not in line with the principle of the caustic method that affirms it is not depending on the cluster dynamical state.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
Warp drive dynamic solutions considering different fluid sources
Authors:
Osvaldo L. Santos-Pereira,
Everton M. C. Abreu,
Marcelo B. Ribeiro
Abstract:
Alcubierre proposed in 1994 that the well known special relativistic limitation that particles cannot travel with velocities bigger than the light speed can be bypassed when such trips are considered globally within specific general relativistic frameworks. Although initial results indicated this scenario as being unphysical, since it would seem to require negative mass-energy density, recent theo…
▽ More
Alcubierre proposed in 1994 that the well known special relativistic limitation that particles cannot travel with velocities bigger than the light speed can be bypassed when such trips are considered globally within specific general relativistic frameworks. Although initial results indicated this scenario as being unphysical, since it would seem to require negative mass-energy density, recent theoretical analyses suggest that such an unphysical situation may not always be necessarily true. In this paper we review some solutions of the Einstein equations using the original Alcubierre warp drive metric endowed with various matter-energy sources, namely dust, perfect fluid, anisotropic fluid, and perfect fluid with a cosmological constant. A connection of some of these solutions featuring shock waves described by the Burgers equation is also shown.
△ Less
Submitted 1 November, 2021;
originally announced November 2021.
-
Reconstruction for Powerful Graph Representations
Authors:
Leonardo Cotta,
Christopher Morris,
Bruno Ribeiro
Abstract:
Graph neural networks (GNNs) have limited expressive power, failing to represent many graph classes correctly. While more expressive graph representation learning (GRL) alternatives can distinguish some of these classes, they are significantly harder to implement, may not scale well, and have not been shown to outperform well-tuned GNNs in real-world tasks. Thus, devising simple, scalable, and exp…
▽ More
Graph neural networks (GNNs) have limited expressive power, failing to represent many graph classes correctly. While more expressive graph representation learning (GRL) alternatives can distinguish some of these classes, they are significantly harder to implement, may not scale well, and have not been shown to outperform well-tuned GNNs in real-world tasks. Thus, devising simple, scalable, and expressive GRL architectures that also achieve real-world improvements remains an open challenge. In this work, we show the extent to which graph reconstruction -- reconstructing a graph from its subgraphs -- can mitigate the theoretical and practical problems currently faced by GRL architectures. First, we leverage graph reconstruction to build two new classes of expressive graph representations. Secondly, we show how graph reconstruction boosts the expressive power of any GNN architecture while being a (provably) powerful inductive bias for invariances to vertex removals. Empirically, we show how reconstruction can boost GNN's expressive power -- while maintaining its invariance to permutations of the vertices -- by solving seven graph property tasks not solvable by the original GNN. Further, we demonstrate how it boosts state-of-the-art GNN's performance across nine real-world benchmark datasets.
△ Less
Submitted 6 December, 2021; v1 submitted 1 October, 2021;
originally announced October 2021.
-
Perfect fluid warp drive solutions with the cosmological constant
Authors:
Osvaldo L. Santos-Pereira,
Everton M. C. Abreu,
Marcelo B. Ribeiro
Abstract:
The Alcubierre metric describes a spacetime geometry that allows a massive particle inside a spacetime distortion, called warp bubble, to travel with superluminal global velocities. In this work we advance solutions of the Einstein equations with the cosmological constant for the Alcubierre warp drive metric having the perfect fluid as source. We also consider the particular dust case with the cos…
▽ More
The Alcubierre metric describes a spacetime geometry that allows a massive particle inside a spacetime distortion, called warp bubble, to travel with superluminal global velocities. In this work we advance solutions of the Einstein equations with the cosmological constant for the Alcubierre warp drive metric having the perfect fluid as source. We also consider the particular dust case with the cosmological constant, which generalizes our previous dust solution (arXiv:2008.06560) and led to vacuum solutions connecting the warp drive with shock waves via the Burgers equation, as well as our perfect fluid solution without the cosmological constant (arXiv:2101.11467). All energy conditions are also analyzed. The results show that the shift vector in the direction of the warp bubble motion creates a coupling in the Einstein equations that requires off-diagonal terms in the energy-momentum source. Therefore, it seems that to achieve superluminal speeds by means of the Alcubierre warp drive spacetime geometry one may require a complex configuration and distribution of energy, matter and momentum as source in order to produce a warp drive bubble. In addition, warp speeds seem to require more complex forms of matter than dust for stable solutions and that negative matter may not be a strict requirement to achieve global superluminal speeds.
△ Less
Submitted 24 August, 2021;
originally announced August 2021.
-
Spin-polarized Majorana zero-modes in double zigzag honeycomb nanoribbons
Authors:
R. C. Bento Ribeiro,
J. H. Correa,
L. S. Ricco,
A. C. Seridonio,
M. S. Figueira
Abstract:
We study the emergence of Majorana zero modes (MZMs) at the ends of a finite double zigzag honeycomb nanoribbon (zHNR). We show that a double zHNR geometry can host spin-polarized MZMs at its ends. We considered a minimal model composed by first nearest neighbor hopping, Rashba spin-orbit coupling (RSOC), p-wave superconducting pairing, and an applied external magnetic field (EMF). The energy spec…
▽ More
We study the emergence of Majorana zero modes (MZMs) at the ends of a finite double zigzag honeycomb nanoribbon (zHNR). We show that a double zHNR geometry can host spin-polarized MZMs at its ends. We considered a minimal model composed by first nearest neighbor hopping, Rashba spin-orbit coupling (RSOC), p-wave superconducting pairing, and an applied external magnetic field (EMF). The energy spectrum regions with either spin up or down MZMs belong to distinct topological phase transitions characterized by their corresponding winding numbers and can be accessed by tunning the chemical potential of the nanoribbons. Hybrid systems constituted by zHNRs deposited on conventional s-wave superconductors are potential candidates for experimentally realizing the proposal. The spin's discrimination of MZMs suggests a possible route for performing topological-conventional qubit operations using Majorana spintronics.
△ Less
Submitted 17 August, 2021;
originally announced August 2021.
-
A Machine Learning Approach to Classify Kinematics and Vortex Wake Modes of Oscillating Foils
Authors:
Bernardo Luiz R. Ribeiro,
Jennifer A. Franck
Abstract:
Machine learning techniques have received attention in fluid dynamics in terms of predicting, clustering and classifying complex flow physics. One application has been the classification or clustering of various wake structures that emanate from bluff bodies such as cylinders or flapping foils, creating a rich diversity of vortex formations specific to flow conditions, geometry, and/or kinematics…
▽ More
Machine learning techniques have received attention in fluid dynamics in terms of predicting, clustering and classifying complex flow physics. One application has been the classification or clustering of various wake structures that emanate from bluff bodies such as cylinders or flapping foils, creating a rich diversity of vortex formations specific to flow conditions, geometry, and/or kinematics of the body. When utilizing oscillating foils to harvest energy from tidal or river flows, it is critical to understand the intricate and nonlinear relationship between flapping kinematics and the downstream vortex wake structure for optimal siting and operation of arrays. This paper develops a classification model to obtain groups of kinematics that contain similar wake patterns within the energy harvesting regime. Data is obtained through simulations of 27 unique oscillating foil kinematics for a total of 13,650 samples of the wake vorticity field. Within these samples three groups are visually labeled based on the relative angle of attack. A machine learning approach combining a convolutional neural network (CNN) with long short-term memory (LSTM) units is utilized to automatically classify the wakes into the three groups. The average accuracy on five test data subsets is 80% when the three visually labeled groups are used for classification. After analyzing the test subset with lowest accuracy, an update on the group division boundaries is proposed. With this update, the algorithm achieves an average accuracy of 90%, demonstrating that the three groups are able to discern distinct wake structures within a range of energy harvesting kinematics.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Moiré-localized interlayer exciton wavefunctions captured by imaging its electron and hole constituents
Authors:
Ouri Karni,
Elyse Barré,
Vivek Pareek,
Johnathan D. Georgaras,
Michael K. L. Man,
Chakradhar Sahoo,
David R. Bacon,
Xing Zhu,
Henrique B. Ribeiro,
Aidan L. O'Beirne,
Jenny Hu,
Abdullah Al-Mahboob,
Mohamed M. M. Abdelrasoul,
Nicholas S. Chan,
Arka Karmakar,
Andrew J. Winchester,
Bumho Kim,
Kenji Watanabe,
Takashi Taniguchi,
Katayun Barmak,
Julien Madéo,
Felipe H. da Jornada,
Tony F. Heinz,
Keshav M. Dani
Abstract:
Interlayer excitons (ILXs) - electron-hole pairs bound across two atomically thin layered semiconductors - have emerged as attractive platforms to study exciton condensation, single-photon emission and other quantum-information applications. Yet, despite extensive optical spectroscopic investigations, critical information about their size, valley configuration and the influence of the moiré potent…
▽ More
Interlayer excitons (ILXs) - electron-hole pairs bound across two atomically thin layered semiconductors - have emerged as attractive platforms to study exciton condensation, single-photon emission and other quantum-information applications. Yet, despite extensive optical spectroscopic investigations, critical information about their size, valley configuration and the influence of the moiré potential remains unknown. Here, we captured images of the time- and momentum-resolved distribution of both the electron and the hole that bind to form the ILX in a WSe2/MoS2 heterostructure. We thereby obtain a direct measurement of the interlayer exciton diameter of ~5.4 nm, comparable to the moiré unit-cell length of 6.1 nm. Surprisingly, this large ILX is well localized within the moiré cell to a region of only 1.8 nm - smaller than the size of the exciton itself. This high degree of localization of the interlayer exciton is backed by Bethe-Salpeter equation calculations and demonstrates that the ILX can be localized within small moiré unit cells. Unlike large moiré cells, these are uniform over large regions, thus allowing the formation of extended arrays of localized excitations for quantum technology.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Long-Term Environmental Stability of Nitrogen-Healed Black Phosphorus
Authors:
Valeria S. Marangoni,
Alisson R. Cadore,
Henrique B. Ribeiro,
Leandro Hostert,
Christiano J. S. de Matos,
Cecilia C. C. Silva,
Leandro Seixas,
Camila M. Maroneze
Abstract:
The unique optoelectronic properties of black phosphorus (BP) have triggered great interest in its applications in areas not fulfilled by other layered materials (LMs). However, its poor stability (fast degradation, i.e. <<1 h for monolayers) under ambient conditions restricts its practical application. We demonstrate here, by an experimental-theoretical approach, that the incorporation of nitroge…
▽ More
The unique optoelectronic properties of black phosphorus (BP) have triggered great interest in its applications in areas not fulfilled by other layered materials (LMs). However, its poor stability (fast degradation, i.e. <<1 h for monolayers) under ambient conditions restricts its practical application. We demonstrate here, by an experimental-theoretical approach, that the incorporation of nitrogen molecules (N2) into the BP structure results in a relevant improvement of its stability in air, up to 8 days without optical degradation signs. Our strategy involves the generation of defects (phosphorus vacancies) by electron-beam irradiation, followed by their healing with N2 molecules. As an additional route, N2 plasma treatment is presented as an alternative for large area application. Our first principles calculations elucidate the mechanisms involved in the nitrogen incorporation as well as on the stabilization of the modified BP, which corroborates with our experimental observations. This stabilization approach can be applied in the processing of BP, allowing for its use in environmentally stable van der Waals heterostructures with other LMs as well as in optoelectronic and wearable devices.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
Low-luminosity galaxies in the early universe have observed sizes similar to star cluster complexes
Authors:
R. J. Bouwens,
G. D. Illingworth,
P. G. van Dokkum,
B. Ribeiro,
P. A. Oesch,
M. Stefanon
Abstract:
We compare the sizes and luminosities of faint $z=6$-8 galaxies magnified by the Hubble Frontier Fields (HFF) clusters with star-forming regions, as well as more evolved objects, in the nearby universe. Our high-redshift comparison sample includes 333 z=6-8 galaxies, for which size measurements were made as part of a companion study where lensing magnifications were estimated from various public m…
▽ More
We compare the sizes and luminosities of faint $z=6$-8 galaxies magnified by the Hubble Frontier Fields (HFF) clusters with star-forming regions, as well as more evolved objects, in the nearby universe. Our high-redshift comparison sample includes 333 z=6-8 galaxies, for which size measurements were made as part of a companion study where lensing magnifications were estimated from various public models. Accurate size measurements for these sources are complicated by the lens model uncertainties, but other results and arguments suggest that faint galaxies are small, as discussed in a companion study. The measured sizes for sources in our comparison sample range from <50 pc to ~500 pc. For many of the lowest luminosity sources, extremely small sizes are inferred, reaching individual sizes as small as 10-30 pc, with several sources in the 10-15 pc range with our conservative magnification limits. The sizes and luminosities are similar to those of single star cluster complexes like 30 Doradus in the lower-redshift universe and -- in a few cases -- super star clusters. The identification of these compact, faint star-forming sources in the z~6-8 universe also allows us to set upper limits on the proto-globular cluster LF at z~6. By comparisons of the counts and sizes with recent models, we rule out (with some caveats) proto-globular cluster formation scenarios favoring substantial (xi=10) post-formation mass loss and set useful upper limits on others. Our size results suggest we may be very close to discovering a bona-fide population of forming globular clusters at high redshift.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
The Power Spectrum of Cosmological Number Densities
Authors:
Amanda R. Lopes,
Marcelo B. Ribeiro,
William R. Stoeger
Abstract:
We study the cosmological power spectra (PS) of the differential and integral galaxy volume number densities $γ_i$ and $γ_i^{*}$, constructed with the cosmological distances $d_i$ $(i=A,G,L,Z)$, where $d_A$ is the angular diameter distance, $d_G$ is the galaxy area distance, $d_L$ is the luminosity distance and $d_z$ is the redshift distance. Theoretical and observational quantities were obtained…
▽ More
We study the cosmological power spectra (PS) of the differential and integral galaxy volume number densities $γ_i$ and $γ_i^{*}$, constructed with the cosmological distances $d_i$ $(i=A,G,L,Z)$, where $d_A$ is the angular diameter distance, $d_G$ is the galaxy area distance, $d_L$ is the luminosity distance and $d_z$ is the redshift distance. Theoretical and observational quantities were obtained in the FLRW spacetime with a non-vanishing $Λ$. The radial correlation $Ξ_i$, as defined in the context of these densities, is discussed in the wave number domain. All observational quantities were computed using luminosity function (LF) data obtained from the FORS Deep Field galaxy survey. The theoretical and observational PS of $γ_i$, $γ_i^{\ast}$, $Ξ_i$ and $γ_i / γ_i^\ast$ were calculated by performing Fourier transforms on these densities previously derived by Iribarrem et al. (2012) from the observed values $γ_{obs}$ and ${γ^\ast}_{obs}$ obtained using the galactic absolute magnitudes and galaxy LF Schechter's parameters presented in Gabasch et al. (2004, 2006) in the range $0.5 \le z \le5.0$. The results show similar behavior of the PS obtained from $γ$ and $γ^{\ast}$ using $d_L$, $d_z$ and $d_G$ as distance measures. The PS of the densities defined with $d_A$ have a different and inconclusive behavior, as this cosmological distance reaches a maximum at $z\approx 1.6$ in the adopted cosmology. For the other distances, our results suggest that the PS of ${γ_i}_{obs}$, ${γ^\ast_i}_{obs}$ and ${γ_i / γ^{\ast}_i}_{obs}$ have a general behavior approximately similar to the PS obtained with the galaxy two-point correlation function and, by being sample size independent, they may be considered as alternative analytical tools to study the galaxy distribution.
△ Less
Submitted 8 August, 2022; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Bianchi VI0 viscous fluid cosmology with magnetic field
Authors:
Marcelo Byrro Ribeiro,
Abhik Kumar Sanyal
Abstract:
A spatially homogeneous Bianchi type VI0 model containing a viscous fluid in the presence of an axial magnetic field has been studied. A barotropic equation of state together with a pair of linear relations among the square root of matter density, shear scalar, and expansion scalar have been assumed. Solutions are obtained in the presence of a magnetic field, only in two special cases, which are c…
▽ More
A spatially homogeneous Bianchi type VI0 model containing a viscous fluid in the presence of an axial magnetic field has been studied. A barotropic equation of state together with a pair of linear relations among the square root of matter density, shear scalar, and expansion scalar have been assumed. Solutions are obtained in the presence of a magnetic field, only in two special cases, which are comparatively simpler. The complete solutions for this model in the absence of a magnetic field are also obtained. The presence of a magnetic field in the former case however, does not in effect cause any major modification in the fundamental nature of the initial singularity of the expanding model.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
Analyzing the "Sleeping Giants" Activism Model in Brazil
Authors:
Bárbara Gomes Ribeiro,
Manoel Horta Ribeiro,
Virgílio Almeida,
Wagner Meira Jr
Abstract:
In 2020, amidst the COVID pandemic and a polarized political climate, the Sleeping Giants online activist movement gained traction in Brazil. Its rationale was simple: to curb the spread of misinformation by harming the advertising revenue of sources that produce this type of content. Like its international counterparts, Sleeping Giants Brasil (SGB) campaigned against media outlets using Twitter t…
▽ More
In 2020, amidst the COVID pandemic and a polarized political climate, the Sleeping Giants online activist movement gained traction in Brazil. Its rationale was simple: to curb the spread of misinformation by harming the advertising revenue of sources that produce this type of content. Like its international counterparts, Sleeping Giants Brasil (SGB) campaigned against media outlets using Twitter to ask companies to remove ads from the targeted outlets. This work presents a thorough quantitative characterization of this activism model, analyzing the three campaigns carried out by SGB between May and September 2020. To do so, we use digital traces from both Twitter and Google Trends, toxicity and sentiment classifiers trained for the Portuguese language, and an annotated corpus of SGB's tweets. Our key findings were threefold. First, we found that SGB's requests to companies were largely successful (with 83.85\% of all 192 targeted companies responding positively) and that user pressure was correlated to the speed of companies' responses. Second, there were no significant changes in the online attention and the user engagement going towards the targeted media outlets in the six months that followed SGB's campaign (as measured by Google Trends and Twitter engagement). Third, we observed that user interactions with companies changed only transiently, even if the companies did not respond to SGB's request. Overall, our results paint a nuanced portrait of internet activism. On the one hand, they suggest that SGB was successful in getting companies to boycott specific media outlets, which may have harmed their advertisement revenue stream. On the other hand, they also suggest that the activist movement did not impact the online attention these media outlets received nor the online image of companies that did not respond positively to their requests.
△ Less
Submitted 25 February, 2022; v1 submitted 16 May, 2021;
originally announced May 2021.