-
Direct laser acceleration in varying plasma density profiles
Authors:
Robert Babjak,
Bertrand Martinez,
Miroslav Krus,
Marija Vranic
Abstract:
Direct laser acceleration has proven to be an efficient source of high-charge electron bunches and high brilliance X-rays. However, an analytical description of the acceleration in the interaction with varying plasma density targets is still missing. Here, we provide an analytical estimate of the maximum energies that electrons can achieve in such a case. We demonstrate that the maximum energy dep…
▽ More
Direct laser acceleration has proven to be an efficient source of high-charge electron bunches and high brilliance X-rays. However, an analytical description of the acceleration in the interaction with varying plasma density targets is still missing. Here, we provide an analytical estimate of the maximum energies that electrons can achieve in such a case. We demonstrate that the maximum energy depends on the local electron properties at the moment when the electron fulfills the resonant condition at the beginning of the acceleration. This knowledge enables density shaping for various purposes. One application is to decrease the required acceleration distance which has important implications for multi-petawatt laser experiments, where strong laser depletion could play a crucial role. Another use for density tailoring is to achieve acceleration beyond the radiation reaction limit. We derive the energy scaling law that is valid for arbitrary density profile that varies slowly compared with the betatron period. Our results can be applied to electron heating in exponential preplasma of thin foils, ablating plasma plumes, or gas jets with long-scale ramp-up.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Neural network sampling of Bethe-Heitler process in particle-in-cell codes
Authors:
Óscar Amaro,
Chiara Badiali,
Bertrand Martinez
Abstract:
This study uses neural networks to improve Monte Carlo (MC) implementations of the Bethe-Heitler process in Particle-In-Cell (PIC) codes. We provide a neural network that is as accurate as pre-calculated tables, and requires a hundred times less memory to store. It is trained to predict Bethe-Heitler pair production cross-sections for atomic numbers 1-50 and photon energies between 1 MeV and 10 Ge…
▽ More
This study uses neural networks to improve Monte Carlo (MC) implementations of the Bethe-Heitler process in Particle-In-Cell (PIC) codes. We provide a neural network that is as accurate as pre-calculated tables, and requires a hundred times less memory to store. It is trained to predict Bethe-Heitler pair production cross-sections for atomic numbers 1-50 and photon energies between 1 MeV and 10 GeV in the PIC code OSIRIS. We first validate our approach against a theoretical estimate in a simplified context. We later prove that both approaches have similar performance in a typical relativistic laser-plasma interaction scenario. The large memory decrease accessible with neural networks will enable introducing more advanced cross-section models for Bethe-Heitler pair production and other QED mechanisms in the MC modules of PIC codes.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Direct Laser Acceleration of Bethe-Heitler positrons in laser-channel interactions
Authors:
Bertrand Martinez,
Robert Babjak,
Marija Vranic
Abstract:
Positron creation and acceleration is one of the major challenges for constructing future lepton colliders. On the one hand, conventional technology can provide a solution, but at a prohibitive cost and scale. On the other hand, alternative, reduced-scale ideas for positron beam generation could bring this dream closer to reality. Here we propose a novel plasma-based positron acceleration method u…
▽ More
Positron creation and acceleration is one of the major challenges for constructing future lepton colliders. On the one hand, conventional technology can provide a solution, but at a prohibitive cost and scale. On the other hand, alternative, reduced-scale ideas for positron beam generation could bring this dream closer to reality. Here we propose a novel plasma-based positron acceleration method using a powerful laser propagating through a dense and narrow plasma channel. A large amount of electrons is injected within the channel during laser propagation. This electron loading creates static fields in the plasma, enabling positrons to be guided transversely while they directly gain energy from the laser field itself. Within this context, we present a theoretical model to describe how the laser injects the electrons and estimate the beam-loaded effective electron density. We validate our theoretical predictions through Quasi-3D PIC simulations and demonstrate the robustness of this guiding and direct laser acceleration process for positrons. Our approach could pave the way for testing this new positron acceleration scheme at ELI-Beamlines, showcasing unprecedentedly high average energy gain rate of a few TeV/m. The fireball jet produced contains GeV-level electrons, positrons, and x-rays, opening the path towards potential laboratory astrophysics experiments using these beams.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation
Authors:
Yunhao Ge,
Yihe Tang,
Jiashu Xu,
Cem Gokmen,
Chengshu Li,
Wensi Ai,
Benjamin Jose Martinez,
Arman Aydin,
Mona Anvari,
Ayush K Chakravarthy,
Hong-Xing Yu,
Josiah Wong,
Sanjana Srivastava,
Sharon Lee,
Shengxin Zha,
Laurent Itti,
Yunzhu Li,
Roberto Martín-Martín,
Miao Liu,
Pengchuan Zhang,
Ruohan Zhang,
Li Fei-Fei,
Jiajun Wu
Abstract:
The systematic evaluation and understanding of computer vision models under varying conditions require large amounts of data with comprehensive and customized labels, which real-world vision datasets rarely satisfy. While current synthetic data generators offer a promising alternative, particularly for embodied AI tasks, they often fall short for computer vision tasks due to low asset and renderin…
▽ More
The systematic evaluation and understanding of computer vision models under varying conditions require large amounts of data with comprehensive and customized labels, which real-world vision datasets rarely satisfy. While current synthetic data generators offer a promising alternative, particularly for embodied AI tasks, they often fall short for computer vision tasks due to low asset and rendering quality, limited diversity, and unrealistic physical properties. We introduce the BEHAVIOR Vision Suite (BVS), a set of tools and assets to generate fully customized synthetic data for systematic evaluation of computer vision models, based on the newly developed embodied AI benchmark, BEHAVIOR-1K. BVS supports a large number of adjustable parameters at the scene level (e.g., lighting, object placement), the object level (e.g., joint configuration, attributes such as "filled" and "folded"), and the camera level (e.g., field of view, focal length). Researchers can arbitrarily vary these parameters during data generation to perform controlled experiments. We showcase three example application scenarios: systematically evaluating the robustness of models across different continuous axes of domain shift, evaluating scene understanding models on the same set of images, and training and evaluating simulation-to-real transfer for a novel vision task: unary and binary state prediction. Project website: https://behavior-vision-suite.github.io/
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Dialogue Understandability: Why are we streaming movies with subtitles?
Authors:
Helard Becerra Martinez,
Alessandro Ragano,
Diptasree Debnath,
Asad Ullah,
Crisron Rudolf Lucas,
Martin Walsh,
Andrew Hines
Abstract:
Watching movies and TV shows with subtitles enabled is not simply down to audibility or speech intelligibility. A variety of evolving factors related to technological advances, cinema production and social behaviour challenge our perception and understanding. This study seeks to formalise and give context to these influential factors under a wider and novel term referred to as Dialogue Understanda…
▽ More
Watching movies and TV shows with subtitles enabled is not simply down to audibility or speech intelligibility. A variety of evolving factors related to technological advances, cinema production and social behaviour challenge our perception and understanding. This study seeks to formalise and give context to these influential factors under a wider and novel term referred to as Dialogue Understandability. We propose a working definition for Dialogue Understandability being a listener's capacity to follow the story without undue cognitive effort or concentration being required that impacts their Quality of Experience (QoE). The paper identifies, describes and categorises the factors that influence Dialogue Understandability mapping them over the QoE framework, a media streaming lifecycle, and the stakeholders involved. We then explore available measurement tools in the literature and link them to the factors they could potentially be used for. The maturity and suitability of these tools is evaluated over a set of pilot experiments. Finally, we reflect on the gaps that still need to be filled, what we can measure and what not, future subjective experiments, and new research trends that could help us to fully characterise Dialogue Understandability.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation
Authors:
Chengshu Li,
Ruohan Zhang,
Josiah Wong,
Cem Gokmen,
Sanjana Srivastava,
Roberto Martín-Martín,
Chen Wang,
Gabrael Levine,
Wensi Ai,
Benjamin Martinez,
Hang Yin,
Michael Lingelbach,
Minjune Hwang,
Ayano Hiranaka,
Sujay Garlanka,
Arman Aydin,
Sharon Lee,
Jiankai Sun,
Mona Anvari,
Manasi Sharma,
Dhruva Bansal,
Samuel Hunter,
Kyu-Young Kim,
Alan Lou,
Caleb R Matthews
, et al. (10 additional authors not shown)
Abstract:
We present BEHAVIOR-1K, a comprehensive simulation benchmark for human-centered robotics. BEHAVIOR-1K includes two components, guided and motivated by the results of an extensive survey on "what do you want robots to do for you?". The first is the definition of 1,000 everyday activities, grounded in 50 scenes (houses, gardens, restaurants, offices, etc.) with more than 9,000 objects annotated with…
▽ More
We present BEHAVIOR-1K, a comprehensive simulation benchmark for human-centered robotics. BEHAVIOR-1K includes two components, guided and motivated by the results of an extensive survey on "what do you want robots to do for you?". The first is the definition of 1,000 everyday activities, grounded in 50 scenes (houses, gardens, restaurants, offices, etc.) with more than 9,000 objects annotated with rich physical and semantic properties. The second is OMNIGIBSON, a novel simulation environment that supports these activities via realistic physics simulation and rendering of rigid bodies, deformable bodies, and liquids. Our experiments indicate that the activities in BEHAVIOR-1K are long-horizon and dependent on complex manipulation skills, both of which remain a challenge for even state-of-the-art robot learning solutions. To calibrate the simulation-to-reality gap of BEHAVIOR-1K, we provide an initial study on transferring solutions learned with a mobile manipulator in a simulated apartment to its real-world counterpart. We hope that BEHAVIOR-1K's human-grounded nature, diversity, and realism make it valuable for embodied AI and robot learning research. Project website: https://behavior.stanford.edu.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Ultralight vector dark matter search using data from the KAGRA O3GK run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi
, et al. (1778 additional authors not shown)
Abstract:
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese…
▽ More
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Variability mitigation in epitaxial-heterostructure-based spin qubit devices via gate layout optimization
Authors:
Biel Martinez,
Silvano de Franceschi,
Yann-Michel Niquet
Abstract:
The scalability of spin qubit devices is conditioned by qubit-to-qubit variability. Disorder in the host materials indeed affects the wave functions of the confined carriers, which leads to variations in their charge and spin properties. Charge disorder in the amorphous oxides is particularly detrimental owing to its long-range influence. Here we analyze the effects of charge traps at the semicond…
▽ More
The scalability of spin qubit devices is conditioned by qubit-to-qubit variability. Disorder in the host materials indeed affects the wave functions of the confined carriers, which leads to variations in their charge and spin properties. Charge disorder in the amorphous oxides is particularly detrimental owing to its long-range influence. Here we analyze the effects of charge traps at the semiconductor/oxide interface, which are generally believed to play a dominant role in variability. We consider multiple random distributions of these interface traps and numerically calculate their impact on the chemical potentials, detuning and tunnel coupling of two adjacent quantum dots in SiGe heterostructure. Our results highlight the beneficial screening effect of the metal gates. The surface of the heterostructure shall, therefore, be covered as much as possible by the gates in order to limit variability. We propose an alternative layout with tip-shaped gates that maximizes the coverage of the semiconductor/oxide interface and outperforms the usual planar layout in some regimes. This highlights the importance of design in the management of device-to-device variability.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Direct laser acceleration: A model for the electron injection from the walls of a cylindrical guiding structure
Authors:
P. Valenta,
D. Maslarova,
R. Babjak,
B. Martinez,
S. V. Bulanov,
M. Vranic
Abstract:
We use analytical methods and particle-in-cell simulation to investigate the origin of electrons accelerated by the process of direct laser acceleration driven by high-power laser pulses in preformed narrow cylindrical plasma channels. The simulation shows that the majority of accelerated electrons are originally located along the interface between the channel wall and the channel interior. The an…
▽ More
We use analytical methods and particle-in-cell simulation to investigate the origin of electrons accelerated by the process of direct laser acceleration driven by high-power laser pulses in preformed narrow cylindrical plasma channels. The simulation shows that the majority of accelerated electrons are originally located along the interface between the channel wall and the channel interior. The analytical model based on the electron hydrodynamics illustrates the underlying physical mechanism of the release of electrons from the channel wall when irradiated by an intense laser, the subsequent electron dynamics, and the corresponding evolution of the channel density profile. The quantitative predictions of the total charge of released electrons and the average electron density inside the channel are validated by comparison with the simulation results.
△ Less
Submitted 28 May, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
Authors:
Mehdi Noroozi,
Isma Hadji,
Brais Martinez,
Adrian Bulat,
Georgios Tzimiropoulos
Abstract:
In this paper, we introduce YONOS-SR, a novel stable diffusion-based approach for image super-resolution that yields state-of-the-art results using only a single DDIM step. We propose a novel scale distillation approach to train our SR model. Instead of directly training our SR model on the scale factor of interest, we start by training a teacher model on a smaller magnification scale, thereby mak…
▽ More
In this paper, we introduce YONOS-SR, a novel stable diffusion-based approach for image super-resolution that yields state-of-the-art results using only a single DDIM step. We propose a novel scale distillation approach to train our SR model. Instead of directly training our SR model on the scale factor of interest, we start by training a teacher model on a smaller magnification scale, thereby making the SR problem simpler for the teacher. We then train a student model for a higher magnification scale, using the predictions of the teacher as a target during the training. This process is repeated iteratively until we reach the target scale factor of the final model. The rationale behind our scale distillation is that the teacher aids the student diffusion model training by i) providing a target adapted to the current noise level rather than using the same target coming from ground truth data for all noise levels and ii) providing an accurate target as the teacher has a simpler task to solve. We empirically show that the distilled model significantly outperforms the model trained for high scales directly, specifically with few steps during inference. Having a strong diffusion model that requires only one step allows us to freeze the U-Net and fine-tune the decoder on top of it. We show that the combination of spatially distilled U-Net and fine-tuned decoder outperforms state-of-the-art methods requiring 200 steps with only one single step.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Graph Guided Question Answer Generation for Procedural Question-Answering
Authors:
Hai X. Pham,
Isma Hadji,
Xinnuo Xu,
Ziedune Degutyte,
Jay Rainey,
Evangelos Kazakos,
Afsaneh Fazly,
Georgios Tzimiropoulos,
Brais Martinez
Abstract:
In this paper, we focus on task-specific question answering (QA). To this end, we introduce a method for generating exhaustive and high-quality training data, which allows us to train compact (e.g., run on a mobile device), task-specific QA models that are competitive against GPT variants. The key technological enabler is a novel mechanism for automatic question-answer generation from procedural t…
▽ More
In this paper, we focus on task-specific question answering (QA). To this end, we introduce a method for generating exhaustive and high-quality training data, which allows us to train compact (e.g., run on a mobile device), task-specific QA models that are competitive against GPT variants. The key technological enabler is a novel mechanism for automatic question-answer generation from procedural text which can ingest large amounts of textual instructions and produce exhaustive in-domain QA training data. While current QA data generation methods can produce well-formed and varied data, their non-exhaustive nature is sub-optimal for training a QA model. In contrast, we leverage the highly structured aspect of procedural text and represent each step and the overall flow of the procedure as graphs. We then condition on graph nodes to automatically generate QA pairs in an exhaustive and controllable manner. Comprehensive evaluations of our method show that: 1) small models trained with our data achieve excellent performance on the target QA task, even exceeding that of GPT3 and ChatGPT despite being several orders of magnitude smaller. 2) semantic coverage is the key indicator for downstream QA performance. Crucially, while large language models excel at syntactic diversity, this does not necessarily result in improvements on the end QA model. In contrast, the higher semantic coverage provided by our method is critical for QA performance.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
The small kt-region in Drell-Yan production at next-to-leading order with the Parton Branching Method
Authors:
I. Bubanja,
A. Bermudez Martinez,
L. Favart,
F. Guzman,
F. Hautmann,
H. Jung,
A. Lelek,
M. Mendizabal,
K. Moral Figueroa,
L. Moureaux,
N. Raicevic,
M. Seidel,
S. Taheri Monfared
Abstract:
The Parton Branching (PB) method describes the evolution of transverse momentum dependent (TMD) parton distributions, covering all kinematic regions from small to large transverse momenta kT. The small kT-region is very sensitive both to the contribution of the intrinsic motion of partons (intrinsic kT) and to the resummation of soft gluons taken into account by the PB TMD evolution equations. We…
▽ More
The Parton Branching (PB) method describes the evolution of transverse momentum dependent (TMD) parton distributions, covering all kinematic regions from small to large transverse momenta kT. The small kT-region is very sensitive both to the contribution of the intrinsic motion of partons (intrinsic kT) and to the resummation of soft gluons taken into account by the PB TMD evolution equations. We study the role of soft-gluon emissions in TMD as well as integrated parton distributions. We perform a detailed investigation of the PB TMD methodology at next-to-leading order (NLO) in Drell-Yan (DY) production for low transverse momenta. We present the extraction of the nonperturbative "intrinsic-kT" distribution from recent measurements of DY transverse momentum distributions at the LHC across a wide range in DY masses, including a detailed treatment of statistical, correlated and uncorrelated uncertainties. We comment on the (in)dependence of intrinsic transverse momentum on DY mass and center-of-mass energy, and on the comparison with other approaches.
△ Less
Submitted 26 February, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Local-ECM: An empirical cubature hyper-reduction method adapted to local reduced order models
Authors:
Jose Raul Bravo Martinez,
Sebastian Ares de Parga Regalado,
Joaquin Alberto Hernandez Ortega,
Riccardo Rossi Bernecoli
Abstract:
We present the Local Empirical Cubature Method (Local-ECM), a novel algorithm tailored for creating efficient integration rules, particularly addressing clusters of intrinsically distinct functions, as observed in local reduced-order models. Local-ECM seeks to enhance existing empirical cubature methodologies by harnessing the locality of the functions to yield the sparsest outcome, while incurrin…
▽ More
We present the Local Empirical Cubature Method (Local-ECM), a novel algorithm tailored for creating efficient integration rules, particularly addressing clusters of intrinsically distinct functions, as observed in local reduced-order models. Local-ECM seeks to enhance existing empirical cubature methodologies by harnessing the locality of the functions to yield the sparsest outcome, while incurring virtually no implementation overheads. Our approach straightforwardly poses a local cubature optimization problem for the first time, out of which we also propose alternative Linear Programming (LP) strategies for its resolution. Through examination across three academic examples, we demonstrate the capability of our method to identify the sparsest cubature rules for a given tolerance, outperforming alternate methods outlined, including the LP and other global strategies. We have made our code freely available through the GitHub repository at https://github.com/Rbravo555/localECM
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Radiation-dominated injection of positrons generated by the nonlinear Breit-Wheeler process into a plasma channel
Authors:
Dominika Maslarova,
Bertrand Martinez,
Marija Vranic
Abstract:
Plasma acceleration is considered a prospective technology for building a compact multi-TeV electron-positron collider in the future. The challenge of this endeavor is greater for positrons than for the electrons because usually the self-generated fields from laser-plasma interaction are not well-suited for positron focusing and on-axis guiding. In addition, an external positron source is required…
▽ More
Plasma acceleration is considered a prospective technology for building a compact multi-TeV electron-positron collider in the future. The challenge of this endeavor is greater for positrons than for the electrons because usually the self-generated fields from laser-plasma interaction are not well-suited for positron focusing and on-axis guiding. In addition, an external positron source is required, while electrons are naturally available in the plasma. Here, we study electron-positron pair generation by an orthogonal collision of a multi-PW laser pulse and a GeV electron beam by the nonlinear Breit-Wheeler process. We studied conditions favorable for positron deflection in the direction of the laser pulse propagation, which favors injection into the plasma for further acceleration. We demonstrate using the OSIRIS particle-in-cell framework that the radiation reaction triggered by ultra-high laser intensity plays a crucial role in the positron injection. It provides a suppression of the initial transverse momentum gained by the positrons from the Breit-Wheeler process. For the parameters used in this work, the intensity of at least 2.2x1023 W/cm2 is needed in order to inject more than 1% of positrons created. Above this threshold, the percentage of injected positrons rapidly increases with intensity. Moreover, subsequent direct laser acceleration of positrons in a plasma channel, using the same laser pulse that created them, can ensure a boost of the final positron energy by a factor of two. The positron focusing and guiding on the axis is provided by significant electron beam loading that changes the internal structure of the channel fields.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi
, et al. (1750 additional authors not shown)
Abstract:
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect…
▽ More
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Counting roots of fully triangular polynomials over finite fields
Authors:
José Gustavo Coelho,
Fabio Enrique Brochero Martínez
Abstract:
Let $\mathbb{F}_q$ be a finite field with $q$ elements, $f \in \mathbb{F}_q[x_1, \dots, x_n]$ a polynomial in $n$ variables and let us denote by $N(f)$ the number of roots of $f$ in $\mathbb{F}_q^n$. %Many authors, such as Wei Cao and Kung Jiang have used augmented degree matrices to determine $N(f)$ for different families of polynomials. In this paper we consider the family of fully triangular po…
▽ More
Let $\mathbb{F}_q$ be a finite field with $q$ elements, $f \in \mathbb{F}_q[x_1, \dots, x_n]$ a polynomial in $n$ variables and let us denote by $N(f)$ the number of roots of $f$ in $\mathbb{F}_q^n$. %Many authors, such as Wei Cao and Kung Jiang have used augmented degree matrices to determine $N(f)$ for different families of polynomials. In this paper we consider the family of fully triangular polynomials, i.e., polynomials of the form \begin{equation*}
f(x_1, \dots, x_n) = a_1 x_1^{d_{1,1}} + a_2 x_1^{d_{1,2}} x_2^{d_{2,2}} + \dots + a_n x_1^{d_{1,n}}\cdots x_n^{d_{n,n}} - b, \end{equation*} where $d_{i,j} > 0$ for all $1 \le i \le j \le n$. For these polynomials, we obtain explicit formulas for $N(f)$ when the augmented degree matrix of $f$ is row-equivalent to the augmented degree matrix of a linear polynomial or a quadratic diagonal polynomial.
△ Less
Submitted 7 December, 2023; v1 submitted 2 August, 2023;
originally announced August 2023.
-
High-speed data processing onboard sunrise chromospheric infrared spectropolarimeter for the SUNRISE III balloon telescope
Authors:
Masahito Kubo,
Yukio Katsukawa,
David Hernández Expósito,
Antonio Sánchez Gómez,
María Balaguer Jimenéz,
David Orozco Suárez,
José M. Morales Fernández,
Beatriz Aparicio del Moral,
Antonio J. Moreno Mantas,
Eduardo Bailón Martínez,
Jose Carlos del Toro Iniesta,
Yusuke Kawabata,
Carlos Quintero Noda,
Takayoshi Oba,
Ryohtaroh T. Ishikawa,
Toshifumi Shimizu
Abstract:
The Sunrise Chromospheric Infrared spectroPolarimeter (SCIP) has been developed for the third flight of the SUNRISE balloon-borne stratospheric solar observatory. The aim of SCIP is to reveal the evolution of three-dimensional magnetic fields in the solar photosphere and chromosphere using spectropolarimetric measurements with a polarimetric precision of 0.03\% (1$σ$). Multiple lines in the 770 an…
▽ More
The Sunrise Chromospheric Infrared spectroPolarimeter (SCIP) has been developed for the third flight of the SUNRISE balloon-borne stratospheric solar observatory. The aim of SCIP is to reveal the evolution of three-dimensional magnetic fields in the solar photosphere and chromosphere using spectropolarimetric measurements with a polarimetric precision of 0.03\% (1$σ$). Multiple lines in the 770 and 850 nm wavelength bands are simultaneously observed with two 2k$\times$2k CMOS cameras at a frame rate of 31.25 Hz. Stokes profiles are calculated onboard by accumulating the images modulated by a polarization modulation unit, and then compression processes are applied to the two-dimensional maps of the Stokes profiles. This onboard data processing effectively reduces the data rate. SCIP electronics can handle large data formats at high speed. Before the implementation into the flight SCIP electronics, a performance verification of the onboard data processing was performed with synthetic SCIP data that were produced with a numerical simulation modeling the solar atmospheres. Finally, we verified that the high-speed onboard data processing was realized on ground with the flight hardware by using images illuminated by natural sunlight or an LED.
△ Less
Submitted 31 July, 2023;
originally announced July 2023.
-
Aligned Unsupervised Pretraining of Object Detectors with Self-training
Authors:
Ioannis Maniadis Metaxas,
Adrian Bulat,
Ioannis Patras,
Brais Martinez,
Georgios Tzimiropoulos
Abstract:
The unsupervised pretraining of object detectors has recently become a key component of object detector training, as it leads to improved performance and faster convergence during the supervised fine-tuning stage. Existing unsupervised pretraining methods, however, typically rely on low-level information to define proposals that are used to train the detector. Furthermore, in the absence of class…
▽ More
The unsupervised pretraining of object detectors has recently become a key component of object detector training, as it leads to improved performance and faster convergence during the supervised fine-tuning stage. Existing unsupervised pretraining methods, however, typically rely on low-level information to define proposals that are used to train the detector. Furthermore, in the absence of class labels for these proposals, an auxiliary loss is used to add high-level semantics. This results in complex pipelines and a task gap between the pretraining and the downstream task. We propose a framework that mitigates this issue and consists of three simple yet key ingredients: (i) richer initial proposals that do encode high-level semantics, (ii) class pseudo-labeling through clustering, that enables pretraining using a standard object detection training pipeline, (iii) self-training to iteratively improve and enrich the object proposals. Once the pretraining and downstream tasks are aligned, a simple detection pipeline without further bells and whistles can be directly used for pretraining and, in fact, results in state-of-the-art performance on both the full and low data regimes, across detector architectures and datasets, by significant margins. We further show that our pretraining strategy is also capable of pretraining from scratch (including the backbone) and works on complex images like COCO, paving the path for unsupervised representation learning using object detection directly as a pretext task.
△ Less
Submitted 7 July, 2024; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Linear-in-momentum spin orbit interactions in planar Ge/GeSi heterostructures and spin qubits
Authors:
Esteban A. Rodríguez-Mena,
José Carlos Abadillo-Uriel,
Gaëtan Veste,
Biel Martinez,
Jing Li,
Benoît Sklénard,
Yann-Michel Niquet
Abstract:
We investigate the existence of linear-in-momentum spin-orbit interactions in the valence band of Ge/GeSi heterostructures using an atomistic tight-binding method. We show that symmetry breaking at the Ge/GeSi interfaces gives rise to a linear Dresselhaus-type interaction for heavy-holes. This interaction results from the heavy-hole/light-hole mixings induced by the interfaces and can be captured…
▽ More
We investigate the existence of linear-in-momentum spin-orbit interactions in the valence band of Ge/GeSi heterostructures using an atomistic tight-binding method. We show that symmetry breaking at the Ge/GeSi interfaces gives rise to a linear Dresselhaus-type interaction for heavy-holes. This interaction results from the heavy-hole/light-hole mixings induced by the interfaces and can be captured by a suitable correction to the minimal Luttinger-Kohn, four bands $\vec{k}\cdot\vec{p}$ Hamiltonian. It is dependent on the steepness of the Ge/GeSi interfaces, and is suppressed if interdiffusion is strong enough. Besides the Dresselhaus interaction, the Ge/GeSi interfaces also make a contribution to the in-plane gyromagnetic $g$-factors of the holes. The tight-binding calculations also highlight the existence of a small linear Rashba interaction resulting from the couplings between the heavy-hole/light-hole manifold and the conduction band enabled by the low structural symmetry of Ge/GeSi heterostructures. These interactions can be leveraged to drive the hole spin. The linear Dresselhaus interaction may, in particular, dominate the physics of the devices for out-of-plane magnetic fields. When the magnetic field lies in-plane, it is, however, usually far less efficient than the $g$-tensor modulation mechanisms arising from the motion of the dot in non-separable, inhomogeneous electric fields and strains.
△ Less
Submitted 15 December, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
Transformation of transverse momentum distributions from Parton Branching to Collins-Soper-Sterman framework
Authors:
Armando Bermudez Martinez
Abstract:
Two main frameworks for defining transverse momentum dependent (TMD) parton densities are the Collins-Soper-Sterman (CSS) formalism, and the Parton Branching (PB) approach. While PB-TMDs have an explicit dependence on a single scale which is used to evolve PB-TMDs in momentum space, TMDs defined in CSS formalism present a double-scale evolution in renormalization and rapidity scales, via a pair of…
▽ More
Two main frameworks for defining transverse momentum dependent (TMD) parton densities are the Collins-Soper-Sterman (CSS) formalism, and the Parton Branching (PB) approach. While PB-TMDs have an explicit dependence on a single scale which is used to evolve PB-TMDs in momentum space, TMDs defined in CSS formalism present a double-scale evolution in renormalization and rapidity scales, via a pair of coupled evolution equations. In this letter I leverage the Collins-Soper kernel determined from simulated Drell Yan transverse momentum spectra using PB-TMDs, and provide, for the first time, the transformation of TMD parton distributions from the PB framework to the CSS formalism. The evolved PB-TMDs in $b$-space are compared to the recently released, unpolarized TMD distribution ART23.
△ Less
Submitted 25 August, 2023; v1 submitted 13 July, 2023;
originally announced July 2023.
-
Optoacoustic cooling of traveling hypersound waves
Authors:
Laura Blázquez Martínez,
Philipp Wiedemann,
Changlong Zhu,
Andreas Geilen,
Birgit Stiller
Abstract:
We experimentally demonstrate optoacoustic cooling via stimulated Brillouin-Mandelstam scattering in a 50 cm-long tapered photonic crystal fiber. For a 7.38 GHz acoustic mode, a cooling rate of 219 K from room temperature has been achieved. As anti-Stokes and Stokes Brillouin processes naturally break the symmetry of phonon cooling and heating, resolved sideband schemes are not necessary. The expe…
▽ More
We experimentally demonstrate optoacoustic cooling via stimulated Brillouin-Mandelstam scattering in a 50 cm-long tapered photonic crystal fiber. For a 7.38 GHz acoustic mode, a cooling rate of 219 K from room temperature has been achieved. As anti-Stokes and Stokes Brillouin processes naturally break the symmetry of phonon cooling and heating, resolved sideband schemes are not necessary. The experiments pave the way to explore the classical to quantum transition for macroscopic objects and could enable new quantum technologies in terms of storage and repeater schemes.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
Numerical and Experimental Investigation of A Three-Axis Free Rotation Wind Tunnel Model
Authors:
Laurène Muller,
Michel Libsig,
Bastien Martinez,
Denis Bidino,
Myriam Bastide,
Yannick Bailly,
Jean-Claude Roy
Abstract:
The current need of improving performance in terms of control and aerodynamic efficiency of ammunitions leads to the necessity of performing accurate flying geometry characterizations. Therefore, new investigation methods are developed in order to increase the aerodynamic knowledge. Free flight measurements experiments are the most common way to obtain dynamic aerodynamic coefficients. However, th…
▽ More
The current need of improving performance in terms of control and aerodynamic efficiency of ammunitions leads to the necessity of performing accurate flying geometry characterizations. Therefore, new investigation methods are developed in order to increase the aerodynamic knowledge. Free flight measurements experiments are the most common way to obtain dynamic aerodynamic coefficients. However, they do not always allow neither easy nor perfect measurements conditions. Currently ISL develops a stereovision method based wind-tunnel measurements methodology for investigation of a 3-axis free rotation model. This methods has been applied to the DREV-ISL reference model in order to compare coefficients obtained by this method with numerical results.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Black Box Few-Shot Adaptation for Vision-Language models
Authors:
Yassine Ouali,
Adrian Bulat,
Brais Martinez,
Georgios Tzimiropoulos
Abstract:
Vision-Language (V-L) models trained with contrastive learning to align the visual and language modalities have been shown to be strong few-shot learners. Soft prompt learning is the method of choice for few-shot downstream adaptation aiming to bridge the modality gap caused by the distribution shift induced by the new domain. While parameter-efficient, prompt learning still requires access to the…
▽ More
Vision-Language (V-L) models trained with contrastive learning to align the visual and language modalities have been shown to be strong few-shot learners. Soft prompt learning is the method of choice for few-shot downstream adaptation aiming to bridge the modality gap caused by the distribution shift induced by the new domain. While parameter-efficient, prompt learning still requires access to the model weights and can be computationally infeasible for large models with billions of parameters. To address these shortcomings, in this work, we describe a black-box method for V-L few-shot adaptation that (a) operates on pre-computed image and text features and hence works without access to the model's weights, (b) it is orders of magnitude faster at training time, (c) it is amenable to both supervised and unsupervised training, and (d) it can be even used to align image and text features computed from uni-modal models. To achieve this, we propose Linear Feature Alignment (LFA), a simple linear approach for V-L re-alignment in the target domain. LFA is initialized from a closed-form solution to a least-squares problem and then it is iteratively updated by minimizing a re-ranking loss. Despite its simplicity, our approach can even surpass soft-prompt learning methods as shown by extensive experiments on 11 image and 2 video datasets.
△ Less
Submitted 17 August, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
Crystallization of piezoceramic films on glass via flash lamp annealing
Authors:
Longfei Song,
Juliette Cardoletti,
Alfredo Blazquez Martinez,
Andreja Bencan,
Brigita Kmet,
Stephanie Girod,
Emmanuel Defay,
Sebastjan Glinsek
Abstract:
Integration of thin-film oxide piezoelectrics on glass is imperative for the next generation of transparent electronics to attain sensing and actuating functions. However, their crystallization temperature (above 650 °C) is incompatible with most glasses. We developed a flash lamp process for growth of piezoelectric lead zirconate titanate films. The process enables crystallization on various type…
▽ More
Integration of thin-film oxide piezoelectrics on glass is imperative for the next generation of transparent electronics to attain sensing and actuating functions. However, their crystallization temperature (above 650 °C) is incompatible with most glasses. We developed a flash lamp process for growth of piezoelectric lead zirconate titanate films. The process enables crystallization on various types of glasses in a few seconds only. Functional properties of these films are comparable to the films processed with standard rapid thermal annealing at 700 °C. A surface haptic device was fabricated with a 1 $\unicode{x00B5}$m-thick film (piezoelectric e$_{33,f}$ of -5 C m$^{-2}$). Its ultrasonic surface deflection reached 1.5 $\unicode{x00B5}$m at 60 V, sufficient for its use in surface rendering applications. This flash lamp annealing process is compatible with large glass sheets and roll-to-roll processing and has the potential to significantly expand the applications of piezoelectric devices on glass.
△ Less
Submitted 29 February, 2024; v1 submitted 23 March, 2023;
originally announced March 2023.
-
Graph Neural Network contextual embedding for Deep Learning on Tabular Data
Authors:
Mario Villaizán-Vallelado,
Matteo Salvatori,
Belén Carro Martinez,
Antonio Javier Sanchez Esguevillas
Abstract:
All industries are trying to leverage Artificial Intelligence (AI) based on their existing big data which is available in so called tabular form, where each record is composed of a number of heterogeneous continuous and categorical columns also known as features. Deep Learning (DL) has constituted a major breakthrough for AI in fields related to human skills like natural language processing, but i…
▽ More
All industries are trying to leverage Artificial Intelligence (AI) based on their existing big data which is available in so called tabular form, where each record is composed of a number of heterogeneous continuous and categorical columns also known as features. Deep Learning (DL) has constituted a major breakthrough for AI in fields related to human skills like natural language processing, but its applicability to tabular data has been more challenging. More classical Machine Learning (ML) models like tree-based ensemble ones usually perform better. This paper presents a novel DL model using Graph Neural Network (GNN) more specifically Interaction Network (IN), for contextual embedding and modelling interactions among tabular features. Its results outperform those of a recently published survey with DL benchmark based on five public datasets, also achieving competitive results when compared to boosted-tree solutions.
△ Less
Submitted 4 July, 2023; v1 submitted 11 March, 2023;
originally announced March 2023.
-
Electrical manipulation of a single electron spin in CMOS with micromagnet and spin-valley coupling
Authors:
Bernhard Klemt,
Victor El-Homsy,
Martin Nurizzo,
Pierre Hamonic,
Biel Martinez,
Bruna Cardoso Paz,
Cameron spence,
Matthieu Dartiailh,
Baptiste Jadot,
Emmanuel Chanrion,
Vivien Thiney,
Renan Lethiecq,
Benoit Bertrand,
Heimanu Niebojewski,
Christopher Bäuerle,
Maud Vinet,
Yann-Michel Niquet,
Tristan Meunier,
Matias Urdampilleta
Abstract:
For semiconductor spin qubits, complementary-metal-oxide-semiconductor (CMOS) technology is the ideal candidate for reliable and scalable fabrication. Making the direct leap from academic fabrication to qubits fabricated fully by industrial CMOS standards is difficult without intermediate solutions. With a flexible back-end-of-line (BEOL) new functionalities such as micromagnets or superconducting…
▽ More
For semiconductor spin qubits, complementary-metal-oxide-semiconductor (CMOS) technology is the ideal candidate for reliable and scalable fabrication. Making the direct leap from academic fabrication to qubits fabricated fully by industrial CMOS standards is difficult without intermediate solutions. With a flexible back-end-of-line (BEOL) new functionalities such as micromagnets or superconducting circuits can be added in a post-CMOS process to study the physics of these devices or achieve proof of concepts. Once the process is established it can be incorporated in the foundry-compatible process flow. Here, we study a single electron spin qubit in a CMOS device with a micromagnet integrated in the flexible BEOL. We exploit the synthetic spin orbit coupling (SOC) to control the qubit via electric field and we investigate the spin-valley physics in the presence of SOC where we show an enhancement of the Rabi frequency at the spin-valley hotspot. Finally, we probe the high frequency noise in the system using dynamical decoupling pulse sequences and demonstrate that charge noise dominates the qubit decoherence in this range.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
A note on Bass' conjecture
Authors:
Danilo Vilela Avelar,
Fabio Enrique Brochero Martínez,
Sávio Ribas
Abstract:
For a finite group $G$, we denote by ${\sf d}(G)$ and by ${\sf E}(G)$, respectively, the small Davenport constant and the Gao constant of $G$. Let $C_n$ be the cyclic group of order $n$ and let $G_{m,n,s} = C_n \rtimes_s C_m$ be a metacyclic group. In [J. Bass; {\em Improving the Erdős-Ginzburg-Ziv theorem for some non-abelian groups.} J. Number Theory {\bf 126} (2007), 217-236, Conjecture 17], Ba…
▽ More
For a finite group $G$, we denote by ${\sf d}(G)$ and by ${\sf E}(G)$, respectively, the small Davenport constant and the Gao constant of $G$. Let $C_n$ be the cyclic group of order $n$ and let $G_{m,n,s} = C_n \rtimes_s C_m$ be a metacyclic group. In [J. Bass; {\em Improving the Erdős-Ginzburg-Ziv theorem for some non-abelian groups.} J. Number Theory {\bf 126} (2007), 217-236, Conjecture 17], Bass conjectured that ${\sf d}(G_{m,n,s}) = m+n-2$ and ${\sf E}(G_{m,n,s}) = mn+m+n-2$ provided $ord_n(s) = m$. In this paper, we show that the assumption $ord_n(s) = m$ is essential and cannot be removed. Moreover, if we suppose that Bass' conjecture holds for $G_{m,n,s}$ and the $mn$-product-one free sequences of maximal length are well behaved, then Bass conjecture also holds for $G_{2m,2n,r}$, where $r^2 \equiv s \pmod n$.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
Open data from the third observing run of LIGO, Virgo, KAGRA and GEO
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné,
A. Allocca
, et al. (1719 additional authors not shown)
Abstract:
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti…
▽ More
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Hole spin driving by strain-induced spin-orbit interactions
Authors:
José Carlos Abadillo-Uriel,
Esteban A. Rodríguez-Mena,
Biel Martinez,
Yann-Michel Niquet
Abstract:
Hole spins in semiconductor quantum dots can be efficiently manipulated with radio-frequency electric fields owing to the strong spin-orbit interactions in the valence bands. Here we show that the motion of the dot in inhomogeneous strain fields gives rise to linear Rashba spin-orbit interactions (with spatially dependent spin-orbit lengths) and g-factor modulations that allow for fast Rabi oscill…
▽ More
Hole spins in semiconductor quantum dots can be efficiently manipulated with radio-frequency electric fields owing to the strong spin-orbit interactions in the valence bands. Here we show that the motion of the dot in inhomogeneous strain fields gives rise to linear Rashba spin-orbit interactions (with spatially dependent spin-orbit lengths) and g-factor modulations that allow for fast Rabi oscillations. Such inhomogeneous strains may build up spontaneously due to process and cool down stress. We discuss spin qubits in Ge/GeSi heterostructures as an illustration. We highlight that Rabi frequencies can be enhanced by one order of magnitude by shear strain gradients as small as $3\times 10^{-6}$ nm$^{-1}$ within the dots. This underlines that spin in solids can be very sensitive to strains and opens the way for strain engineering in hole spin devices for quantum information and spintronics.
△ Less
Submitted 1 September, 2023; v1 submitted 7 December, 2022;
originally announced December 2022.
-
On the number of rational points of Artin-Schreier curves and hypersurfaces
Authors:
Fabio Enrique Brochero Martínez,
Daniela Alves de Oliveira
Abstract:
Let $\mathbb F_{q^n}$ denote the finite field with $q^n$ elements. In this paper we determine the number of $\mathbb F_{q^n}$-rational points of the affine Artin-Schreier curve given by $y^q-y = x(x^{q^i}-x)-λ$ and of the Artin-Schreier hypersurface $y^q-y=\sum_{j=1}^r a_jx_j(x_j^{q^{i_j}}-x_j)-λ.$ Moreover in both cases, we show that the Weil bound is attained only in the case where the trace of…
▽ More
Let $\mathbb F_{q^n}$ denote the finite field with $q^n$ elements. In this paper we determine the number of $\mathbb F_{q^n}$-rational points of the affine Artin-Schreier curve given by $y^q-y = x(x^{q^i}-x)-λ$ and of the Artin-Schreier hypersurface $y^q-y=\sum_{j=1}^r a_jx_j(x_j^{q^{i_j}}-x_j)-λ.$ Moreover in both cases, we show that the Weil bound is attained only in the case where the trace of $λ\in\mathbb F_{q^n}$ over $\mathbb F_q$ is zero. We use quadratic forms and permutation matrices to determine the number of affine rational points of these curves and hypersurfaces.
△ Less
Submitted 6 July, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
Wedderburn Decomposition and Idempotents of some finite metacyclic group algebras
Authors:
F. E. Brochero Martínez,
L. Batista de Oliveira,
C. R. Giraldo Vergara
Abstract:
In this article, we show explicitly the Wedderburn decomposition of the metacyclic group algebra $\mathbb F_qG$, where $G$ has a cyclic subgroup of index 2 and $\gcd(|G|,q)=1$. We also construct the complete set of central and left idempotents of these group algebras.
In this article, we show explicitly the Wedderburn decomposition of the metacyclic group algebra $\mathbb F_qG$, where $G$ has a cyclic subgroup of index 2 and $\gcd(|G|,q)=1$. We also construct the complete set of central and left idempotents of these group algebras.
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
Graph2Vid: Flow graph to Video Grounding for Weakly-supervised Multi-Step Localization
Authors:
Nikita Dvornik,
Isma Hadji,
Hai Pham,
Dhaivat Bhatt,
Brais Martinez,
Afsaneh Fazly,
Allan D. Jepson
Abstract:
In this work, we consider the problem of weakly-supervised multi-step localization in instructional videos. An established approach to this problem is to rely on a given list of steps. However, in reality, there is often more than one way to execute a procedure successfully, by following the set of steps in slightly varying orders. Thus, for successful localization in a given video, recent works r…
▽ More
In this work, we consider the problem of weakly-supervised multi-step localization in instructional videos. An established approach to this problem is to rely on a given list of steps. However, in reality, there is often more than one way to execute a procedure successfully, by following the set of steps in slightly varying orders. Thus, for successful localization in a given video, recent works require the actual order of procedure steps in the video, to be provided by human annotators at both training and test times. Instead, here, we only rely on generic procedural text that is not tied to a specific video. We represent the various ways to complete the procedure by transforming the list of instructions into a procedure flow graph which captures the partial order of steps. Using the flow graphs reduces both training and test time annotation requirements. To this end, we introduce the new problem of flow graph to video grounding. In this setup, we seek the optimal step ordering consistent with the procedure flow graph and a given video. To solve this problem, we propose a new algorithm - Graph2Vid - that infers the actual ordering of steps in the video and simultaneously localizes them. To show the advantage of our proposed formulation, we extend the CrossTask dataset with procedure flow graph information. Our experiments show that Graph2Vid is both more efficient than the baselines and yields strong step localization results, without the need for step order annotation.
△ Less
Submitted 31 October, 2022; v1 submitted 10 October, 2022;
originally announced October 2022.
-
FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Authors:
Adrian Bulat,
Ricardo Guerrero,
Brais Martinez,
Georgios Tzimiropoulos
Abstract:
This paper is on Few-Shot Object Detection (FSOD), where given a few templates (examples) depicting a novel class (not seen during training), the goal is to detect all of its occurrences within a set of images. From a practical perspective, an FSOD system must fulfil the following desiderata: (a) it must be used as is, without requiring any fine-tuning at test time, (b) it must be able to process…
▽ More
This paper is on Few-Shot Object Detection (FSOD), where given a few templates (examples) depicting a novel class (not seen during training), the goal is to detect all of its occurrences within a set of images. From a practical perspective, an FSOD system must fulfil the following desiderata: (a) it must be used as is, without requiring any fine-tuning at test time, (b) it must be able to process an arbitrary number of novel objects concurrently while supporting an arbitrary number of examples from each class and (c) it must achieve accuracy comparable to a closed system. Towards satisfying (a)-(c), in this work, we make the following contributions: We introduce, for the first time, a simple, yet powerful, few-shot detection transformer (FS-DETR) based on visual prompting that can address both desiderata (a) and (b). Our system builds upon the DETR framework, extending it based on two key ideas: (1) feed the provided visual templates of the novel classes as visual prompts during test time, and (2) ``stamp'' these prompts with pseudo-class embeddings (akin to soft prompting), which are then predicted at the output of the decoder. Importantly, we show that our system is not only more flexible than existing methods, but also, it makes a step towards satisfying desideratum (c). Specifically, it is significantly more accurate than all methods that do not require fine-tuning and even matches and outperforms the current state-of-the-art fine-tuning based methods on the most well-established benchmarks (PASCAL VOC & MSCOCO).
△ Less
Submitted 20 August, 2023; v1 submitted 10 October, 2022;
originally announced October 2022.
-
Effective Self-supervised Pre-training on Low-compute Networks without Distillation
Authors:
Fuwen Tan,
Fatemeh Saleh,
Brais Martinez
Abstract:
Despite the impressive progress of self-supervised learning (SSL), its applicability to low-compute networks has received limited attention. Reported performance has trailed behind standard supervised pre-training by a large margin, barring self-supervised learning from making an impact on models that are deployed on device. Most prior works attribute this poor performance to the capacity bottlene…
▽ More
Despite the impressive progress of self-supervised learning (SSL), its applicability to low-compute networks has received limited attention. Reported performance has trailed behind standard supervised pre-training by a large margin, barring self-supervised learning from making an impact on models that are deployed on device. Most prior works attribute this poor performance to the capacity bottleneck of the low-compute networks and opt to bypass the problem through the use of knowledge distillation (KD). In this work, we revisit SSL for efficient neural networks, taking a closer at what are the detrimental factors causing the practical limitations, and whether they are intrinsic to the self-supervised low-compute setting. We find that, contrary to accepted knowledge, there is no intrinsic architectural bottleneck, we diagnose that the performance bottleneck is related to the model complexity vs regularization strength trade-off. In particular, we start by empirically observing that the use of local views can have a dramatic impact on the effectiveness of the SSL methods. This hints at view sampling being one of the performance bottlenecks for SSL on low-capacity networks. We hypothesize that the view sampling strategy for large neural networks, which requires matching views in very diverse spatial scales and contexts, is too demanding for low-capacity architectures. We systematize the design of the view sampling mechanism, leading to a new training methodology that consistently improves the performance across different SSL methods (e.g. MoCo-v2, SwAV, DINO), different low-size networks (e.g. MobileNetV2, ResNet18, ResNet34, ViT-Ti), and different tasks (linear probe, object detection, instance segmentation and semi-supervised learning). Our best models establish a new state-of-the-art for SSL methods on low-compute networks despite not using a KD loss term.
△ Less
Submitted 2 October, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Bayesian Prompt Learning for Image-Language Model Generalization
Authors:
Mohammad Mahdi Derakhshani,
Enrique Sanchez,
Adrian Bulat,
Victor Guilherme Turrisi da Costa,
Cees G. M. Snoek,
Georgios Tzimiropoulos,
Brais Martinez
Abstract:
Foundational image-language models have generated considerable interest due to their efficient adaptation to downstream tasks by prompt learning. Prompt learning treats part of the language model input as trainable while freezing the rest, and optimizes an Empirical Risk Minimization objective. However, Empirical Risk Minimization is known to suffer from distributional shifts which hurt generaliza…
▽ More
Foundational image-language models have generated considerable interest due to their efficient adaptation to downstream tasks by prompt learning. Prompt learning treats part of the language model input as trainable while freezing the rest, and optimizes an Empirical Risk Minimization objective. However, Empirical Risk Minimization is known to suffer from distributional shifts which hurt generalizability to prompts unseen during training. By leveraging the regularization ability of Bayesian methods, we frame prompt learning from the Bayesian perspective and formulate it as a variational inference problem. Our approach regularizes the prompt space, reduces overfitting to the seen prompts and improves the prompt generalization on unseen prompts. Our framework is implemented by modeling the input prompt space in a probabilistic manner, as an a priori distribution which makes our proposal compatible with prompt learning approaches that are unconditional or conditional on the image. We demonstrate empirically on 15 benchmarks that Bayesian prompt learning provides an appropriate coverage of the prompt space, prevents learning spurious features, and exploits transferable invariant features. This results in better generalization of unseen prompts, even across different datasets and domains. Code available at: https://github.com/saic-fi/Bayesian-Prompt-Learning
△ Less
Submitted 20 August, 2023; v1 submitted 5 October, 2022;
originally announced October 2022.
-
REST: REtrieve & Self-Train for generative action recognition
Authors:
Adrian Bulat,
Enrique Sanchez,
Brais Martinez,
Georgios Tzimiropoulos
Abstract:
This work is on training a generative action/video recognition model whose output is a free-form action-specific caption describing the video (rather than an action class label). A generative approach has practical advantages like producing more fine-grained and human-readable output, and being naturally open-world. To this end, we propose to adapt a pre-trained generative Vision & Language (V&L)…
▽ More
This work is on training a generative action/video recognition model whose output is a free-form action-specific caption describing the video (rather than an action class label). A generative approach has practical advantages like producing more fine-grained and human-readable output, and being naturally open-world. To this end, we propose to adapt a pre-trained generative Vision & Language (V&L) Foundation Model for video/action recognition. While recently there have been a few attempts to adapt V&L models trained with contrastive learning (e.g. CLIP) for video/action, to the best of our knowledge, we propose the very first method that sets outs to accomplish this goal for a generative model. We firstly show that direct fine-tuning of a generative model to produce action classes suffers from severe overfitting. To alleviate this, we introduce REST, a training framework consisting of two key components: an unsupervised method for adapting the generative model to action/video by means of pseudo-caption generation and Self-training, i.e. without using any action-specific labels; (b) a Retrieval approach based on CLIP for discovering a diverse set of pseudo-captions for each video to train the model. Importantly, we show that both components are necessary to obtain high accuracy. We evaluate REST on the problem of zero-shot action recognition where we show that our approach is very competitive when compared to contrastive learning-based methods. Code will be made available.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Boson-jet and jet-jet azimuthal correlations at high transverse momenta
Authors:
A. M. van Kampen,
A. Bermudez Martinez,
L. I. Estevez Banos,
F. Hautmann,
H. Jung,
M. Mendizabal,
K. Moral Figueroa,
S. Prestel,
S. Taheri Monfared,
Q. Wang,
K. Wichmann,
H. Yang
Abstract:
We discuss our recent results on azimuthal distributions in vector boson + jets and multi-jet production at the LHC, obtained from the matching of next-to-leading order (NLO) perturbative matrix elements with transverse momentum dependent (TMD) parton branching. We present a comparative analysis of boson-jet and jet-jet correlations in the back to-back region, and a study of the theoretical system…
▽ More
We discuss our recent results on azimuthal distributions in vector boson + jets and multi-jet production at the LHC, obtained from the matching of next-to-leading order (NLO) perturbative matrix elements with transverse momentum dependent (TMD) parton branching. We present a comparative analysis of boson-jet and jet-jet correlations in the back to-back region, and a study of the theoretical systematic uncertainties associated with the matching scale in the cases of TMD and collinear parton showers.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Hole spin manipulation in inhomogeneous and non-separable electric fields
Authors:
Biel Martinez,
José Carlos Abadillo-Uriel,
Esteban A. Rodríguez-Mena,
Yann-Michel Niquet
Abstract:
The usual models for electrical spin manipulation in semiconductor quantum dots assume that the confinement potential is separable in the three spatial dimensions and that the AC drive field is homogeneous. However, the electric field induced by the gates in quantum dot devices is not fully separable and displays significant inhomogeneities. Here, we address the electrical manipulation of hole spi…
▽ More
The usual models for electrical spin manipulation in semiconductor quantum dots assume that the confinement potential is separable in the three spatial dimensions and that the AC drive field is homogeneous. However, the electric field induced by the gates in quantum dot devices is not fully separable and displays significant inhomogeneities. Here, we address the electrical manipulation of hole spins in semiconductor heterostructures subject to inhomogeneous vertical electric fields and/or in-plane AC electric fields. We consider Ge quantum dots electrically confined in a Ge/GeSi quantum well as an illustration. We show that the lack of separability between the vertical and in-plane motions gives rise to an additional spin-orbit coupling mechanism (beyond the usual linear and cubic in momentum Rashba terms) that modulates the principal axes of the hole gyromagnetic g-matrix. This non-separability mechanism can be of the same order of magnitude as Rashba-type interactions, and enables spin manipulation when the magnetic field is applied in the plane of the heterostructure even if the dot is symmetric (disk-shaped). More generally, we show that Rabi oscillations in strongly patterned electric fields harness a variety of g-factor modulations. We discuss the implications for the design, modeling and understanding of hole spin qubit devices.
△ Less
Submitted 28 December, 2022; v1 submitted 21 September, 2022;
originally announced September 2022.
-
The number of rational points of a class of superelliptic curves
Authors:
José Alves Oliveira,
Daniela Oliveira,
F. E. Brochero Martínez
Abstract:
In this paper, we study the number of $\mathbb F_{q^n}$-rational points on the affine curve $\mathcal{X}_{d,a,b}$ given by the equation $$ y^d=ax\text{Tr}(x)+b,$$ where $\text{Tr}$ denote the trace function from $\mathbb F_{q^n}$ to $\mathbb F_{q}$ and $d$ is a positive integer. In particular, we present bounds for the number of $\mathbb F_{q}$-rational points on $\mathcal{X}_{d,a,b}$ and, for the…
▽ More
In this paper, we study the number of $\mathbb F_{q^n}$-rational points on the affine curve $\mathcal{X}_{d,a,b}$ given by the equation $$ y^d=ax\text{Tr}(x)+b,$$ where $\text{Tr}$ denote the trace function from $\mathbb F_{q^n}$ to $\mathbb F_{q}$ and $d$ is a positive integer. In particular, we present bounds for the number of $\mathbb F_{q}$-rational points on $\mathcal{X}_{d,a,b}$ and, for the cases where $d$ satisfies a natural condition, explicit formulas for the number of rational points are obtained. Particularly, a complete characterization is given for the case $d=2$. As a consequence of our results, we compute the number of elements $α$ in $\mathbb F_{q^n}$ such that $α$ and $\text{Tr}(α)$ are quadratic residues in $\mathbb F_{q^n}$.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
Efficient Attention-free Video Shift Transformers
Authors:
Adrian Bulat,
Brais Martinez,
Georgios Tzimiropoulos
Abstract:
This paper tackles the problem of efficient video recognition. In this area, video transformers have recently dominated the efficiency (top-1 accuracy vs FLOPs) spectrum. At the same time, there have been some attempts in the image domain which challenge the necessity of the self-attention operation within the transformer architecture, advocating the use of simpler approaches for token mixing. How…
▽ More
This paper tackles the problem of efficient video recognition. In this area, video transformers have recently dominated the efficiency (top-1 accuracy vs FLOPs) spectrum. At the same time, there have been some attempts in the image domain which challenge the necessity of the self-attention operation within the transformer architecture, advocating the use of simpler approaches for token mixing. However, there are no results yet for the case of video recognition, where the self-attention operator has a significantly higher impact (compared to the case of images) on efficiency. To address this gap, in this paper, we make the following contributions: (a) we construct a highly efficient \& accurate attention-free block based on the shift operator, coined Affine-Shift block, specifically designed to approximate as closely as possible the operations in the MHSA block of a Transformer layer. Based on our Affine-Shift block, we construct our Affine-Shift Transformer and show that it already outperforms all existing shift/MLP--based architectures for ImageNet classification. (b) We extend our formulation in the video domain to construct Video Affine-Shift Transformer (VAST), the very first purely attention-free shift-based video transformer. (c) We show that VAST significantly outperforms recent state-of-the-art transformers on the most popular action recognition benchmarks for the case of models with low computational and memory footprint. Code will be made available.
△ Less
Submitted 23 August, 2022;
originally announced August 2022.
-
Azimuthal di-jet correlations with parton branching TMD distributions
Authors:
A. Bermudez Martinez,
F. Hautmann
Abstract:
The parton branching formulation of TMD evolution has recently been used to make predictions for jet observables at the Large Hadron Collider (LHC), including perturbative matching at next-to-leading order (NLO). This contribution presents results for the azimuthal Δ-φcorrelations in events with di-jets at large transverse momentum. It focuses on the back-to-back region of large Δ-φand discusses p…
▽ More
The parton branching formulation of TMD evolution has recently been used to make predictions for jet observables at the Large Hadron Collider (LHC), including perturbative matching at next-to-leading order (NLO). This contribution presents results for the azimuthal Δ-φcorrelations in events with di-jets at large transverse momentum. It focuses on the back-to-back region of large Δ-φand discusses prospects for detailed studies of QCD dynamics in this region at the LHC.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
Challenges and Opportunities for Simultaneous Multi-functional Networks in the UHF Bands
Authors:
Xavier Vilajosana,
Guillem Boquet,
Joan Melià,
Pere Tuset-Peiró,
Borja Martinez,
Ferran Adelantado
Abstract:
Multi-functional wireless networks are rapidly evolving and aspire to become a promising attribute of the upcoming 6G networks. Enabling multiple simultaneous networking functions with a single radio fosters the development of more integrated and simpler equipment, overcoming design and technology barriers inherited from radio systems of the past. We are seeing numerous trends exploiting these fea…
▽ More
Multi-functional wireless networks are rapidly evolving and aspire to become a promising attribute of the upcoming 6G networks. Enabling multiple simultaneous networking functions with a single radio fosters the development of more integrated and simpler equipment, overcoming design and technology barriers inherited from radio systems of the past. We are seeing numerous trends exploiting these features in newly designed radios, such as those operating on the mmWave band. In this article, however, we carefully analyze the challenges and opportunities for multi-functional wireless networks in UHF bands, advocating the reuse of existing infrastructures and technologies, and exploring the possibilities of expanding their functionality without requiring architectural changes. We believe that both modern and legacy technologies can be turned into multi-functional systems if the right scientific and technological challenges are properly addressed. This transformation can foster the development of new applications and extend the useful life of these systems, contributing to a more sustainable digitization by delaying equipment obsolescence.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
Multi-jet Merging with TMD Parton Branching
Authors:
A. Bermudez Martinez,
F. Hautmann,
M. L. Mangano
Abstract:
One of the main theoretical systematics in studies of final states with large jet multiplicities at high-energy hadron colliders is associated with the merging of QCD parton showers and hard-scattering matrix elements. We present a method to incorporate the physics of transverse momentum recoils due to initial-state shower evolution into multi-jet merging algorithms by using the concept of transve…
▽ More
One of the main theoretical systematics in studies of final states with large jet multiplicities at high-energy hadron colliders is associated with the merging of QCD parton showers and hard-scattering matrix elements. We present a method to incorporate the physics of transverse momentum recoils due to initial-state shower evolution into multi-jet merging algorithms by using the concept of transverse momentum dependent (TMD) distributions and the associated parton branching. We investigate the dependence on the merging scale and illustrate the impact of the new method at the level of both exclusive and inclusive final-state observables by studying differential jet rates, transverse momentum spectra and multiplicity distributions, using vector boson + jets events at the LHC as a case study.
△ Less
Submitted 14 September, 2022; v1 submitted 3 August, 2022;
originally announced August 2022.
-
$γ$ rays run on time
Authors:
Daniel Beltrán Martínez,
Felipe J. Llanes-Estrada,
Gloria Tejedor-García
Abstract:
Significant absorption of radiation is usually accompanied by refraction. This is not the case for $γ$ rays travelling cosmic distances. We show that the real and imaginary parts of the refraction index are indeed commensurable, as they are related by dispersion relations, but when turning to physical observables, the (finite) optical depth is way larger than the (infinitesimal) time delay of the…
▽ More
Significant absorption of radiation is usually accompanied by refraction. This is not the case for $γ$ rays travelling cosmic distances. We show that the real and imaginary parts of the refraction index are indeed commensurable, as they are related by dispersion relations, but when turning to physical observables, the (finite) optical depth is way larger than the (infinitesimal) time delay of the gamma rays relative to gravitational radiation. The numerically large factor solving the apparent contradiction is $E_γ/H_0$ arising from basic wave properties (Bouguer-Beer-Lambert law) and the standard cosmological model, respectively.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Birefringence induced by antiferroelectric switching in transparent polycrystalline $PbZr_{0.95}Ti_{0.05}O_{3}$ film
Authors:
Pranab Parimal Biswas,
Cosme Milesi-Brault,
Alfredo Blázquez Martínez,
Naveen Aruchamy,
Longfei Song,
Veronika Kovacova,
Sebastjan Glinsek,
Torsten Granzow,
Emmanuel Defay,
Mael Guennou
Abstract:
The most characteristic functional property of antiferroelectric materials is the possibility to induce a phase transition from a non-polar to a polar phase by an electric field. Here, we investigate the effect of this field-induced phase transition on the birefringence change of $PbZr_{0.95}Ti_{0.05}O_{3}$. We use a transparent polycrystalline $PbZr_{0.95}Ti_{0.05}O_{3}$ film grown on…
▽ More
The most characteristic functional property of antiferroelectric materials is the possibility to induce a phase transition from a non-polar to a polar phase by an electric field. Here, we investigate the effect of this field-induced phase transition on the birefringence change of $PbZr_{0.95}Ti_{0.05}O_{3}$. We use a transparent polycrystalline $PbZr_{0.95}Ti_{0.05}O_{3}$ film grown on $PbTiO_{3}/HfO_{2}/SiO_{2}$ with interdigitated electrodes to directly investigate changes in birefringence in a simple transmission geometry. In spite of the polycrystalline nature of the film and its moderate thickness, the field-induced transition produces a sizeable effect observable under a polarized microscope. The film in its polar phase is found to behave like a homogeneous birefringent medium. The time evolution of this field-induced birefringence provides information about irreversibilities in the antiferroelectric switching process and its slow dynamics. The change in birefringence has two main contributions, one that responds briskly (~ 0.5 s), and a slower one that rises and saturates over a period of as long as 30 minutes. Possible origins for this long saturation and relaxation times are discussed.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Creation and direct laser acceleration of positrons in a single stage
Authors:
B. Martinez,
B. Barbosa,
M. Vranic
Abstract:
Relativistic positron beams are required for fundamental research in nonlinear strong field QED, plasma physics, and laboratory astrophysics. Positrons are difficult to create and manipulate due to their short lifetime, and their energy gain is limited by the accelerator size in conventional facilities. Alternative compact accelerator concepts in plasmas are becoming more and more mature for elect…
▽ More
Relativistic positron beams are required for fundamental research in nonlinear strong field QED, plasma physics, and laboratory astrophysics. Positrons are difficult to create and manipulate due to their short lifetime, and their energy gain is limited by the accelerator size in conventional facilities. Alternative compact accelerator concepts in plasmas are becoming more and more mature for electrons, but positron generation and acceleration remain an outstanding challenge. Here we propose a new setup where we can generate, inject and accelerate them in a single stage during the propagation of an intense laser in a plasma channel. The positrons are created from a laser-electron collision at 90 degrees, where the injection and guiding are made possible by an 800 nC electron beam loading which reverses the sign of the background electrostatic field. We obtain a 20 fC positron beam, with GeV-level central energy within 0.5 mm of plasma.
△ Less
Submitted 18 January, 2023; v1 submitted 18 July, 2022;
originally announced July 2022.
-
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
Authors:
Fatemeh Saleh,
Fuwen Tan,
Adrian Bulat,
Georgios Tzimiropoulos,
Brais Martinez
Abstract:
Learning visual representations through self-supervision is an extremely challenging task as the network needs to sieve relevant patterns from spurious distractors without the active guidance provided by supervision. This is achieved through heavy data augmentation, large-scale datasets and prohibitive amounts of compute. Video self-supervised learning (SSL) suffers from added challenges: video da…
▽ More
Learning visual representations through self-supervision is an extremely challenging task as the network needs to sieve relevant patterns from spurious distractors without the active guidance provided by supervision. This is achieved through heavy data augmentation, large-scale datasets and prohibitive amounts of compute. Video self-supervised learning (SSL) suffers from added challenges: video datasets are typically not as large as image datasets, compute is an order of magnitude larger, and the amount of spurious patterns the optimizer has to sieve through is multiplied several fold. Thus, directly learning self-supervised representations from video data might result in sub-optimal performance. To address this, we propose to utilize a strong image-based model, pre-trained with self- or language supervision, in a video representation learning framework, enabling the model to learn strong spatial and temporal information without relying on the video labeled data. To this end, we modify the typical video-based SSL design and objective to encourage the video encoder to \textit{subsume} the semantic content of an image-based model trained on a general domain. The proposed algorithm is shown to learn much more efficiently (i.e. in less epochs and with a smaller batch) and results in a new state-of-the-art performance on standard downstream tasks among single-modality SSL methods.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
Determination of Collins-Soper kernel from cross-sections ratios
Authors:
Armando Bermudez Martinez,
Alexey Vladimirov
Abstract:
We present a novel method of extraction of the Collins-Soper kernel directly from the comparison of differential cross-sections measured at different energies. Using this method, we analyze the pseudo-data generated by the CASCADE event generator and extract the Collins-Soper kernel predicted by the parton-branching model in the wide range of transverse distances. The procedure can be applied, wit…
▽ More
We present a novel method of extraction of the Collins-Soper kernel directly from the comparison of differential cross-sections measured at different energies. Using this method, we analyze the pseudo-data generated by the CASCADE event generator and extract the Collins-Soper kernel predicted by the parton-branching model in the wide range of transverse distances. The procedure can be applied, with minor modifications, to the real measured data for Drell-Yan and SIDIS processes.
△ Less
Submitted 2 June, 2022;
originally announced June 2022.
-
Knowledge Distillation Meets Open-Set Semi-Supervised Learning
Authors:
Jing Yang,
Xiatian Zhu,
Adrian Bulat,
Brais Martinez,
Georgios Tzimiropoulos
Abstract:
Existing knowledge distillation methods mostly focus on distillation of teacher's prediction and intermediate activation. However, the structured representation, which arguably is one of the most critical ingredients of deep models, is largely overlooked. In this work, we propose a novel {\em \modelname{}} ({\bf\em \shortname{})} method dedicated for distilling representational knowledge semantica…
▽ More
Existing knowledge distillation methods mostly focus on distillation of teacher's prediction and intermediate activation. However, the structured representation, which arguably is one of the most critical ingredients of deep models, is largely overlooked. In this work, we propose a novel {\em \modelname{}} ({\bf\em \shortname{})} method dedicated for distilling representational knowledge semantically from a pretrained teacher to a target student. The key idea is that we leverage the teacher's classifier as a semantic critic for evaluating the representations of both teacher and student and distilling the semantic knowledge with high-order structured information over all feature dimensions. This is accomplished by introducing a notion of cross-network logit computed through passing student's representation into teacher's classifier. Further, considering the set of seen classes as a basis for the semantic space in a combinatorial perspective, we scale \shortname{} to unseen classes for enabling effective exploitation of largely available, arbitrary unlabeled training data. At the problem level, this establishes an interesting connection between knowledge distillation with open-set semi-supervised learning (SSL). Extensive experiments show that our \shortname{} outperforms significantly previous state-of-the-art knowledge distillation methods on both coarse object classification and fine face recognition tasks, as well as less studied yet practically crucial binary network distillation. Under more realistic open-set SSL settings we introduce, we reveal that knowledge distillation is generally more effective than existing Out-Of-Distribution (OOD) sample detection, and our proposed \shortname{} is superior over both previous distillation and SSL competitors. The source code is available at \url{https://github.com/jingyang2017/SRD\_ossl}.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
Authors:
Junting Pan,
Adrian Bulat,
Fuwen Tan,
Xiatian Zhu,
Lukasz Dudziak,
Hongsheng Li,
Georgios Tzimiropoulos,
Brais Martinez
Abstract:
Self-attention based models such as vision transformers (ViTs) have emerged as a very competitive architecture alternative to convolutional neural networks (CNNs) in computer vision. Despite increasingly stronger variants with ever-higher recognition accuracies, due to the quadratic complexity of self-attention, existing ViTs are typically demanding in computation and model size. Although several…
▽ More
Self-attention based models such as vision transformers (ViTs) have emerged as a very competitive architecture alternative to convolutional neural networks (CNNs) in computer vision. Despite increasingly stronger variants with ever-higher recognition accuracies, due to the quadratic complexity of self-attention, existing ViTs are typically demanding in computation and model size. Although several successful design choices (e.g., the convolutions and hierarchical multi-stage structure) of prior CNNs have been reintroduced into recent ViTs, they are still not sufficient to meet the limited resource requirements of mobile devices. This motivates a very recent attempt to develop light ViTs based on the state-of-the-art MobileNet-v2, but still leaves a performance gap behind. In this work, pushing further along this under-studied direction we introduce EdgeViTs, a new family of light-weight ViTs that, for the first time, enable attention-based vision models to compete with the best light-weight CNNs in the tradeoff between accuracy and on-device efficiency. This is realized by introducing a highly cost-effective local-global-local (LGL) information exchange bottleneck based on optimal integration of self-attention and convolutions. For device-dedicated evaluation, rather than relying on inaccurate proxies like the number of FLOPs or parameters, we adopt a practical approach of focusing directly on on-device latency and, for the first time, energy efficiency. Specifically, we show that our models are Pareto-optimal when both accuracy-latency and accuracy-energy trade-offs are considered, achieving strict dominance over other ViTs in almost all cases and competing with the most efficient CNNs. Code is available at https://github.com/saic-fi/edgevit.
△ Less
Submitted 21 July, 2022; v1 submitted 6 May, 2022;
originally announced May 2022.