Search | arXiv e-print repository

Path Percolation in Quantum Communication Networks

Authors: Xiangyi Meng, Bingjie Hao, Balázs Ráth, István A. Kovács

Abstract: In a quantum communication network, links represent entanglement between qubits located at different nodes. Even if two nodes are not directly linked by shared entanglement, communication channels can be established between them via quantum routing protocols. However, in contrast to classical communication networks, each communication event removes all participating links along the communication p… ▽ More In a quantum communication network, links represent entanglement between qubits located at different nodes. Even if two nodes are not directly linked by shared entanglement, communication channels can be established between them via quantum routing protocols. However, in contrast to classical communication networks, each communication event removes all participating links along the communication path, disrupting the quantum network. Here, we propose a simple model, where randomly selected pairs of nodes communicate through shortest paths. Each time such a path is used, all participating links are eliminated, leading to a correlated percolation process that we call ``path percolation.'' We study path percolation both numerically and analytically and present the phase diagram of the steady states as a function of the rate at which new links are being added to the quantum communication network. As a key result, the steady state is found to be independent from the initial network topology when new link are added randomly between disconnected components. We close by discussing extensions of path percolation and their potential applications. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 10 pages, 5 figures

arXiv:2406.11183 [pdf, ps, other]

Arithmetical Structures on Coconut Trees

Authors: Alexander Diaz-Lopez, Brian Ha, Pamela E. Harris, Jonathan Rogers, Theo Koss, Dorian Smith

Abstract: If G is a finite connected graph, then an arithmetical structure on $G$ is a pair of vectors $(\mathbf{d}, \mathbf{r})$ with positive integer entries such that $(\diag(\mathbf{d}) - A)\cdot \mathbf{r} = \mathbf{0}$, where $A$ is the adjacency matrix of $G$ and the entries of $\mathbf{r}$ have no common factor other than $1$. In this paper, we generalize the result of Archer, Bishop, Diaz-Lopez, Ga… ▽ More If G is a finite connected graph, then an arithmetical structure on $G$ is a pair of vectors $(\mathbf{d}, \mathbf{r})$ with positive integer entries such that $(\diag(\mathbf{d}) - A)\cdot \mathbf{r} = \mathbf{0}$, where $A$ is the adjacency matrix of $G$ and the entries of $\mathbf{r}$ have no common factor other than $1$. In this paper, we generalize the result of Archer, Bishop, Diaz-Lopez, García Puente, Glass, and Louwsma on enumerating arithmetical structures on bidents (also called coconut tree graphs $\CT{p}{2}$) to all coconut tree graphs $\CT{p}{s}$ which consists of a path on $p>0$ vertices to which we append $s>0$ leaves to the right most vertex on the path. We also give a characterization of smooth arithmetical structures on coconut trees when given number assignments to the leaf nodes. △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: 18 pages, 9 figures, comments are welcomed

MSC Class: 05C50; 05C30

arXiv:2406.10950 [pdf, other]

E-Bench: Towards Evaluating the Ease-of-Use of Large Language Models

Authors: Zhenyu Zhang, Bingguang Hao, Jinpeng Li, Zekai Zhang, Dongyan Zhao

Abstract: Most large language models (LLMs) are sensitive to prompts, and another synonymous expression or a typo may lead to unexpected results for the model. Composing an optimal prompt for a specific demand lacks theoretical support and relies entirely on human experimentation, which poses a considerable obstacle to popularizing generative artificial intelligence. However, there is no systematic analysis… ▽ More Most large language models (LLMs) are sensitive to prompts, and another synonymous expression or a typo may lead to unexpected results for the model. Composing an optimal prompt for a specific demand lacks theoretical support and relies entirely on human experimentation, which poses a considerable obstacle to popularizing generative artificial intelligence. However, there is no systematic analysis of the stability of LLMs in resisting prompt perturbations in real-world scenarios. In this work, we propose to evaluate the ease-of-use of LLMs and construct E-Bench, simulating the actual situation of human use from synonymous perturbation (including paraphrasing, simplification, and colloquialism) and typographical perturbation (such as typing). On this basis, we also discuss the combination of these two types of perturbation and analyze the main reasons for performance degradation. Experimental results indicate that with the increase of model size, although the ease-of-use are significantly improved, there is still a long way to go to build a sufficiently user-friendly model. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2404.16189 [pdf, other]

Structure Preserving PINN for Solving Time Dependent PDEs with Periodic Boundary

Authors: Baoli Hao, Ulisses Braga-Neto, Chun Liu, Lifan Wang, Ming Zhong

Abstract: We present a structure preserving PINN for solving a series of time dependent PDEs with periodic boundary. Our method can incorporate the periodic boundary condition as the natural output of any deep neural net, hence significantly improving the training accuracy of baseline PINN. Together with mini-batching and other PINN variants (SA-PINN, RBA-PINN, etc.), our structure preserving PINN can even… ▽ More We present a structure preserving PINN for solving a series of time dependent PDEs with periodic boundary. Our method can incorporate the periodic boundary condition as the natural output of any deep neural net, hence significantly improving the training accuracy of baseline PINN. Together with mini-batching and other PINN variants (SA-PINN, RBA-PINN, etc.), our structure preserving PINN can even handle stiff PDEs for modeling a wide range of convection-diffusion and reaction-diffusion processes. We demonstrate the effectiveness of our PINNs on various PDEs from Allen Cahn, Gray Scott to nonlinear Schrodinger. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.06219 [pdf, other]

doi 10.5220/0011986300003497

Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector

Authors: Bach Ha, Birgit Schalter, Laura White, Joachim Koehler

Abstract: Maintaining sewer systems in large cities is important, but also time and effort consuming, because visual inspections are currently done manually. To reduce the amount of aforementioned manual work, defects within sewer pipes should be located and classified automatically. In the past, multiple works have attempted solving this problem using classical image processing, machine learning, or a comb… ▽ More Maintaining sewer systems in large cities is important, but also time and effort consuming, because visual inspections are currently done manually. To reduce the amount of aforementioned manual work, defects within sewer pipes should be located and classified automatically. In the past, multiple works have attempted solving this problem using classical image processing, machine learning, or a combination of those. However, each provided solution only focus on detecting a limited set of defect/structure types, such as fissure, root, and/or connection. Furthermore, due to the use of hand-crafted features and small training datasets, generalization is also problematic. In order to overcome these deficits, a sizable dataset with 14.7 km of various sewer pipes were annotated by sewer maintenance experts in the scope of this work. On top of that, an object detector (EfficientDet-D0) was trained for automatic defect detection. From the result of several expermients, peculiar natures of defects in the context of object detection, which greatly effect annotation and training process, are found and discussed. At the end, the final detector was able to detect 83% of defects in the test set; out of the missing 17%, only 0.77% are very severe defects. This work provides an example of applying deep learning-based object detection into an important but quiet engineering field. It also gives some practical pointers on how to annotate peculiar "object", such as defects. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Journal ref: (2023) In Proceedings of the 3rd International Conference on Image Processing and Vision Engineering - IMPROVE; ISBN 978-989-758-642-2; ISSN 2795-4943, SciTePress, pages 188-198

arXiv:2403.07344 [pdf]

Electronic Structure of Superconducting Infinite-Layer Lanthanum Nickelates

Authors: Wenjie Sun, Zhicheng Jiang, Chengliang Xia, Bo Hao, Yueying Li, Shengjun Yan, Maosen Wang, Hongquan Liu, Jianyang Ding, Jiayu Liu, Zhengtai Liu, Jishan Liu, Hanghui Chen, Dawei Shen, Yuefeng Nie

Abstract: Revealing the momentum-resolved electronic structure of infinite-layer nickelates is essential for understanding this new class of unconventional superconductors, but has been hindered by the formidable challenges in improving the sample quality. In this work, we report for the first time the angle-resolved photoemission spectroscopy of superconducting La$_{0.8}$Sr$_{0.2}$NiO$_{2}$ films prepared… ▽ More Revealing the momentum-resolved electronic structure of infinite-layer nickelates is essential for understanding this new class of unconventional superconductors, but has been hindered by the formidable challenges in improving the sample quality. In this work, we report for the first time the angle-resolved photoemission spectroscopy of superconducting La$_{0.8}$Sr$_{0.2}$NiO$_{2}$ films prepared by molecular beam epitaxy and ${\mathrm{\textit{in situ}}}$ atomic-hydrogen reduction. The measured Fermi topology closely matches theoretical calculations, showing a large Ni-$d_{x^2-y^2}$ derived Fermi sheet that evolves from hole-like to electron-like along $k_{z}$, and a three-dimensional (3D) electron pocket centered at Brillouin zone corner. The Ni-$d_{x^2-y^2}$ derived bands show a mass enhancement ($m^*/m_{\rm{DFT}}$) of 2-3,while the 3D electron band shows negligible band renormalization. Moreover, the Ni-$d_{x^2-y^2}$ derived states also display a band dispersion anomaly at higher binding energy, reminiscent of the waterfall feature and kinks observed in cuprates. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: 29 pages,13 figures

arXiv:2402.01287 [pdf, other]

Spiking CenterNet: A Distillation-boosted Spiking Neural Network for Object Detection

Authors: Lennard Bodden, Franziska Schwaiger, Duc Bach Ha, Lars Kreuzberg, Sven Behnke

Abstract: In the era of AI at the edge, self-driving cars, and climate change, the need for energy-efficient, small, embedded AI is growing. Spiking Neural Networks (SNNs) are a promising approach to address this challenge, with their event-driven information flow and sparse activations. We propose Spiking CenterNet for object detection on event data. It combines an SNN CenterNet adaptation with an efficien… ▽ More In the era of AI at the edge, self-driving cars, and climate change, the need for energy-efficient, small, embedded AI is growing. Spiking Neural Networks (SNNs) are a promising approach to address this challenge, with their event-driven information flow and sparse activations. We propose Spiking CenterNet for object detection on event data. It combines an SNN CenterNet adaptation with an efficient M2U-Net-based decoder. Our model significantly outperforms comparable previous work on Prophesee's challenging GEN1 Automotive Detection Dataset while using less than half the energy. Distilling the knowledge of a non-spiking teacher into our SNN further increases performance. To the best of our knowledge, our work is the first approach that takes advantage of knowledge distillation in the field of spiking object detection. △ Less

Submitted 6 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: 8 pages, 5 figures. Accepted at IJCNN 2024

arXiv:2402.00396 [pdf, other]

Efficient Exploration for LLMs

Authors: Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy

Abstract: We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fitting a reward model to the feedback received. Our best-performing agent generates queries using double Thompson sampling, with uncertainty represented by an epistemic neural network. Our results demo… ▽ More We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fitting a reward model to the feedback received. Our best-performing agent generates queries using double Thompson sampling, with uncertainty represented by an epistemic neural network. Our results demonstrate that efficient exploration enables high levels of performance with far fewer queries. Further, both uncertainty estimation and the choice of exploration scheme play critical roles. △ Less

Submitted 4 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Accepted at ICML 2024

arXiv:2401.15980 [pdf]

Superconductivity in freestanding infinite-layer nickelate membranes

Authors: Shengjun Yan, Wei Mao, Wenjie Sun, Yueying Li, Haoying Sun, Jiangfeng Yang, Bo Hao, Wei Guo, Leyan Nian, Zhengbin Gu, Peng Wang, Yuefeng Nie

Abstract: The observation of superconductivity in infinite-layer nickelates has attracted significant attention due to its potential as a new platform for exploring high $ \mathrm{\textit{T}}_{c} $ superconductivity. However, thus far, superconductivity has only been observed in epitaxial thin films, which limits the manipulation capabilities and modulation methods compared to two-dimensional exfoliated mat… ▽ More The observation of superconductivity in infinite-layer nickelates has attracted significant attention due to its potential as a new platform for exploring high $ \mathrm{\textit{T}}_{c} $ superconductivity. However, thus far, superconductivity has only been observed in epitaxial thin films, which limits the manipulation capabilities and modulation methods compared to two-dimensional exfoliated materials. Given the exceptionally giant strain tunability and stacking capability of freestanding membranes, separating superconducting nickelates from the as-grown substrate is a novel way to engineer the superconductivity and uncover the underlying physics. Herein, we report the synthesis of the superconducting freestanding $ \mathrm{La}_{0.8}\mathrm{Sr}_{0.2}\mathrm{Ni}\mathrm{O}_{2} $ membranes ($ \mathrm{\textit{T}}_{c}\mathrm{=}\mathrm{10.9}\;\mathrm{K} $), emphasizing the crucial roles of the interface engineering in the precursor phase film growth and the quick transfer process in achieving superconductivity. Our work offers a new versatile platform for investigating the superconductivity in nickelates, such as the pairing symmetry via constructing Josephson tunneling junctions and higher $ \mathrm{\textit{T}}_{c} $ values via high-pressure experiments. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: 20 pages, 9 figures

arXiv:2401.15979 [pdf, other]

doi 10.1002/adma.202401342

${\mathrm{\textit{In situ}}}$ preparation of superconducting infinite-layer nickelate thin films with atomically flat surface

Authors: Wenjie Sun, Zhichao Wang, Bo Hao, Shengjun Yan, Haoying Sun, Zhengbin Gu, Yu Deng, Yuefeng Nie

Abstract: Since their discovery, the infinite-layer nickelates have been regarded as an appealing system for gaining deeper insights into high temperature superconductivity (HTSC). However, the synthesis of superconducting samples has been proved to be challenging. Here, we develop an ultrahigh vacuum (UHV) ${\mathrm{\textit{in situ}}}$ reduction method using atomic hydrogen as reducing agent and apply it i… ▽ More Since their discovery, the infinite-layer nickelates have been regarded as an appealing system for gaining deeper insights into high temperature superconductivity (HTSC). However, the synthesis of superconducting samples has been proved to be challenging. Here, we develop an ultrahigh vacuum (UHV) ${\mathrm{\textit{in situ}}}$ reduction method using atomic hydrogen as reducing agent and apply it in lanthanum nickelate system. The reduction parameters, including the reduction temperature (${\mathrm{\textit{T}_{R}}}$) and hydrogen pressure (${\mathrm{\textit{P}_{H}}}$), are systematically explored. We found that the reduction window for achieving superconducting transition is quite wide, reaching nearly 80$^\circ$C in ${\mathrm{\textit{T}_{R}}}$ and 3 orders of magnitude in ${\mathrm{\textit{P}_{H}}}$ when the reduction time is set to 30 mins. And there exists an optimal ${\mathrm{\textit{P}_{H}}}$ for achieving the highest ${\mathrm{\textit{T}_{c}}}$ if both ${\mathrm{\textit{T}_{R}}}$ and reduction time are fixed. More prominently, as confirmed by atomic force microscopy and scanning transmission electron microscopy, the atomically flat surface can be preserved during the ${\mathrm{\textit{in situ}}}$ reduction process, providing advantages over the ${\mathrm{\textit{ex situ}}}$ CaH$_2$ method for surface-sensitive experiments. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: 5 pages, 4 figures

Journal ref: Adv. Mater. 2024, 2401342

arXiv:2401.06223 [pdf]

Production of Martian fiber by in-situ resource utilization strategy

Authors: Ze-Shi Guo, Dan Xing, Xiong-Yu Xi, Cun-Guang Liang, Bin Hao, Xiaojia Zeng, Hong Tang, Huaican Chen, Wen Yin, Peng Zhang, Kefa Zhou, Qingbin Zheng, Peng-Cheng Ma

Abstract: Many countries and commercial organizations have shown great interest in constructing Martian base. In-situ resource utilization (ISRU) provides a cost-effective way to achieve this ambitious goal. In this paper, we proposed to use Martian soil simulant to produce fiber to satisfy material requirement for the construction of Martian base. The composition, melting behavior and fiber forming process… ▽ More Many countries and commercial organizations have shown great interest in constructing Martian base. In-situ resource utilization (ISRU) provides a cost-effective way to achieve this ambitious goal. In this paper, we proposed to use Martian soil simulant to produce fiber to satisfy material requirement for the construction of Martian base. The composition, melting behavior and fiber forming process of soil simulant was studied, and continuous fiber with a maximum strength of 1320 MPa was obtained on a spinning facility. The findings of this study demonstrate the feasibility of ISRU to prepare Martian fiber from the soil on the Mars, offering a new way to get key materials for the construction of Martian base. △ Less

Submitted 27 October, 2023; originally announced January 2024.

arXiv:2312.14862 [pdf, other]

YAYI 2: Multilingual Open-Source Large Language Models

Authors: Yin Luo, Qingchao Kong, Nan Xu, Jia Cao, Bao Hao, Baoyu Qu, Bo Chen, Chao Zhu, Chenyang Zhao, Donglei Zhang, Fan Feng, Feifei Zhao, Hailong Sun, Hanxuan Yang, Haojun Pan, Hongyu Liu, Jianbin Guo, Jiangtao Du, Jingyi Wang, Junfeng Li, Lei Sun, Liduo Liu, Lifeng Dong, Lili Liu, Lin Wang , et al. (28 additional authors not shown)

Abstract: As the latest advancements in natural language processing, large language models (LLMs) have achieved human-level language understanding and generation abilities in many real-world tasks, and even have been regarded as a potential path to the artificial general intelligence. To better facilitate research on LLMs, many open-source LLMs, such as Llama 2 and Falcon, have recently been proposed and ga… ▽ More As the latest advancements in natural language processing, large language models (LLMs) have achieved human-level language understanding and generation abilities in many real-world tasks, and even have been regarded as a potential path to the artificial general intelligence. To better facilitate research on LLMs, many open-source LLMs, such as Llama 2 and Falcon, have recently been proposed and gained comparable performances to proprietary models. However, these models are primarily designed for English scenarios and exhibit poor performances in Chinese contexts. In this technical report, we propose YAYI 2, including both base and chat models, with 30 billion parameters. YAYI 2 is pre-trained from scratch on a multilingual corpus which contains 2.65 trillion tokens filtered by our pre-training data processing pipeline. The base model is aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback. Extensive experiments on multiple benchmarks, such as MMLU and CMMLU, consistently demonstrate that the proposed YAYI 2 outperforms other similar sized open-source models. △ Less

Submitted 22 December, 2023; originally announced December 2023.

arXiv:2312.14456 [pdf, other]

doi 10.1103/PhysRevX.14.011047

Spontaneous gap opening and potential excitonic states in an ideal Dirac semimetal Ta$_2$Pd$_3$Te$_5$

Authors: Peng Zhang, Yuyang Dong, Dayu Yan, Bei Jiang, Tao Yang, Jun Li, Zhaopeng Guo, Yong Huang, Bo Hao, Qing Li, Yupeng Li, Kifu Kurokawa, Rui Wang, Yuefeng Nie, Makoto Hashimoto, Donghui Lu, Wen-He Jiao, Jie Shen, Tian Qian, Zhijun Wang, Youguo Shi, Takeshi Kondo

Abstract: The opening of an energy gap in the electronic structure generally indicates the presence of interactions. In materials with low carrier density and short screening length, long-range Coulomb interaction favors the spontaneous formation of electron-hole pairs, so-called excitons, opening an excitonic gap at the Fermi level. Excitonic materials host unique phenomenons associated with pair excitatio… ▽ More The opening of an energy gap in the electronic structure generally indicates the presence of interactions. In materials with low carrier density and short screening length, long-range Coulomb interaction favors the spontaneous formation of electron-hole pairs, so-called excitons, opening an excitonic gap at the Fermi level. Excitonic materials host unique phenomenons associated with pair excitations. However, there is still no generally recognized single-crystal material with excitonic order, which is, therefore, awaited in condensed matter physics. Here, we show that excitonic states may exist in the quasi-one-dimensional material Ta$_2$Pd$_3$Te$_5$, which has an almost ideal Dirac-like band structure, with Dirac point located exactly at Fermi level. We find that an energy gap appears at 350 K, and it grows with decreasing temperature. The spontaneous gap opening is absent in a similar material Ta$_2$Ni$_3$Te$_5$. Intriguingly, the gap is destroyed by the potassium deposition on the crystal, likely due to extra-doped carriers. Furthermore, we observe a pair of in-gap flat bands, which is an analog of the impurity states in a superconducting gap. All these observations can be properly explained by an excitonic order, providing Ta$_2$Pd$_3$Te$_5$ as a new and promising candidate realizing excitonic states. △ Less

Submitted 15 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

Comments: 9 pages, 5 figures

Journal ref: Phys. Rev. X 14, 011047 (2024)

arXiv:2310.13303 [pdf, other]

Motif-Based Prompt Learning for Universal Cross-Domain Recommendation

Authors: Bowen Hao, Chaoqun Yang, Lei Guo, Junliang Yu, Hongzhi Yin

Abstract: Cross-Domain Recommendation (CDR) stands as a pivotal technology addressing issues of data sparsity and cold start by transferring general knowledge from the source to the target domain. However, existing CDR models suffer limitations in adaptability across various scenarios due to their inherent complexity. To tackle this challenge, recent advancements introduce universal CDR models that leverage… ▽ More Cross-Domain Recommendation (CDR) stands as a pivotal technology addressing issues of data sparsity and cold start by transferring general knowledge from the source to the target domain. However, existing CDR models suffer limitations in adaptability across various scenarios due to their inherent complexity. To tackle this challenge, recent advancements introduce universal CDR models that leverage shared embeddings to capture general knowledge across domains and transfer it through "Multi-task Learning" or "Pre-train, Fine-tune" paradigms. However, these models often overlook the broader structural topology that spans domains and fail to align training objectives, potentially leading to negative transfer. To address these issues, we propose a motif-based prompt learning framework, MOP, which introduces motif-based shared embeddings to encapsulate generalized domain knowledge, catering to both intra-domain and inter-domain CDR tasks. Specifically, we devise three typical motifs: butterfly, triangle, and random walk, and encode them through a Motif-based Encoder to obtain motif-based shared embeddings. Moreover, we train MOP under the "Pre-training \& Prompt Tuning" paradigm. By unifying pre-training and recommendation tasks as a common motif-based similarity learning task and integrating adaptable prompt parameters to guide the model in downstream recommendation tasks, MOP excels in transferring domain knowledge effectively. Experimental results on four distinct CDR tasks demonstrate the effectiveness of MOP than the state-of-the-art models. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Comments: 10pages

arXiv:2310.11531 [pdf, ps, other]

Efficient Online Learning with Offline Datasets for Infinite Horizon MDPs: A Bayesian Approach

Authors: Dengwang Tang, Rahul Jain, Botao Hao, Zheng Wen

Abstract: In this paper, we study the problem of efficient online reinforcement learning in the infinite horizon setting when there is an offline dataset to start with. We assume that the offline dataset is generated by an expert but with unknown level of competence, i.e., it is not perfect and not necessarily using the optimal policy. We show that if the learning agent models the behavioral policy (paramet… ▽ More In this paper, we study the problem of efficient online reinforcement learning in the infinite horizon setting when there is an offline dataset to start with. We assume that the offline dataset is generated by an expert but with unknown level of competence, i.e., it is not perfect and not necessarily using the optimal policy. We show that if the learning agent models the behavioral policy (parameterized by a competence parameter) used by the expert, it can do substantially better in terms of minimizing cumulative regret, than if it doesn't do that. We establish an upper bound on regret of the exact informed PSRL algorithm that scales as $\tilde{O}(\sqrt{T})$. This requires a novel prior-dependent regret analysis of Bayesian online learning algorithms for the infinite horizon setting. We then propose the Informed RLSVI algorithm to efficiently approximate the iPSRL algorithm. △ Less

Submitted 1 February, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: 22 pages

MSC Class: 93E35

arXiv:2309.12393 [pdf, other]

doi 10.1103/PhysRevResearch.6.L022013

Constraining work fluctuations of non-Hermitian dynamics across the exceptional point of a superconducting qubit

Authors: Serra Erdamar, Maryam Abbasi, Byung Ha, Weijian Chen, Jacob Muldoon, Yogesh Joglekar, Kater W. Murch

Abstract: Thermodynamics constrains changes to the energy of a system, both deliberate and random, via its first and second laws. When the system is not in equilibrium, fluctuation theorems such as the Jarzynski equality further restrict the distributions of deliberate work done. Such fluctuation theorems have been experimentally verified in small, non-equilibrium quantum systems undergoing unitary or decoh… ▽ More Thermodynamics constrains changes to the energy of a system, both deliberate and random, via its first and second laws. When the system is not in equilibrium, fluctuation theorems such as the Jarzynski equality further restrict the distributions of deliberate work done. Such fluctuation theorems have been experimentally verified in small, non-equilibrium quantum systems undergoing unitary or decohering dynamics. Yet, their validity in systems governed by a non-Hermitian Hamiltonian has long been contentious, due to the false premise of the Hamiltonian's dual and equivalent roles in dynamics and energetics. Here we show that work fluctuations in a non-Hermitian qubit obey the Jarzynski equality even if its Hamiltonian has complex or purely imaginary eigenvalues. With post-selection on a dissipative superconducting circuit undergoing a cyclic parameter sweep, we experimentally quantify the work distribution using projective energy measurements and show that the fate of the Jarzynski equality is determined by the parity-time symmetry of, and the energetics that result from, the corresponding non-Hermitian, Floquet Hamiltonian. By distinguishing the energetics from non-Hermitian dynamics, our results provide the recipe for investigating the non-equilibrium quantum thermodynamics of such open systems. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 7 pages, 5 figures

arXiv:2309.03799 [pdf, other]

FisheyePP4AV: A privacy-preserving method for autonomous vehicles on fisheye camera images

Authors: Linh Trinh, Bach Ha, Tu Tran

Abstract: In many parts of the world, the use of vast amounts of data collected on public roadways for autonomous driving has increased. In order to detect and anonymize pedestrian faces and nearby car license plates in actual road-driving scenarios, there is an urgent need for effective solutions. As more data is collected, privacy concerns regarding it increase, including but not limited to pedestrian fac… ▽ More In many parts of the world, the use of vast amounts of data collected on public roadways for autonomous driving has increased. In order to detect and anonymize pedestrian faces and nearby car license plates in actual road-driving scenarios, there is an urgent need for effective solutions. As more data is collected, privacy concerns regarding it increase, including but not limited to pedestrian faces and surrounding vehicle license plates. Normal and fisheye cameras are the two common camera types that are typically mounted on collection vehicles. With complex camera distortion models, fisheye camera images were deformed in contrast to regular images. It causes computer vision tasks to perform poorly when using numerous deep learning models. In this work, we pay particular attention to protecting privacy while yet adhering to several laws for fisheye camera photos taken by driverless vehicles. First, we suggest a framework for extracting face and plate identification knowledge from several teacher models. Our second suggestion is to transform both the image and the label from a regular image to fisheye-like data using a varied and realistic fisheye transformation. Finally, we run a test using the open-source PP4AV dataset. The experimental findings demonstrated that our model outperformed baseline methods when trained on data from autonomous vehicles, even when the data were softly labeled. The implementation code is available at our github: https://github.com/khaclinh/FisheyePP4AV. △ Less

Submitted 7 September, 2023; originally announced September 2023.

arXiv:2309.02342 [pdf, ps, other]

doi 10.1103/PhysRevE.108.064214

Attractive and repulsive interactions in the one-dimensional swarmalator model

Authors: Baoli Hao, Ming Zhong, Kevin O'Keeffe

Abstract: We study a population of swarmalators, mobile variants of phase oscillators, which run on a ring and have both attractive and repulsive interactions. This one-dimensional (1D) swarmalator model produces several of collective states: the standard sync and async states as well as a splaylike "polarized" state and several unsteady states such as active bands or swirling. The model's simplicity allows… ▽ More We study a population of swarmalators, mobile variants of phase oscillators, which run on a ring and have both attractive and repulsive interactions. This one-dimensional (1D) swarmalator model produces several of collective states: the standard sync and async states as well as a splaylike "polarized" state and several unsteady states such as active bands or swirling. The model's simplicity allows us to describe some of the states analytically. The model can be considered as a toy model for real-world swarmalators such as vinegar eels and sperm which swarm in quasi-1D geometries. △ Less

Submitted 4 January, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

arXiv:2308.10239 [pdf, other]

From Global to Local: Multi-scale Out-of-distribution Detection

Authors: Ji Zhang, Lianli Gao, Bingguang Hao, Hao Huang, Jingkuan Song, Hengtao Shen

Abstract: Out-of-distribution (OOD) detection aims to detect "unknown" data whose labels have not been seen during the in-distribution (ID) training process. Recent progress in representation learning gives rise to distance-based OOD detection that recognizes inputs as ID/OOD according to their relative distances to the training data of ID classes. Previous approaches calculate pairwise distances relying on… ▽ More Out-of-distribution (OOD) detection aims to detect "unknown" data whose labels have not been seen during the in-distribution (ID) training process. Recent progress in representation learning gives rise to distance-based OOD detection that recognizes inputs as ID/OOD according to their relative distances to the training data of ID classes. Previous approaches calculate pairwise distances relying only on global image representations, which can be sub-optimal as the inevitable background clutter and intra-class variation may drive image-level representations from the same ID class far apart in a given representation space. In this work, we overcome this challenge by proposing Multi-scale OOD DEtection (MODE), a first framework leveraging both global visual information and local region details of images to maximally benefit OOD detection. Specifically, we first find that existing models pretrained by off-the-shelf cross-entropy or contrastive losses are incompetent to capture valuable local representations for MODE, due to the scale-discrepancy between the ID training and OOD detection processes. To mitigate this issue and encourage locally discriminative representations in ID training, we propose Attention-based Local PropAgation (ALPA), a trainable objective that exploits a cross-attention mechanism to align and highlight the local regions of the target objects for pairwise examples. During test-time OOD detection, a Cross-Scale Decision (CSD) function is further devised on the most discriminative multi-scale representations to distinguish ID/OOD data more faithfully. We demonstrate the effectiveness and flexibility of MODE on several benchmarks -- on average, MODE outperforms the previous state-of-the-art by up to 19.24% in FPR, 2.77% in AUROC. Code is available at https://github.com/JimZAI/MODE-OOD. △ Less

Submitted 20 August, 2023; originally announced August 2023.

Comments: 13 pages

arXiv:2306.12365 [pdf]

Attention Hybrid Variational Net for Accelerated MRI Reconstruction

Authors: Guoyao Shen, Boran Hao, Mengyu Li, Chad W. Farris, Ioannis Ch. Paschalidis, Stephan W. Anderson, Xin Zhang

Abstract: The application of compressed sensing (CS)-enabled data reconstruction for accelerating magnetic resonance imaging (MRI) remains a challenging problem. This is due to the fact that the information lost in k-space from the acceleration mask makes it difficult to reconstruct an image similar to the quality of a fully sampled image. Multiple deep learning-based structures have been proposed for MRI r… ▽ More The application of compressed sensing (CS)-enabled data reconstruction for accelerating magnetic resonance imaging (MRI) remains a challenging problem. This is due to the fact that the information lost in k-space from the acceleration mask makes it difficult to reconstruct an image similar to the quality of a fully sampled image. Multiple deep learning-based structures have been proposed for MRI reconstruction using CS, both in the k-space and image domains as well as using unrolled optimization methods. However, the drawback of these structures is that they are not fully utilizing the information from both domains (k-space and image). Herein, we propose a deep learning-based attention hybrid variational network that performs learning in both the k-space and image domain. We evaluate our method on a well-known open-source MRI dataset and a clinical MRI dataset of patients diagnosed with strokes from our institution to demonstrate the performance of our network. In addition to quantitative evaluation, we undertook a blinded comparison of image quality across networks performed by a subspecialty trained radiologist. Overall, we demonstrate that our network achieves a superior performance among others under multiple reconstruction tasks. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: 22 pages, 4 figures, 3 tables

arXiv:2305.16561 [pdf, other]

Proper network randomization is key to assessing social balance

Authors: Bingjie Hao, István A. Kovács

Abstract: Studying significant network patterns, known as graphlets (or motifs), has been a popular approach to understand the underlying organizing principles of complex networks. Statistical significance is routinely assessed by comparing to null models that randomize the connections while preserving some key aspects of the data. However, in signed networks, capturing both positive (friendly) and negative… ▽ More Studying significant network patterns, known as graphlets (or motifs), has been a popular approach to understand the underlying organizing principles of complex networks. Statistical significance is routinely assessed by comparing to null models that randomize the connections while preserving some key aspects of the data. However, in signed networks, capturing both positive (friendly) and negative (hostile) relations, the results have been controversial and also at odds with the classical theory of structural balance. We show that this is largely due to the fact that large-scale signed networks exhibit a poor correlation between the number of positive and negative ties of each node. As a solution, here we propose a null model based on the maximum entropy framework that preserves both the signed degrees and the network topology (STP randomization). With STP randomization the results change qualitatively and most social networks consistently satisfy strong structural balance, both at the level of triangles and larger graphlets. We propose a potential underlying mechanism of the observed patterns in signed social networks and outline further applications of STP randomization. △ Less

Submitted 25 May, 2023; originally announced May 2023.

arXiv:2305.11908 [pdf, other]

Sequential Best-Arm Identification with Application to Brain-Computer Interface

Authors: Xin Zhou, Botao Hao, Jian Kang, Tor Lattimore, Lexin Li

Abstract: A brain-computer interface (BCI) is a technology that enables direct communication between the brain and an external device or computer system. It allows individuals to interact with the device using only their thoughts, and holds immense potential for a wide range of applications in medicine, rehabilitation, and human augmentation. An electroencephalogram (EEG) and event-related potential (ERP)-b… ▽ More A brain-computer interface (BCI) is a technology that enables direct communication between the brain and an external device or computer system. It allows individuals to interact with the device using only their thoughts, and holds immense potential for a wide range of applications in medicine, rehabilitation, and human augmentation. An electroencephalogram (EEG) and event-related potential (ERP)-based speller system is a type of BCI that allows users to spell words without using a physical keyboard, but instead by recording and interpreting brain signals under different stimulus presentation paradigms. Conventional non-adaptive paradigms treat each word selection independently, leading to a lengthy learning process. To improve the sampling efficiency, we cast the problem as a sequence of best-arm identification tasks in multi-armed bandits. Leveraging pre-trained large language models (LLMs), we utilize the prior knowledge learned from previous tasks to inform and facilitate subsequent tasks. To do so in a coherent way, we propose a sequential top-two Thompson sampling (STTS) algorithm under the fixed-confidence setting and the fixed-budget setting. We study the theoretical property of the proposed algorithm, and demonstrate its substantial empirical improvement through both synthetic data analysis as well as a P300 BCI speller simulator example. △ Less

Submitted 17 May, 2023; originally announced May 2023.

arXiv:2303.11369 [pdf, other]

Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale

Authors: Botao Hao, Rahul Jain, Dengwang Tang, Zheng Wen

Abstract: In this paper, we address the following problem: Given an offline demonstration dataset from an imperfect expert, what is the best way to leverage it to bootstrap online learning performance in MDPs. We first propose an Informed Posterior Sampling-based RL (iPSRL) algorithm that uses the offline dataset, and information about the expert's behavioral policy used to generate the offline dataset. Its… ▽ More In this paper, we address the following problem: Given an offline demonstration dataset from an imperfect expert, what is the best way to leverage it to bootstrap online learning performance in MDPs. We first propose an Informed Posterior Sampling-based RL (iPSRL) algorithm that uses the offline dataset, and information about the expert's behavioral policy used to generate the offline dataset. Its cumulative Bayesian regret goes down to zero exponentially fast in N, the offline dataset size if the expert is competent enough. Since this algorithm is computationally impractical, we then propose the iRLSVI algorithm that can be seen as a combination of the RLSVI algorithm for online RL, and imitation learning. Our empirical results show that the proposed iRLSVI algorithm is able to achieve significant reduction in regret as compared to two baselines: no offline data, and offline dataset but used without information about the generative policy. Our algorithm bridges online RL and imitation learning for the first time. △ Less

Submitted 16 July, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

Comments: Alphabetical order. Corresponding to Rahul Jain

arXiv:2302.03319 [pdf, ps, other]

Leveraging Demonstrations to Improve Online Learning: Quality Matters

Authors: Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen

Abstract: We investigate the extent to which offline demonstration data can improve online learning. It is natural to expect some improvement, but the question is how, and by how much? We show that the degree of improvement must depend on the quality of the demonstration data. To generate portable insights, we focus on Thompson sampling (TS) applied to a multi-armed bandit as a prototypical online learning… ▽ More We investigate the extent to which offline demonstration data can improve online learning. It is natural to expect some improvement, but the question is how, and by how much? We show that the degree of improvement must depend on the quality of the demonstration data. To generate portable insights, we focus on Thompson sampling (TS) applied to a multi-armed bandit as a prototypical online learning algorithm and model. The demonstration data is generated by an expert with a given competence level, a notion we introduce. We propose an informed TS algorithm that utilizes the demonstration data in a coherent way through Bayes' rule and derive a prior-dependent Bayesian regret bound. This offers insight into how pretraining can greatly improve online performance and how the degree of improvement increases with the expert's competence level. We also develop a practical, approximate informed TS algorithm through Bayesian bootstrapping and show substantial empirical regret reduction through experiments. △ Less

Submitted 17 May, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

Comments: Accepted at ICML 2023

arXiv:2301.12579 [pdf, other]

Sample Efficient Deep Reinforcement Learning via Local Planning

Authors: Dong Yin, Sridhar Thiagarajan, Nevena Lazic, Nived Rajaraman, Botao Hao, Csaba Szepesvari

Abstract: The focus of this work is sample-efficient deep reinforcement learning (RL) with a simulator. One useful property of simulators is that it is typically easy to reset the environment to a previously observed state. We propose an algorithmic framework, named uncertainty-first local planning (UFLP), that takes advantage of this property. Concretely, in each data collection iteration, with some probab… ▽ More The focus of this work is sample-efficient deep reinforcement learning (RL) with a simulator. One useful property of simulators is that it is typically easy to reset the environment to a previously observed state. We propose an algorithmic framework, named uncertainty-first local planning (UFLP), that takes advantage of this property. Concretely, in each data collection iteration, with some probability, our meta-algorithm resets the environment to an observed state which has high uncertainty, instead of sampling according to the initial-state distribution. The agent-environment interaction then proceeds as in the standard online RL setting. We demonstrate that this simple procedure can dramatically improve the sample cost of several baseline RL algorithms on difficult exploration tasks. Notably, with our framework, we can achieve super-human performance on the notoriously hard Atari game, Montezuma's Revenge, with a simple (distributional) double DQN. Our work can be seen as an efficient approximate implementation of an existing algorithm with theoretical guarantees, which offers an interpretation of the positive empirical results. △ Less

Submitted 3 July, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

Comments: 25 pages, 11 figures

arXiv:2212.11945 [pdf, ps, other]

On the Diophantine equations of the form $λらむだ_1U_{n_1} + λらむだ_2U_{n_2} +\ldots + λらむだ_kU_{n_k} = wp_1^{z_1}p_2^{z_2} \cdots p_s^{z_s}$

Authors: Eva Goedhart, Brian Ha, Lily McBeath, Luisa Velasco

Abstract: In this paper, we consider the Diophantine equation $λらむだ_1U_{n_1}+\ldots+λらむだ_kU_{n_k}=wp_1^{z_1} \cdots p_s^{z_s},$ where $\{U_n\}_{n\geq 0}$ is a fixed non-degenerate linear recurrence sequence of order greater than or equal to 2; $w$ is a fixed non-zero integer; $p_1,\dots,p_s$ are fixed, distinct prime numbers; $λらむだ_1,\dots,λらむだ_k$ are strictly positive integers; and $n_1,\dots,n_k,z_1,\dots,z_s$ are no… ▽ More In this paper, we consider the Diophantine equation $λらむだ_1U_{n_1}+\ldots+λらむだ_kU_{n_k}=wp_1^{z_1} \cdots p_s^{z_s},$ where $\{U_n\}_{n\geq 0}$ is a fixed non-degenerate linear recurrence sequence of order greater than or equal to 2; $w$ is a fixed non-zero integer; $p_1,\dots,p_s$ are fixed, distinct prime numbers; $λらむだ_1,\dots,λらむだ_k$ are strictly positive integers; and $n_1,\dots,n_k,z_1,\dots,z_s$ are non-negative integer unknowns. We prove the existence of an effectively computable upper-bound on the solutions $(n_1,\dots,n_k,z_1,\dots,z_s)$. In our proof, we use lower bounds for linear forms in logarithms, extending the work of Pink and Ziegler (2016), Mazumdar and Rout (2019), Meher and Rout (2017), and Ziegler (2019). △ Less

Submitted 22 December, 2022; originally announced December 2022.

Comments: 14 pages, 1 table

MSC Class: 11B37; 11D61; 11D45; 11D72; 11J86

arXiv:2210.08198

Distributionally Robust Multiclass Classification and Applications in Deep Image Classifiers

Authors: Ruidi Chen, Boran Hao, Ioannis Ch. Paschalidis

Abstract: We develop a Distributionally Robust Optimization (DRO) formulation for Multiclass Logistic Regression (MLR), which could tolerate data contaminated by outliers. The DRO framework uses a probabilistic ambiguity set defined as a ball of distributions that are close to the empirical distribution of the training set in the sense of the Wasserstein metric. We relax the DRO formulation into a regulariz… ▽ More We develop a Distributionally Robust Optimization (DRO) formulation for Multiclass Logistic Regression (MLR), which could tolerate data contaminated by outliers. The DRO framework uses a probabilistic ambiguity set defined as a ball of distributions that are close to the empirical distribution of the training set in the sense of the Wasserstein metric. We relax the DRO formulation into a regularized learning problem whose regularizer is a norm of the coefficient matrix. We establish out-of-sample performance guarantees for the solutions to our model, offering insights on the role of the regularizer in controlling the prediction error. We apply the proposed method in rendering deep Vision Transformer (ViT)-based image classifiers robust to random and adversarial attacks. Specifically, using the MNIST and CIFAR-10 datasets, we demonstrate reductions in test error rate by up to 83.5% and loss by up to 91.3% compared with baseline methods, by adopting a novel random training method. △ Less

Submitted 25 March, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

Comments: This work was intended as a replacement of arXiv:2109.12772 and any subsequent updates will appear there

arXiv:2208.12657 [pdf, other]

Multi tasks RetinaNet for mitosis detection

Authors: Chen Yang, Wang Ziyue, Fang Zijie, Bian Hao, Zhang Yongbing

Abstract: The account of mitotic cells is a key feature in tumor diagnosis. However, due to the variability of mitotic cell morphology, it is a highly challenging task to detect mitotic cells in tumor tissues. At the same time, although advanced deep learning method have achieved great success in cell detection, the performance is often unsatisfactory when tested data from another domain (i.e. the different… ▽ More The account of mitotic cells is a key feature in tumor diagnosis. However, due to the variability of mitotic cell morphology, it is a highly challenging task to detect mitotic cells in tumor tissues. At the same time, although advanced deep learning method have achieved great success in cell detection, the performance is often unsatisfactory when tested data from another domain (i.e. the different tumor types and different scanners). Therefore, it is necessary to develop algorithms for detecting mitotic cells with robustness in domain shifts scenarios. Our work further proposes a foreground detection and tumor classification task based on the baseline(Retinanet), and utilizes data augmentation to improve the domain generalization performance of our model. We achieve the state-of-the-art performance (F1 score: 0.5809) on the challenging premilary test dataset. △ Less

Submitted 26 August, 2022; originally announced August 2022.

arXiv:2206.04640 [pdf, ps, other]

Regret Bounds for Information-Directed Reinforcement Learning

Authors: Botao Hao, Tor Lattimore

Abstract: Information-directed sampling (IDS) has revealed its potential as a data-efficient algorithm for reinforcement learning (RL). However, theoretical understanding of IDS for Markov Decision Processes (MDPs) is still limited. We develop novel information-theoretic tools to bound the information ratio and cumulative information gain about the learning target. Our theoretical results shed light on the… ▽ More Information-directed sampling (IDS) has revealed its potential as a data-efficient algorithm for reinforcement learning (RL). However, theoretical understanding of IDS for Markov Decision Processes (MDPs) is still limited. We develop novel information-theoretic tools to bound the information ratio and cumulative information gain about the learning target. Our theoretical results shed light on the importance of choosing the learning target such that the practitioners can balance the computation and regret bounds. As a consequence, we derive prior-free Bayesian regret bounds for vanilla-IDS which learns the whole environment under tabular finite-horizon MDPs. In addition, we propose a computationally-efficient regularized-IDS that maximizes an additive form rather than the ratio form and show that it enjoys the same regret bound as vanilla-IDS. With the aid of rate-distortion theory, we improve the regret bound by learning a surrogate, less informative environment. Furthermore, we extend our analysis to linear MDPs and prove similar regret bounds for Thompson sampling as a by-product. △ Less

Submitted 24 November, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

Comments: Accepted at NeurIPS 2022

arXiv:2205.10895 [pdf, ps, other]

Contextual Information-Directed Sampling

Authors: Botao Hao, Tor Lattimore, Chao Qin

Abstract: Information-directed sampling (IDS) has recently demonstrated its potential as a data-efficient reinforcement learning algorithm. However, it is still unclear what is the right form of information ratio to optimize when contextual information is available. We investigate the IDS design through two contextual bandit problems: contextual bandits with graph feedback and sparse linear contextual bandi… ▽ More Information-directed sampling (IDS) has recently demonstrated its potential as a data-efficient reinforcement learning algorithm. However, it is still unclear what is the right form of information ratio to optimize when contextual information is available. We investigate the IDS design through two contextual bandit problems: contextual bandits with graph feedback and sparse linear contextual bandits. We provably demonstrate the advantage of contextual IDS over conditional IDS and emphasize the importance of considering the context distribution. The main message is that an intelligent agent should invest more on the actions that are beneficial for the future unseen contexts while the conditional IDS can be myopic. We further propose a computationally-efficient version of contextual IDS based on Actor-Critic and evaluate it empirically on a neural network contextual bandit. △ Less

Submitted 9 June, 2022; v1 submitted 22 May, 2022; originally announced May 2022.

Comments: Accepted at ICML 2022

arXiv:2202.09867 [pdf, other]

Interacting Contour Stochastic Gradient Langevin Dynamics

Authors: Wei Deng, Siqi Liang, Botao Hao, Guang Lin, Faming Liang

Abstract: We propose an interacting contour stochastic gradient Langevin dynamics (ICSGLD) sampler, an embarrassingly parallel multiple-chain contour stochastic gradient Langevin dynamics (CSGLD) sampler with efficient interactions. We show that ICSGLD can be theoretically more efficient than a single-chain CSGLD with an equivalent computational budget. We also present a novel random-field function, which f… ▽ More We propose an interacting contour stochastic gradient Langevin dynamics (ICSGLD) sampler, an embarrassingly parallel multiple-chain contour stochastic gradient Langevin dynamics (CSGLD) sampler with efficient interactions. We show that ICSGLD can be theoretically more efficient than a single-chain CSGLD with an equivalent computational budget. We also present a novel random-field function, which facilitates the estimation of self-adapting parameters in big data and obtains free mode explorations. Empirically, we compare the proposed algorithm with popular benchmark methods for posterior sampling. The numerical results show a great potential of ICSGLD for large-scale uncertainty estimation tasks. △ Less

Submitted 20 February, 2022; originally announced February 2022.

Comments: ICLR 2022

arXiv:2202.06839 [pdf, other]

doi 10.1145/3491102.3517563

Close-up and Whispering: An Understanding of Multimodal and Parasocial Interactions in YouTube ASMR videos

Authors: Shuo Niu, Hugh S. Manon, Ava Bartolome, Nguyen B. Ha, Keegan Veazey

Abstract: ASMR (Autonomous Sensory Meridian Response) has grown to immense popularity on YouTube and drawn HCI designers' attention to its effects and applications in design. YouTube ASMR creators incorporate visual elements, sounds, motifs of touching and tasting, and other scenarios in multisensory video interactions to deliver enjoyable and relaxing experiences to their viewers. ASMRtists engage viewers… ▽ More ASMR (Autonomous Sensory Meridian Response) has grown to immense popularity on YouTube and drawn HCI designers' attention to its effects and applications in design. YouTube ASMR creators incorporate visual elements, sounds, motifs of touching and tasting, and other scenarios in multisensory video interactions to deliver enjoyable and relaxing experiences to their viewers. ASMRtists engage viewers by social, physical, and task attractions. Research has identified the benefits of ASMR in mental wellbeing. However, ASMR remains an understudied phenomenon in the HCI community, constraining designers' ability to incorporate ASMR in video-based designs. This work annotates and analyzes the interaction modalities and parasocial attractions of 2663 videos to identify unique experiences. YouTube comment sections are also analyzed to compare viewers' responses to different ASMR interactions. We find that ASMR videos are experiences of multimodal social connection, relaxing physical intimacy, and sensory-rich activity observation. Design implications are discussed to foster future ASMR-augmented video interactions. △ Less

Submitted 23 October, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

Comments: 4 pages

arXiv:2201.06868 [pdf, other]

A Study on the Ambiguity in Human Annotation of German Oral History Interviews for Perceived Emotion Recognition and Sentiment Analysis

Authors: Michael Gref, Nike Matthiesen, Sreenivasa Hikkal Venugopala, Shalaka Satheesh, Aswinkumar Vijayananth, Duc Bach Ha, Sven Behnke, Joachim Köhler

Abstract: For research in audiovisual interview archives often it is not only of interest what is said but also how. Sentiment analysis and emotion recognition can help capture, categorize and make these different facets searchable. In particular, for oral history archives, such indexing technologies can be of great interest. These technologies can help understand the role of emotions in historical remember… ▽ More For research in audiovisual interview archives often it is not only of interest what is said but also how. Sentiment analysis and emotion recognition can help capture, categorize and make these different facets searchable. In particular, for oral history archives, such indexing technologies can be of great interest. These technologies can help understand the role of emotions in historical remembering. However, humans often perceive sentiments and emotions ambiguously and subjectively. Moreover, oral history interviews have multi-layered levels of complex, sometimes contradictory, sometimes very subtle facets of emotions. Therefore, the question arises of the chance machines and humans have capturing and assigning these into predefined categories. This paper investigates the ambiguity in human perception of emotions and sentiment in German oral history interviews and the impact on machine learning systems. Our experiments reveal substantial differences in human perception for different emotions. Furthermore, we report from ongoing machine learning experiments with different modalities. We show that the human perceptual ambiguity and other challenges, such as class imbalance and lack of training data, currently limit the opportunities of these technologies for oral history archives. Nonetheless, our work uncovers promising observations and possibilities for further research. △ Less

Submitted 18 January, 2022; originally announced January 2022.

Comments: Submitted to LREC 2022

arXiv:2112.02275 [pdf, other]

A Multi-Strategy based Pre-Training Method for Cold-Start Recommendation

Authors: Bowen Hao, Hongzhi Yin, Jing Zhang, Cuiping Li, Hong Chen

Abstract: Cold-start problem is a fundamental challenge for recommendation tasks. The recent self-supervised learning (SSL) on Graph Neural Networks (GNNs) model, PT-GNN, pre-trains the GNN model to reconstruct the cold-start embeddings and has shown great potential for cold-start recommendation. However, due to the over-smoothing problem, PT-GNN can only capture up to 3-order relation, which can not provid… ▽ More Cold-start problem is a fundamental challenge for recommendation tasks. The recent self-supervised learning (SSL) on Graph Neural Networks (GNNs) model, PT-GNN, pre-trains the GNN model to reconstruct the cold-start embeddings and has shown great potential for cold-start recommendation. However, due to the over-smoothing problem, PT-GNN can only capture up to 3-order relation, which can not provide much useful auxiliary information to depict the target cold-start user or item. Besides, the embedding reconstruction task only considers the intra-correlations within the subgraph of users and items, while ignoring the inter-correlations across different subgraphs. To solve the above challenges, we propose a multi-strategy based pre-training method for cold-start recommendation (MPT), which extends PT-GNN from the perspective of model architecture and pretext tasks to improve the cold-start recommendation performance. Specifically, in terms of the model architecture, in addition to the short-range dependencies of users and items captured by the GNN encoder, we introduce a Transformer encoder to capture long-range dependencies. In terms of the pretext task, in addition to considering the intra-correlations of users and items by the embedding reconstruction task, we add embedding contrastive learning task to capture inter-correlations of users and items. We train the GNN and Transformer encoders on these pretext tasks under the meta-learning setting to simulate the real cold-start scenario, making the model easily and rapidly being adapted to new cold-start users and items. Experiments on three public recommendation datasets show the superiority of the proposed MPT model against the vanilla GNN models, the pre-training GNN model on user/item embedding inference and the recommendation task. △ Less

Submitted 21 May, 2022; v1 submitted 4 December, 2021; originally announced December 2021.

arXiv:2112.02274 [pdf, other]

Self-supervised Graph Learning for Occasional Group Recommendation

Authors: Bowen Hao, Hongzhi Yin, Cuiping Li, Hong Chen

Abstract: As an important branch in Recommender System, occasional group recommendation has received more and more attention. In this scenario, each occasional group (cold-start group) has no or few historical interacted items. As each occasional group has extremely sparse interactions with items, traditional group recommendation methods can not learn high-quality group representations. The recent proposed… ▽ More As an important branch in Recommender System, occasional group recommendation has received more and more attention. In this scenario, each occasional group (cold-start group) has no or few historical interacted items. As each occasional group has extremely sparse interactions with items, traditional group recommendation methods can not learn high-quality group representations. The recent proposed Graph Neural Networks (GNNs), which incorporate the high-order neighbors of the target occasional group, can alleviate the above problem in some extent. However, these GNNs still can not explicitly strengthen the embedding quality of the high-order neighbors with few interactions. Motivated by the Self-supervised Learning technique, which is able to find the correlations within the data itself, we propose a self-supervised graph learning framework, which takes the user/item/group embedding reconstruction as the pretext task to enhance the embeddings of the cold-start users/items/groups. In order to explicitly enhance the high-order cold-start neighbors' embedding quality, we further introduce an embedding enhancer, which leverages the self-attention mechanism to improve the embedding quality for them. Comprehensive experiments show the advantages of our proposed framework than the state-of-the-art methods. △ Less

Submitted 21 July, 2022; v1 submitted 4 December, 2021; originally announced December 2021.

Comments: This paper uses self-supervised learning technique to enhance the embeddings of users/groups/items, the idea is novel in group recommendation scenario. However, some presentations need to be revised, so as to let the readers understand

arXiv:2111.11721 [pdf, ps, other]

doi 10.1063/5.0069792

Effects of resonant magnetic perturbations on neutral beam heating in a tokamak

Authors: Youjun Hu, Yingfeng Xu, Baolong Hao, Guoqiang Li, Kaiyang He, Youwen Sun, Li Li, Jinfang Wang, Juan Huang, Lei Ye, Xiaotao Xiao, Feng Wang, Chengkang Pan, Yongjian Xu

Abstract: Effects of resonant magnetic perturbations (RMPs) on tangential neutral beam heating in the EAST tokamak are studied numerically. RMPs with linear resistive magnetohydrodynamics response are used in the modeling. A variety of representing configurations of RMP coil currents are examined and their effects on the NBI heating efficiency are compared, in order to find a parameter window where deleteri… ▽ More Effects of resonant magnetic perturbations (RMPs) on tangential neutral beam heating in the EAST tokamak are studied numerically. RMPs with linear resistive magnetohydrodynamics response are used in the modeling. A variety of representing configurations of RMP coil currents are examined and their effects on the NBI heating efficiency are compared, in order to find a parameter window where deleterious effects of RMPs on NBI heating efficiency are minimized. It is found that the internal redistribution of fast ions by RMPs induces local accumulation of fast ions, resulting in higher local fast ion pressure than the case without RMPs. It is also found that the toroidal phasing of the RMP with respect to the fast ion source has slight effects on the steady-state radial profile of fast ions. The dependence of fast ion loss fraction on the RMP up-down phase difference shows similar behavior as the dependence of the radial width of chaotic magnetic field on the phase difference. A statistical method of identifying resonances between RMPs and lost fast ions is proposed and the results indicate that some resonances between RMPs and lost passing particles may be of non-integer fractional order, rather than the usual integer order. △ Less

Submitted 23 November, 2021; originally announced November 2021.

Journal ref: Phys. Plasmas 28, 122502 (2021)

arXiv:2111.04754 [pdf, other]

doi 10.1103/PhysRevLett.128.110402

Decoherence Induced Exceptional Points in a Dissipative Superconducting Qubit

Authors: Weijian Chen, Maryam Abbasi, Byung Ha, Serra Erdamar, Yogesh N. Joglekar, Kater W. Murch

Abstract: Open quantum systems interacting with an environment exhibit dynamics described by the combination of dissipation and coherent Hamiltonian evolution. Taken together, these effects are captured by a Liouvillian superoperator. The degeneracies of the (generically non-Hermitian) Liouvillian are exceptional points, which are associated with critical dynamics as the system approaches steady state. We u… ▽ More Open quantum systems interacting with an environment exhibit dynamics described by the combination of dissipation and coherent Hamiltonian evolution. Taken together, these effects are captured by a Liouvillian superoperator. The degeneracies of the (generically non-Hermitian) Liouvillian are exceptional points, which are associated with critical dynamics as the system approaches steady state. We use a superconducting transmon circuit coupled to an engineered environment to observe two different types of Liouvillian exceptional points that arise either from the interplay of energy loss and decoherence or purely due to decoherence. By dynamically tuning the Liouvillian superoperators in real time we observe a non-Hermiticity-induced chiral state transfer. Our study motivates a new look at open quantum system dynamics from the vantage of Liouvillian exceptional points, enabling applications of non-Hermitian dynamics in the understanding and control of open quantum systems. △ Less

Submitted 8 November, 2021; originally announced November 2021.

Comments: 11 pages, 10 figures

Journal ref: Phys. Rev. Lett. 128, 110402 (2022)

arXiv:2110.04629 [pdf, other]

The Neural Testbed: Evaluating Joint Predictions

Authors: Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Botao Hao, Morteza Ibrahimi, Dieterich Lawson, Xiuyuan Lu, Brendan O'Donoghue, Benjamin Van Roy

Abstract: Predictive distributions quantify uncertainties ignored by point estimates. This paper introduces The Neural Testbed: an open-source benchmark for controlled and principled evaluation of agents that generate such predictions. Crucially, the testbed assesses agents not only on the quality of their marginal predictions per input, but also on their joint predictions across many inputs. We evaluate a… ▽ More Predictive distributions quantify uncertainties ignored by point estimates. This paper introduces The Neural Testbed: an open-source benchmark for controlled and principled evaluation of agents that generate such predictions. Crucially, the testbed assesses agents not only on the quality of their marginal predictions per input, but also on their joint predictions across many inputs. We evaluate a range of agents using a simple neural network data generating process. Our results indicate that some popular Bayesian deep learning agents do not fare well with joint predictions, even when they can produce accurate marginal predictions. We also show that the quality of joint predictions drives performance in downstream decision tasks. We find these results are robust across choice a wide range of generative models, and highlight the practical importance of joint predictions to the community. △ Less

Submitted 1 November, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

arXiv:2109.12772 [pdf, other]

Distributionally Robust Multiclass Classification and Applications in Deep Image Classifiers

Authors: Ruidi Chen, Boran Hao, Ioannis Paschalidis

Abstract: We develop a Distributionally Robust Optimization (DRO) formulation for Multiclass Logistic Regression (MLR), which could tolerate data contaminated by outliers. The DRO framework uses a probabilistic ambiguity set defined as a ball of distributions that are close to the empirical distribution of the training set in the sense of the Wasserstein metric. We relax the DRO formulation into a regulariz… ▽ More We develop a Distributionally Robust Optimization (DRO) formulation for Multiclass Logistic Regression (MLR), which could tolerate data contaminated by outliers. The DRO framework uses a probabilistic ambiguity set defined as a ball of distributions that are close to the empirical distribution of the training set in the sense of the Wasserstein metric. We relax the DRO formulation into a regularized learning problem whose regularizer is a norm of the coefficient matrix. We establish out-of-sample performance guarantees for the solutions to our model, offering insights on the role of the regularizer in controlling the prediction error. We apply the proposed method in rendering deep Vision Transformer (ViT)-based image classifiers robust to random and adversarial attacks. Specifically, using the MNIST and CIFAR-10 datasets, we demonstrate reductions in test error rate by up to 83.5% and loss by up to 91.3% compared with baseline methods, by adopting a novel random training method. △ Less

Submitted 25 March, 2023; v1 submitted 26 September, 2021; originally announced September 2021.

Comments: 9 pages; Previously this version appeared as arXiv:2210.08198 which was submitted as a new work by accident

arXiv:2108.05533 [pdf, ps, other]

Efficient Local Planning with Linear Function Approximation

Authors: Dong Yin, Botao Hao, Yasin Abbasi-Yadkori, Nevena Lazić, Csaba Szepesvári

Abstract: We study query and computationally efficient planning algorithms with linear function approximation and a simulator. We assume that the agent only has local access to the simulator, meaning that the agent can only query the simulator at states that have been visited before. This setting is more practical than many prior works on reinforcement learning with a generative model. We propose two algori… ▽ More We study query and computationally efficient planning algorithms with linear function approximation and a simulator. We assume that the agent only has local access to the simulator, meaning that the agent can only query the simulator at states that have been visited before. This setting is more practical than many prior works on reinforcement learning with a generative model. We propose two algorithms, named confident Monte Carlo least square policy iteration (Confident MC-LSPI) and confident Monte Carlo Politex (Confident MC-Politex) for this setting. Under the assumption that the Q-functions of all policies are linear in known features of the state-action pairs, we show that our algorithms have polynomial query and computational costs in the dimension of the features, the effective planning horizon, and the targeted sub-optimality, while these costs are independent of the size of the state space. One technical contribution of our work is the introduction of a novel proof technique that makes use of a virtual policy iteration algorithm. We use this method to leverage existing results on $\ell_\infty$-bounded approximate policy iteration to show that our algorithm can learn the optimal policy for the given initial state even only with local access to the simulator. We believe that this technique can be extended to broader settings beyond this work. △ Less

Submitted 4 February, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

Comments: Algorithmic Learning Theory 2022

arXiv:2107.02017 [pdf]

Chaotic diffusion in multi-scale turbulence

Authors: Yueheng Huang, Nong Xiang, Jiale Chen, Zechen Wang, Xiang Gao, Xiaodong Lin, Junle Qu, Yuanxi Wan, Zong Xu, Muquan Wu, Xiang Zhu, Baolong Hao

Abstract: Turbulence plays a very important role in determining the transport of energy and particles in tokamaks. This work is devoted to studying the chaotic diffusion in multi-scale turbulence in the context of the nonlinear wave-particle interaction. Turbulent waves with different scales of characteristic wavelengths can interact with the same group of charged particles when their phase velocity is clos… ▽ More Turbulence plays a very important role in determining the transport of energy and particles in tokamaks. This work is devoted to studying the chaotic diffusion in multi-scale turbulence in the context of the nonlinear wave-particle interaction. Turbulent waves with different scales of characteristic wavelengths can interact with the same group of charged particles when their phase velocity is close to the velocities of the charged particles. A multi-wavenumber standard mapping is developed to model the chaotic diffusion in multi-scale turbulence. The diffusion coefficient is obtained by calculating the correlation functions analytically. It is found that the contribution of the largest scale turbulence dominates the deviation from the quasi-linear diffusion coefficient. Increasing the overlap parameters of the smaller scale turbulence by just the increasing the wavenumber cannot make the diffusion coefficient to be the quasi-linear diffusion coefficient for a finite wave amplitude. Especially, in two-scale turbulence, the diffusion coefficient is mostly over the quasi-linear diffusion coefficient in the large wavenumber (of the smaller scale turbulence) limit. As more scales of components are added in the turbulence, the diffusion coefficient approaches the quasi-linear diffusion coefficient. The results can also be applied to other resonance-induced multi-scale turbulence in Hamiltonian systems with 1.5 or 2 degrees of freedom. △ Less

Submitted 30 June, 2021; originally announced July 2021.

Comments: 8 pages, 3 figures

MSC Class: 37-02

arXiv:2106.06467 [pdf, other]

A Large-Scale Rich Context Query and Recommendation Dataset in Online Knowledge-Sharing

Authors: Bin Hao, Min Zhang, Weizhi Ma, Shaoyun Shi, Xinxing Yu, Houzhi Shan, Yiqun Liu, Shaoping Ma

Abstract: Data plays a vital role in machine learning studies. In the research of recommendation, both user behaviors and side information are helpful to model users. So, large-scale real scenario datasets with abundant user behaviors will contribute a lot. However, it is not easy to get such datasets as most of them are only hold and protected by companies. In this paper, a new large-scale dataset collecte… ▽ More Data plays a vital role in machine learning studies. In the research of recommendation, both user behaviors and side information are helpful to model users. So, large-scale real scenario datasets with abundant user behaviors will contribute a lot. However, it is not easy to get such datasets as most of them are only hold and protected by companies. In this paper, a new large-scale dataset collected from a knowledge-sharing platform is presented, which is composed of around 100M interactions collected within 10 days, 798K users, 165K questions, 554K answers, 240K authors, 70K topics, and more than 501K user query keywords. There are also descriptions of users, answers, questions, authors, and topics, which are anonymous. Note that each user's latest query keywords have not been included in previous open datasets, which reveal users' explicit information needs. We characterize the dataset and demonstrate its potential applications for recommendation study. Multiple experiments show the dataset can be used to evaluate algorithms in general top-N recommendation, sequential recommendation, and context-aware recommendation. This dataset can also be used to integrate search and recommendation and recommendation with negative feedback. Besides, tasks beyond recommendation, such as user gender prediction, most valuable answerer identification, and high-quality answer recognition, can also use this dataset. To the best of our knowledge, this is the largest real-world interaction dataset for personalized recommendation. △ Less

Submitted 11 June, 2021; originally announced June 2021.

Comments: 7 pages

arXiv:2106.01660 [pdf, ps, other]

Bandit Phase Retrieval

Authors: Tor Lattimore, Botao Hao

Abstract: We study a bandit version of phase retrieval where the learner chooses actions $(A_t)_{t=1}^n$ in the $d$-dimensional unit ball and the expected reward is $\langle A_t, θしーた_\star\rangle^2$ where $θしーた_\star \in \mathbb R^d$ is an unknown parameter vector. We prove that the minimax cumulative regret in this problem is $\smash{\tilde Θしーた(d \sqrt{n})}$, which improves on the best known bounds by a factor of… ▽ More We study a bandit version of phase retrieval where the learner chooses actions $(A_t)_{t=1}^n$ in the $d$-dimensional unit ball and the expected reward is $\langle A_t, θしーた_\star\rangle^2$ where $θしーた_\star \in \mathbb R^d$ is an unknown parameter vector. We prove that the minimax cumulative regret in this problem is $\smash{\tilde Θしーた(d \sqrt{n})}$, which improves on the best known bounds by a factor of $\smash{\sqrt{d}}$. We also show that the minimax simple regret is $\smash{\tilde Θしーた(d / \sqrt{n})}$ and that this is only achievable by an adaptive algorithm. Our analysis shows that an apparently convincing heuristic for guessing lower bounds can be misleading and that uniform bounds on the information ratio for information-directed sampling are not sufficient for optimal regret. △ Less

Submitted 4 June, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

arXiv:2105.14267 [pdf, other]

Information Directed Sampling for Sparse Linear Bandits

Authors: Botao Hao, Tor Lattimore, Wei Deng

Abstract: Stochastic sparse linear bandits offer a practical model for high-dimensional online decision-making problems and have a rich information-regret structure. In this work we explore the use of information-directed sampling (IDS), which naturally balances the information-regret trade-off. We develop a class of information-theoretic Bayesian regret bounds that nearly match existing lower bounds on a v… ▽ More Stochastic sparse linear bandits offer a practical model for high-dimensional online decision-making problems and have a rich information-regret structure. In this work we explore the use of information-directed sampling (IDS), which naturally balances the information-regret trade-off. We develop a class of information-theoretic Bayesian regret bounds that nearly match existing lower bounds on a variety of problem instances, demonstrating the adaptivity of IDS. To efficiently implement sparse IDS, we propose an empirical Bayesian approach for sparse posterior sampling using a spike-and-slab Gaussian-Laplace prior. Numerical results demonstrate significant regret reductions by sparse IDS relative to several baselines. △ Less

Submitted 29 May, 2021; originally announced May 2021.

arXiv:2105.14231 [pdf, other]

Development, Implementation, and Experimental Outdoor Evaluation of Quadcopter Controllers for Computationally Limited Embedded Systems

Authors: Juan Paredes, Prashin Sharma, Brian Ha, Manuel Lanchares, Ella Atkins, Peter Gaskell, Ilya Kolmanovsky

Abstract: Quadcopters are increasingly used for applications ranging from hobby to industrial products and services. This paper serves as a tutorial on the design, simulation, implementation, and experimental outdoor testing of digital quadcopter flight controllers, including Explicit Model Predictive Control, Linear Quadratic Regulator, and Proportional Integral Derivative. A quadcopter was flown in an out… ▽ More Quadcopters are increasingly used for applications ranging from hobby to industrial products and services. This paper serves as a tutorial on the design, simulation, implementation, and experimental outdoor testing of digital quadcopter flight controllers, including Explicit Model Predictive Control, Linear Quadratic Regulator, and Proportional Integral Derivative. A quadcopter was flown in an outdoor testing facility and made to track an inclined, circular path at different tangential velocities under ambient wind conditions. Controller performance was evaluated via multiple metrics, such as position tracking error, velocity tracking error, and onboard computation time. Challenges related to the use of computationally limited embedded hardware and flight in an outdoor environment are addressed with proposed solutions. △ Less

Submitted 1 June, 2021; v1 submitted 29 May, 2021; originally announced May 2021.

arXiv:2102.06234 [pdf, other]

Optimization Issues in KL-Constrained Approximate Policy Iteration

Authors: Nevena Lazić, Botao Hao, Yasin Abbasi-Yadkori, Dale Schuurmans, Csaba Szepesvári

Abstract: Many reinforcement learning algorithms can be seen as versions of approximate policy iteration (API). While standard API often performs poorly, it has been shown that learning can be stabilized by regularizing each policy update by the KL-divergence to the previous policy. Popular practical algorithms such as TRPO, MPO, and VMPO replace regularization by a constraint on KL-divergence of consecutiv… ▽ More Many reinforcement learning algorithms can be seen as versions of approximate policy iteration (API). While standard API often performs poorly, it has been shown that learning can be stabilized by regularizing each policy update by the KL-divergence to the previous policy. Popular practical algorithms such as TRPO, MPO, and VMPO replace regularization by a constraint on KL-divergence of consecutive policies, arguing that this is easier to implement and tune. In this work, we study this implementation choice in more detail. We compare the use of KL divergence as a constraint vs. as a regularizer, and point out several optimization issues with the widely-used constrained approach. We show that the constrained algorithm is not guaranteed to converge even on simple problem instances where the constrained problem can be solved exactly, and in fact incurs linear expected regret. With approximate implementation using softmax policies, we show that regularization can improve the optimization landscape of the original objective. We demonstrate these issues empirically on several bandit and RL environments. △ Less

Submitted 11 February, 2021; originally announced February 2021.

arXiv:2102.03607 [pdf, other]

Bootstrapping Fitted Q-Evaluation for Off-Policy Inference

Authors: Botao Hao, Xiang Ji, Yaqi Duan, Hao Lu, Csaba Szepesvári, Mengdi Wang

Abstract: Bootstrapping provides a flexible and effective approach for assessing the quality of batch reinforcement learning, yet its theoretical property is less understood. In this paper, we study the use of bootstrapping in off-policy evaluation (OPE), and in particular, we focus on the fitted Q-evaluation (FQE) that is known to be minimax-optimal in the tabular and linear-model cases. We propose a boots… ▽ More Bootstrapping provides a flexible and effective approach for assessing the quality of batch reinforcement learning, yet its theoretical property is less understood. In this paper, we study the use of bootstrapping in off-policy evaluation (OPE), and in particular, we focus on the fitted Q-evaluation (FQE) that is known to be minimax-optimal in the tabular and linear-model cases. We propose a bootstrapping FQE method for inferring the distribution of the policy evaluation error and show that this method is asymptotically efficient and distributionally consistent for off-policy statistical inference. To overcome the computation limit of bootstrapping, we further adapt a subsampling procedure that improves the runtime by an order of magnitude. We numerically evaluate the bootrapping method in classical RL environments for confidence interval estimation, estimating the variance of off-policy evaluator, and estimating the correlation between multiple off-policy evaluators. △ Less

Submitted 22 May, 2022; v1 submitted 6 February, 2021; originally announced February 2021.

Comments: Accepted at ICML 2021

arXiv:2101.03706 [pdf, other]

doi 10.1145/3411764.3445397

#StayHome #WithMe: How Do YouTubers Help with COVID-19 Loneliness?

Authors: Shuo Niu, Ava Bartolome, Cat Mai, Nguyen B. Ha

Abstract: Loneliness threatens public mental wellbeing during COVID-19. In response, YouTube creators participated in the #StayHome #WithMe movement (SHWM) and made myriad videos for people experiencing loneliness or boredom at home. User-shared videos generate parasocial attachment and virtual connectedness. However, there is limited knowledge of how creators contributed videos during disasters to provide… ▽ More Loneliness threatens public mental wellbeing during COVID-19. In response, YouTube creators participated in the #StayHome #WithMe movement (SHWM) and made myriad videos for people experiencing loneliness or boredom at home. User-shared videos generate parasocial attachment and virtual connectedness. However, there is limited knowledge of how creators contributed videos during disasters to provide social provisions as disaster-relief. Grounded on Weiss's loneliness theory, this work analyzed 1488 SHWM videos to examine video sharing as a pathway to social provisions. Findings suggested that skill and knowledge sharing, entertaining arts, homelife activities, live chatting, and gameplay were the most popular video styles. YouTubers utilized parasocial relationships to form a space for staying away from the disaster. SHWM YouTubers provided friend-like, mentor-like, and family-like provisions through videos in different styles. Family-like provisions led to the highest overall viewer engagement. Based on the findings, design implications for supporting viewers' mental wellbeing in disasters are discussed. △ Less

Submitted 13 January, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

Comments: CHI Conference on Human Factors in Computing Systems (CHI '21), May 8--13, 2021, Yokohama, Japan

Journal ref: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems

arXiv:2012.14234 [pdf, other]

Recommending Courses in MOOCs for Jobs: An Auto Weak Supervision Approach

Authors: Bowen Hao, Jing Zhang, Cuiping Li, Hong Chen, Hongzhi Yin

Abstract: The proliferation of massive open online courses (MOOCs) demands an effective way of course recommendation for jobs posted in recruitment websites, especially for the people who take MOOCs to find new jobs. Despite the advances of supervised ranking models, the lack of enough supervised signals prevents us from directly learning a supervised ranking model. This paper proposes a general automated w… ▽ More The proliferation of massive open online courses (MOOCs) demands an effective way of course recommendation for jobs posted in recruitment websites, especially for the people who take MOOCs to find new jobs. Despite the advances of supervised ranking models, the lack of enough supervised signals prevents us from directly learning a supervised ranking model. This paper proposes a general automated weak supervision framework AutoWeakS via reinforcement learning to solve the problem. On the one hand, the framework enables training multiple supervised ranking models upon the pseudo labels produced by multiple unsupervised ranking models. On the other hand, the framework enables automatically searching the optimal combination of these supervised and unsupervised models. Systematically, we evaluate the proposed model on several datasets of jobs from different recruitment websites and courses from a MOOCs platform. Experiments show that our model significantly outperforms the classical unsupervised, supervised and weak supervision baselines. △ Less

Submitted 28 December, 2020; originally announced December 2020.

arXiv:2012.07064 [pdf, other]

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation

Authors: Bowen Hao, Jing Zhang, Hongzhi Yin, Cuiping Li, Hong Chen

Abstract: Cold-start problem is a fundamental challenge for recommendation tasks. Despite the recent advances on Graph Neural Networks (GNNs) incorporate the high-order collaborative signal to alleviate the problem, the embeddings of the cold-start users and items aren't explicitly optimized, and the cold-start neighbors are not dealt with during the graph convolution in GNNs. This paper proposes to pre-tra… ▽ More Cold-start problem is a fundamental challenge for recommendation tasks. Despite the recent advances on Graph Neural Networks (GNNs) incorporate the high-order collaborative signal to alleviate the problem, the embeddings of the cold-start users and items aren't explicitly optimized, and the cold-start neighbors are not dealt with during the graph convolution in GNNs. This paper proposes to pre-train a GNN model before applying it for recommendation. Unlike the goal of recommendation, the pre-training GNN simulates the cold-start scenarios from the users/items with sufficient interactions and takes the embedding reconstruction as the pretext task, such that it can directly improve the embedding quality and can be easily adapted to the new cold-start users/items. To further reduce the impact from the cold-start neighbors, we incorporate a self-attention-based meta aggregator to enhance the aggregation ability of each graph convolution step, and an adaptive neighbor sampler to select the effective neighbors according to the feedbacks from the pre-training GNN model. Experiments on three public recommendation datasets show the superiority of our pre-training GNN model against the original GNN models on user/item embedding inference and the recommendation task. △ Less

Submitted 13 December, 2020; originally announced December 2020.

Showing 1–50 of 97 results for author: Ha, B