(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 80 results for author: Ke, T

.
  1. arXiv:2408.06987  [pdf, other

    stat.ME

    Optimal Network Pairwise Comparison

    Authors: Jiashun Jin, Zheng Tracy Ke, Shengming Luo, Yucong Ma

    Abstract: We are interested in the problem of two-sample network hypothesis testing: given two networks with the same set of nodes, we wish to test whether the underlying Bernoulli probability matrices of the two networks are the same or not. We propose Interlacing Balance Measure (IBM) as a new two-sample testing approach. We consider the {\it Degree-Corrected Mixed-Membership (DCMM)} model for undirected… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 92 pages

    MSC Class: 62H30; 91C20

  2. arXiv:2407.17889  [pdf

    cs.NE

    An Error Discovery and Correction for the Family of V-Shaped BPSO Algorithms

    Authors: Qing Zhao, Chengkui Zhang, Hao Li, Ting Ke

    Abstract: BPSO algorithm is a swarm intelligence optimization algorithm, which has the characteristics of good optimization effect, high efficiency and easy to implement. In recent years, it has been used to optimize a variety of machine learning and deep learning models, such as CNN, LSTM, SVM, etc. But it is easy to fall into local optimum for the lack of exploitation ability. It is found that in the arti… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 25 pages, 11 figures

  3. arXiv:2406.17746  [pdf, other

    cs.CL cs.AI

    Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

    Authors: USVSN Sai Prashanth, Alvin Deng, Kyle O'Brien, Jyothir S V, Mohammad Aflah Khan, Jaydeep Borkar, Christopher A. Choquette-Choo, Jacob Ray Fuehne, Stella Biderman, Tracy Ke, Katherine Lee, Naomi Saphra

    Abstract: Memorization in language models is typically treated as a homogenous phenomenon, neglecting the specifics of the memorized data. We instead model memorization as the effect of a set of complex factors that describe each sample and relate it to the model and corpus. To build intuition around these factors, we break memorization down into a taxonomy: recitation of highly duplicated sequences, recons… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  5. arXiv:2405.17806  [pdf, other

    math.ST

    Entry-Wise Eigenvector Analysis and Improved Rates for Topic Modeling on Short Documents

    Authors: Zheng Tracy Ke, Jingming Wang

    Abstract: Topic modeling is a widely utilized tool in text analysis. We investigate the optimal rate for estimating a topic model. Specifically, we consider a scenario with $n$ documents, a vocabulary of size $p$, and document lengths at the order $N$. When $N\geq c\cdot p$, referred to as the long-document case, the optimal rate is established in the literature at $\sqrt{p/(Nn)}$. However, when $N=o(p)$, r… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 50 pages

    MSC Class: 62H12

    Journal ref: MDPI Mathematics, 2024

  6. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  7. arXiv:2405.01507  [pdf, other

    cs.LG stat.ML

    Accelerating Convergence in Bayesian Few-Shot Classification

    Authors: Tianjun Ke, Haoqun Cao, Feng Zhou

    Abstract: Bayesian few-shot classification has been a focal point in the field of few-shot learning. This paper seamlessly integrates mirror descent-based variational inference into Gaussian process-based few-shot classification, addressing the challenge of non-conjugate inference. By leveraging non-Euclidean geometry, mirror descent achieves accelerated convergence by providing the steepest descent directi… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  8. arXiv:2404.04801  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    LHAASO-KM2A detector simulation using Geant4

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

    Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  9. arXiv:2403.11013  [pdf, other

    cs.LG math.ST

    Improved Algorithm and Bounds for Successive Projection

    Authors: Jiashun Jin, Zheng Tracy Ke, Gabriel Moryoussef, Jiajun Tang, Jingming Wang

    Abstract: Given a $K$-vertex simplex in a $d$-dimensional space, suppose we measure $n$ points on the simplex with noise (hence, some of the observed points fall outside the simplex). Vertex hunting is the problem of estimating the $K$ vertices of the simplex. A popular vertex hunting algorithm is successive projection algorithm (SPA). However, SPA is observed to perform unsatisfactorily under strong noise… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 32 pages, 5 figures

  10. Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

    Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Physical Review Letters 132, 131002 (2024)

  11. arXiv:2402.10885  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    3D Diffuser Actor: Policy Diffusion with 3D Scene Representations

    Authors: Tsung-Wei Ke, Nikolaos Gkanatsios, Katerina Fragkiadaki

    Abstract: Diffusion policies are conditional diffusion models that learn robot action distributions conditioned on the robot and environment state. They have recently shown to outperform both deterministic and alternative action distribution learning formulations. 3D robot policies use 3D scene feature representations aggregated from a single or multiple camera views using sensed depth. They have shown to g… ▽ More

    Submitted 25 July, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: First two authors contributed equally

  12. arXiv:2402.06559  [pdf, other

    cs.LG cs.AI cs.CL cs.RO

    Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following

    Authors: Brian Yang, Huangyuan Su, Nikolaos Gkanatsios, Tsung-Wei Ke, Ayush Jain, Jeff Schneider, Katerina Fragkiadaki

    Abstract: Diffusion models excel at modeling complex and multimodal trajectory distributions for decision-making and control. Reward-gradient guided denoising has been recently proposed to generate trajectories that maximize both a differentiable reward function and the likelihood under the data distribution captured by a diffusion model. Reward-gradient guided denoising requires a differentiable reward fun… ▽ More

    Submitted 16 July, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  13. Recent Advances in Text Analysis

    Authors: Zheng Tracy Ke, Pengsheng Ji, Jiashun Jin, Wanshan Li

    Abstract: Text analysis is an interesting research area in data science and has various applications, such as in artificial intelligence, biomedical research, and engineering. We review popular methods for text analysis, ranging from topic modeling to the recent neural language models. In particular, we review Topic-SCORE, a statistical approach to topic modeling, and discuss how to use it to analyze MADSta… ▽ More

    Submitted 7 February, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Journal ref: Annual Review of Statistics and Its Application 2024 11:1

  14. arXiv:2311.16102  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback

    Authors: Mihir Prabhudesai, Tsung-Wei Ke, Alexander C. Li, Deepak Pathak, Katerina Fragkiadaki

    Abstract: The advancements in generative modeling, particularly the advent of diffusion models, have sparked a fundamental question: how can these models be effectively used for discriminative tasks? In this work, we find that generative models can be great test-time adapters for discriminative models. Our method, Diffusion-TTA, adapts pre-trained discriminative models such as image classifiers, segmenters… ▽ More

    Submitted 29 November, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023 Webpage with Code: https://diffusion-tta.github.io/

  15. arXiv:2310.17082  [pdf, ps, other

    astro-ph.HE

    Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 11 pages, 3 figures, Accepted by the APJL

  16. arXiv:2310.10379  [pdf, other

    cs.LG stat.ML

    Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification

    Authors: Tianjun Ke, Haoqun Cao, Zenan Ling, Feng Zhou

    Abstract: Meta-learning has demonstrated promising results in few-shot classification (FSC) by learning to solve new problems using prior knowledge. Bayesian methods are effective at characterizing uncertainty in FSC, which is crucial in high-risk fields. In this context, the logistic-softmax likelihood is often employed as an alternative to the softmax likelihood in multi-class Gaussian process classificat… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  17. Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

    Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 49pages, 11figures

    Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

  18. arXiv:2306.05363  [pdf, other

    stat.ME cs.LG math.ST stat.AP

    Subject clustering by IF-PCA and several recent methods

    Authors: Dieyi Chen, Jiashun Jin, Zheng Tracy Ke

    Abstract: Subject clustering (i.e., the use of measured features to cluster subjects, such as patients or cells, into multiple groups) is a problem of great interest. In recent years, many approaches were proposed, among which unsupervised deep learning (UDL) has received a great deal of attention. Two interesting questions are (a) how to combine the strengths of UDL and other approaches, and (b) how these… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  19. arXiv:2306.01089  [pdf, other

    cs.SI cs.LG stat.ME stat.ML

    Semi-supervised Community Detection via Structural Similarity Metrics

    Authors: Yicong Jiang, Tracy Ke

    Abstract: Motivated by social network analysis and network-based recommendation systems, we study a semi-supervised community detection problem in which the objective is to estimate the community label of a new node using the network topology and partially observed community labels of existing nodes. The network is modeled using a degree-corrected stochastic block model, which allows for severe degree heter… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 9 pages, 8 figures, accepted by the 11th International Conference on Learning Representations (ICLR 2023)

  20. arXiv:2305.17030  [pdf, other

    astro-ph.HE hep-ph

    The First LHAASO Catalog of Gamma-Ray Sources

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022.… ▽ More

    Submitted 27 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 40 pages, 13 figures, 4 tables

    Journal ref: The Astrophysical Journal Supplement Series, 271 (2024) 25

  21. Measurement of ultra-high-energy diffuse gamma-ray emission of the Galactic plane from 10 TeV to 1 PeV with LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The diffuse Galactic $γがんま$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γがんま$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer ar… ▽ More

    Submitted 19 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 12 pages, 8 figures, 5 tables; accepted for publication in Physical Review Letters; source mask file provided as ancillary file

    Journal ref: Phys. Rev. Lett. 131, 151001 (2023)

  22. arXiv:2303.05024  [pdf, other

    math.ST cs.LG cs.SI stat.ML

    Phase transition for detecting a small community in a large network

    Authors: Jiashun Jin, Zheng Tracy Ke, Paxton Turner, Anru R. Zhang

    Abstract: How to detect a small community in a large network is an interesting problem, including clique detection as a special case, where a naive degree-based $χかい^2$-test was shown to be powerful in the presence of an Erdős-Renyi background. Using Sinkhorn's theorem, we show that the signal captured by the $χかい^2$-test may be a modeling artifact, and it may disappear once we replace the Erdős-Renyi model by… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  23. arXiv:2301.01381  [pdf, other

    stat.ME math.ST stat.ML

    Testing High-dimensional Multinomials with Applications to Text Analysis

    Authors: T. Tony Cai, Zheng Tracy Ke, Paxton Turner

    Abstract: Motivated by applications in text mining and discrete distribution inference, we investigate the testing for equality of probability mass functions of $K$ groups of high-dimensional multinomial distributions. A test statistic, which is shown to have an asymptotic standard normal distribution under the null, is proposed. The optimal detection boundary is established, and the proposed test is shown… ▽ More

    Submitted 24 November, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

  24. arXiv:2210.00314  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Hierarchical Image Segmentation For Recognition and By Recognition

    Authors: Tsung-Wei Ke, Sangwoo Mo, Stella X. Yu

    Abstract: Large vision and language models learned directly through image-text associations often lack detailed visual substantiation, whereas image segmentation tasks are treated separately from recognition, supervisedly learned without interconnections. Our key observation is that, while an image can be recognized in multiple ways, each has a consistent part-and-whole visual organization. Segmentation thu… ▽ More

    Submitted 2 May, 2024; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: ICLR 2024 (spotlight). First two authors contributed equally. Code available at https://github.com/twke18/CAST

    ACM Class: I.4.6; I.4.10; I.5.3

  25. arXiv:2207.12601  [pdf

    astro-ph.HE hep-ex

    Flux Variations of Cosmic Ray Air Showers Detected by LHAASO-KM2A During a Thunderstorm on 10 June 2021

    Authors: LHAASO Collaboration, F. Aharonian, Q. An, Axikegu, L. X. Bai, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Zhe Cao, Zhen Cao, J. Chang, J. F. Chang, E. S. Chen, Liang Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, S. H. Chen, S. Z. Chen, T. L. Chen, X. J. Chen , et al. (248 additional authors not shown)

    Abstract: The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations… ▽ More

    Submitted 6 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: 18 pages, 11 figures

    Journal ref: Chinese Phys. C 47 015001 (2023)

  26. arXiv:2207.06933  [pdf, other

    cond-mat.mes-hall cond-mat.supr-con

    Controlling Andreev bound states with the magnetic vector potential

    Authors: Christian M. Moehle, Prasanna K. Rout, Nayan A. Jainandunsing, Dibyendu Kuiri, Chung Ting Ke, Di Xiao, Candice Thomas, Michael J. Manfra, Michal P. Nowak, Srijit Goswami

    Abstract: Tunneling spectroscopy measurements are often used to probe the energy spectrum of Andreev bound states (ABSs) in semiconductor-superconductor hybrids. Recently, this spectroscopy technique has been incorporated into planar Josephson junctions (JJs) formed in two-dimensional electron gases, a potential platform to engineer phase-controlled topological superconductivity. Here, we perform ABS spectr… ▽ More

    Submitted 15 November, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

    Journal ref: Nano Lett. 22, 8601 (2022)

  27. arXiv:2204.12087  [pdf, other

    math.ST

    Optimal Network Membership Estimation Under Severe Degree Heterogeneity

    Authors: Zheng Tracy Ke, Jingming Wang

    Abstract: Real networks often have severe degree heterogeneity, with the maximum, average, and minimum node degrees differing significantly. This paper examines the impact of degree heterogeneity on statistical limits of network data analysis. Introducing the heterogeneity distribution (HD) under a degree-corrected mixed-membership network model, we show that the optimal rate of mixed membership estimation… ▽ More

    Submitted 22 July, 2024; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: 109 pages, 8 figures

  28. arXiv:2204.11432  [pdf, other

    cs.CV cs.LG

    Unsupervised Hierarchical Semantic Segmentation with Multiview Cosegmentation and Clustering Transformers

    Authors: Tsung-Wei Ke, Jyh-Jing Hwang, Yunhui Guo, Xudong Wang, Stella X. Yu

    Abstract: Unsupervised semantic segmentation aims to discover groupings within and across images that capture object and view-invariance of a category without external supervision. Grouping naturally has levels of granularity, creating ambiguity in unsupervised segmentation. Existing methods avoid this ambiguity and treat it as a factor outside modeling, whereas we embrace it and desire hierarchical groupin… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: In CVPR 2022. Webpage & Code: https://twke18.github.io/projects/hsg.html

  29. arXiv:2204.11194  [pdf, other

    cs.DL

    Co-citation and Co-authorship Networks of Statisticians

    Authors: Pengsheng Ji, Jiashun Jin, Zheng Tracy Ke, Wanshan Li

    Abstract: We collected and cleaned a large data set on publications in statistics. The data set consists of the coauthor relationships and citation relationships of 83, 331 papers published in 36 representative journals in statistics, probability, and machine learning, spanning 41 years. The data set allows us to construct many different networks, and motivates a number of research problems about the resear… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: 61 pages, 16 figures

  30. arXiv:2204.11109  [pdf, other

    math.ST

    Power Enhancement and Phase Transitions for Global Testing of the Mixed Membership Stochastic Block Model

    Authors: Louis Cammarata, Zheng Tracy Ke

    Abstract: The mixed-membership stochastic block model (MMSBM) is a common model for social networks. Given an $n$-node symmetric network generated from a $K$-community MMSBM, we would like to test $K=1$ versus $K>1$. We first study the degree-based $χかい^2$ test and the orthodox Signed Quadrilateral (oSQ) test. These two statistics estimate an order-2 polynomial and an order-4 polynomial of a "signal" matrix,… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: 78 pages, 6 figures

  31. arXiv:2204.11097  [pdf, other

    cs.SI stat.ME

    The SCORE normalization, especially for highly heterogeneous network and text data

    Authors: Zheng Tracy Ke, Jiashun Jin

    Abstract: SCORE was introduced as a spectral approach to network community detection. Since many networks have severe degree heterogeneity, the ordinary spectral clustering (OSC) approach to community detection may perform unsatisfactorily. SCORE alleviates the effect of degree heterogeneity by introducing a new normalization idea in the spectral domain and makes OSC more effective. SCORE is easy to use and… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: 34 pages, 5 figures, 7 tables

  32. arXiv:2203.15075  [pdf, other

    math.ST

    A Comparison of Hamming Errors of Representative Variable Selection Methods

    Authors: Zheng Tracy Ke, Longlin Wang

    Abstract: Lasso is a celebrated method for variable selection in linear models, but it faces challenges when the variables are moderately or strongly correlated. This motivates alternative approaches such as using a non-convex penalty, adding a ridge regularization, or conducting a post-Lasso thresholding. In this paper, we compare Lasso with 5 other methods: Elastic net, SCAD, forward selection, thresholde… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 87 pages; 13 figures

    Journal ref: Tenth International Conference on Learning Representations (ICLR 2022)

  33. Peta-electron volt gamma-ray emission from the Crab Nebula

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, Axikegu, L. X. Bai, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, H. Cai, J. T. Cai, Zhe Cao, J. Chang, J. F. Chang, B. M. Chen, E. S. Chen, J. Chen, Liang Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen , et al. (250 additional authors not shown)

    Abstract: The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γがんま$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ pet… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: 43 pages, 13 figures, 2 tables; Published in Science

    Journal ref: Science, 2021, Vol 373, Issue 6553, pp. 425-430

  34. arXiv:2110.04381  [pdf, other

    stat.ME stat.AP

    Allocation of COVID-19 Testing Budget on a Commute Network of Counties

    Authors: Yaxuan Huang, Zheng Tracy Ke, Jiashun Jin

    Abstract: The screening testing is an effective tool to control the early spread of an infectious disease such as COVID-19. When the total testing capacity is limited, we aim to optimally allocate testing resources among n counties. We build a (weighted) commute network on counties, with the weight between two counties a decreasing function of their traffic distance. We introduce a network-based disease mod… ▽ More

    Submitted 24 March, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

  35. InSbAs two-dimensional electron gases as a platform for topological superconductivity

    Authors: Christian M. Moehle, Chung Ting Ke, Qingzhen Wang, Candice Thomas, Di Xiao, Saurabh Karwal, Mario Lodari, Vincent van de Kerkhof, Ruben Termaat, Geoffrey C. Gardner, Giordano Scappucci, Michael J. Manfra, Srijit Goswami

    Abstract: Topological superconductivity can be engineered in semiconductors with strong spin-orbit interaction coupled to a superconductor. Experimental advances in this field have often been triggered by the development of new hybrid material systems. Among these, two-dimensional electron gases (2DEGs) are of particular interest due to their inherent design flexibility and scalability. Here we discuss resu… ▽ More

    Submitted 4 November, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

  36. arXiv:2105.00957  [pdf, other

    cs.CV

    Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning

    Authors: Tsung-Wei Ke, Jyh-Jing Hwang, Stella X. Yu

    Abstract: Weakly supervised segmentation requires assigning a label to every pixel based on training instances with partial annotations such as image-level tags, object bounding boxes, labeled points and scribbles. This task is challenging, as coarse annotations (tags, boxes) lack precise pixel localization whereas sparse annotations (points, scribbles) lack broad region coverage. Existing methods tackle th… ▽ More

    Submitted 10 May, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: In ICLR 2021. Webpage & Code: https://twke18.github.io/projects/spml.html

  37. arXiv:2011.11730  [pdf, other

    cs.RO cs.CV

    RISE-SLAM: A Resource-aware Inverse Schmidt Estimator for SLAM

    Authors: Tong Ke, Kejian J. Wu, Stergios I. Roumeliotis

    Abstract: In this paper, we present the RISE-SLAM algorithm for performing visual-inertial simultaneous localization and mapping (SLAM), while improving estimation consistency. Specifically, in order to achieve real-time operation, existing approaches often assume previously-estimated states to be perfectly known, which leads to inconsistent estimates. Instead, based on the idea of the Schmidt-Kalman filter… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: IROS 2019

  38. arXiv:2011.09594  [pdf, other

    cs.CV cs.RO

    Deep Multi-view Depth Estimation with Predicted Uncertainty

    Authors: Tong Ke, Tien Do, Khiem Vuong, Kourosh Sartipi, Stergios I. Roumeliotis

    Abstract: In this paper, we address the problem of estimating dense depth from a sequence of images using deep neural networks. Specifically, we employ a dense-optical-flow network to compute correspondences and then triangulate the point cloud to obtain an initial depth map.Parts of the point cloud, however, may be less accurate than others due to lack of common observations or small parallax. To further i… ▽ More

    Submitted 27 March, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: IEEE International Conference on Robotics and Automation (ICRA 2021)

  39. arXiv:2010.08132  [pdf, other

    math.ST

    Power of Knockoff: The Impact of Ranking Algorithm, Augmented Design, and Symmetric Statistic

    Authors: Zheng Tracy Ke, Jun S. Liu, Yucong Ma

    Abstract: The knockoff filter is a recent false discovery rate (FDR) control method for high-dimensional linear models. We point out that knockoff has three key components: ranking algorithm, augmented design, and symmetric statistic, and each component admits multiple choices. By considering various combinations of the three components, we obtain a collection of variants of knockoff. All these variants gua… ▽ More

    Submitted 13 February, 2024; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 67 pages, 13 figures

    Journal ref: Journal of Machine Learning Research, 2024

  40. arXiv:2009.09177  [pdf, other

    stat.ME math.ST

    Optimal Estimation of the Number of Communities

    Authors: Jiashun Jin, Zheng Tracy Ke, Shengming Luo, Minzhe Wang

    Abstract: In network analysis, how to estimate the number of communities $K$ is a fundamental problem. We consider a broad setting where we allow severe degree heterogeneity and a wide range of sparsity levels, and propose Stepwise Goodness-of-Fit (StGoF) as a new approach. This is a stepwise algorithm, where for $m = 1, 2, \ldots$, we alternately use a community detection step and a goodness-of-fit (GoF) s… ▽ More

    Submitted 25 January, 2022; v1 submitted 19 September, 2020; originally announced September 2020.

    MSC Class: 62H12; 62H30; 91C20

  41. arXiv:2008.00092  [pdf, other

    cs.CV

    Deep Depth Estimation from Visual-Inertial SLAM

    Authors: Kourosh Sartipi, Tien Do, Tong Ke, Khiem Vuong, Stergios I. Roumeliotis

    Abstract: This paper addresses the problem of learning to complete a scene's depth from sparse depth points and images of indoor scenes. Specifically, we study the case in which the sparse depth is computed from a visual-inertial simultaneous localization and mapping (VI-SLAM) system. The resulting point cloud has low density, it is noisy, and has non-uniform spatial distribution, as compared to the input f… ▽ More

    Submitted 14 August, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: 9 pages

  42. arXiv:2007.07498  [pdf, other

    stat.ML cs.LG stat.ME

    Measurement error models: from nonparametric methods to deep neural networks

    Authors: Zhirui Hu, Zheng Tracy Ke, Jun S Liu

    Abstract: The success of deep learning has inspired recent interests in applying neural networks in statistical inference. In this paper, we investigate the use of deep neural networks for nonparametric regression with measurement errors. We propose an efficient neural network design for estimating measurement error models, in which we use a fully connected feed-forward neural network (FNN) to approximate t… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 37 pages, 8 figures

  43. arXiv:2006.00436  [pdf, other

    stat.ME

    Estimation of the number of spiked eigenvalues in a covariance matrix by bulk eigenvalue matching analysis

    Authors: Zheng Tracy Ke, Yucong Ma, Xihong Lin

    Abstract: The spiked covariance model has gained increasing popularity in high-dimensional data analysis. A fundamental problem is determination of the number of spiked eigenvalues, $K$. For estimation of $K$, most attention has focused on the use of $top$ eigenvalues of sample covariance matrix, and there is little investigation into proper ways of utilizing $bulk$ eigenvalues to estimate $K$. We propose a… ▽ More

    Submitted 5 January, 2021; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: 48 pages, 8 figures, 5 tables

  44. Stable quantum dots in an InSb two-dimensional electron gas

    Authors: Ivan Kulesh, Chung Ting Ke, Candice Thomas, Saurabh Karwal, Christian M. Moehle, Sara Metti, Ray Kallaher, Geoffrey C. Gardner, Michael J. Manfra, Srijit Goswami

    Abstract: Indium antimonide (InSb) two-dimensional electron gases (2DEGs) have a unique combination of material properties: high electron mobility, strong spin-orbit interaction, large Landé g-factor, and small effective mass. This makes them an attractive platform to explore a variety of mesoscopic phenomena ranging from spintronics to topological superconductivity. However, there exist limited studies of… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: Includes supplementary information. Data files available at https://doi.org/10.4121/uuid:28a121af-1e08-429d-9d53-1cc53764e91e

    Journal ref: Phys. Rev. Applied 13, 041003 (2020)

  45. arXiv:1909.06503  [pdf, other

    stat.ME math.ST

    Community Detection for Hypergraph Networks via Regularized Tensor Power Iteration

    Authors: Zheng Tracy Ke, Feng Shi, Dong Xia

    Abstract: To date, social network analysis has been largely focused on pairwise interactions. The study of higher-order interactions, via a hypergraph network, brings in new insights. We study community detection in a hypergraph network. A popular approach is to project the hypergraph to a graph and then apply community detection methods for graph networks, but we show that this approach may cause unwanted… ▽ More

    Submitted 2 January, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: 53 pages, 5 figures

  46. 2$Φふぁい_{0}$-periodic magnetic interference in ballistic graphene Josephson junctions

    Authors: C. T. Ke, A. W. Draelos, A. Seredinski, M. T. Wei, H. Li, M. Hernandez-Rivera, K. Watanabe, T. Taniguchi, M. Yamamoto, S. Tarucha, Y. Bomze, I. V. Borzenets, F. Amet, G. Finkelstein

    Abstract: We investigate supercurrent interference patterns measured as a function of magnetic field in ballistic graphene Josephson junctions. At high doping, the expected $Φふぁい_{0}$-periodic "Fraunhofer" pattern is observed, indicating a uniform current distribution. Close to the Dirac point, we find anomalous interference patterns with an apparent 2$Φふぁい_{0}$ periodicity, similar to that predicted for topologi… ▽ More

    Submitted 19 June, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: Main text+ supplementary

    Journal ref: Phys. Rev. Research 1, 033084 (2019)

  47. arXiv:1906.00051  [pdf, other

    stat.ME stat.CO

    Diagonally-Dominant Principal Component Analysis

    Authors: Zheng Tracy Ke, Lingzhou Xue, Fan Yang

    Abstract: We consider the problem of decomposing a large covariance matrix into the sum of a low-rank matrix and a diagonally dominant matrix, and we call this problem the "Diagonally-Dominant Principal Component Analysis (DD-PCA)". DD-PCA is an effective tool for designing statistical methods for strongly correlated data. We showcase the use of DD-PCA in two statistical problems: covariance matrix estimati… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

  48. arXiv:1905.06485  [pdf, other

    econ.TH math.AP math.PR

    Parallel Search for Information

    Authors: T. Tony Ke, Wenpin Tang, J. Miguel Villas-Boas, Yuming Zhang

    Abstract: We consider the problem of a decision-maker searching for information on multiple alternatives when information is learned on all alternatives simultaneously. The decision-maker has a running cost of searching for information, and has to decide when to stop searching for information and choose one alternative. The expected payoff of each alternative evolves as a diffusion process when information… ▽ More

    Submitted 9 April, 2020; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: 33 pages, 3 figures

  49. arXiv:1904.11689  [pdf, other

    cond-mat.mes-hall cond-mat.supr-con

    Chiral Quasiparticle Tunneling Between Quantum Hall Edges in Proximity with a Superconductor

    Authors: M. T. Wei, A. W. Draelos, A. Seredinski, C. T. Ke, H. Li, Y. Mehta, K. Watanabe, T. Taniguchi, M. Yamamoto, S. Tarucha, G. Finkelstein, F. Amet, I. V. Borzenets

    Abstract: We study a two-terminal graphene Josephson junction with contacts shaped to form a narrow constriction, less than 100nm in length. The contacts are made from type II superconducting contacts and able to withstand magnetic fields high enough to reach the quantum Hall (QH) regime in graphene. In this regime, the device conductance is determined by edge states, plus the contribution from the constric… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: 4 pages, 3 figures

    Journal ref: Phys. Rev. B 100, 121403 (2019)

  50. arXiv:1904.09532  [pdf, other

    math.ST stat.ME

    Optimal Adaptivity of Signed-Polygon Statistics for Network Testing

    Authors: Jiashun Jin, Zheng Tracy Ke, Shengming Luo

    Abstract: Given a symmetric social network, we are interested in testing whether it has only one community or multiple communities. The desired tests should (a) accommodate severe degree heterogeneity, (b) accommodate mixed-memberships, (c) have a tractable null distribution, and (d) adapt automatically to different levels of sparsity, and achieve the optimal phase diagram. How to find such a test is a chal… ▽ More

    Submitted 21 May, 2019; v1 submitted 20 April, 2019; originally announced April 2019.

    MSC Class: 62H15; 62H20; 62C20