(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 64 results for author: Principe, J C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17951  [pdf, other

    cs.LG cs.IT stat.ML

    Cauchy-Schwarz Divergence Information Bottleneck for Regression

    Authors: Shujian Yu, Xi Yu, Sigurd Løkse, Robert Jenssen, Jose C. Principe

    Abstract: The information bottleneck (IB) approach is popular to improve the generalization, robustness and explainability of deep neural networks. Essentially, it aims to find a minimum sufficient representation $\mathbf{t}$ by striking a trade-off between a compression term $I(\mathbf{x};\mathbf{t})$ and a prediction term $I(y;\mathbf{t})$, where $I(\cdot;\cdot)$ refers to the mutual information (MI). MI… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: accepted by ICLR-24, project page: \url{https://github.com/SJYuCNEL/Cauchy-Schwarz-Information-Bottleneck}

  2. arXiv:2401.11313  [pdf, other

    cs.CV cs.LG eess.IV

    Weakly-Supervised Semantic Segmentation of Circular-Scan, Synthetic-Aperture-Sonar Imagery

    Authors: Isaac J. Sledge, Dominic M. Byrne, Jonathan L. King, Steven H. Ostertag, Denton L. Woods, James L. Prater, Jermaine L. Kennedy, Timothy M. Marston, Jose C. Principe

    Abstract: We propose a weakly-supervised framework for the semantic segmentation of circular-scan synthetic-aperture-sonar (CSAS) imagery. The first part of our framework is trained in a supervised manner, on image-level labels, to uncover a set of semi-sparse, spatially-discriminative regions in each image. The classification uncertainty of each region is then evaluated. Those areas with the lowest uncerta… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Submitted to the IEEE Journal of Oceanic Engineering

  3. arXiv:2312.12318  [pdf, other

    cs.LG eess.SP

    An Alternate View on Optimal Filtering in an RKHS

    Authors: Benjamin Colburn, Jose C. Principe, Luis G. Sanchez Giraldo

    Abstract: Kernel Adaptive Filtering (KAF) are mathematically principled methods which search for a function in a Reproducing Kernel Hilbert Space. While they work well for tasks such as time series prediction and system identification they are plagued by a linear relationship between number of training samples and model size, hampering their use on the very large data sets common in today's data saturated w… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 5 pages, 2 figures

  4. arXiv:2305.20074  [pdf, other

    cs.CV cs.AI cs.IT cs.LG

    Feature Learning in Image Hierarchies using Functional Maximal Correlation

    Authors: Bo Hu, Yuheng Bu, José C. Príncipe

    Abstract: This paper proposes the Hierarchical Functional Maximal Correlation Algorithm (HFMCA), a hierarchical methodology that characterizes dependencies across two hierarchical levels in multiview systems. By framing view similarities as dependencies and ensuring contrastivity by imposing orthonormality, HFMCA achieves faster convergence and increased stability in self-supervised learning. HFMCA defines… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  5. arXiv:2301.08970  [pdf, other

    cs.LG cs.IT stat.ML

    The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision Making

    Authors: Shujian Yu, Hongming Li, Sigurd Løkse, Robert Jenssen, José C. Príncipe

    Abstract: The Cauchy-Schwarz (CS) divergence was developed by Príncipe et al. in 2000. In this paper, we extend the classic CS divergence to quantify the closeness between two conditional distributions and show that the developed conditional CS divergence can be simply estimated by a kernel density estimator from given samples. We illustrate the advantages (e.g., rigorous faithfulness guarantee, lower compu… ▽ More

    Submitted 26 April, 2024; v1 submitted 21 January, 2023; originally announced January 2023.

    Comments: 27 pages, 10 figures, under 2nd round review

  6. arXiv:2212.11083  [pdf, other

    cs.LG cs.AI cs.IT

    Adapting the Exploration Rate for Value-of-Information-Based Reinforcement Learning

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: In this paper, we consider the problem of adjusting the exploration rate when using value-of-information-based exploration. We do this by converting the value-of-information optimization into a problem of finding equilibria of a flow for a changing exploration rate. We then develop an efficient path-following scheme for converging to these equilibria and hence uncovering optimal action-selection p… ▽ More

    Submitted 30 December, 2022; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Submitted to the IEEE Transactions on Information Theory

  7. arXiv:2212.04631  [pdf, other

    cs.LG cs.AI cs.IT

    The Normalized Cross Density Functional: A Framework to Quantify Statistical Dependence for Random Processes

    Authors: Bo Hu, Jose C. Principe

    Abstract: This paper presents a novel approach to measuring statistical dependence between two random processes (r.p.) using a positive-definite function called the Normalized Cross Density (NCD). NCD is derived directly from the probability density functions of two r.p. and constructs a data-dependent Hilbert space, the Normalized Cross-Density Hilbert Space (NCD-HS). By Mercer's Theorem, the NCD norm can… ▽ More

    Submitted 20 February, 2024; v1 submitted 8 December, 2022; originally announced December 2022.

  8. arXiv:2211.02005  [pdf, other

    eess.SP cs.IT

    Robust Dependence Measure using RKHS based Uncertainty Moments and Optimal Transport

    Authors: Rishabh Singh, Jose C. Principe

    Abstract: Reliable measurement of dependence between variables is essential in many applications of statistics and machine learning. Current approaches for dependence estimation, especially density-based approaches, lack in precision, robustness and/or interpretability (in terms of the type of dependence being estimated). We propose a two-step approach for dependence quantification between random variables:… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  9. arXiv:2211.01999  [pdf, other

    cs.CV cs.IT cs.LG eess.IV

    Quantifying Model Uncertainty for Semantic Segmentation using Operators in the RKHS

    Authors: Rishabh Singh, Jose C. Principe

    Abstract: Deep learning models for semantic segmentation are prone to poor performance in real-world applications due to the highly challenging nature of the task. Model uncertainty quantification (UQ) is one way to address this issue of lack of model trustworthiness by enabling the practitioner to know how much to trust a segmentation output. Current UQ methods in this application domain are mainly restric… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  10. arXiv:2206.00118  [pdf, other

    cs.LG cs.IT

    Principle of Relevant Information for Graph Sparsification

    Authors: Shujian Yu, Francesco Alesiani, Wenzhe Yin, Robert Jenssen, Jose C. Principe

    Abstract: Graph sparsification aims to reduce the number of edges of a graph while maintaining its structural properties. In this paper, we propose the first general and effective information-theoretic formulation of graph sparsification, by taking inspiration from the Principle of Relevant Information (PRI). To this end, we extend the PRI from a standard scalar random variable setting to structured data (i… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: accepted by UAI-22

  11. arXiv:2202.02951  [pdf, other

    eess.IV cs.CV

    Deep Deterministic Independent Component Analysis for Hyperspectral Unmixing

    Authors: Hongming Li, Shujian Yu, Jose C. Principe

    Abstract: We develop a new neural network based independent component analysis (ICA) method by directly minimizing the dependence amongst all extracted components. Using the matrix-based R{é}nyi's $αあるふぁ$-order entropy functional, our network can be directly optimized by stochastic gradient descent (SGD), without any variational approximation or adversarial training. As a solid application, we evaluate our ICA… ▽ More

    Submitted 14 February, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Accepted by ICASSP 2022

  12. arXiv:2110.05794  [pdf, other

    cs.LG cs.IT stat.ML

    Information Theoretic Structured Generative Modeling

    Authors: Bo Hu, Shujian Yu, Jose C. Principe

    Abstract: Rényi's information provides a theoretical foundation for tractable and data-efficient non-parametric density estimation, based on pair-wise evaluations in a reproducing kernel Hilbert space (RKHS). This paper extends this framework to parametric probabilistic modeling, motivated by the fact that Rényi's information can be estimated in closed-form for Gaussian mixtures. Based on this special conne… ▽ More

    Submitted 7 March, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  13. arXiv:2109.11737  [pdf, other

    cs.IT cs.LG

    Estimating Rényi's $αあるふぁ$-Cross-Entropies in a Matrix-Based Way

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: Conventional information-theoretic quantities assume access to probability distributions. Estimating such distributions is not trivial. Here, we consider function-based formulations of cross entropy that sidesteps this a priori estimation requirement. We propose three measures of Rényi's $αあるふぁ$-cross-entropies in the setting of reproducing-kernel Hilbert spaces. Each measure has its appeals. We prove… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Submitted to the IEEE Transactions on Information Theory

  14. arXiv:2109.10888  [pdf, other

    cs.LG cs.IT

    A Physics inspired Functional Operator for Model Uncertainty Quantification in the RKHS

    Authors: Rishabh Singh, Jose C. Principe

    Abstract: Accurate uncertainty quantification of model predictions is a crucial problem in machine learning. Existing Bayesian methods, being highly iterative, are expensive to implement and often fail to accurately capture a model's true posterior because of their tendency to select only central moments. We propose a fast single-shot uncertainty quantification framework where, instead of working with the c… ▽ More

    Submitted 29 May, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: Abstract and title modified for more clarity. Updated version of arXiv:2103.01374

  15. arXiv:2107.10504  [pdf, other

    cs.CV cs.LG

    External-Memory Networks for Low-Shot Learning of Targets in Forward-Looking-Sonar Imagery

    Authors: Isaac J. Sledge, Christopher D. Toole, Joseph A. Maestri, Jose C. Principe

    Abstract: We propose a memory-based framework for real-time, data-efficient target analysis in forward-looking-sonar (FLS) imagery. Our framework relies on first removing non-discriminative details from the imagery using a small-scale DenseNet-inspired network. Doing so simplifies ensuing analyses and permits generalizing from few labeled examples. We then cascade the filtered imagery into a novel NeuralRAM… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Submitted to IEEE Journal of Oceanic Engineering

  16. An Information-Theoretic Approach for Automatically Determining the Number of States when Aggregating Markov Chains

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: A fundamental problem when aggregating Markov chains is the specification of the number of state groups. Too few state groups may fail to sufficiently capture the pertinent dynamics of the original, high-order Markov chain. Too many state groups may lead to a non-parsimonious, reduced-order Markov chain whose complexity rivals that of the original. In this paper, we show that an augmented value-of… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: Submitted to IEEE ICASSP. arXiv admin note: substantial text overlap with arXiv:1903.09266

  17. arXiv:2104.09015  [pdf, other

    cs.LG cs.AI

    Labels, Information, and Computation: Efficient Learning Using Sufficient Labels

    Authors: Shiyu Duan, Spencer Chang, Jose C. Principe

    Abstract: In supervised learning, obtaining a large set of fully-labeled training data is expensive. We show that we do not always need full label information on every single training example to train a competent classifier. Specifically, inspired by the principle of sufficiency in statistics, we present a statistic (a summary) of the fully-labeled training set that captures almost all the relevant informat… ▽ More

    Submitted 17 January, 2023; v1 submitted 18 April, 2021; originally announced April 2021.

  18. arXiv:2103.01374  [pdf, other

    cs.LG cs.IT

    A Kernel Framework to Quantify a Model's Local Predictive Uncertainty under Data Distributional Shifts

    Authors: Rishabh Singh, Jose C. Principe

    Abstract: Traditional Bayesian approaches for model uncertainty quantification rely on notoriously difficult processes of marginalization over each network parameter to estimate its probability density function (PDF). Our hypothesis is that internal layer outputs of a trained neural network contain all of the information related to both its mapping function (quantified by its weights) as well as the input d… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  19. arXiv:2102.12017  [pdf, other

    cs.LG cs.AI cs.RO

    Annotating Motion Primitives for Simplifying Action Search in Reinforcement Learning

    Authors: Isaac J. Sledge, Darshan W. Bryner, Jose C. Principe

    Abstract: Reinforcement learning in large-scale environments is challenging due to the many possible actions that can be taken in specific situations. We have previously developed a means of constraining, and hence speeding up, the search process through the use of motion primitives; motion primitives are sequences of pre-specified actions taken across a state series. As a byproduct of this work, we have fo… ▽ More

    Submitted 26 November, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: IEEE Transactions on Emerging Topics in Computational Intelligence

  20. arXiv:2102.00533  [pdf, other

    cs.LG cs.IT

    Deep Deterministic Information Bottleneck with Matrix-based Entropy Functional

    Authors: Xi Yu, Shujian Yu, Jose C. Principe

    Abstract: We introduce the matrix-based Renyi's $αあるふぁ$-order entropy functional to parameterize Tishby et al. information bottleneck (IB) principle with a neural network. We term our methodology Deep Deterministic Information Bottleneck (DIB), as it avoids variational inference and distribution assumption. We show that deep neural networks trained with DIB outperform the variational objective counterpart and t… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: Accepted at ICASSP-21. Code available at https://github.com/yuxi120407/DIB. Extended version of the suppelementary material in "Measuring the Dependence with Matrix-based Entropy Functional", AAAI-21, arXiv:2101.10160

  21. arXiv:2101.10160  [pdf, other

    cs.LG cs.IT stat.ML

    Measuring Dependence with Matrix-based Entropy Functional

    Authors: Shujian Yu, Francesco Alesiani, Xi Yu, Robert Jenssen, Jose C. Principe

    Abstract: Measuring the dependence of data plays a central role in statistics and machine learning. In this work, we summarize and generalize the main idea of existing information-theoretic dependence measures into a higher-level perspective by the Shearer's inequality. Based on our generalization, we then propose two measures, namely the matrix-based normalized total correlation ($T_αあるふぁ^*$) and the matrix-ba… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: Accepted at AAAI-21. An interpretable and differentiable dependence (or independence) measure that can be used to 1) train deep network under covariate shift and non-Gaussian noise; 2) implement a deep deterministic information bottleneck; and 3) understand the dynamics of learning of CNN. Code available at https://bit.ly/AAAI-dependence

  22. Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: Deep-predictive-coding networks (DPCNs) are hierarchical, generative models. They rely on feed-forward and feed-back connections to modulate latent feature representations of stimuli in a dynamic and context-sensitive manner. A crucial element of DPCNs is a forward-backward inference procedure to uncover sparse, invariant features. However, this inference is a major computational bottleneck. It se… ▽ More

    Submitted 23 September, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: Submitted to the IEEE Transactions on Neural Networks and Learning Systems

  23. Target Detection and Segmentation in Circular-Scan Synthetic-Aperture-Sonar Images using Semi-Supervised Convolutional Encoder-Decoders

    Authors: Isaac J. Sledge, Matthew S. Emigh, Jonathan L. King, Denton L. Woods, J. Tory Cobb, Jose C. Principe

    Abstract: We propose a framework for saliency-based, multi-target detection and segmentation of circular-scan, synthetic-aperture-sonar (CSAS) imagery. Our framework relies on a multi-branch, convolutional encoder-decoder network (MB-CEDN). The encoder portion of the MB-CEDN extracts visual contrast features from CSAS images. These features are fed into dual decoders that perform pixel-level segmentation to… ▽ More

    Submitted 17 February, 2022; v1 submitted 10 January, 2021; originally announced January 2021.

    Comments: Submitted to IEEE Journal of Oceanic Engineering

  24. arXiv:2101.03419  [pdf, other

    cs.LG cs.NE stat.ML

    Training Deep Architectures Without End-to-End Backpropagation: A Survey on the Provably Optimal Methods

    Authors: Shiyu Duan, Jose C. Principe

    Abstract: This tutorial paper surveys provably optimal alternatives to end-to-end backpropagation (E2EBP) -- the de facto standard for training deep architectures. Modular training refers to strictly local training without both the forward and the backward pass, i.e., dividing a deep architecture into several nonoverlapping modules and training them separately without any end-to-end operation. Between the f… ▽ More

    Submitted 9 August, 2022; v1 submitted 9 January, 2021; originally announced January 2021.

    Comments: Accepted by IEEE Computational Intelligence Magazine

  25. arXiv:2010.09103  [pdf

    cs.LG cs.CV

    Unsupervised Foveal Vision Neural Networks with Top-Down Attention

    Authors: Ryan Burt, Nina N. Thigpen, Andreas Keil, Jose C. Principe

    Abstract: Deep learning architectures are an extremely powerful tool for recognizing and classifying images. However, they require supervised learning and normally work on vectors the size of image pixels and produce the best results when trained on millions of object images. To help mitigate these issues, we propose the fusion of bottom-up saliency and top-down attention employing only unsupervised learnin… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

    Comments: 29 pages, 15 figures

  26. arXiv:2007.06503  [pdf, other

    cs.LG stat.ML

    PRI-VAE: Principle-of-Relevant-Information Variational Autoencoders

    Authors: Yanjun Li, Shujian Yu, Jose C. Principe, Xiaolin Li, Dapeng Wu

    Abstract: Although substantial efforts have been made to learn disentangled representations under the variational autoencoder (VAE) framework, the fundamental properties to the dynamics of learning of most VAE models still remain unknown and under-investigated. In this work, we first propose a novel learning objective, termed the principle-of-relevant-information variational autoencoder (PRI-VAE), to learn… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  27. arXiv:2005.02196  [pdf, other

    cs.LG cs.IT stat.ML

    Measuring the Discrepancy between Conditional Distributions: Methods, Properties and Applications

    Authors: Shujian Yu, Ammar Shaker, Francesco Alesiani, Jose C. Principe

    Abstract: We propose a simple yet powerful test statistic to quantify the discrepancy between two conditional distributions. The new statistic avoids the explicit estimation of the underlying distributions in highdimensional space and it operates on the cone of symmetric positive semidefinite (SPS) matrix using the Bregman matrix divergence. Moreover, it inherits the merits of the correntropy function to ex… ▽ More

    Submitted 28 December, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: manuscript accepted at IJCAI 20; added additional notes on computational complexity and auto-differentiable property; code is available at https://github.com/SJYuCNEL/Bregman-Correntropy-Conditional-Divergence

  28. arXiv:2001.11495  [pdf, other

    cs.LG eess.SP stat.ML

    Towards a Kernel based Uncertainty Decomposition Framework for Data and Models

    Authors: Rishabh Singh, Jose C. Principe

    Abstract: This paper introduces a new framework for quantifying predictive uncertainty for both data and models that relies on projecting the data into a Gaussian reproducing kernel Hilbert space (RKHS) and transforming the data probability density function (PDF) in a way that quantifies the flow of its gradient as a topological potential field quantified at all points in the sample space. This enables the… ▽ More

    Submitted 1 December, 2020; v1 submitted 30 January, 2020; originally announced January 2020.

    Journal ref: Neural Computation, 33(5):1164-1198, 2021

  29. arXiv:2001.00265  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Fast Estimation of Information Theoretic Learning Descriptors using Explicit Inner Product Spaces

    Authors: Kan Li, Jose C. Principe

    Abstract: Kernel methods form a theoretically-grounded, powerful and versatile framework to solve nonlinear problems in signal processing and machine learning. The standard approach relies on the \emph{kernel trick} to perform pairwise evaluations of a kernel function, leading to scalability issues for large datasets due to its linear and superlinear growth with respect to the training data. Recently, we pr… ▽ More

    Submitted 1 January, 2020; originally announced January 2020.

    Comments: 10 pages, 3 figures, 2 tables. arXiv admin note: text overlap with arXiv:1912.04530

  30. arXiv:1912.04530  [pdf, ps, other

    cs.LG math.NA stat.ML

    No-Trick (Treat) Kernel Adaptive Filtering using Deterministic Features

    Authors: Kan Li, Jose C. Principe

    Abstract: Kernel methods form a powerful, versatile, and theoretically-grounded unifying framework to solve nonlinear problems in signal processing and machine learning. The standard approach relies on the kernel trick to perform pairwise evaluations of a kernel function, which leads to scalability issues for large datasets due to its linear and superlinear growth with respect to the training data. A popula… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: 12 pages, 7 figures

  31. arXiv:1911.10606  [pdf, ps, other

    eess.SP cs.IT cs.LG

    Functional Bayesian Filter

    Authors: Kan Li, Jose C. Principe

    Abstract: We present a general nonlinear Bayesian filter for high-dimensional state estimation using the theory of reproducing kernel Hilbert space (RKHS). Applying kernel method and the representer theorem to perform linear quadratic estimation in a functional space, we derive a Bayesian recursive state estimator for a general nonlinear dynamical system in the original input space. Unlike existing nonlinea… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

    Comments: 15 pages, 8 figures

  32. arXiv:1911.03267  [pdf, other

    eess.IV cs.CV cs.LG eess.SP

    Algorithmic Design and Implementation of Unobtrusive Multistatic Serial LiDAR Image

    Authors: Chi Ding, Zheng Cao, Matthew S. Emigh, Jose C. Principe, Bing Ouyang, Anni Vuorenkoski, Fraser Dalgleish, Brian Ramos, Yanjun Li

    Abstract: To fully understand interactions between marine hydrokinetic (MHK) equipment and marine animals, a fast and effective monitoring system is required to capture relevant information whenever underwater animals appear. A new automated underwater imaging system composed of LiDAR (Light Detection and Ranging) imaging hardware and a scene understanding software module named Unobtrusive Multistatic Seria… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

  33. arXiv:1907.06022  [pdf, other

    cs.LG eess.IV stat.ML

    Multiscale Principle of Relevant Information for Hyperspectral Image Classification

    Authors: Yantao Wei, Shujian Yu, Luis Sanchez Giraldo, Jose C. Principe

    Abstract: This paper proposes a novel architecture, termed multiscale principle of relevant information (MPRI), to learn discriminative spectral-spatial features for hyperspectral image (HSI) classification. MPRI inherits the merits of the principle of relevant information (PRI) to effectively extract multiscale information embedded in the given data, and also takes advantage of the multilayer structure to… ▽ More

    Submitted 4 June, 2021; v1 submitted 13 July, 2019; originally announced July 2019.

    Comments: Mansucript to be published in Machine Learning Journal (Springer). Code available at https://github.com/SJYuCNEL/Principle-of-Relevant-Information-and-HSI-Classification

  34. arXiv:1904.06617  [pdf, ps, other

    eess.SY cs.IT

    Minimum Error Entropy Kalman Filter

    Authors: Badong Chen, Lujuan Dang, Yuantao Gu, Nanning Zheng, Jose C. Prıncipe

    Abstract: To date most linear and nonlinear Kalman filters (KFs) have been developed under the Gaussian assumption and the well-known minimum mean square error (MMSE) criterion. In order to improve the robustness with respect to impulsive (or heavy-tailed) non-Gaussian noises, the maximum correntropy criterion (MCC) has recently been used to replace the MMSE criterion in developing several robust Kalman-typ… ▽ More

    Submitted 17 April, 2019; v1 submitted 13 April, 2019; originally announced April 2019.

    Comments: 12 pages, 4 figures

  35. Maximum Correntropy Criterion with Variable Center

    Authors: Badong Chen, Xin Wang, Yingsong Li, Jose C. Principe

    Abstract: Correntropy is a local similarity measure defined in kernel space and the maximum correntropy criterion (MCC) has been successfully applied in many areas of signal processing and machine learning in recent years. The kernel function in correntropy is usually restricted to the Gaussian function with center located at zero. However, zero-mean Gaussian function may not be a good choice for many pract… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

    Comments: 5 pages, 1 figure

  36. Reduction of Markov Chains using a Value-of-Information-Based Approach

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: In this paper, we propose an approach to obtain reduced-order models of Markov chains. Our approach is composed of two information-theoretic processes. The first is a means of comparing pairs of stationary chains on different state spaces, which is done via the negative Kullback-Leibler divergence defined on a model joint space. Model reduction is achieved by solving a value-of-information criteri… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: Submitted to Entropy

  37. arXiv:1901.07484  [pdf, other

    cs.LG stat.ML

    An Exact Reformulation of Feature-Vector-based Radial-Basis-Function Networks for Graph-based Observations

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: Radial-basis-function networks are traditionally defined for sets of vector-based observations. In this short paper, we reformulate such networks so that they can be applied to adjacency-matrix representations of weighted, directed graphs that represent the relationships between object pairs. We re-state the sum-of-squares objective function so that it is purely dependent on entries from the adjac… ▽ More

    Submitted 1 August, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: Submitted to the IEEE Transactions on Neural Networks and Learning Systems

  38. arXiv:1901.01140  [pdf

    eess.SP cs.NE

    Theory and Algorithms for Pulse Signal Processing

    Authors: Gabriel Nallathambi, Jose C. Principe

    Abstract: The integrate and fire converter transforms an analog signal into train of biphasic pulses. The pulse train has information encoded in the timing and polarity of pulses. While it has been shown that any finite bandwidth analog signal can be reconstructed from these pulse trains with an error as small as desired, there is a need for fundamental signal processing techniques to operate directly on pu… ▽ More

    Submitted 31 December, 2018; originally announced January 2019.

  39. arXiv:1811.11971  [pdf, other

    cs.CV cs.IT stat.ML

    Simple stopping criteria for information theoretic feature selection

    Authors: Shujian Yu, Jose C. Principe

    Abstract: Feature selection aims to select the smallest feature subset that yields the minimum generalization error. In the rich literature in feature selection, information theory-based approaches seek a subset of features such that the mutual information between the selected features and the class labels is maximized. Despite the simplicity of this objective, there still remain several open problems in op… ▽ More

    Submitted 29 January, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: Paper published in the journal of Entropy

    Journal ref: Entropy 2019, 21(1), 99

  40. arXiv:1808.07912  [pdf, other

    cs.IT cs.LG stat.ML

    Multivariate Extension of Matrix-based Renyi's αあるふぁ-order Entropy Functional

    Authors: Shujian Yu, Luis Gonzalo Sanchez Giraldo, Robert Jenssen, Jose C. Principe

    Abstract: The matrix-based Renyi's αあるふぁ-order entropy functional was recently introduced using the normalized eigenspectrum of a Hermitian matrix of the projected data in a reproducing kernel Hilbert space (RKHS). However, the current theory in the matrix-based Renyi's αあるふぁ-order entropy functional only defines the entropy of a single variable or mutual information between two random variables. In information the… ▽ More

    Submitted 31 July, 2019; v1 submitted 23 August, 2018; originally announced August 2018.

    Comments: To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence. Matlab code is available from Google drive at https://drive.google.com/open?id=1SlxzEOX8RbnLwCgRyqGwMOL7vuT90Gje or Baidu Cloud at https://pan.baidu.com/s/1xupfXCmIV20gXPr0TicGkg (access code: d1sa)

  41. arXiv:1806.10131  [pdf, other

    cs.LG cs.AI stat.ML

    Request-and-Reverify: Hierarchical Hypothesis Testing for Concept Drift Detection with Expensive Labels

    Authors: Shujian Yu, Xiaoyang Wang, Jose C. Principe

    Abstract: One important assumption underlying common classification models is the stationarity of the data. However, in real-world streaming applications, the data concept indicated by the joint distribution of feature and label is not stationary but drifting over time. Concept drift detection aims to detect such drifts and adapt the model so as to mitigate any deterioration in the model's predictive perfor… ▽ More

    Submitted 28 June, 2018; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: Published as a conference paper at IJCAI 2018

    Report number: ITD-18-58133N

    Journal ref: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (2018) 3033-3039

  42. arXiv:1804.06537  [pdf, other

    cs.LG cs.IT stat.ML

    Understanding Convolutional Neural Networks with Information Theory: An Initial Exploration

    Authors: Shujian Yu, Kristoffer Wickstrøm, Robert Jenssen, Jose C. Principe

    Abstract: The matrix-based Renyi's αあるふぁ-entropy functional and its multivariate extension were recently developed in terms of the normalized eigenspectrum of a Hermitian matrix of the projected data in a reproducing kernel Hilbert space (RKHS). However, the utility and possible applications of these new estimators are rather new and mostly unknown to practitioners. In this paper, we first show that our estimat… ▽ More

    Submitted 23 January, 2020; v1 submitted 17 April, 2018; originally announced April 2018.

    Comments: Paper accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS). Code for 1) estimating information quantities, 2) plotting the information plane, and 3) selecting convolutional filters, is available from (MATLAB) https://drive.google.com/drive/folders/1DJYshWIiijKWrFKrztW9FgTzGfMV3D8M?usp=sharing or (Python) https://github.com/Wickstrom/InfExperiment

  43. arXiv:1804.00057  [pdf, other

    cs.LG cs.IT stat.ML

    Understanding Autoencoders with Information Theoretic Concepts

    Authors: Shujian Yu, Jose C. Principe

    Abstract: Despite their great success in practical applications, there is still a lack of theoretical and systematic methods to analyze deep neural networks. In this paper, we illustrate an advanced information theoretic methodology to understand the dynamics of learning and the design of autoencoders, a special type of deep learning architectures that resembles a communication channel. By generalizing the… ▽ More

    Submitted 7 May, 2019; v1 submitted 30 March, 2018; originally announced April 2018.

    Comments: Paper accepted by Neural Networks. Code for estimating information quantities and drawing the information plane is available from https://drive.google.com/drive/folders/1e5sIywZfmWp4Dn0WEesb6fqQRM0DIGxZ?usp=sharing

  44. Guided Policy Exploration for Markov Decision Processes using an Uncertainty-Based Value-of-Information Criterion

    Authors: Isaac J. Sledge, Matthew S. Emigh, Jose C. Principe

    Abstract: Reinforcement learning in environments with many action-state pairs is challenging. At issue is the number of episodes needed to thoroughly search the policy space. Most conventional heuristics address this search problem in a stochastic manner. This can leave large portions of the policy space unvisited during the early training stages. In this paper, we propose an uncertainty-based, information-… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: IEEE Transactions on Neural Networks and Learning Systems

  45. arXiv:1802.00174  [pdf, other

    cs.LG

    Augmented Space Linear Model

    Authors: Zhengda Qin, Badong Chen, Nanning Zheng, Jose C. Principe

    Abstract: The linear model uses the space defined by the input to project the target or desired signal and find the optimal set of model parameters. When the problem is nonlinear, the adaption requires nonlinear models for good performance, but it becomes slower and more cumbersome. In this paper, we propose a linear model called Augmented Space Linear Model (ASLM), which uses the full joint space of input… ▽ More

    Submitted 2 February, 2018; v1 submitted 1 February, 2018; originally announced February 2018.

    Comments: 5 pages and 1 figures

  46. arXiv:1710.10381  [pdf, other

    cs.AI cs.LG stat.ML

    Partitioning Relational Matrices of Similarities or Dissimilarities using the Value of Information

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: In this paper, we provide an approach to clustering relational matrices whose entries correspond to either similarities or dissimilarities between objects. Our approach is based on the value of information, a parameterized, information-theoretic criterion that measures the change in costs associated with changes in information. Optimizing the value of information yields a deterministic annealing s… ▽ More

    Submitted 27 October, 2017; originally announced October 2017.

    Comments: Submitted to the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

  47. arXiv:1710.04089  [pdf, ps, other

    stat.ML cs.LG

    Quantized Minimum Error Entropy Criterion

    Authors: Badong Chen, Lei Xing, Nanning Zheng, Jose C. Príncipe

    Abstract: Comparing with traditional learning criteria, such as mean square error (MSE), the minimum error entropy (MEE) criterion is superior in nonlinear and non-Gaussian signal processing and machine learning. The argument of the logarithm in Renyis entropy estimator, called information potential (IP), is a popular MEE cost in information theoretic learning (ITL). The computational complexity of IP is ho… ▽ More

    Submitted 12 October, 2017; v1 submitted 11 October, 2017; originally announced October 2017.

  48. arXiv:1710.02869  [pdf, other

    cs.AI cs.LG stat.ML

    An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: In this paper, we propose an information-theoretic exploration strategy for stochastic, discrete multi-armed bandits that achieves optimal regret. Our strategy is based on the value of information criterion. This criterion measures the trade-off between policy information and obtainable rewards. High amounts of policy information are associated with exploration-dominant searches of the space and y… ▽ More

    Submitted 3 March, 2018; v1 submitted 8 October, 2017; originally announced October 2017.

    Comments: Entropy

  49. arXiv:1709.03541  [pdf, other

    astro-ph.IM cs.IT

    Robust period estimation using mutual information for multi-band light curves in the synoptic survey era

    Authors: Pablo Huijse, Pablo A. Estevez, Francisco Forster, Scott F. Daniel, Andrew J. Connolly, Pavlos Protopapas, Rodrigo Carrasco, Jose C. Principe

    Abstract: The Large Synoptic Survey Telescope (LSST) will produce an unprecedented amount of light curves using six optical bands. Robust and efficient methods that can aggregate data from multidimensional sparsely-sampled time series are needed. In this paper we present a new method for light curve period estimation based on the quadratic mutual information (QMI). The proposed method does not assume a part… ▽ More

    Submitted 11 September, 2017; originally announced September 2017.

    Comments: Accepted for publication ApJ Supplement Series: Special Issue on Solar/Stellar Astronomy Big Data

  50. arXiv:1708.01541  [pdf, ps, other

    cs.CV

    Associations among Image Assessments as Cost Functions in Linear Decomposition: MSE, SSIM, and Correlation Coefficient

    Authors: Jianji Wang, Nanning Zheng, Badong Chen, Jose C. Principe

    Abstract: The traditional methods of image assessment, such as mean squared error (MSE), signal-to-noise ratio (SNR), and Peak signal-to-noise ratio (PSNR), are all based on the absolute error of images. Pearson's inner-product correlation coefficient (PCC) is also usually used to measure the similarity between images. Structural similarity (SSIM) index is another important measurement which has been shown… ▽ More

    Submitted 4 August, 2017; originally announced August 2017.

    Comments: 11 pages, 0 figures