-
Adaptive warped kernel estimation for nonparametric regression with circular responses
Authors:
Tien Dat Nguyen,
Thanh Mai Pham Ngoc,
Vincent Rivoirard
Abstract:
In this paper, we deal with nonparametric regression for circular data, meaning that observations are represented by points lying on the unit circle. We propose a kernel estimation procedure with data-driven selection of the bandwidth parameter. For this purpose, we use a warping strategy combined with a Goldenshluger-Lepski type estimator. To study optimality of our methodology, we consider the m…
▽ More
In this paper, we deal with nonparametric regression for circular data, meaning that observations are represented by points lying on the unit circle. We propose a kernel estimation procedure with data-driven selection of the bandwidth parameter. For this purpose, we use a warping strategy combined with a Goldenshluger-Lepski type estimator. To study optimality of our methodology, we consider the minimax setting and prove, by establishing upper and lower bounds, that our procedure is nearly optimal on anisotropic Holder classes of functions for pointwise estimation. The obtained rates also reveal the specific nature of regression for circular responses. Finally, a numerical study is conducted, illustrating the good performances of our approach.
△ Less
Submitted 30 June, 2023; v1 submitted 6 April, 2022;
originally announced April 2022.
-
Semiparametric inference for mixtures of circular data
Authors:
Claire Lacour,
Thanh Mai Pham Ngoc
Abstract:
We consider X 1 ,. .. , X n a sample of data on the circle S 1 , whose distribution is a twocomponent mixture. Denoting R and Q two rotations on S 1 , the density of the X i 's is assumed to be g(x) = pf (R --1 x) + (1 -- p)f (Q --1 x), where p $\in$ (0, 1) and f is an unknown density on the circle. In this paper we estimate both the parametric part $θ$ = (p, R, Q) and the nonparametric part f. Th…
▽ More
We consider X 1 ,. .. , X n a sample of data on the circle S 1 , whose distribution is a twocomponent mixture. Denoting R and Q two rotations on S 1 , the density of the X i 's is assumed to be g(x) = pf (R --1 x) + (1 -- p)f (Q --1 x), where p $\in$ (0, 1) and f is an unknown density on the circle. In this paper we estimate both the parametric part $θ$ = (p, R, Q) and the nonparametric part f. The specific problems of identifiability on the circle are studied. A consistent estimator of $θ$ is introduced and its asymptotic normality is proved. We propose a Fourier-based estimator of f with a penalized criterion to choose the resolution level. We show that our adaptive estimator is optimal from the oracle and minimax points of view when the density belongs to a Sobolev ball. Our method is illustrated by numerical simulations.
△ Less
Submitted 31 May, 2022; v1 submitted 12 March, 2021;
originally announced March 2021.
-
Statistical deconvolution of the free Fokker-Planck equation at fixed time
Authors:
Mylène Maïda,
Tien Dat Nguyen,
Thanh Mai Pham Ngoc,
Vincent Rivoirard,
Viet Chi Tran
Abstract:
We are interested in reconstructing the initial condition of a non-linear partial differential equation (PDE), namely the Fokker-Planck equation, from the observation of a Dyson Brownian motion at a given time $t>0$. The Fokker-Planck equation describes the evolution of electrostatic repulsive particle systems, and can be seen as the large particle limit of correctly renormalized Dyson Brownian mo…
▽ More
We are interested in reconstructing the initial condition of a non-linear partial differential equation (PDE), namely the Fokker-Planck equation, from the observation of a Dyson Brownian motion at a given time $t>0$. The Fokker-Planck equation describes the evolution of electrostatic repulsive particle systems, and can be seen as the large particle limit of correctly renormalized Dyson Brownian motions. The solution of the Fokker-Planck equation can be written as the free convolution of the initial condition and the semi-circular distribution. We propose a nonparametric estimator for the initial condition obtained by performing the free deconvolution via the subordination functions method. This statistical estimator is original as it involves the resolution of a fixed point equation, and a classical deconvolution by a Cauchy distribution. This is due to the fact that, in free probability, the analogue of the Fourier transform is the R-transform, related to the Cauchy transform. In past literature, there has been a focus on the estimation of the initial conditions of linear PDEs such as the heat equation, but to the best of our knowledge, this is the first time that the problem is tackled for a non-linear PDE. The convergence of the estimator is proved and the integrated mean square error is computed, providing rates of convergence similar to the ones known for non-parametric deconvolution methods. Finally, a simulation study illustrates the good performances of our estimator.
△ Less
Submitted 21 June, 2020;
originally announced June 2020.
-
Adaptive optimal kernel density estimation for directional data
Authors:
Thanh Mai Pham Ngoc
Abstract:
We focus on the nonparametric density estimation problem with directional data. We propose a new rule for bandwidth selection for kernel density estimation. Our procedure is automatic, fully data-driven and adaptive to the smoothness degree of the density. We obtain an oracle inequality and optimal rates of convergence for the L2 error. Our theoretical results are illustrated with simulations.
We focus on the nonparametric density estimation problem with directional data. We propose a new rule for bandwidth selection for kernel density estimation. Our procedure is automatic, fully data-driven and adaptive to the smoothness degree of the density. We obtain an oracle inequality and optimal rates of convergence for the L2 error. Our theoretical results are illustrated with simulations.
△ Less
Submitted 7 August, 2018;
originally announced August 2018.
-
Nonparametric estimation of the fragmentation kernel based on a PDE stationary distribution approximation
Authors:
Van Ha Hoang,
Thanh Mai Pham Ngoc,
Vincent Rivoirard,
Viet Chi Tran
Abstract:
We consider a stochastic individual-based model in continuous time to describe a size-structured population for cell divisions. This model is motivated by the detection of cellular aging in biology. We address here the problem of nonparametric estimation of the kernel ruling the divisions based on the eigenvalue problem related to the asymptotic behavior in large population. This inverse problem i…
▽ More
We consider a stochastic individual-based model in continuous time to describe a size-structured population for cell divisions. This model is motivated by the detection of cellular aging in biology. We address here the problem of nonparametric estimation of the kernel ruling the divisions based on the eigenvalue problem related to the asymptotic behavior in large population. This inverse problem involves a multiplicative deconvolution operator. Using Fourier technics we derive a nonparametric estimator whose consistency is studied. The main difficulty comes from the non-standard equations connecting the Fourier transforms of the kernel and the parameters of the model. A numerical study is carried out and we pay special attention to the derivation of bandwidths by using resampling.
△ Less
Submitted 25 September, 2020; v1 submitted 25 October, 2017;
originally announced October 2017.
-
Adaptive Estimation of Nonparametric Geometric Graphs
Authors:
Yohann De Castro,
Claire Lacour,
Thanh Mai Pham Ngoc
Abstract:
This article studies the recovery of graphons when they are convolution kernels on compact (symmetric) metric spaces. This case is of particular interest since it covers the situation where the probability of an edge depends only on some unknown nonparametric function of the distance between latent points, referred to as Nonparametric Geometric Graphs (NGG). In this setting, adaptive estimation of…
▽ More
This article studies the recovery of graphons when they are convolution kernels on compact (symmetric) metric spaces. This case is of particular interest since it covers the situation where the probability of an edge depends only on some unknown nonparametric function of the distance between latent points, referred to as Nonparametric Geometric Graphs (NGG). In this setting, adaptive estimation of NGG is possible using a spectral procedure combined with a Goldenshluger-Lepski adaptation method. The latent spaces covered by our framework encompass (among others) compact symmetric spaces of rank one, namely real spheres and projective spaces. For these latter, explicit computations of the eigen-basis and of the model complexity can be achieved, leading to quantitative non-asymptotic results. The time complexity of our method scales cubicly in the size of the graph and exponentially in the regularity of the graphon. Hence, this paper offers an algorithmically and theoretically efficient procedure to estimate smooth NGG. As a by product, this paper shows a non-asymptotic concentration result on the spectrum of integral operators defined by symmetric kernels (not necessarily positive).
△ Less
Submitted 6 April, 2020; v1 submitted 7 August, 2017;
originally announced August 2017.
-
Adaptive wavelet multivariate regression with errors in variables
Authors:
Michaël Chichignoud,
Van Ha Hoang,
Thanh Mai Pham Ngoc,
Vincent Rivoirard
Abstract:
In the multidimensional setting, we consider the errors-in-variables model. We aim at estimating the unknown nonparametric multivariate regression function with errors in the covariates. We devise an adaptive estimator based on projection kernels on wavelets and a deconvolution operator. We propose an automatic and fully data driven procedure to select the wavelet level resolution. We obtain an or…
▽ More
In the multidimensional setting, we consider the errors-in-variables model. We aim at estimating the unknown nonparametric multivariate regression function with errors in the covariates. We devise an adaptive estimator based on projection kernels on wavelets and a deconvolution operator. We propose an automatic and fully data driven procedure to select the wavelet level resolution. We obtain an oracle inequality and optimal rates of convergence over anisotropic H{ö}lder classes. Our theoretical results are illustrated by some simulations.
△ Less
Submitted 12 January, 2016;
originally announced January 2016.
-
Goodness-of-fit test for noisy directional data
Authors:
Claire Lacour,
Thanh Mai Pham Ngoc
Abstract:
We consider spherical data $X_i$ noised by a random rotation $\varepsilon_i\in$ SO(3) so that only the sample $Z_i=\varepsilon_iX_i$, $i=1,\dots, N$ is observed. We define a nonparametric test procedure to distinguish $H_0:$ ''the density $f$ of $X_i$ is the uniform density $f_0$ on the sphere'' and $H_1:$ ''$\|f-f_0\|_2^2\geq \Cψ_N$ and $f$ is in a Sobolev space with smoothness $s$''. For a noise…
▽ More
We consider spherical data $X_i$ noised by a random rotation $\varepsilon_i\in$ SO(3) so that only the sample $Z_i=\varepsilon_iX_i$, $i=1,\dots, N$ is observed. We define a nonparametric test procedure to distinguish $H_0:$ ''the density $f$ of $X_i$ is the uniform density $f_0$ on the sphere'' and $H_1:$ ''$\|f-f_0\|_2^2\geq \Cψ_N$ and $f$ is in a Sobolev space with smoothness $s$''. For a noise density $f_\varepsilon$ with smoothness index $ν$, we show that an adaptive procedure (i.e. $s$ is not assumed to be known) cannot have a faster rate of separation than $ψ_N^{ad}(s)=(N/\sqrt{\log\log(N)})^{-2s/(2s+2ν+1)}$ and we provide a procedure which reaches this rate. We also deal with the case of super smooth noise. We illustrate the theory by implementing our test procedure for various kinds of noise on SO(3) and by comparing it to other procedures. Applications to real data in astrophysics and paleomagnetism are provided.
△ Less
Submitted 15 November, 2013; v1 submitted 9 March, 2012;
originally announced March 2012.
-
Regression in random design and Bayesian warped wavelets estimators
Authors:
Thanh Mai Pham Ngoc
Abstract:
In this paper we deal with the regression problem in a random design setting. We investigate asymptotic optimality under minimax point of view of various Bayesian rules based on warped wavelets and show that they nearly attain optimal minimax rates of convergence over the Besov smoothness class considered. Warped wavelets have been introduced recently, they offer very good computable and easy-to…
▽ More
In this paper we deal with the regression problem in a random design setting. We investigate asymptotic optimality under minimax point of view of various Bayesian rules based on warped wavelets and show that they nearly attain optimal minimax rates of convergence over the Besov smoothness class considered. Warped wavelets have been introduced recently, they offer very good computable and easy-to-implement properties while being well adapted to the statistical problem at hand. We particularly put emphasis on Bayesian rules leaning on small and large variance Gaussian priors and discuss their simulation performances comparing them with a hard thresholding procedure.
△ Less
Submitted 20 August, 2009;
originally announced August 2009.
-
Localized spherical deconvolution
Authors:
Gérard Kerkyacharian,
Thanh Mai Pham Ngoc,
Dominique Picard
Abstract:
We provide a new algorithm for the treatment of the deconvolution problem on the sphere which combines the traditional SVD inversion with an appropriate thresholding technique in a well chosen new basis. We establish upper bounds for the behavior of our procedure for any $\mathbb {L}_p$ loss. It is important to emphasize the adaptation properties of our procedures with respect to the regularity (s…
▽ More
We provide a new algorithm for the treatment of the deconvolution problem on the sphere which combines the traditional SVD inversion with an appropriate thresholding technique in a well chosen new basis. We establish upper bounds for the behavior of our procedure for any $\mathbb {L}_p$ loss. It is important to emphasize the adaptation properties of our procedures with respect to the regularity (sparsity) of the object to recover as well as to inhomogeneous smoothness. We also perform a numerical study which proves that the procedure shows very promising properties in practice as well.
△ Less
Submitted 13 June, 2011; v1 submitted 13 August, 2009;
originally announced August 2009.
-
Statistical minimax approach of the Hausdorff moment problem
Authors:
Thanh Mai Pham Ngoc
Abstract:
The purpose of this paper is to study the problem of estimating a compactly supported density of probability from noisy observations of its moments. In fact, we provide a statistical approach to the famous Hausdorff classical moment problem. We prove an upper bound and a lower bound on the rate of convergence of the mean squared error showing that the considered estimator attains minimax rate ov…
▽ More
The purpose of this paper is to study the problem of estimating a compactly supported density of probability from noisy observations of its moments. In fact, we provide a statistical approach to the famous Hausdorff classical moment problem. We prove an upper bound and a lower bound on the rate of convergence of the mean squared error showing that the considered estimator attains minimax rate over the corresponding smoothness classes.
△ Less
Submitted 23 April, 2009; v1 submitted 9 May, 2007;
originally announced May 2007.