No Arabic abstract
Nonparametric statistics for distribution functions F or densities f=F under qualitative shape constraints provides an interesting alternative to classical parametric or entirely nonparametric approaches. We contribute to this area by considering a new shape constraint: F is said to be bi-log-concave, if both log(F) and log(1 - F) are concave. Many commonly considered distributions are compatible with this constraint. For instance, any c.d.f. F with log-concave density f = F is bi-log-concave. But in contrast to the latter constraint, bi-log-concavity allows for multimodal densities. We provide various characterizations. It is shown that combining any nonparametric confidence band for F with the new shape-constraint leads to substantial improvements, particularly in the tails. To pinpoint this, we show that these confidence bands imply non-trivial confidence bounds for arbitrary moments and the moment generating function of F.
Shape-constrained density estimation is an important topic in mathematical statistics. We focus on densities on $mathbb{R}^d$ that are log-concave, and we study geometric properties of the maximum likelihood estimator (MLE) for weighted samples. Cule, Samworth, and Stewart showed that the logarithm of the optimal log-concave density is piecewise linear and supported on a regular subdivision of the samples. This defines a map from the space of weights to the set of regular subdivisions of the samples, i.e. the face poset of their secondary polytope. We prove that this map is surjective. In fact, every regular subdivision arises in the MLE for some set of weights with positive probability, but coarser subdivisions appear to be more likely to arise than finer ones. To quantify these results, we introduce a continuous version of the secondary polytope, whose dual we name the Samworth body. This article establishes a new link between geometric combinatorics and nonparametric statistics, and it suggests numerous open problems.
In this paper, the functional Quermassintegrals of log-concave functions in $mathbb R^n$ are discussed, we obtain the integral expression of the $i$-th functional mixed Quermassintegrals, which are similar to the integral expression of the $i$-th Quermassintegrals of convex bodies.
Affine invariant points and maps for sets were introduced by Grunbaum to study the symmetry structure of convex sets. We extend these notions to a functional setting. The role of symmetry of the set is now taken by evenness of the function. We show that among the examples for affine invariant points are the classical center of gravity of a log-concave function and its Santalo point. We also show that the recently introduced floating functions and the John- and Lowner functions are examples of affine invariant maps. Their centers provide new examples of affine invariant points for log-concave functions.
The aim of this paper is to develop the $L_p$ John ellipsoid for the geometry of log-concave functions. Using the results of the $L_p$ Minkowski theory for log-concave function established in cite{fan-xin-ye-geo2020}, we characterize the $L_p$ John ellipsoid for log-concave function, and establish some inequalities of the $L_p$ John ellipsoid for log-concave function. Finally, the analog of Balls volume ratio inequality for the $L_p$ John ellipsoid of log-concave function is established.
Let X_1, ..., X_n be independent and identically distributed random vectors with a log-concave (Lebesgue) density f. We first prove that, with probability one, there exists a unique maximum likelihood estimator of f. The use of this estimator is attractive because, unlike kernel density estimation, the method is fully automatic, with no smoothing parameters to choose. Although the existence proof is non-constructive, we are able to reformulate the issue of computation in terms of a non-differentiable convex optimisation problem, and thus combine techniques of computational geometry with Shors r-algorithm to produce a sequence that converges to the maximum likelihood estimate. For the moderate or large sample sizes in our simulations, the maximum likelihood estimator is shown to provide an improvement in performance compared with kernel-based methods, even when we allow the use of a theoretical, optimal fixed bandwidth for the kernel estimator that would not be available in practice. We also present a real data clustering example, which shows that our methodology can be used in conjunction with the Expectation--Maximisation (EM) algorithm to fit finite mixtures of log-concave densities. An R version of the algorithm is available in the package LogConcDEAD -- Log-Concave Density Estimation in Arbitrary Dimensions.