Mondon, Camille, Trinh, Thi Huong
, Ruiz-Gazen, Anne
and Thomas-Agnan, Christine
(2024)
ICS for complex data with application to outlier detection for density data.
TSE Working Paper, n. 24-1585, Toulouse
Preview |
Text
Download (3MB) | Preview |
Abstract
Invariant coordinate selection (ICS) is a dimension reduction method, used as a preliminary step for clustering and outlier detection. It has been primarily applied to multivariate data. This work introduces a coordinate-free definition of ICS in an abstract Euclidean space and extends the method to complex data. Functional and distributional data are preprocessed into a finite-dimensional subspace. For example, in the framework of Bayes Hilbert spaces, distributional data are smoothed into compositional spline functions through the Maximum Penalised Likelihood method. We describe an outlier detection procedure for complex data and study the impact of some preprocessing parameters on the results. We compare our approach with other outlier detection methods through simulations, producing promising results in scenarios with a low proportion of outliers. ICS allows detecting abnormal climate events in a sample of daily maximum temperature distributions recorded across the provinces of Northern Vietnam between 1987 and 2016.
Item Type: | Monograph (Working Paper) |
---|---|
Language: | English |
Date: | October 2024 |
Place of Publication: | Toulouse |
Uncontrolled Keywords: | Bayes spaces, Distributional data, Extreme weather, Functional data, Invariant coordinate selection, Outlier detection, Temperature distribution |
Subjects: | B- ECONOMIE ET FINANCE |
Divisions: | TSE-R (Toulouse) |
Institution: | Université Toulouse Capitole |
Site: | UT1 |
Date Deposited: | 23 May 2025 08:06 |
Last Modified: | 23 May 2025 08:06 |
OAI Identifier: | oai:tse-fr.eu:129830 |
URI: | https://publications.ut-capitole.fr/id/eprint/50853 |