Mondon, Camille
, Trinh, Thi Huong
, Ruiz-Gazen, Anne
ORCID: https://orcid.org/0000-0001-8970-8061 and Thomas-Agnan, Christine
ORCID: https://orcid.org/0000-0002-7845-5385
(2024)
ICS for complex data with application to outlier detection for density data.
TSE Working Paper, n. 24-1585, Toulouse

Preview |
Text
Download (3MB) | Preview |
Abstract
Invariant coordinate selection (ICS) is a dimension reduction method, used as a preliminary step for clustering and outlier detection. It has been primarily applied to multivariate data. This work introduces a coordinate-free definition of ICS in an abstract Euclidean space and extends the method to complex data. Functional and distributional data are preprocessed into a finite-dimensional subspace. For example, in the framework of Bayes Hilbert spaces, distributional data are smoothed into compositional spline functions through the Maximum Penalised Likelihood method. We describe an outlier detection procedure for complex data and study the impact of some preprocessing parameters on the results. We compare our approach with other outlier detection methods through simulations, producing promising results in scenarios with a low proportion of outliers. ICS allows detecting abnormal climate events in a sample of daily maximum temperature distributions recorded across the provinces of Northern Vietnam between 1987 and 2016.
| Item Type: | Monograph (Working Paper) |
|---|---|
| Language: | English |
| Date: | October 2024 |
| Place of Publication: | Toulouse |
| Uncontrolled Keywords: | Bayes spaces, Distributional data, Extreme weather, Functional data, Invariant coordinate selection, Outlier detection, Temperature distribution |
| Subjects: | B- ECONOMIE ET FINANCE |
| Divisions: | TSE-R (Toulouse) |
| Institution: | Université Toulouse Capitole |
| Site: | UT1 |
| Date Deposited: | 23 May 2025 08:06 |
| Last Modified: | 18 Dec 2025 09:46 |
| OAI Identifier: | oai:tse-fr.eu:129830 |
| URI: | https://publications.ut-capitole.fr/id/eprint/50853 |
Available Versions of this Item
- ICS for complex data with application to outlier detection for density data. (deposited 23 May 2025 08:06) [Currently Displayed]

Tools
Tools