Mondon, CamilleIdRef, Trinh, Thi HuongIdRef, Ruiz-Gazen, AnneIdRefORCIDORCID: https://orcid.org/0000-0001-8970-8061 and Thomas-Agnan, ChristineIdRefORCIDORCID: https://orcid.org/0000-0002-7845-5385 (2024) ICS for complex data with application to outlier detection for density data. TSE Working Paper, n. 24-1585, Toulouse

Warning
There is a more recent version of this item available.
[thumbnail of wp_tse_1585.pdf]
Preview
Text
Download (3MB) | Preview

Abstract

Invariant coordinate selection (ICS) is a dimension reduction method, used as a preliminary step for clustering and outlier detection. It has been primarily applied to multivariate data. This work introduces a coordinate-free definition of ICS in an abstract Euclidean space and extends the method to complex data. Functional and distributional data are preprocessed into a finite-dimensional subspace. For example, in the framework of Bayes Hilbert spaces, distributional data are smoothed into compositional spline functions through the Maximum Penalised Likelihood method. We describe an outlier detection procedure for complex data and study the impact of some preprocessing parameters on the results. We compare our approach with other outlier detection methods through simulations, producing promising results in scenarios with a low proportion of outliers. ICS allows detecting abnormal climate events in a sample of daily maximum temperature distributions recorded across the provinces of Northern Vietnam between 1987 and 2016.

Item Type: Monograph (Working Paper)
Language: English
Date: October 2024
Place of Publication: Toulouse
Uncontrolled Keywords: Bayes spaces, Distributional data, Extreme weather, Functional data, Invariant coordinate selection, Outlier detection, Temperature distribution
Subjects: B- ECONOMIE ET FINANCE
Divisions: TSE-R (Toulouse)
Institution: Université Toulouse Capitole
Site: UT1
Date Deposited: 23 May 2025 08:06
Last Modified: 18 Dec 2025 09:46
OAI Identifier: oai:tse-fr.eu:129830
URI: https://publications.ut-capitole.fr/id/eprint/50853

Available Versions of this Item

View Item

Downloads

Downloads per month over past year