HyperAIHyperAI

Command Palette

Search for a command to run...

il y a 3 ans

Le titre est vide. Veuillez fournir le titre à traduire.

Analyse des séries temporelles — Du niveau débutant au niveau avancé

20 heures de calcul sur RTX 5090 pour seulement $1 (valeur $7)
Aller à Notebook

Résumé

Please provide the title and abstract you would like me to translate into French.

One-sentence Summary

This article proposes a discriminant analysis framework for replicated time series that models transfer functions as stochastic variables to account for within-group spectral variability, analyzes stochastic cepstra to derive parsimonious measures of relative power for optimal group separation, and validates the approach through a simulation study and an analysis of gait variability.

Key Contributions

  • Proposes a statistical model for replicated time series that treats transfer functions as stochastic variables to account for both between-group and within-group spectral variability.
  • Develops a discriminant analysis of stochastic cepstra that yields parsimonious relative power measures for optimal group separation and consistent classification using a finite number of estimated cepstral coefficients.
  • Demonstrates the advantages of accounting for within-group spectral variability through a simulation study and an empirical analysis of gait variability data.

Introduction

Researchers routinely analyze replicated time series across distinct groups to extract distinguishing frequency domain patterns, a practice essential for applications like gait analysis and biomedical signal processing. Traditional discriminant methods, however, typically assume spectral characteristics remain uniform within each group, effectively ignoring the natural cyclical variations that occur among individual replicates. To bridge this gap, the authors leverage a modeling framework that treats transfer functions as stochastic variables, enabling the simultaneous capture of both between-group and within-group spectral differences. They then apply discriminant analysis to stochastic cepstra, yielding compact relative power metrics that reliably classify new observations while properly accounting for internal group variability. The approach remains computationally straightforward, as it relies on standard discriminant procedures applied to a finite set of estimated cepstral coefficients.

Dataset

  • Dataset Composition and Sources The authors analyze gait variability data originally collected by Hausdorff et al. (2000) and publicly hosted on PhysioNet. The dataset records stride intervals from individuals across three neurological cohorts to support pathological characterization and diagnostic classification.

  • Subset Details

    • Healthy controls: 16 participants
    • Amyotrophic lateral sclerosis (ALS) patients: 11 participants
    • Huntington’s disease patients: 18 participants
    • Total cohort: 45 individuals
  • Cropping and Processing Pipeline

    • Participants walked at a normal pace while wearing plantar pressure sensors.
    • The analysis isolates left foot stride intervals, discarding a 20-second start-up period and retaining a fixed 3.5-minute window per subject.
    • A 3 standard deviation median filter removes artifacts caused by hallway turns.
    • The signal is interpolated using cubic smoothing splines, resampled to 2 Hz, and linearly detrended.
    • This yields a standardized, detrended stride interval series of length 420 for each participant.
  • Model Usage and Analysis

    • The processed series are transformed into log-spectral estimates using 7 multitapers, with cross-validation selecting 4 spectral coefficients.
    • Two discriminant functions are constructed to contrast power across specific frequency bands, mapping each subject into a 2D feature space.
    • Classification performance is evaluated using leave-one-out cross-validation rather than a fixed training-test split.
    • The resulting discriminant scores drive the comparative assessment against alternative information-theoretic classifiers and serve as an interpretable diagnostic tool for separating the three neurological groups.

Method

The authors leverage a cepstral-based framework for discriminant analysis of time series data, designed to identify low-dimensional measures that best separate groups while accounting for within-group spectral variability. The core of the method revolves around transforming time series into a functional representation through log-spectra and their associated cepstral coefficients, enabling a rigorous and interpretable analysis of group differences. The replicate-specific log-spectrum is defined as γjk(λ)=logAjk(λ)2\gamma_{jk}(\lambda) = \log |A_{jk}(\lambda)|^2γjk(λ)=logAjk(λ)2, where Ajk(λ)A_{jk}(\lambda)Ajk(λ) represents the Fourier transform of the kkkth replicate from the jjjth group. The group mean log-spectrum is denoted by αj(λ)=E{γjk(λ)}\alpha_j(\lambda) = \mathbb{E}\{\gamma_{jk}(\lambda)\}αj(λ)=E{γjk(λ)}, and the deviation from this mean is βjk(λ)=γjk(λ)αj(λ)\beta_{jk}(\lambda) = \gamma_{jk}(\lambda) - \alpha_j(\lambda)βjk(λ)=γjk(λ)αj(λ). These log-spectra are then transformed into cepstral coefficients via cosine series, resulting in cjk=01γjk(λ)2cos(2πλ)dλc_{jk\ell} = \int_0^1 \gamma_{jk}(\lambda) \sqrt{2} \cos(2\pi\lambda\ell) d\lambdacjk=01γjk(λ)2cos(2πλ)dλ for 1\ell \geq 11, and cjk0=01γjk(λ)dλc_{jk0} = \int_0^1 \gamma_{jk}(\lambda) d\lambdacjk0=01γjk(λ)dλ, forming a sequence cjkRNc_{jk} \in \mathbb{R}^{\mathbb{N}}cjkRN. The group-average and deviation cepstra aja_jaj and bjkb_{jk}bjk are similarly derived from αj\alpha_jαj and βjk\beta_{jk}βjk, respectively.

The method proceeds by formulating a cepstral Fisher’s discriminant analysis, which seeks linear combinations of cepstral coefficients that maximize separation between group means relative to within-group variability. Let y0RNy_0 \in \mathbb{R}^{\mathbb{N}}y0RN be a weight vector, and define the separation of the linear function =0y0cjk\sum_{\ell=0}^\infty y_{0\ell} c_{jk\ell}=0y0cjk as y0Λ2=,m=0y0Λ(,m)y0m||y_0||_\Lambda^2 = \sum_{\ell,m=0}^\infty y_{0\ell} \Lambda(\ell,m) y_{0m}∣∣y0Λ2=,m=0y0Λ(,m)y0m, where Λ(,m)\Lambda(\ell,m)Λ(,m) is the between-group kernel. The within-group kernel Γ(,m)=E(bjklbjkm)\Gamma(\ell,m) = \mathbb{E}(b_{jkl}b_{jkm})Γ(,m)=E(bjklbjkm) defines the covariance between linear combinations, with y0Γ2=y0,y0Γ||y_0||_\Gamma^2 = \langle y_0, y_0 \rangle_\Gamma∣∣y0Γ2=y0,y0Γ. The first discriminant is obtained by maximizing y1Λ||y_1||_\Lambda∣∣y1Λ under the constraint y1Γ=1||y_1||_\Gamma = 1∣∣y1Γ=1, yielding djk1==0y1cjkd_{jk1} = \sum_{\ell=0}^\infty y_{1\ell} c_{jk\ell}djk1==0y1cjk. Higher-order discriminants are defined sequentially, ensuring orthogonality to lower-order discriminants with respect to Γ\GammaΓ, such that yqy_qyq satisfies yq,ymΓ=0\langle y_q, y_m \rangle_\Gamma = 0yq,ymΓ=0 for m<qm < qm<q. The number of non-trivial discriminants QQQ is bounded by J1J-1J1, the number of groups, and the ranks of Λ\LambdaΛ and Γ\GammaΓ. This framework produces parsimonious, interpretable measures that capture the most discriminative features across groups.

The approach can be extended to log-spectral weight functions. When =0yq<\sum_{\ell=0}^\infty |y_{q\ell}| < \infty=0yq<, the weight function ξq(λ)=yq0+=1yq2cos(2πλ)\xi_q(\lambda) = y_{q0} + \sum_{\ell=1}^\infty y_{q\ell} \sqrt{2} \cos(2\pi\lambda\ell)ξq(λ)=yq0+=1yq2cos(2πλ) exists, and the discriminant djkqd_{jkq}djkq can be expressed as 01ξq(λ)γjk(λ)dλ\int_0^1 \xi_q(\lambda) \gamma_{jk}(\lambda) d\lambda01ξq(λ)γjk(λ)dλ. This formulation connects the cepstral method to integral-based discriminant analysis, though the cepstral approach is more general, as it encompasses integral log-spectral discriminants when they exist.

Estimation is performed using a finite-dimensional approximation. Given nnn independent time series epochs of length NNN, the authors consider estimating cepstral coefficients via c^jk=N1m=0N1γ^jkm2cos(2πλm)\hat{c}_{jk\ell} = N^{-1} \sum_{m=0}^{N-1} \hat{\gamma}_{jkm} \sqrt{2} \cos(2\pi\lambda_m \ell)c^jk=N1m=0N1γ^jkm2cos(2πλm), where λm=m/N\lambda_m = m/Nλm=m/N and γ^jkm\hat{\gamma}_{jkm}γ^jkm is a log-spectral estimator. The multitaper method, which uses RRR orthonormal data tapers hrth_{rt}hrt, is advocated for its favorable bias, variance, and resolution properties. The rrrth tapered periodogram is Ijkrm=N1/2t=1NhrtXjkte2πiλmt2I_{jkrm} = \left|N^{-1/2} \sum_{t=1}^N h_{rt} X_{jkt} e^{-2\pi i \lambda_m t}\right|^2Ijkrm=N1/2t=1NhrtXjkte2πiλmt2, and the multitaper log-spectral estimator is γ^jkm=log(R1r=1RIjkrm)\hat{\gamma}_{jkm} = \log \left( R^{-1} \sum_{r=1}^R I_{jkrm} \right)γ^jkm=log(R1r=1RIjkrm).

For practical implementation, the authors truncate the cepstral coefficients to LLL dimensions, forming c^jkL=(c^jk0,,c^jkL1)T\hat{\mathbf{c}}_{jk}^L = (\hat{c}_{jk0}, \ldots, \hat{c}_{jkL-1})^Tc^jkL=(c^jk0,,c^jkL1)T. Discriminants and weight functions are estimated using classical Fisher’s discriminant analysis on this finite-dimensional vector. The estimated group mean is a^jL=nj1k=1njc^jkL\hat{\mathbf{a}}_j^L = n_j^{-1} \sum_{k=1}^{n_j} \hat{\mathbf{c}}_{jk}^La^jL=nj1k=1njc^jkL, and the overall mean is a^L=j=1Jπ^ja^jL\hat{\mathbf{a}}^L = \sum_{j=1}^J \hat{\pi}_j \hat{\mathbf{a}}_j^La^L=j=1Jπ^ja^jL. The between-group kernel is estimated as Λ^L=j=1Jπ^j(a^jLa^L)(a^jLa^L)T\hat{\mathbf{\Lambda}}_L = \sum_{j=1}^J \hat{\pi}_j (\hat{\mathbf{a}}_j^L - \hat{\mathbf{a}}^L)(\hat{\mathbf{a}}_j^L - \hat{\mathbf{a}}^L)^TΛ^L=j=1Jπ^j(a^jLa^L)(a^jLa^L)T. The within-group covariance is Γ^L=j=1Jπ^jΓ^Lj\hat{\mathbf{\Gamma}}_L = \sum_{j=1}^J \hat{\pi}_j \hat{\mathbf{\Gamma}}_{Lj}Γ^L=j=1Jπ^jΓ^Lj, where Γ^Lj=(nj1)1k=1njb^jkL(b^jkL)T\hat{\mathbf{\Gamma}}_{Lj} = (n_j - 1)^{-1} \sum_{k=1}^{n_j} \hat{\mathbf{b}}_{jk}^L (\hat{\mathbf{b}}_{jk}^L)^TΓ^Lj=(nj1)1k=1njb^jkL(b^jkL)T and b^jkL=c^jkLa^jL\hat{\mathbf{b}}_{jk}^L = \hat{\mathbf{c}}_{jk}^L - \hat{\mathbf{a}}_j^Lb^jkL=c^jkLa^jL. The qqqth estimated weight function y^qL\hat{\mathbf{y}}_q^Ly^qL is the qqqth eigenvector of Γ^L1Λ^L\hat{\mathbf{\Gamma}}_L^{-1} \hat{\mathbf{\Lambda}}_LΓ^L1Λ^L, and the discriminant is d^jkqL=(y^qL)Tc^jkL\hat{d}_{jkq}^L = (\hat{\mathbf{y}}_q^L)^T \hat{\mathbf{c}}_{jk}^Ld^jkqL=(y^qL)Tc^jkL. The corresponding weight function ξ^qL(λ)=y^q0L+=1L1y^qL2cos(2πλ)\hat{\xi}_q^L(\lambda) = \hat{y}_{q0}^L + \sum_{\ell=1}^{L-1} \hat{y}_{q\ell}^L \sqrt{2} \cos(2\pi\lambda\ell)ξ^qL(λ)=y^q0L+=1L1y^qL2cos(2πλ) is interpretable even if the theoretical version ξq\xi_qξq exists only in a limiting sense.

Classification of a new time series with unknown group membership proceeds by estimating its cepstrum c^L\hat{\mathbf{c}}_*^Lc^L and computing its discriminants d^qL\hat{d}_{*q}^Ld^qL. The classification rule assigns the observation to the group jjj that minimizes q=1Q(d^qLμjq)22log(πj)\sum_{q=1}^Q (\hat{d}_{*q}^L - \mu_{jq})^2 - 2 \log(\pi_j)q=1Q(d^qLμjq)22log(πj), where μjq==0yqaj\mu_{jq} = \sum_{\ell=0}^\infty y_{q\ell} a_{j\ell}μjq==0yqaj is the qqqth group-mean discriminant. This rule is optimal under Gaussianity and corresponds to the centroid classifier of Delaigle and Hall (2012). The number of cepstral coefficients LLL is selected via leave-one-out cross-validation, minimizing classification error across all replicates.

Theoretical consistency is established under regularity conditions. Assumptions on the smoothness of log-spectra ensure that the largest eigenvalue of the within-group covariance is bounded and the smallest eigenvalue σL\sigma_LσL decays slowly. Consistency of the estimated weight functions and discriminants requires σL2n1/20\sigma_L^{-2} n^{-1/2} \to 0σL2n1/20, σL2N1/20\sigma_L^{-2} N^{-1/2} \to 0σL2N1/20, and Ln1/20L n^{-1/2} \to 0Ln1/20 as n,N,Ln, N, L \to \inftyn,N,L. Under these conditions, y^qL\hat{\mathbf{y}}_q^Ly^qL converges in the Γ\GammaΓ-norm to the true weight function yqy_qyq, and d^jkqL\hat{d}_{jkq}^Ld^jkqL converges in probability to djkqd_{jkq}djkq. Consequently, the classification rule Π^(c^L)\hat{\Pi}(\hat{\mathbf{c}}_*^L)Π^(c^L) converges in probability to the true group membership Π(c)\Pi(c_*)Π(c). The framework accommodates heterogeneity in within-group covariance by using a pooled estimate j=1JπjΓj\sum_{j=1}^J \pi_j \Gamma_jj=1JπjΓj, and the estimator remains valid under this setting, though classification rates may require adjustment. The method provides a robust and interpretable approach to discriminant analysis in the presence of within-group spectral variability, offering advantages over traditional methods that assume identical replicates within groups.

Experiment

A simulation study evaluated time series classification procedures under varying levels of within-group spectral variability by testing conditional autoregressive processes across different training sample sizes and sequence lengths. The proposed cepstral Fisher's discriminant analysis consistently achieved higher out-of-sample classification accuracy than traditional methods that ignore group-level spectral differences, with multitaper-based estimation yielding the strongest performance. While the cepstral approach maintained robust accuracy regardless of spectral variability levels, conventional information criterion methods experienced significant performance degradation as within-group variability increased. These results validate the superior stability and effectiveness of the proposed framework for handling heterogeneous spectral patterns in time series classification.

The authors conducted a simulation study to evaluate classification procedures under varying levels of within-group spectral variability. Results show that cepstral Fisher's discriminant analysis consistently outperforms information criterion-based methods across all conditions, with performance differences influenced by the choice of log-spectral estimator. The classification rates of methods that do not account for within-group variability are more sensitive to increases in spectral variability. Cepstral Fisher's discriminant analysis achieves higher classification rates than information criterion-based methods across all experimental conditions. Among cepstral methods, the multitaper estimator performs best, while the direct estimator performs poorest. Methods ignoring within-group spectral variability show reduced classification performance as variability increases.

This simulation study evaluates classification procedures under varying levels of within-group spectral variability to validate their robustness against spectral fluctuations. The findings indicate that explicitly modeling within-group variability is critical, as approaches that ignore it suffer substantial performance degradation when variability increases. Cepstral Fisher's discriminant analysis consistently outperforms information criterion-based methods across all conditions, with overall accuracy heavily influenced by the selection of the log-spectral estimator. Among the tested estimators, the multitaper approach delivers the most reliable results, whereas the direct method proves least effective.


Créer de l'IA avec l'IA

De l'idée au lancement — accélérez votre développement IA avec le co-codage IA gratuit, un environnement prêt à l'emploi et le meilleur prix pour les GPU.

Codage assisté par IA
GPU prêts à l’emploi
Tarifs les plus avantageux

HyperAI Newsletters

Abonnez-vous à nos dernières mises à jour
Nous vous enverrons les dernières mises à jour de la semaine dans votre boîte de réception à neuf heures chaque lundi matin
Propulsé par MailChimp