Kruskal-Wallis Test for Functional Data Based on Random Projections Generated from a Simulation of a Brownian Motion

Keywords: Functional data, random projections, Kruskal-Wallis test, non-parametric statistics, brownian motion

Abstract

The k-sample problem for functional data has been widely studied from theoretical and applied perspectives. In literature, Gaussianity of the generating process is generally assumed, which may be impractical in some situations. This work proposes an extension of the Kruskal-Wallis test to the case of functional data as an alternative to the problem of non- Gaussianity. The methodology used consisted of transforming each group's functional data into scalars using random projections and subsequently performing classical Kruskal-Wallis tests. The main results were the extension of the Kruskal-Wallis test to the case of functional data and the verification of its unbiased and consistency properties. Reducing dimensionality from random projections allows us to extend the classical Kruskal-Wallis test to the functional context and solve problems of non-Gaussianity and atypical observations.

Author Biographies

Rafael Meléndez Surmay, Universidad de la Guajira, Colombia

Universidad de la Guajira, Riohacha-Colombia, rmelendez@uniguajira.edu.co

Ramón Giraldo Henao, Universidad Nacional de Colombia, Colombia

Universidad Nacional de Colombia, Bogotá-Colombia, rgiraldoh@unal.edu.co

Francisco Rodríguez Cortes, Universidad Nacional de Colombia, Colombia

Universidad Nacional de Colombia, Medellín-Colombia, frrodriguezc@unal.edu.co

References

T. Górecki and Ł. Smaga, “A comparison of tests for the one-way ANOVA problem for functional data,” Comput. Stat., vol. 30, no. 4, pp. 987–1010, Dec. 2015. https://doi.org/10.1007/s00180-015-0555-0

J. T. Zhang, Analysis of variance for functional data, 1st ed. New York, NY, USA: Chapman and Hall/CRC, 2013. https://doi.org/10.1201/b15005

F. Ferraty, P. Vieu, and S. Viguier-Pla, “Factor-based comparison of groups of curves,” Comput. Stat. Data Anal., vol. 51, no. 10, pp. 4903–4910, Jun. 2007. https://doi.org/https://doi.org/10.1016/j.csda.2006.10.001

M. L. Bourbonnais et al., “Characterizing spatial-temporal patterns of landscape disturbance and recovery in western Alberta, Canada using a functional data analysis approach and remotely sensed data,” Ecol. Inform., vol. 39, pp. 140–150, May. 2017. https://doi.org/https://doi.org/10.1016/j.ecoinf.2017.04.010

A. Roy, T. Nelson, and P. Turaga, “Functional data analysis approach for mapping change in time series: A case study using bicycle ridership patterns,” Transp. Res. Interdiscip. Perspect., vol. 17, p. 100752, Jan. 2023. https://doi.org/https://doi.org/10.1016/j.trip.2022.100752

J. M. Torres, P. J. G. Nieto, L. Alejano, and A. N. Reyes, “Detection of outliers in gas emissions from urban areas using functional data analysis,” J. Hazard. Mater., vol. 186, no. 1, pp. 144–149, Feb. 2011. https://doi.org/https://doi.org/10.1016/j.jhazmat.2010.10.091

M. Tang, Z. Li, and G. Tian, “A Data-Driven-Based Wavelet Support Vector Approach for Passenger Flow Forecasting of the Metropolitan Hub,” IEEE Access, vol. 7, pp. 7176-7183, Jan. 2019. https://ieeexplore.ieee.org/abstract/document/8600312

Z. Jin-Ting, and X. Liang, “One-way ANOVA for functional data via globalizing the pointwise F-test,” Scand. Stat. Theory Appl., vol. 41, no. 1, pp. 51–71, Mar. 2014. https://doi.org/10.1111/sjos.12025

A. Cuevas, M. Febrero, and R. Fraiman, “An anova test for functional data,” Comput. Stat. Data Anal., vol. 47, no. 1, pp. 111–122, Aug. 2004. https://doi.org/https://doi.org/10.1016/j.csda.2003.10.021

J. O. Ramsay, and B. W. Silverman, Functional Data Analysis, 2nd ed. New York, NY, USA: Springer-Verlag New York, 2005. https://doi.org/10.1007/b98888

C. G. Kaufman, and S. R. Sain, “Bayesian Functional ANOVA Modeling Using Gaussian Process Prior Distributions,” Bayesian Anal., vol. 5 no. 1, pp. 123–149, Mar. 2010. https://doi.org/10.1214/10-BA505

Q. Shen, and J. J. Faraway, “An F test for linear models with functional responses,” Statistica Sinica, vol. 14, pp. 1239–1257, 2004. https://api.semanticscholar.org/CorpusID:55106079

P. Delicado, “Functional k-sample problem when data are density functions,” Comput. Stat., vol. 22, no. 3, pp. 391–410, Sep. 2007. https://doi.org/10.1007/s00180-007-0047-y

M. Myllymäki, T. Mrkvička, P. Grabarnik, H. Seijo, and U. Hahn, “Global envelope tests for spatial processes,” J. R. Stat. Soc. Series B Stat. Methodol., vol. 79, no. 2, pp. 381–404, Mar. 2017. https://doi.org/10.1111/rssb.12172

O. A. Vsevolozhskaya, M. C. Greenwood, and D. B. Holodov, “Pairwise comparison of treatment levels in functional analysis of variance with application to erythrocyte hemolysis,” Ann. Appl. Stat., vol. 8, pp. 905–925, Jun. 2014. https://api.semanticscholar.org/CorpusID:38476665

A. Pini, S. Vantini, B. M. Colosimo, and M. Grasso, “Domain-selective functional analysis of variance for supervised statistical profile monitoring of signal data,” J. R. Stat. Soc. Ser. C Appl. Stat., vol. 67, no. 1, pp. 55–81, Jan. 2018. https://doi.org/10.1111/rssc.12218

A. B. Kashlak, S. Myroshnychenko, and S. Spektor, “Analytic Permutation Testing for Functional Data ANOVA,” J. Comput. Graph. Stat., vol. 32, no. 1, pp. 294–303, May. 2023. https://doi.org/10.1080/10618600.2022.2069780

M. Hollander, D. A. Wolfe, and E. Chicken, “The onw-Way Layout Introduction,” in Nonparametric Statistical Methods, D. J. Balding et al., Eds., Hoboken, New Jersey: John Wiley & Sons, 2013. https://books.google.es/books?hl=es&lr=&id=Y5s3AgAAQBAJ&oi=fnd&pg=PP10&dq=E.+Hollander,+M.,+Wolfe,+d.+and+Chicken,+Nonparametric+statistical+methods,+John+Wiley.+Londres,+2013.&ots=a-h-k6diyR&sig=I_655cMRqPSiDdGABrn8nLSOa98

D. Achlioptas, “Database-friendly random projections,” in Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, New York, NY, USA, 2001. https://api.semanticscholar.org/CorpusID:2640788

A. Nieto-Reyes, “Random Projections: Applications to Statistical Data Depth and Goodness of Fit Test,” BEIO Rev. Of. la Soc. Estadística e Investig. Oper., vol. 35, no. 1, pp. 7–22, Mar. 2019. https://www.seio.es/beio/BEIOVol35Num1.pdf#page=13

J. A. Cuesta-Albertos, R. Fraiman, and T. Ransford, “Random projections and goodness-of-fit tests in infinite-dimensional spaces,” Bull. Brazilian Math. Soc., vol. 37, no. 4, pp. 477–501, Dec. 2006. https://doi.org/10.1007/s00574-006-0023-0

R. Ihaka, R. Gentleman. The R Project for Statistical Computing. (V R.4.2.1 2022). Accessed: Apr.. 16, 2023. [Online]. Available: https://cran.r-project.org/bin/windows/base/old/4.2.1/

J. Ramsay, G. Hooker, and S. Graves, Functional Data Analysis with R and MATLAB. New York, NY, USA: Springer New York, 2009. https://doi.org/10.1007/978-0-387-98185-7

T. Pohlert, The Pairwise Multiple Comparison of Mean Ranks Package (PMCMR) v4.4. 2016. Accessed: Apr.16, 2023. [Online]. Available: http://cran.r-project.org/package=PMCMR

How to Cite
[1]
Meléndez Surmay R., R. Giraldo Henao, and Rodríguez Cortes F., “Kruskal-Wallis Test for Functional Data Based on Random Projections Generated from a Simulation of a Brownian Motion”, TecnoL., vol. 27, no. 59, p. e2986, Apr. 2024.

Downloads

Download data is not yet available.
Published
2024-04-29
Section
Research Papers

Altmetric

Funding data

Crossref Cited-by logo