<?xml version="1.0" encoding="UTF-8"?><?xml-model type="application/xml-dtd" href="http://jats.nlm.nih.gov/publishing/1.1d3/JATS-journalpublishing1.dtd"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1d3 20150301//EN" "http://jats.nlm.nih.gov/publishing/1.1d3/JATS-journalpublishing1.dtd">
<article xmlns:ali="http://www.niso.org/schemas/ali/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" dtd-version="1.1d3" specific-use="Marcalyc 1.2" article-type="research-article" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="redalyc">3442</journal-id>
<journal-title-group>
<journal-title specific-use="original" xml:lang="es">TecnoLógicas</journal-title>
</journal-title-group>
<issn pub-type="ppub">0123-7799</issn>
<issn pub-type="epub">2256-5337</issn>
<publisher>
<publisher-name>Instituto Tecnológico Metropolitano</publisher-name>
<publisher-loc>
<country>Colombia</country>
<email>tecnologicas@itm.edu.co</email>
</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="art-access-id" specific-use="redalyc">344276634010</article-id>
<article-id pub-id-type="doi">https://doi.org/10.22430/22565337.2986</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Artículos de investigación</subject>
</subj-group>
</article-categories>
<title-group>
<article-title xml:lang="en">Kruskal-Wallis Test for Functional Data Based on Random Projections Generated from a Simulation of a Brownian Motion</article-title>
<trans-title-group>
<trans-title xml:lang="es">Prueba de Kruskal-Wallis para datos funcionales basada en proyecciones aleatorias generadas a partir de una simulación de un movimiento browniano</trans-title>
</trans-title-group>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-6449-0358</contrib-id>
<name name-style="western">
<surname>Meléndez Surmay</surname>
<given-names>Rafael</given-names>
</name>
<xref ref-type="corresp" rid="corresp1"/>
<xref ref-type="aff" rid="aff1"/>
<email>rmelendez@uniguajira.edu.co</email>
</contrib>
<contrib contrib-type="author" corresp="no">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-3010-169X</contrib-id>
<name name-style="western">
<surname>Giraldo Henao</surname>
<given-names>Ramón</given-names>
</name>
<xref ref-type="aff" rid="aff2"/>
<email>rgiraldoh@unal.edu.co</email>
</contrib>
<contrib contrib-type="author" corresp="no">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-2152-8619</contrib-id>
<name name-style="western">
<surname>Rodríguez Cortes</surname>
<given-names>Francisco</given-names>
</name>
<xref ref-type="aff" rid="aff3"/>
<email>frrodriguezc@unal.edu.co</email>
</contrib>
</contrib-group>
<aff id="aff1">
<institution content-type="original">Rafael Meléndez Surmay; Universidad de la Guajira, Riohacha-Colombia. ORCID: 0000-0002-6449-0358 Correo: rmelendez@uniguajira.edu.co</institution>
<institution content-type="orgname">Universidad de la Guajira</institution>
<country country="CO">Colombia</country>
</aff>
<aff id="aff2">
<institution content-type="original">Ramón Giraldo Henao; Universidad Nacional de Colombia, Bogotá-Colombia. ORCID: 0000-0003-3010-169X Correo: rgiraldoh@unal.edu.co</institution>
<institution content-type="orgname">Universidad Nacional de Colombia</institution>
<country country="CO">Colombia</country>
</aff>
<aff id="aff3">
<institution content-type="original">Francisco Rodríguez Cortes Universidad Nacional de Colombia, Medellín-Colombia. ORCID: 0000-0002-2152-8619 Correo: frrodriguezc@unal.edu.co</institution>
<institution content-type="orgname">Universidad Nacional de Colombia</institution>
<country country="CO">Colombia</country>
</aff>
<author-notes>
<corresp id="corresp1">
<email>rmelendez@uniguajira.edu.co</email>
</corresp>
</author-notes>
<pub-date pub-type="epub-ppub">
<year>2024</year>
</pub-date>
<volume>27</volume>
<issue>59</issue>
<fpage>1</fpage>
<lpage>11</lpage>
<history>
<date date-type="received" publication-format="dd mes yyyy">
<day>10</day>
<month>01</month>
<year>2024</year>
</date>
<date date-type="accepted" publication-format="dd mes yyyy">
<day>22</day>
<month>04</month>
<year>2024</year>
</date>
<date date-type="pub" publication-format="dd mes yyyy">
<day>29</day>
<month>04</month>
<year>2024</year>
</date>
</history>
<permissions>
<copyright-year>2024</copyright-year>
<copyright-holder>Instituto Tecnológico Metropolitano</copyright-holder>
<ali:free_to_read/>
<license xlink:href="https://creativecommons.org/licenses/by-nc-sa/4.0/">
<ali:license_ref>https://creativecommons.org/licenses/by-nc-sa/4.0/</ali:license_ref>
<license-p>Esta obra está bajo una Licencia Creative Commons Atribución-NoComercial-CompartirIgual 4.0 Internacional.</license-p>
</license>
</permissions>
<self-uri content-type="html" xlink:href="https://revistas.itm.edu.co/index.php/tecnologicas/issue/view/135">https://revistas.itm.edu.co/index.php/tecnologicas/issue/view/135</self-uri>
<abstract xml:lang="en">
<title>Abstract</title>
<p>The <italic>k</italic>-sample problem for functional data has been widely studied from theoretical and applied perspectives. In literature, Gaussianity of the generating process is generally assumed, which may be impractical in some situations. This work proposes an extension of the Kruskal-Wallis test to the case of functional data as an alternative to the problem of non-Gaussianity. The methodology used consisted of transforming each group's functional data into scalars using random projections and subsequently performing classical Kruskal-Wallis tests. The main results were the extension of the Kruskal-Wallis test to the case of functional data and the verification of its unbiased and consistency properties. Reducing dimensionality from random projections allows us to extend the classical Kruskal-Wallis test to the functional context and solve problems of non-Gaussianity and atypical observations.</p>
</abstract>
<trans-abstract xml:lang="es">
<title>Resumen</title>
<p>El problema de<italic> k</italic> muestras de datos funcionales se ha estudiado ampliamente desde perspectivas teóricas y aplicadas. En la literatura se asume generalmente el supuesto de Gaussianidad del proceso generador, el cual puede ser impráctico en algunas situaciones particulares. Este trabajo tuvo como objetivo proponer una extensión de la prueba de Kruskal-Wallis al caso de datos funcionales, como alternativa al problema de no Gaussianidad. La metodología empleada consistió en transformar los datos funcionales de cada grupo en escalares empleando proyecciones aleatorias y en realizar posteriormente pruebas de Kruskal-Wallis clásicas. Los principales resultados fueron la extensión de la prueba de Kruskal-Wallis al caso de datos funcionales y la comprobación de las propiedades de insesgadez y consistencia de esta misma. Se puede concluir que la reducción de la dimensionalidad a partir de las proyecciones aleatorias permite extender la prueba de Kruskal-Wallis clásica al contexto funcional y por ende solucionar problemas de no Gaussianidad y observaciones atípicas.</p>
</trans-abstract>
<kwd-group xml:lang="en">
<title>Keywords</title>
<kwd>Functional data</kwd>
<kwd>random projections</kwd>
<kwd>Kruskal-Wallis test</kwd>
<kwd>non-parametric statistics</kwd>
<kwd>brownian motion</kwd>
</kwd-group>
<kwd-group xml:lang="es">
<title>Palabras clave</title>
<kwd>Datos funcionales</kwd>
<kwd>proyecciones aleatorias</kwd>
<kwd>prueba de Kruskal-Wallis</kwd>
<kwd>estadística no paramétrica</kwd>
<kwd>movimiento Browniano</kwd>
</kwd-group>
<counts>
<fig-count count="3"/>
<table-count count="1"/>
<equation-count count="6"/>
<ref-count count="24"/>
</counts>
<custom-meta-group>
<custom-meta>
<meta-name>How to cite / Cómo citar</meta-name>
<meta-value>R. Meléndez Surmay, R. Giraldo Henao, and F. Rodríguez Cortes, “Kruskal-Wallis Test for Functional Data Based on Random Projections Generated from a Simulation of a Brownian Motion,” <italic>TecnoLógicas</italic>, vol. 27, no. 59, e2986, Apr. 2024. https://doi.org/10.22430/22565337.2986</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
<body>
<sec>
<title>
<bold>Highllights inglés</bold>:</title>
<p>The Kruskal-Wallis test resolves the issue of non-normality in functional data.</p>
<p>The Kruskal-Wallis test addresses the treatment of outliers in functional data.</p>
<p>The use of random projections reduces the dimensionality of the problem of k samples in functional data.</p>
</sec>
<sec>
<title>
<bold>Highlights español</bold>:</title>
<p>La prueba de Kruskal-Wallis resuelve el problema de no normalidad en datos funcionales.</p>
<p>La prueba de Kruskal-Wallis soluciona el tratamiento de datos atípicos en datos funcionales.</p>
<p>El uso de proyecciones aleatorias reduce la dimensionalidad del problema de k muestras en datos funcionales</p>
</sec>
<sec>
<title>
<bold>1.     INTRODUCTION</bold>
</title>
<p>Advances in computational and analytical techniques allow for continuous monitoring of many processes. New statistical methods are needed to analyze large data sets arising from these processes. Functional data analysis (FDA) has emerged in recent decades as an alternative to statistical modeling of large data volumes. FDA is a framework for analyzing data consisting of random functions (usually curves) rather than observations of a few variables or random vectors [<xref ref-type="bibr" rid="redalyc_344276634010_ref1">1</xref>]. New challenges have arisen in extracting meaningful information hidden in functional data [<xref ref-type="bibr" rid="redalyc_344276634010_ref2">2</xref>]. As in classical statistics, in FDA data preprocessing, modeling, hypothesis testing, parameter estimation, and predictive analysis using parametric or nonparametric models are fields of interest. Many theoretical and applied contributions have been proposed in these areas [<xref ref-type="bibr" rid="redalyc_344276634010_ref2">2</xref>], [<xref ref-type="bibr" rid="redalyc_344276634010_ref3">3</xref>]. In the last decade, the FDA has already found applications in several areas of research, including ecology [<xref ref-type="bibr" rid="redalyc_344276634010_ref26">4</xref>], epidemiology [<xref ref-type="bibr" rid="redalyc_344276634010_ref5">5</xref>], remote sensing [<xref ref-type="bibr" rid="redalyc_344276634010_ref26">4</xref>], outlier detection in environmental applications [<xref ref-type="bibr" rid="redalyc_344276634010_ref6">6</xref>], and traffic volume forecasting [<xref ref-type="bibr" rid="redalyc_344276634010_ref7">7</xref>].</p>
<p>To construct a functional observation X<italic>ij</italic>(.) from the discretely observed data one can employ a standard smoothing technique such as cubic B-splines [<xref ref-type="bibr" rid="redalyc_344276634010_ref8">8</xref>]. The FDA package [<xref ref-type="bibr" rid="redalyc_344276634010_ref9">9</xref>] implements the smoothing techniques in R [<xref ref-type="bibr" rid="redalyc_344276634010_ref10">10</xref>].</p>
<p>This work focuses mainly on proposing a methodology for comparing groups when the same functional variable has been observed in several individuals in each of these. Specifically, a traditional nonparametric tool to solve the .-sample problem for a functional response is adapted to the FDA scenario. Let <italic>X<sub>i</sub>
</italic>
<sub>1</sub> (<italic>t</italic>), <italic>X<sub>i</sub>
</italic>
<sub>2</sub>(<italic>t</italic>), <italic> X<sub>i</sub>
</italic>
<sub>n</sub>(<italic>t</italic>) …  <italic>i</italic> = 1,2, …,<italic> k</italic>   random set of functions defined over an interval <italic>T</italic> = [a,b] which come from Gaussian processes GP (<italic>μ<sub>k</sub>
</italic> (<italic>t</italic>), γ<sub>
<italic>k</italic>
</sub> (<italic>s,t</italic>)) [<xref ref-type="bibr" rid="redalyc_344276634010_ref8">8</xref>]. The hypothesis of interest is given in <xref ref-type="disp-formula" rid="e1">(1)</xref>
</p>
<p>
<disp-formula id="e1">
<label>(1)</label>
<graphic xlink:href="344276634010_ee3.png" position="anchor" orientation="portrait"/>
</disp-formula>
</p>
<p>Against the alternative that at least two functional means are different. The statistical literature has a widely considered hypothesis established in <xref ref-type="disp-formula" rid="e1">(1)</xref>The proposed approaches are proposed for point-wise t-tests, functional ANOVA, functional principal components analysis, and permutation tests.</p>
<p>Some authors have extensively studied the functional ANOVA problem. For example, [<xref ref-type="bibr" rid="redalyc_344276634010_ref9">9</xref>] introduced an asymptotic version of the ANOVA F-test, and [<xref ref-type="bibr" rid="redalyc_344276634010_ref2">2</xref>] considered asymptotic or bootstrapped versions of a <italic> L</italic>
<sup>2</sup> norm based test, F-type statistic-based test, and globalizing pointwise F-test. Furthermore, [<xref ref-type="bibr" rid="redalyc_344276634010_ref1">1</xref>] introduced a method based on a representation of a basis function, and [<xref ref-type="bibr" rid="redalyc_344276634010_ref10">10</xref>] described a bootstrap procedure based on pointwise F-tests. However, Bayesian functional ANOVA has received less attention. But, [<xref ref-type="bibr" rid="redalyc_344276634010_ref11">11</xref>] introduced a Gaussian process ANOVA modeling approach under a Bayesian framework.</p>
<p>Other approaches were considered by [<xref ref-type="bibr" rid="redalyc_344276634010_ref12">12</xref>], [<xref ref-type="bibr" rid="redalyc_344276634010_ref9">9</xref>], and [<xref ref-type="bibr" rid="redalyc_344276634010_ref13">13</xref>]. Furthermore, [<xref ref-type="bibr" rid="redalyc_344276634010_ref14">14</xref>] proposed a new method using a graphical interface based on the global rank test, and this procedure for functional ANOVA was applied using permutations. Other authors have proposed other approaches, such as that used the Westfall-Young randomization to correct for multiple tests. However, this method cannot obtain an overall p-value. Meanwhile, [<xref ref-type="bibr" rid="redalyc_344276634010_ref15">15</xref>] divided the domain of interest into regions. However, a disadvantage is that the partition must be respected. Furthermore, [<xref ref-type="bibr" rid="redalyc_344276634010_ref16">16</xref>] developed a multi-way functional ANOVA to determine rejection regions. Our interest is to provide an alternative to the case where the Gaussian assumption is unrealistic, and [<xref ref-type="bibr" rid="redalyc_344276634010_ref17">17</xref>] presented a unified methodology for performing computation-free permutation tests for the testing of the <italic>k</italic> sample in commutative and noncommutative <italic>Lq.</italic>  spaces, which includes multivariate and functional data.</p>
<p>This work is organized as follows. Sections 2.1 and 2.2 review the Kruskal-Wallis test and random projections. Section 3 presents an extension of the Kruskal-Wallis test for functional data and shows its respective pseudocode. In Section 4.1, we present the simulation study and in Section 4.2, we present the application with real data. Finally, we present the discussion and some conclusions.</p>
</sec>
<sec>
<title>
<bold>2.     BACKGROUND</bold>
</title>
<sec>
<title>
<bold>2.1   Kruskal-Wallis test</bold>
</title>
<p>This section briefly reviews the main statistical technique used in the analysis. Kruskal-Wallis [<xref ref-type="bibr" rid="redalyc_344276634010_ref18">18</xref>] is a non-parametric statistical test that compares the median values of two or more independent samples. The null hypothesis for the Kruskal-Wallis test is that all the samples come from the same population, and the alternative hypothesis is that at least one group's sample comes from a population with a different median than the others. The test is based on the ranks of the observations within each group. It is an alternative to ANOVA when the normality assumption is unrealistic. The hypothesis of interest is shown in <xref ref-type="disp-formula" rid="e2">(2)</xref>
</p>
<p>
<disp-formula id="e2">
<label>(2)</label>
<graphic xlink:href="344276634010_ee4.png" position="anchor" orientation="portrait"/>
</disp-formula>
</p>
<p>Which establishes that there are no significant differences in the effects of the treatments. The null hypothesis states that the following distributions 𝐹1= 𝐹2 = ⋯ = 𝐹𝑘 are equal. To calculate the Kruskal-Wallis statistic, all . observations from the k-samples are combined and ordered from smallest to largest. Let r<italic>
<sub>ij</sub>
</italic> be the rank of r<italic>
<sub>ij</sub>
</italic> in this joint classification, and <italic>Rj</italic> defined as <xref ref-type="disp-formula" rid="e4">(3)</xref>
</p>
<p>
<disp-formula id="e4">
<label>(3)</label>
<graphic xlink:href="344276634010_ee5.png" position="anchor" orientation="portrait"/>
</disp-formula>
</p>
<p>Thus, for example,<italic> R</italic>
<sub>1</sub> is the sum of the ranks received by the observations of group 1 and <italic>R</italic>
<sub>1</sub> is the average rank for these same observations. Kruskal-Wallis H statistics are given by [<xref ref-type="bibr" rid="redalyc_344276634010_ref18">18</xref>] as shown in <xref ref-type="disp-formula" rid="e5">(4)</xref>
</p>
<p>
<disp-formula id="e5">
<label>(4)</label>
<graphic xlink:href="344276634010_ee6.png" position="anchor" orientation="portrait"/>
</disp-formula>
</p>
<p>At a significance level of α, H0 is rejected if H ≥ hα   otherwise, do not reject. The values of hα are given in Table A.12 of [<xref ref-type="bibr" rid="redalyc_344276634010_ref18">18</xref>]. When H0 is true, the statistic H has, as min(n., ⋯. nk) tends to infinity, an asymptotic chi-square X<sup>2</sup> distribution with <italic>k</italic> - 1 degrees of freedom. Under this assumption, the reject rule is.</p>
<p>Reject 𝐻≥𝜒<sup>2</sup>
<sub>𝑘−1</sub>,<sub>𝛼</sub> ; otherwise, do not reject.</p>
<p>When the null hypothesis is rejected and it is concluded that at least one sample comes from a population with a different median, some post-hoc tests (e.g., Dunn's test) can be used to identify which samples differ significantly.</p>
</sec>
<sec>
<title>
<bold>2.2   Random Projections</bold>
</title>
<p>The hypothesis of interest (see hypothesis in <xref ref-type="disp-formula" rid="e1">(1)</xref>) can be tested using the projections of the functions. These involve mapping high-dimensional data points into a lower-dimensional space using a randomly generated projection matrix [<xref ref-type="bibr" rid="redalyc_344276634010_ref20">19</xref>]. The basic idea is to use a randomly generated projection matrix to map each high-dimensional data point onto a lower-dimensional space. By doing this, we can reduce the number of dimensions of the data while still retaining important information about the data structure.</p>
<p>Random projections are often used in situations where the dimensionality of the data makes it difficult to work with or analyze. In other words, random projections can be a handy tool for reducing the complexity of the data without losing important information. Given a set of data or a distribution in spaces of dimension greater than one, random projections consist of projecting the data or calculating the marginal of the distribution in a lower-dimensional subspace that has been chosen randomly [<xref ref-type="bibr" rid="redalyc_344276634010_ref21">20</xref>]. Random projections preserve certain properties that are very important in the FDA. One of them is that it preserves distances with a high degree of probability if a projected subspace is the uniform distribution. This result is extended to the standard Gaussian distribution [<xref ref-type="bibr" rid="redalyc_344276634010_ref10">10</xref>]. In this sense, [<xref ref-type="bibr" rid="redalyc_344276634010_ref22">21</xref>] showed that if two distributions are defined in a separable Hilbert space and have finite moments of some order, then projecting the distributions onto a random one-dimensional subspace is sufficient to distinguish them with high probability, as long as the moments of one of the distributions match those of the random projection. In other words, if we have two distributions with similar moments up to some order, projecting them onto a random one-dimensional subspace will produce similar one-dimensional marginal distributions. However, if the moments of one of the distributions differ from those of the random projection, then the one-dimensional marginal distributions will be different, and the two distributions can be distinguished with high probability.</p>
<p>Once the functional data have been projected onto a lower-dimensional space, a hypothesis test can be performed to determine whether the functional means are equal. The choice of hypothesis test depends on the specific application, but a common approach is to use a t-test or an ANOVA test. One advantage of using random projections to test the equality of functional means is that it can be computationally efficient, mainly when dealing with high-dimensional functional data. It can also be robust to noise and outliers in the data, as random projections can help filter out some of the noise.</p>
</sec>
</sec>
<sec>
<title>
<bold>3.     KRUSKAL-WALLIS TEST FOR FUNCTIONAL DATA </bold>
</title>
<p>This research presents an extension of the Kruskal-Wallis test for functional data based on random projections.</p>
<p>We propose extending the Kruskal-Wallis test to the case of functional data (the observation for each individual in the sample corresponds to a functional datum). As in the univariate case, in the context of functional data analysis, statistical tests require the fulfillment of some assumptions. When the samples are small and the curves do not underlie a Gaussian stochastic process, the functional ANOVA could be inappropriate, and a non-parametric method may be used as a valid alternative. Specifically, a Kruskal-Wallis test for functional data based on random projections (KWFD) is proposed as an alternative methodology to the one-way functional ANOVA when the Gaussianity assumption is unrealistic. The KWFD is a non-parametric alternative for comparing the medians of functional data of three or more groups. We extended the KW test by randomly projecting the functional data onto a low-dimensional subspace.</p>
<p>Let <italic>X<sub>ij</sub>
</italic> (<italic>t</italic>), <italic>i</italic> = 1,2, ⋯, <italic>n<sub>j</sub>
</italic>, j = 1, ⋯,<italic> k</italic> a functional random sample of curves, where . <italic>t</italic> ∈ [<italic>a, b</italic>] is the domain (generally time), <italic>i</italic> correspond to an individual, and j the index for the level factor. The functional random variables are considered independent trajectories of the stochastic processes SP(<italic>μj</italic> (t),<italic>γ</italic>(<italic>s,t</italic>)),<italic>j</italic> = 1,⋯ ,<italic>k</italic>  with a common covariance function  <italic>γ</italic>(<italic>s, t</italic>).  Let <italic>x<sub>ij </sub>
</italic>(<italic>t</italic>),<italic> i</italic> = 1,2, ⋯, <italic>n</italic>;   <italic>j</italic> = 1,⋯, <italic>k</italic>   .  be the recorded set of curves under the <italic>k</italic>. treatments. In the following, we describe the procedure for calculating the <italic>H</italic> statistic to test the null hypothesis in <xref ref-type="disp-formula" rid="e1">(1)</xref>
</p>
<p>· Generate one Brownian motion 𝜐(𝑡) in the interval of interest 𝑇∈ℝ.</p>
<p>· Calculate the random projections 𝑥𝑖𝑗=∫𝑥𝑖𝑗(𝑡)𝜐(𝑡)𝑑𝑡,𝑏𝑎 𝑖=1,⋯,𝑛; 𝑗=1,2,⋯,𝑘.</p>
<p>· Calculate the rank of each projected curve within its group.</p>
<p>· Using the random projections, proceed as in the usual way to calculate 𝑟𝑖𝑗,𝑅𝑖𝑗, and the statistic 𝐻 in <xref ref-type="disp-formula" rid="e4">(3)</xref>
</p>
<p>· Reject the null hypothesis in <xref ref-type="disp-formula" rid="e2">(2)</xref> at the level 𝛼 if 𝐻𝑐≥𝜒<sup>2</sup>
<sub>𝑘−1;1−𝛼</sub>. An alternative is calculating the p-value using a permutation test.</p>
<p>The Kruskal-Wallis test for functional data based on random projections is calculated similarly to the univariate Kruskal-Wallis test. It is based on the sum of the ranks of the projected curves within each group. The test assumes no specific distribution for the functional data and can be robust to atypical curves.</p>
</sec>
<sec>
<title>
<bold>4.     RESULTS AND DISCUSSION</bold>
</title>
<p>Section 4.1 presents a simulation study based on a single Brownian motion simulation. Section 4.2 shows the p-values obtained by generating 1000 random projections.</p>
<sec>
<title>
<bold>4.1   Simulation study indicators</bold>
</title>
<p>We assess the power of the test to detect differences between medians of .-samples of functional data. To establish the performance, we show the results of a simulation study. We follow the procedure given in [<xref ref-type="bibr" rid="redalyc_344276634010_ref15">15</xref>] to perform the analysis. For simplicity, just three groups of curves are considered.</p>
<p>
<disp-formula id="e6">
<label>(5)</label>
<graphic xlink:href="344276634010_ee12.png" position="anchor" orientation="portrait"/>
</disp-formula>
</p>
<p>Where 𝜇(𝑡)=sin(2𝜋𝑡),𝑡 ∈ (0,10), is the mean function and the errors 𝜀𝑖𝑗(𝑡)=1,2,3, follow a uniform distribution on [−1,1]. As an initial illustration, a graph of a Brownian motion and 120 simulated curves according to the equations given in <xref ref-type="disp-formula" rid="e6">(5)</xref> are shown in <xref ref-type="fig" rid="gf1">Figure 1.</xref>nbsp;The curves in red and green are very similar (these come from analogous models (rows 1 and 2 of the equations in 5, and the curves in blue involve an additional parameter 𝛿(𝑡)=𝛿=1.2 that makes these different from the previous ones. Notice in Figure 1 that the highest periodic peaks of the blue curves are close to 3, while in the other two cases (red and green curves), these are close to 2, i.e., the null hypothesis should be rejected. The errors are assumed to be uniform in the interval (1,1). Performing a hypothesis test on the means of functional data assuming that the processes are Gaussian with data such as those presented in <xref ref-type="fig" rid="gf1">Figure 1</xref> would be inappropriate.</p>
<p>
<fig id="gf1">
<label>Figure 1.</label>
<caption>
<title>Brownian motion v(t)  =  v(t - 1) + ϵ(t),ϵ(t) ∼  Normal (0,0.5),t ∈ (0,10) (above left) and curves simulated under the models     Xi1 (t) = μ(t) + εi (t)  (above right), Xi2 (t) = μ(t) + εi (t)  (below left), and Xi3 (t) = μ(t) + δ(t) + εi (t)  (below right), with μ(t) = sin(2πt),δ(t) = 1.2 and ε(t) ∼ uniform(-1,1).</title>
</caption>
<alt-text>Figure 1.  Brownian motion v(t)  =  v(t - 1) + ϵ(t),ϵ(t) ∼  Normal (0,0.5),t ∈ (0,10) (above left) and curves simulated under the models     Xi1 (t) = μ(t) + εi (t)  (above right), Xi2 (t) = μ(t) + εi (t)  (below left), and Xi3 (t) = μ(t) + δ(t) + εi (t)  (below right), with μ(t) = sin(2πt),δ(t) = 1.2 and ε(t) ∼ uniform(-1,1).</alt-text>
<graphic xlink:href="344276634010_gf2.png" position="anchor" orientation="portrait"/>
<attrib>
<bold>Source: </bold>Created by the authors.</attrib>
</fig>
</p>
<p>To evaluate the power of the test, we considered 𝛿(𝑡) = 𝛿, for all 𝑡 ∈ [0,10], with 𝛿 = 0.0,⋯,0.7. Four sample size scenarios are considered (𝑛 = 10,30,80,120) for each sample group. In each case, 1000 realizations are generated. Based on each sample size, we performed a Kruskal-Wallis test as defined in Section 3. In each case, the power of the test is obtained as the percentage of 𝑝−𝑣𝑎𝑙𝑢𝑒𝑠 less than 0.05. We used the libraries fda.usc and stats of R to perform the analysis [<xref ref-type="bibr" rid="redalyc_344276634010_ref23">22</xref>]. <xref ref-type="fig" rid="gf2">Figure 2</xref> shows the empirical power curves for each of the sampling sizes 𝑛 and 𝛿(𝑡) = 𝛿 values. Note that the power of the test increases when 𝛿 and 𝑛 increase; that is, the simulation study provides evidence that the Kruskal-Wallis test for functional data is unbiased and consistent<sup>1</sup>.</p>
<p>
<sup>1</sup> The R code used is available at https://github.com/frajaroco/KWfdRP/blob/main/KWtest.R</p>
<p>
<sup>2</sup> See Canada's Climate Regions at the link https://sites.google.com/a/ocsb.ca/cgc-1d/a-unit-4-climate/1-canadas-climate-regions).</p>
<p>
<fig id="gf2">
<label>Figure 2.</label>
<caption>
<title>Empirical power curves of the Kruskal-Wallis test according to the variation function δ(t) = δ and the sample size n. n = 10 (blue line), n = 30 (green line) n = 80 (red line), and n = 100 (black line) for each sample group. The bottom dashed line corresponds to the significance level 𝛼 = 5 %. Created by the authors.</title>
</caption>
<alt-text>Figure 2.  Empirical power curves of the Kruskal-Wallis test according to the variation function δ(t) = δ and the sample size n. n = 10 (blue line), n = 30 (green line) n = 80 (red line), and n = 100 (black line) for each sample group. The bottom dashed line corresponds to the significance level 𝛼 = 5 %. Created by the authors.</alt-text>
<graphic xlink:href="344276634010_gf3.png" position="anchor" orientation="portrait"/>
<attrib>
<bold>Source</bold>:Created by the authors.</attrib>
</fig>
</p>
</sec>
<sec>
<title>
<bold>4.2   Real data analysis: Temperature curves in Canada</bold>
</title>
<p>We apply the Kruskal-Wallis test for functional data from Section 3 to a widely used meteorological data set in the context of the FDA [<xref ref-type="bibr" rid="redalyc_344276634010_ref24">23</xref>]. This corresponds to the average daily (30-year) temperature (in degrees Celsius) at each of the 35 weather stations located in four climatic zones of Canada (in brackets the number of stations in each zone): Arctic (4) Pacific (7), Continental (9), and Atlantic (15) (see <xref ref-type="fig" rid="gf3">Figure 3</xref>). The Pacific zone is located on the west coast of Canada, including British Columbia and parts of Yukon and the Northwest territories. This area is defined by mild, rainy winters and cool, dry summers. The continental region covers the central parts of Canada, including Manitoba, Saskatchewan, and parts of Alberta and Ontario. Its climate is marked by cold winters and short and hot summers. The Atlantic zone covers the eastern parts of Canada, including Nova Scotia, New Brunswick, and Prince Edward Island. It has mild, wet winters and cool, moist summers. The Arctic region covers the northernmost parts of Canada, including Nunavut, the northwest territories, and parts of Yukon, Quebec, and Labrador. This zone has long, harsh winters and short, cool summers (see Canada's Climate Regions at the link <ext-link ext-link-type="uri" xlink:href="https://sites.google.com/a/ocsb.ca/cgc-1d/a-unit-4-climate/1-canadas-climate-regions">https://sites.google.com/a/ocsb.ca/cgc-1d/a-unit-4-climate/1-canadas-climate-regions</ext-link>). The daily temperature data for the four climatic zones were smoothed using a Fourier basis function. The curves obtained after smoothing are shown in <xref ref-type="fig" rid="gf3">Figure 3.</xref>The interest is to determine whether there are significant differences between the mean (median) curves of these areas. For this purpose, we apply the Kruskal-Wallis test presented in Section 3. We generate random projections using (<xref ref-type="disp-formula" rid="e7">6</xref>) with 𝑖 the index corresponding to the weather station in each one of the four climatic zones (𝑗=1 (Arctic), 2 (Pacific), 3 (Continental), 4 (Atlantic) ) and 𝜈(𝑡) a Brownian motion. The number of stations in each zone is 4 (Arctic), 7 (Pacific), 9 (Continental), and 15 (Atlantic).</p>
<p>
<disp-formula id="e7">
<label>(6)</label>
<graphic xlink:href="344276634010_ee9.png" position="anchor" orientation="portrait"/>
</disp-formula>
</p>
<p>After obtaining the random projections, we conduct a classical Kruskal-Wallis test with these values. For this case, a  <italic>p</italic> - value = 0.00361 was obtained, and consequently, in concordance with Canada's Climatic description above, the null hypothesis is rejected. Note that there are some atypical curves in each panel of <xref ref-type="fig" rid="gf3">Figure 3.</xref>Using a classical ANOVA test based on random projections can be limited in this case. A robust methodology, as proposed here, could be more appropriate. Wilcoxon’s post-hoc tests [<xref ref-type="bibr" rid="redalyc_344276634010_ref25">24</xref>] (<xref ref-type="table" rid="gt7">Table 1</xref>) at a 10 % significance level of 10 % show that the medians of the Atlantic and Pacific zones are significantly different from the median of the Arctic region. At the same level, there are differences between the medians of the Atlantic and Continental regions. A graphical comparison (<xref ref-type="fig" rid="gf3">Figure 3</xref>) indicates marked differences between the curves of these regions.</p>
<p>
<fig id="gf3">
<label>Figure 3.</label>
<caption>
<title>Temperature curves (<italic>x<sub>ij</sub>
</italic> (<italic>t</italic>)) for the Atlantic, Continental, Pacific, and Arctic climate zones obtained after daily data (averages of 30 years) are smoothed using Fourier basis functions. Created by the authors.</title>
</caption>
<alt-text>Figure 3. Temperature curves (xij (t)) for the Atlantic, Continental, Pacific, and Arctic climate zones obtained after daily data (averages of 30 years) are smoothed using Fourier basis functions. Created by the authors.</alt-text>
<graphic xlink:href="344276634010_gf4.png" position="anchor" orientation="portrait"/>
<attrib>
<bold>Source:</bold> Created by the authors.</attrib>
</fig>
</p>
<p>
<table-wrap id="gt7">
<label>Table 1</label>
<caption>
<title>Wilcoxon posthoc tests. </title>
</caption>
<alt-text>Table 1 Wilcoxon posthoc tests. </alt-text>
<alternatives>
<graphic xlink:href="344276634010_gt8.png" position="anchor" orientation="portrait"/>
<table style="border-collapse:collapse;border:none;  " id="gt8-526564616c7963">
<tbody>
<tr style="height:14.15pt">
<td style="width:78.45pt;border:none;border-top:solid windowtext 1.0pt;   padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt"/>
<td style="width:91.65pt;border-top:solid windowtext 1.0pt;   border-left:none;border-bottom:solid windowtext 1.0pt;border-right:none;      padding:0cm 5.4pt 0cm 5.4pt;height:14.15pt">Atlantic</td>
<td style="width:106.3pt;border-top:solid windowtext 1.0pt;   border-left:none;border-bottom:solid windowtext 1.0pt;border-right:none;      padding:0cm 5.4pt 0cm 5.4pt;height:14.15pt">Continental</td>
<td style="width:92.15pt;border-top:solid windowtext 1.0pt;   border-left:none;border-bottom:solid windowtext 1.0pt;border-right:none;      padding:0cm 5.4pt 0cm 5.4pt;height:14.15pt">Pacific</td>
</tr>
<tr style="height:14.15pt">
<td style="width:78.45pt;border:none;padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">Continental</td>
<td style="width:91.65pt;border:none;   padding:0cm 5.4pt 0cm 5.4pt;height:14.15pt">0.09</td>
<td style="width:106.3pt;border:none;   padding:0cm 5.4pt 0cm 5.4pt;height:14.15pt">--</td>
<td style="width:92.15pt;border:none;   padding:0cm 5.4pt 0cm 5.4pt;height:14.15pt">--</td>
</tr>
<tr style="height:14.15pt">
<td style="width:78.45pt;border:none;padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">Pacific</td>
<td style="width:91.65pt;border:none;padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">0.95</td>
<td style="width:106.3pt;border:none;padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">0.25</td>
<td style="width:92.15pt;border:none;padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">--</td>
</tr>
<tr style="height:14.15pt">
<td style="width:78.45pt;border:none;border-bottom:solid windowtext 1.0pt;   padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">Artic</td>
<td style="width:91.65pt;border:none;border-bottom:solid windowtext 1.0pt;   padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">0.01</td>
<td style="width:106.3pt;border:none;border-bottom:solid windowtext 1.0pt;   padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">0.20</td>
<td style="width:92.15pt;border:none;border-bottom:solid windowtext 1.0pt;   padding:0cm 5.4pt 0cm 5.4pt;   height:14.15pt">0.07</td>
</tr>
</tbody>
</table>
</alternatives>
<attrib>
<bold>Source:</bold> Created by the authors.</attrib>
</table-wrap>
</p>
<p>The results described above are based on random projections from a particular BM. The attached R code<sup>3</sup>, shows the values found with 1000 Brownian motions, and the general conclusion is the same.</p>
<p>
<sup>3</sup> https://github.com/frajaroco/KWfdRP/blob/main/KWCanadianWeather.R</p>
</sec>
<sec>
<title>
<bold>4.3   Discussion</bold>
</title>
<p>ANOVA for functional data has been widely discussed, and several approaches have been considered [<xref ref-type="bibr" rid="redalyc_344276634010_ref1">1</xref>], [<xref ref-type="bibr" rid="redalyc_344276634010_ref2">2</xref>]. Many of these are based on the Gaussianity assumption [<xref ref-type="bibr" rid="redalyc_344276634010_ref8">8</xref>, <xref ref-type="bibr" rid="redalyc_344276634010_ref8">10</xref>]. Here, we adapt a classical non-parametric test to this scenario. The strength of the Kruskal-Wallis test for functional data proposed here lies in its versatility. It does not depend on the assumption of Gaussianity, thus extending its applicability to various real-world scenarios where data may deviate from a Gaussian distribution. This test is flexible and can be used with various types of functional data, including curves and time series. It does not impose strict assumptions on the data distribution, making it suitable for analyzing diverse datasets. This approach is particularly advantageous when dealing with data that may not conform to normality or have unknown distributions. Like other statistical tests, the Kruskal-Wallis test assumes the independence of observations within and between groups. Violations of this assumption could potentially affect the accuracy of the test results. If the Kruskal-Wallis test indicates significant differences between groups, post-hoc tests can be conducted to identify differences between groups. Many other non-parametric methods are available for post-hoc testing, each with strengths and limitations.</p>
</sec>
</sec>
<sec>
<title>
<bold>5.     CONCLUSIONS</bold>
</title>
<p>We propose a non-parametric method for the <italic>k</italic>-functional problem, which is useful when the sample size is small, the assumption of normality is not reasonable, or when there are atypical curves. We propose the use of one-dimensional random projections to solve the problem. After obtaining scalars from functions using random projections, a classical Kruskal-Wallis test can be used to test the hypothesis. The results obtained from the simulated and real data show a good performance of the methodology. The results (<xref ref-type="fig" rid="gf2">Figure 2</xref>) illustrate that the Kruskal Wallis test extension performs well under the null hypothesis. Power increases for larger sample sizes and distance parameter. This plot allows us to validate that the proposed test is unbiased and consistent. Some authors consider using points-wise test statistics for functional data problems with two samples and similarly for the .-sample problem, although they are not global tests. Our approach is a helpful alternative when the sample is small, and the Gaussian assumption is inappropriate.</p>
</sec>
</body>
<back>
<ref-list>
<title>
<bold>        REFERENCES</bold>
</title>
<ref id="redalyc_344276634010_ref1">
<label>[1]</label>
<mixed-citation>[1] T. Górecki and Ł. Smaga, “A comparison of tests for the one-way ANOVA problem for functional data,” <italic>Comput. Stat.</italic>, vol. 30, no. 4, pp. 987–1010, Dec. 2015. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00180-015-0555-0">https://doi.org/10.1007/s00180-015-0555-0</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Górecki</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Smaga</surname>
<given-names>Ł.</given-names>
</name>
</person-group>
<article-title>A comparison of tests for the one-way ANOVA problem for functional data</article-title>
<source>Comput. Stat.</source>
<year>2015</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00180-015-0555-0">https://doi.org/10.1007/s00180-015-0555-0</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref2">
<label>[2]</label>
<mixed-citation>[2] J. T. Zhang, <italic>Analysis of variance for functional data</italic>, 1st ed. New York, NY, USA: Chapman and Hall/CRC, 2013. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1201/b15005">https://doi.org/10.1201/b15005</ext-link>
</mixed-citation>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>J. T.</given-names>
</name>
</person-group>
<source>Analysis of variance for functional data</source>
<year>2013</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1201/b15005">https://doi.org/10.1201/b15005</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref3">
<label>[3]</label>
<mixed-citation>[3] F. Ferraty, P. Vieu, and S. Viguier-Pla, “Factor-based comparison of groups of curves,” <italic>Comput. Stat. Data Anal.</italic>, vol. 51, no. 10, pp. 4903–4910, Jun. 2007. <ext-link ext-link-type="uri" xlink:href="https://doi.org/https://doi.org/10.1016/j.csda.2006.10.001">https://doi.org/https://doi.org/10.1016/j.csda.2006.10.001</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ferraty</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Vieu</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Viguier-Pla</surname>
<given-names>S.</given-names>
</name>
</person-group>
<article-title>Factor-based comparison of groups of curves</article-title>
<source>Comput. Stat. Data Anal.</source>
<year>2007</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/https://doi.org/10.1016/j.csda.2006.10.001">https://doi.org/https://doi.org/10.1016/j.csda.2006.10.001</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref26">
<label>[4]</label>
<mixed-citation>[4] M. L. Bourbonnais et al., “Characterizing spatial-temporal patterns of landscape disturbance and recovery in western Alberta, Canada using a functional data analysis approach and remotely sensed data,” <italic>Ecol. Inform.</italic>, vol. 39, pp. 140–150, May. 2017. https://doi.org/https://doi.org/10.1016/j.ecoinf.2017.04.010</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bourbonnais</surname>
<given-names>M. L.</given-names>
</name>
</person-group>
<article-title>Characterizing spatial-temporal patterns of landscape disturbance and recovery in western Alberta, Canada using a functional data analysis approach and remotely sensed data</article-title>
<source>Ecol. Inform.</source>
<year>2017</year>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref5">
<label>[5]</label>
<mixed-citation>[5] A. Roy, T. Nelson, and P. Turaga, “Functional data analysis approach for mapping change in time series: A case study using bicycle ridership patterns,” <italic>Transp. Res. Interdiscip. Perspect.</italic>, vol. 17, p. 100752, Jan. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/https://doi.org/10.1016/j.trip.2022.100752">https://doi.org/https://doi.org/10.1016/j.trip.2022.100752</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Roy</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Nelson</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Turaga</surname>
<given-names>P.</given-names>
</name>
</person-group>
<article-title>Functional data analysis approach for mapping change in time series: A case study using bicycle ridership patterns</article-title>
<source>Transp. Res. Interdiscip. Perspect.</source>
<year>2022</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/https://doi.org/10.1016/j.trip.2022.100752">https://doi.org/https://doi.org/10.1016/j.trip.2022.100752</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref6">
<label>[6]</label>
<mixed-citation>[6] J. M. Torres, P. J. G. Nieto, L. Alejano, and A. N. Reyes, “Detection of outliers in gas emissions from urban areas using functional data analysis,” <italic>J. Hazard. Mater.</italic>, vol. 186, no. 1, pp. 144–149, Feb. 2011. <ext-link ext-link-type="uri" xlink:href="https://doi.org/https://doi.org/10.1016/j.jhazmat.2010.10.091">https://doi.org/https://doi.org/10.1016/j.jhazmat.2010.10.091</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Torres</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Nieto</surname>
<given-names>P. J. G.</given-names>
</name>
<name>
<surname>Alejano</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Reyes</surname>
<given-names>A. N.</given-names>
</name>
</person-group>
<article-title>Detection of outliers in gas emissions from urban areas using functional data analysis</article-title>
<source>J. Hazard. Mater.</source>
<year>2010</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/https://doi.org/10.1016/j.jhazmat.2010.10.091">https://doi.org/https://doi.org/10.1016/j.jhazmat.2010.10.091</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref7">
<label>[7]</label>
<mixed-citation>[7] M. Tang, Z. Li, and G. Tian, “A Data-Driven-Based Wavelet Support Vector Approach for Passenger Flow Forecasting of the Metropolitan Hub,” <italic>IEEE Access</italic>, vol. 7, pp. 7176-7183, Jan. 2019. <ext-link ext-link-type="uri" xlink:href="https://ieeexplore.ieee.org/abstract/document/8600312">https://ieeexplore.ieee.org/abstract/document/8600312</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
</person-group>
<article-title>A Data-Driven-Based Wavelet Support Vector Approach for Passenger Flow Forecasting of the Metropolitan Hub</article-title>
<source>IEEE Access</source>
<year>2019</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://ieeexplore.ieee.org/abstract/document/8600312">https://ieeexplore.ieee.org/abstract/document/8600312</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref8">
<label>[8]</label>
<mixed-citation>[8] Z. Jin-Ting, and X. Liang, “One-way ANOVA for functional data via globalizing the pointwise F-test,” <italic>Scand. Stat. Theory Appl., </italic>vol. 41, no. 1, pp. 51–71, Mar. 2014. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1111/sjos.12025">https://doi.org/10.1111/sjos.12025</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jin-Ting</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Liang</surname>
<given-names>X.</given-names>
</name>
</person-group>
<article-title>One-way ANOVA for functional data via globalizing the pointwise F-test</article-title>
<source>Scand. Stat. Theory Appl.</source>
<year>2014</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1111/sjos.12025">https://doi.org/10.1111/sjos.12025</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref9">
<label>[9]</label>
<mixed-citation>[9] A. Cuevas, M. Febrero, and R. Fraiman, “An anova test for functional data,” <italic>Comput. Stat. Data Anal.</italic>, vol. 47, no. 1, pp. 111–122, Aug. 2004. <ext-link ext-link-type="uri" xlink:href="https://doi.org/https://doi.org/10.1016/j.csda.2003.10.021">https://doi.org/https://doi.org/10.1016/j.csda.2003.10.021</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cuevas</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Febrero</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Fraiman</surname>
<given-names>R.</given-names>
</name>
</person-group>
<article-title>An anova test for functional data</article-title>
<source>Comput. Stat. Data Anal.</source>
<year>2003</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/https://doi.org/10.1016/j.csda.2003.10.021">https://doi.org/https://doi.org/10.1016/j.csda.2003.10.021</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref10">
<label>[10]</label>
<mixed-citation>[10] J. O. Ramsay, and B. W. Silverman, <italic>Functional Data Analysis</italic>, 2nd ed. New York, NY, USA: Springer-Verlag New York, 2005. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/b98888">https://doi.org/10.1007/b98888</ext-link>
</mixed-citation>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Ramsay</surname>
<given-names>J. O.</given-names>
</name>
<name>
<surname>Silverman</surname>
<given-names>B. W.</given-names>
</name>
</person-group>
<source>Functional Data Analysis</source>
<year>2005</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/b98888">https://doi.org/10.1007/b98888</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref11">
<label>[11]</label>
<mixed-citation>[11] C. G. Kaufman, and S. R. Sain, “Bayesian Functional ANOVA Modeling Using Gaussian Process Prior Distributions,” <italic>Bayesian Anal.</italic>, vol. 5 no. 1, pp. 123–149, Mar. 2010. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1214/10-BA505">https://doi.org/10.1214/10-BA505</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaufman</surname>
<given-names>C. G.</given-names>
</name>
<name>
<surname>Sain</surname>
<given-names>S. R.</given-names>
</name>
</person-group>
<article-title>Bayesian Functional ANOVA Modeling Using Gaussian Process Prior Distributions</article-title>
<source>Bayesian Anal.</source>
<year>2010</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1214/10-BA505">https://doi.org/10.1214/10-BA505</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref12">
<label>[12]</label>
<mixed-citation>[12] Q. Shen, and J. J. Faraway, “An F test for linear models with functional responses,” <italic>Statistica Sinica, </italic>vol. 14, pp. 1239–1257, 2004. <ext-link ext-link-type="uri" xlink:href="https://api.semanticscholar.org/CorpusID:55106079">https://api.semanticscholar.org/CorpusID:55106079</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shen</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Faraway</surname>
<given-names>J. J.</given-names>
</name>
</person-group>
<article-title>An F test for linear models with functional responses</article-title>
<source>Statistica Sinica</source>
<year>2004</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://api.semanticscholar.org/CorpusID:55106079">https://api.semanticscholar.org/CorpusID:55106079</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref13">
<label>[13]</label>
<mixed-citation>[13] P. Delicado, “Functional k-sample problem when data are density functions,” <italic>Comput. Stat.</italic>, vol. 22, no. 3, pp. 391–410, Sep. 2007. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00180-007-0047-y">https://doi.org/10.1007/s00180-007-0047-y</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Delicado</surname>
<given-names>P.</given-names>
</name>
</person-group>
<article-title>Functional k-sample problem when data are density functions</article-title>
<source>Comput. Stat.</source>
<year>2007</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00180-007-0047-y">https://doi.org/10.1007/s00180-007-0047-y</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref14">
<label>[14]</label>
<mixed-citation>[14] M. Myllymäki, T. Mrkvička, P. Grabarnik, H. Seijo, and U. Hahn, “Global envelope tests for spatial processes,” <italic>J. R. Stat. Soc. Series B Stat. Methodol., </italic>vol. 79, no. 2, pp. 381–404, Mar. 2017. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1111/rssb.12172">https://doi.org/10.1111/rssb.12172</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Myllymäki</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mrkvička</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Grabarnik</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Seijo</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Hahn</surname>
<given-names>U.</given-names>
</name>
</person-group>
<article-title>Global envelope tests for spatial processes</article-title>
<source>J. R. Stat. Soc. Series B Stat. Methodol.</source>
<year>2017</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1111/rssb.12172">https://doi.org/10.1111/rssb.12172</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref15">
<label>[15]</label>
<mixed-citation>[15] O. A. Vsevolozhskaya, M. C. Greenwood, and D. B. Holodov, “Pairwise comparison of treatment levels in functional analysis of variance with application to erythrocyte hemolysis,” <italic>Ann. Appl. Stat.</italic>, vol. 8, pp. 905–925, Jun. 2014. <ext-link ext-link-type="uri" xlink:href="https://api.semanticscholar.org/CorpusID:38476665">https://api.semanticscholar.org/CorpusID:38476665</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vsevolozhskaya</surname>
<given-names>O. A.</given-names>
</name>
<name>
<surname>Greenwood</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Holodov</surname>
<given-names>D. B.</given-names>
</name>
</person-group>
<article-title>Pairwise comparison of treatment levels in functional analysis of variance with application to erythrocyte hemolysis</article-title>
<source>Ann. Appl. Stat.</source>
<year>2014</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://api.semanticscholar.org/CorpusID:38476665">https://api.semanticscholar.org/CorpusID:38476665</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref16">
<label>[16]</label>
<mixed-citation>[16] A. Pini, S. Vantini, B. M. Colosimo, and M. Grasso, “Domain-selective functional analysis of variance for supervised statistical profile monitoring of signal data,” <italic>J. R. Stat. Soc. Ser. C Appl. Stat.</italic>, vol. 67, no. 1, pp. 55–81, Jan. 2018. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1111/rssc.12218">https://doi.org/10.1111/rssc.12218</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pini</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Vantini</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Colosimo</surname>
<given-names>B. M.</given-names>
</name>
<name>
<surname>Grasso</surname>
<given-names>M.</given-names>
</name>
</person-group>
<article-title>Domain-selective functional analysis of variance for supervised statistical profile monitoring of signal data</article-title>
<source>J. R. Stat. Soc. Ser. C Appl. Stat.</source>
<year>2018</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1111/rssc.12218">https://doi.org/10.1111/rssc.12218</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref17">
<label>[17]</label>
<mixed-citation>[17] A. B. Kashlak, S. Myroshnychenko, and S. Spektor, “Analytic Permutation Testing for Functional Data ANOVA,” <italic>J. Comput. Graph. Stat.</italic>, vol. 32, no. 1, pp. 294–303, May. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/10618600.2022.2069780">https://doi.org/10.1080/10618600.2022.2069780</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kashlak</surname>
<given-names>A. B.</given-names>
</name>
<name>
<surname>Myroshnychenko</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Spektor</surname>
<given-names>S.</given-names>
</name>
</person-group>
<article-title>Analytic Permutation Testing for Functional Data ANOVA</article-title>
<source>J. Comput. Graph. Stat.</source>
<year>2022</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/10618600.2022.2069780">https://doi.org/10.1080/10618600.2022.2069780</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref18">
<label>[18]</label>
<mixed-citation>[18] M. Hollander, D. A. Wolfe, and E. Chicken, “The onw-Way Layout Introduction,” in <italic>Nonparametric Statistical Methods</italic>, D. J. Balding <italic>et al.,</italic> Eds., Hoboken, New Jersey: John Wiley &amp; Sons, 2013.</mixed-citation>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Hollander</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wolfe</surname>
<given-names>D. A.</given-names>
</name>
<name>
<surname>Chicken</surname>
<given-names>E.</given-names>
</name>
</person-group>
<source>The onw-Way Layout Introduction</source>
<year>2013</year>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref20">
<label>[19]</label>
<mixed-citation>[19] D. Achlioptas, “Database-friendly random projections,” in <italic>Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, </italic> New York, NY, USA, 2001. <ext-link ext-link-type="uri" xlink:href="https://api.semanticscholar.org/CorpusID:2640788">https://api.semanticscholar.org/CorpusID:2640788</ext-link>
</mixed-citation>
<element-citation publication-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Achlioptas</surname>
<given-names>D.</given-names>
</name>
</person-group>
<source>Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems</source>
<year>2001</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://api.semanticscholar.org/CorpusID:2640788">https://api.semanticscholar.org/CorpusID:2640788</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref21">
<label>[20]</label>
<mixed-citation>[20] A. Nieto-Reyes, “Random Projections: Applications to Statistical Data Depth and Goodness of Fit Test,” <italic>BEIO Rev. Of. la Soc. Estadística e Investig. Oper.</italic>, vol. 35, no. 1, pp. 7–22, Mar. 2019. <ext-link ext-link-type="uri" xlink:href="https://www.seio.es/beio/BEIOVol35Num1.pdf#page=13">https://www.seio.es/beio/BEIOVol35Num1.pdf#page=13</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nieto-Reyes</surname>
<given-names>A.</given-names>
</name>
</person-group>
<article-title>Random Projections: Applications to Statistical Data Depth and Goodness of Fit Test</article-title>
<source>BEIO Rev. Of. la Soc. Estadística e Investig. Oper.</source>
<year>2019</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://www.seio.es/beio/BEIOVol35Num1.pdf#page=13">https://www.seio.es/beio/BEIOVol35Num1.pdf#page=13</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref22">
<label>[21]</label>
<mixed-citation>[21] J. A. Cuesta-Albertos, R. Fraiman, and T. Ransford, “Random projections and goodness-of-fit tests in infinite-dimensional spaces,” <italic>Bull. Brazilian Math. Soc.</italic>, vol. 37, no. 4, pp. 477–501, Dec. 2006. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00574-006-0023-0">https://doi.org/10.1007/s00574-006-0023-0</ext-link>
</mixed-citation>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cuesta-Albertos</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Fraiman</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Ransford</surname>
<given-names>T.</given-names>
</name>
</person-group>
<article-title>Random projections and goodness-of-fit tests in infinite-dimensional spaces</article-title>
<source>Bull. Brazilian Math. Soc.</source>
<year>2006</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00574-006-0023-0">https://doi.org/10.1007/s00574-006-0023-0</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref23">
<label>[22]</label>
<mixed-citation>[22] R. Ihaka, R. Gentleman. <italic>The R Project for Statistical Computing. </italic>(V R.4.2.1 2022). Accessed: Apr.. 16, 2023. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/bin/windows/base/old/4.2.1">https://cran.r-project.org/bin/windows/base/old/4.2.1</ext-link>/</mixed-citation>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<name>
<surname>Ihaka</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Gentleman</surname>
<given-names>R.</given-names>
</name>
</person-group>
<source>The R Project for Statistical Computing. (V R.4.2.1 2022)</source>
<year>2023</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/bin/windows/base/old/4.2.1">https://cran.r-project.org/bin/windows/base/old/4.2.1</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref24">
<label>[23]</label>
<mixed-citation>[23] J. Ramsay, G. Hooker, and S. Graves, <italic>Functional Data Analysis with R and MATLAB.</italic> New York, NY, USA: Springer New York, 2009. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/978-0-387-98185-7">https://doi.org/10.1007/978-0-387-98185-7</ext-link>
</mixed-citation>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<name>
<surname>Ramsay</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hooker</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Graves</surname>
<given-names>S.</given-names>
</name>
</person-group>
<source>Functional Data Analysis with R and MATLAB</source>
<year>2009</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/978-0-387-98185-7">https://doi.org/10.1007/978-0-387-98185-7</ext-link>
</comment>
</element-citation>
</ref>
<ref id="redalyc_344276634010_ref25">
<label>[24]</label>
<mixed-citation>[24] T. Pohlert, <italic>The Pairwise Multiple Comparison of Mean Ranks Package (PMCMR) v4.4.</italic> 2016. Accessed: Apr.16, 2023. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="http://cran.r-project.org/package=PMCMR">http://cran.r-project.org/package=PMCMR</ext-link>
</mixed-citation>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<name>
<surname>Pohlert</surname>
<given-names>T.</given-names>
</name>
</person-group>
<source>The Pairwise Multiple Comparison of Mean Ranks Package (PMCMR) v4.4. 2016.</source>
<year>2023</year>
<comment>
<ext-link ext-link-type="uri" xlink:href="http://cran.r-project.org/package=PMCMR">http://cran.r-project.org/package=PMCMR</ext-link>
</comment>
</element-citation>
</ref>
</ref-list>
<fn-group>
<title>Notes</title>
<fn id="fn8" fn-type="other">
<label>
<bold>ACKNOWLEDGEMENT AND FUNDING</bold>
</label>
<p>:</p>
<p>The authors thank the Editor and reviewers for their constructive comments, which improved the article's presentation. Francisco J. Rodríguez-Cortés and Ramón Giraldo has been partially supported by Universidad Nacional de Colombia, HERMES projects, Grant/Award Number: 612113.</p>
</fn>
<fn id="fn9" fn-type="other">
<label>
<bold>CONFLICTS OF INTEREST</bold>
</label>
<p>:</p>
<p>The authors declare no conflict of interest.</p>
</fn>
<fn id="fn10" fn-type="other">
<label>
<bold>AUTHORS CONTRIBUTIONS</bold>
</label>
<p>:</p>
<p>
<bold>Rafael Meléndez Surmay</bold>, <bold>Ramón Giraldo Henao</bold>, and <bold>Francisco Rodríguez Cortes</bold>, performed data processing, formal analysis, investigation, methodology, and original draft writing.</p>
</fn>
</fn-group>
</back>
</article>