Geometrically designed, variable knot regression splines

Kaishev, Vladimir K.; Dimitrova, Dimitrina S.; Haberman, Steven; Verrall, Richard J.

doi:10.1007/s00180-015-0621-7

Geometrically designed, variable knot regression splines

Original Paper
Published: 14 September 2015

Volume 31, pages 1079–1105, (2016)
Cite this article

Computational Statistics Aims and scope Submit manuscript

Vladimir K. Kaishev¹,
Dimitrina S. Dimitrova¹,
Steven Haberman¹ &
…
Richard J. Verrall¹

463 Accesses
8 Citations
Explore all metrics

Abstract

A new method of Geometrically Designed least squares (LS) splines with variable knots, named GeDS, is proposed. It is based on the property that the spline regression function, viewed as a parametric curve, has a control polygon and, due to the shape preserving and convex hull properties, it closely follows the shape of this control polygon. The latter has vertices whose x-coordinates are certain knot averages and whose y-coordinates are the regression coefficients. Thus, manipulation of the position of the control polygon may be interpreted as estimation of the spline curve knots and coefficients. These geometric ideas are implemented in the two stages of the GeDS estimation method. In stage A, a linear LS spline fit to the data is constructed, and viewed as the initial position of the control polygon of a higher order ($n>2$) smooth spline curve. In stage B, the optimal set of knots of this higher order spline curve is found, so that its control polygon is as close to the initial polygon of stage A as possible and finally, the LS estimates of the regression coefficients of this curve are found. The GeDS method produces simultaneously linear, quadratic, cubic (and possibly higher order) spline fits with one and the same number of B-spline coefficients. Numerical examples are provided and further supplemental materials are available online.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence in Physical Sciences: Symbolic Regression Trends and Perspectives

Article Open access 19 April 2023

A Guide for Sparse PCA: Model Comparison and Applications

Article Open access 29 June 2021

Genetic Algorithms in the Fields of Artificial Intelligence and Data Sciences

Article 30 August 2021

References

Antoniadis A, Gijbels I, Verhasselt A (2012) Variable selection in additive models using P-splines. Technometrics 54(4):425–438
Article MathSciNet Google Scholar
Beliakov G (2004) Least squares splines with free knots: global optimization approach. Appl Math Comput 149:783–798
MathSciNet MATH Google Scholar
Belitser E, Serra P (2014) Adaptive priors based on splines with random knots. Bayesian Anal 9(4):859–882
Article MathSciNet MATH Google Scholar
Biller C (2000) Adaptive Bayesian regression splines in semiparametric generalized linear models. J Comput Graph Stat 9:122–140
MathSciNet Google Scholar
Cohen E, Riesenfeld RF, Elber G (2001) Geometric modelling with splines: an introduction. A K Peters, Natick
MATH Google Scholar
De Boor C (2001) A practical guide to splines, revised Edition. Springer, New York
MATH Google Scholar
Denison D, Mallick B, Smith A (1998) Automatic Bayesian curve fitting. J R Stat Soc B 60:333–350
Article MathSciNet MATH Google Scholar
Donoho D, Johnstone I (1994) Ideal spatial adaptation by wavelet shrinkage. Biometrika 81:425–455
Article MathSciNet MATH Google Scholar
Eubank R (1988) Spline smoothing and nonparametric regression. Dekker, New York
MATH Google Scholar
Fan J, Gijbels I (1995) Data-driven bandwidth selection in local polynomial fitting: variable bandwidth and spatial adaptation. J R Stat Soc B 57:371–394
MathSciNet MATH Google Scholar
Farin G (2002) Curves and surfaces for CAGD, 5th edn. Morgan Kaufmann, San Francisco
Google Scholar
Friedman JH (1991) Multivariate adaptive regression splines (with discussion). Ann Stat 19:1–141
Article MATH Google Scholar
Friedman JH, Silverman BW (1989) Flexible parsimonious smoothing and additive modeling (with discussion). Technometrics 31:3–39
Article MathSciNet MATH Google Scholar
Hansen MH, Kooperberg C (2002) Spline adaptation in extended linear models (with comments and a rejoinder by the authors). Stat Sci 17(1):2–51
Article MathSciNet MATH Google Scholar
Hastie T (1989) [Flexible Parsimonious Smoothing and Additive Modeling]: Discussion. Technometrics 31:23–29
Huang JZ (2003) Local assymptotics for polynomial spline regression. Ann Stat 31:1600–1635
Article MATH Google Scholar
Jupp D (1978) Approximation to data by splines with free knots. SIAM J Numer Anal 15:328–343
Article MathSciNet MATH Google Scholar
Kaishev VK (1984) A computer program package for solving spline regression problems. In: Havranek T, Sidak Z, Novak M (eds) Proceedings in computational statistics, COMPSTAT. Physica-verlag, Wien, pp 409–414
Google Scholar
Kang H, Chen F, Li Y, Deng J, Yang Z (2015) Knot calculation for spline fitting via sparse optimization. Comput Aided Des 58:179–188
Article MathSciNet Google Scholar
Kimber SAJ, Kreyssig A, Zhang YZ, Jeschke HO, Valenti R, Yokaichiya F, Colombier E, Yan J, Hansen TC, Chatterji T, McQueeney RJ, Canfield PC, Goldman AI, Argyriou DN (2009) Similarities between structural distortions under pressure and chemical doping in superconducting $\text{ BaFe }_2\text{ As }_2$. Nat Mater 8:471–475
Article Google Scholar
Lee TCM (2000) Regression spline smoothing using the minimum description length principle. Stat Probab Lett 48:71–82
Article MathSciNet MATH Google Scholar
Lee TCM (2002a) Automatic smoothing for discontinuous regression functions. Stat Sin 12:823–842
MathSciNet MATH Google Scholar
Lee TCM (2002b) On algorithms for ordinary least squares regression spline fitting: a comparative study. J Stat Comput Simul 72:647–663
Article MathSciNet MATH Google Scholar
Lindstrom MJ (1999) Penalized estimation of free-knot splines. J Comput Graph Stat 8(2):333–352
MathSciNet Google Scholar
Luo Z, Wahba G (1997) Hybrid adaptive splines. J Am Stat Assoc 92:107–115
Article MathSciNet MATH Google Scholar
Mammen E, Van der Geer S (1997) Locally adaptive regression splines. Ann Stat 25(1):387–413
Article MathSciNet MATH Google Scholar
Marx BD, Eilers PHC (1996) Flexible smoothing with B-splines and penalties. Stat Sci 11(2):89–121
Article MathSciNet MATH Google Scholar
Miyata S, Shen X (2003) Adaptive free-knot splines. J Comput Graph Stat 12(1):197–231
Article MathSciNet Google Scholar
Molinari N, Durand J-F, Sabatier R (2004) Bounded optimal knots for regression splines. Comput Stat Data Anal 45(2):159–178
Article MathSciNet MATH Google Scholar
Pittman J (2002) Adaptive splines and genetic algorithms. J Comput Graph Stat 11(3):1–24
Article MathSciNet Google Scholar
Rupert D (2002) Selecting the number of knots for penalized splines. J Comput Graph Stat 11(4):735–757
Article MathSciNet Google Scholar
Rupert D, Carroll RJ (2000) Spatially-adaptive penalties for spline fitting. Aust N Z J Stat 42:205–223
Article Google Scholar
Schwetlick H, Schütze T (1995) Least squares approximation by splines with free knots. BIT Numer Math 35:854–866
Article MathSciNet MATH Google Scholar
Smith PL (1982) Curve fitting and modeling with splines using statistical variable selection techniques. Report NASA 166034, Langley Research Center, Hampton
Smith M, Kohn R (1996) Nonparametric regression using Bayesian variable selection. J Econom 75:317–344
Article MATH Google Scholar
Stone CJ, Hansen MH, Kooperberg C, Truong YK (1997) Polynomial splines and their tensor products in extended linear modeling. Ann Stat 25:1371–1470
Article MathSciNet MATH Google Scholar
Van Loock W, Pipeleers G, De Schutter J, Swevers J (2011) A convex optimization approach to curve fitting with B-splines. In: Preprints of the 18th international federation of automatic control (IFAC), Milano (Italy), 2290–2295
Wahba G (1990) Spline models for observational data. SIAM, Philadelphia
Book MATH Google Scholar
Will G (2006) Powder diffraction: the rietveld method and the two stage method. Springer, Berlin
Google Scholar
Wood SN (2003) Thin plate regression splines. J R Stat Soc B 65(1):95–114
Article MathSciNet MATH Google Scholar
Yuan Y, Chen N, Zhou S (2013) Adaptive B-spline knots selection using multi-resolution basis set. IIE Trans 45(12):1263–1277
Article Google Scholar
Zhou S, Shen X (2001) Spatially adaptive regression splines and accurate knot selection schemes. J Am Stat Assoc 96:247–259
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

The authors would like to acknowledge support received through a research grant from the UK Institute of Actuaries. The authors would also like to thank Simon Kimber for providing them with the $\hbox {BaFe}_2\hbox {As}_2$ dataset and the results from the Rietveld fit given in Kimber et al. (2009). The sincere encouragement received by David van Dyk, and his help in discussing and providing invaluable advice on ways to improve the paper are greatly appreciated.

Author information

Authors and Affiliations

Faculty of Actuarial Science and Insurance, Cass Business School, City University London, 106 Bunhill Row, London, EC1Y 8TZ, UK
Vladimir K. Kaishev, Dimitrina S. Dimitrova, Steven Haberman & Richard J. Verrall

Authors

Vladimir K. Kaishev
View author publications
You can also search for this author in PubMed Google Scholar
Dimitrina S. Dimitrova
View author publications
You can also search for this author in PubMed Google Scholar
Steven Haberman
View author publications
You can also search for this author in PubMed Google Scholar
Richard J. Verrall
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vladimir K. Kaishev.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 2031 KB)

Appendices

1.1 Proofs of the results of section 3.1

Proof of Theorem 3.4

Note that, for $n=2$, $\xi _i\equiv \xi _i^*$, $i=1,\ldots ,p$, hence $V^a[g]\equiv V[g]$ and the bound in (23), which is zero, is sharp. For $n>2$, from (4) it follows that $\xi _1^*\equiv a\equiv \xi _1$ and $\xi _p^*\equiv b\equiv \xi _p$, and from the definitions of V[g] and $V^a[g]$, (9) and (22) respectively, we have

$$\begin{aligned} \Vert V[g]-V^a[g]\Vert _{\infty }= & {} \text{ max }_{t\in [a,b]}\left| \sum _{i=1}^p\left( g\left( \xi _i^*\right) -g\left( \xi _i\right) \right) N_{i,n}(t)\right| \nonumber \\\le & {} \text{ max }_{t\in [a,b]}\sum _{i=1}^p\left| \left( g\left( \xi _i^*\right) -g\left( \xi _i\right) \right) \right| N_{i,n}(t)\nonumber \\\le & {} \text{ max }_{t\in [a,b]}\sum _{i=1}^p\left\{ \text{ max }_{j\in \{2,\ldots ,p-1\}} \left| \left( g\left( \xi _j^*\right) -g\left( \xi _j\right) \right) \right| \right\} N_{i,n}(t)\nonumber \\\le & {} \text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \left( g\left( \xi _j^*\right) -g(\xi _j)\right) \right| \text{ max }_{t\in [a,b]}\sum _{i=1}^pN_{i,n}(t)\nonumber \\= & {} \text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \left( g\left( \xi _j^*\right) -g(\xi _j)\right) \right| , \end{aligned}$$

(27)

where the last equality follows from the partition of unity property of B-splines (see Sect. 2). Applying the definition of the modulus of continuity to (27) we have

$$\begin{aligned} \Vert V[g]-V^a[g]\Vert _{\infty }\le & {} \text{ max }_{j\in \{2,\ldots ,p-1\}} \left| \left( g\left( \xi _j^*\right) -g(\xi _j)\right) \right| \nonumber \\\le & {} \omega \left( g;\text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \xi _j^*-\xi _j\right| \right) . \end{aligned}$$

(28)

From the definition (4) of the Greville sites $\xi _i^*$ we have $\xi _j^*=(t_{j+1}+\ldots + t_{j+n-1})/(n-1)$, $j=2,\ldots ,p-1$. From (21), it follows that $t_{j+1}=\left( \xi _{j-(n-2)}+\ldots +\xi _j \right) /(n-1),\ldots , t_{j+n-1}=\left( \xi _{j}+\ldots +\xi _{j+(n-2)} \right) /(n-1)$, where we have defined $\xi _{1-l}:=a$ and $\xi _{p+l}:=b$, $l=1,2,\ldots $. Consider the $\text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \xi _j-\xi _j^*\right| $ and assume it is achieved for some $j^m$, $2\le j^m<p-1$. Expressing $\xi _{j^m}^*$ in terms of $\xi _{j^m}$, using the above equalities, after some algebra, it is not difficult to see that

$$\begin{aligned} \left| \xi _{j^m}-\xi _{j^m}^*\right| =\frac{1}{(n-1)^2} \left| \sum _{i=1}^{n-2}i\left( \xi _{j^m+(n-1-i)}+\xi _{j^m-(n-1-i)}\right) -(n-1)(n-2) \xi _{j^m}\right| \end{aligned}$$

(29)

and if we now rearrange the terms in the sum in (29), we obtain

$$\begin{aligned} \left| \xi _{j^m}-\xi _{j^m}^*\right| =\frac{1}{(n-1)^2}\left| \sum _{i=1}^{n-2}i\left( \left( \xi _{j^m+(n-1-i)}-\xi _{j^m}\right) -\left( \xi _{j^m}-\xi _{j^m-(n-1-i)}\right) \right) \right| . \end{aligned}$$

(30)

Assume that $\sum _{i=1}^{n-2}i\left( \xi _{j^m+(n-1-i)}-\xi _{j^m}\right) >\sum _{i=1}^{n-2}i\left( \xi _{j^m}-\xi _{j^m-(n-1-i)}\right) $. In this case, it is not difficult to see that (30) is bounded by

$$\begin{aligned} \left| \xi _{j^m}-\xi _{j^m}^*\right|\le & {} \frac{1}{(n-1)^2}\sum _{i=1}^{n-2}i \left( \xi _{j^m+(n-1-i)}-\xi _{j^m}\right) \nonumber \\\le & {} \,\frac{1}{(n-1)^2}\frac{(n-2)(n-1)}{2}\left( \xi _{j^m+(n-2)}-\xi _{j^m} \right) \nonumber \\\le & {} \,\frac{(n-2)}{2(n-1)}\left( \xi _{j^m+(n-2)}-\xi _{j^m}\right) \nonumber \\\le & {} \,\frac{(n-2)^2}{2(n-1)}\text{ max }_{j\in \{1,\ldots ,p-1\}}(\xi _{j+1}-\xi _j). \end{aligned}$$

(31)

Similarly, it can be shown that if $\sum _{i=1}^{n-2}i\left( \xi _{j^m+(n-1-i)}-\xi _{j^m}\right) \le \sum _{i=1}^{n-2}i\Big (\xi _{j^m}- \xi _{j^m-(n-1-i)}\Big )$ the bound in (31) also holds. Thus, from (31) and (28) we have

$$\begin{aligned} \Vert V[g]-V^a[g]\Vert _{\infty }\le \omega \left( g;\frac{(n-2)^2}{2(n-1)} \text{ max }_{j\in \{1,\ldots ,p-1\}}(\xi _{j+1}-\xi _j)\right) . \end{aligned}$$

(32)

Using the monotonicity and subadditivity of $\omega (g;h)$ in h, from (32) we finally obtain

$$\begin{aligned} \Vert V[g]-V^a[g]\Vert _{\infty }\le \left\lceil \frac{(n-2)^2}{2(n-1)}\right\rceil \omega \left( g;\text{ max }_{j\in \{1, \ldots ,p-1\}}(\xi _{j+1}-\xi _j)\right) \end{aligned}$$

where $\lceil \nu \rceil :=\hbox {min}\{z\in \mathbb {Z}:\nu \le z\}$. This completes the proof of Theorem 3.4. $\square $

Proof of Corollary 3.5

This follows directly from (32) and from the definition, (24) of $\omega (g;h)$, i.e.

$$\begin{aligned} \Vert V[t]-V^a[t]\Vert _{\infty }=\left\| t-\sum _{i=1}^p\delta _{i+1} N_{i,n}(t)\right\| _{\infty }\le \frac{(n-2)^2}{2(n-1)} \text{ max }_{j\in \{1,\ldots ,p-1\}}(\delta _{j+2}-\delta _{j+1}). \end{aligned}$$

$\square $

Proof of Corollary 3.6

From (27), for $n=3$ and $g=\hat{f}$, we have

$$\begin{aligned} \Vert V[\hat{f}]-V^a[\hat{f}]\Vert _{\infty }\le & {} \text{ max }_{j\in \{2,\ldots ,p-1\}} \left| \hat{f}\left( \varvec{\delta }_{l,2},\varvec{\hat{\alpha }};\xi _j^*\right) -\hat{f}\left( \varvec{\delta }_{l,2},\varvec{\hat{\alpha }};\delta _{j+1}\right) \right| \end{aligned}$$

(33)

$$\begin{aligned}= & {} \text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \sum _{i=1}^p\hat{\alpha }_i N_{i,2}\left( \xi _j^*\right) -\sum _{i=1}^p\hat{\alpha }_i N_{i,2} (\delta _{j+1})\right| \nonumber \\= & {} \text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \sum _{i=1}^p\hat{\alpha }_i N_{i,2}\left( \xi _j^*\right) -\hat{\alpha }_j\right| \nonumber \\= & {} \text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \sum _{i=j-1}^{j+1}\hat{\alpha }_i N_{i,2}\left( \xi _j^*\right) -\hat{\alpha }_j\right| \end{aligned}$$

(34)

Recall that $n=3$ and hence, $\xi _j^*=(t_{j+1}+t_{j+2})/2$, and $t_{j+1}=(\delta _j+\delta _{j+1})/2$, $t_{j+2}=(\delta _{j+1}+\delta _{j+2})/2$. Therefore, we need to consider the cases when $\delta _{j}<\xi _j^*\le \delta _{j+1}$, or $\delta _{j+1}\le \xi _j^*<\delta _{j+2}$, $2\le j\le p-1$. In the first case, applying the Mansfield-De Boor-Cox recurrence formula we know that if $\delta _j<\xi _j^*<\delta _{j+1}$, then $\sum _{i=j-1}^{j+1}\hat{\alpha }_i N_{i,2}\left( \xi _j^*\right) =\hat{\alpha }_{j-1} N_{j-1,2}\left( \xi _j^*\right) +\hat{\alpha }_{j}N_{j,2}\left( \xi _j^*\right) $, which is a convex combination of only two B-spline coefficients. Thus, (34) becomes

$$\begin{aligned}&\text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \hat{\alpha }_{j-1}N_{j-1,2} \left( \xi _j^*\right) +\hat{\alpha }_j N_{j,2}\left( \xi _j^*\right) -\hat{\alpha }_j\right| \nonumber \\&\quad =\text{ max }_{j\in \{2,\ldots ,p-1\}}\left| \hat{\alpha }_{j-1} \frac{\delta _{j+1}-\xi _j^*}{\delta _{j+1}-\delta _{j}} +\hat{\alpha }_j\frac{\xi _j^*-\delta _{j}}{\delta _{j+1}-\delta _{j}} -\hat{\alpha }_j\frac{\delta _{j+1}-\delta _{j}}{\delta _{j+1} -\delta _{j}}\right| \nonumber \\&\quad =\text{ max }_{j\in \{2,\ldots ,p-1\}}\left| (\hat{\alpha }_{j-1} -\hat{\alpha }_j)\right| \left( \frac{\delta _{j+1}-\xi _j^*}{\delta _{j+1} -\delta _{j}}\right) \nonumber \\&\quad <\text{ max }_{j\in \{2,\ldots ,p-1\}}\left| (\hat{\alpha }_{j-1} -\hat{\alpha }_j)\right| \left( \frac{\frac{1}{4}\left( \delta _{j+1} -\delta _{j}\right) }{\delta _{j+1}-\delta _{j}}\right) \nonumber \\&\quad =\frac{1}{4}\text{ max }_{j\in \{2,\ldots ,p-1\}} \left| (\hat{\alpha }_{j-1}-\hat{\alpha }_j)\right| , \end{aligned}$$

(35)

where we have used the fact that $\delta _{j+2}-\delta _{j+1}>0$ to arrive at the last inequality. Similarly, it is not difficult to see that the same bound as in (35) holds in the case when $\delta _{j+1}\le \xi _j^*\le \delta _{j+2}$. This completes the proof of Corollary 3.6. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kaishev, V.K., Dimitrova, D.S., Haberman, S. et al. Geometrically designed, variable knot regression splines. Comput Stat 31, 1079–1105 (2016). https://doi.org/10.1007/s00180-015-0621-7

Download citation

Received: 19 April 2015
Accepted: 30 August 2015
Published: 14 September 2015
Issue Date: September 2016
DOI: https://doi.org/10.1007/s00180-015-0621-7

Keywords

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Geometrically designed, variable knot regression splines

Abstract

Access this article

Similar content being viewed by others

Artificial Intelligence in Physical Sciences: Symbolic Regression Trends and Perspectives

A Guide for Sparse PCA: Model Comparison and Applications

Genetic Algorithms in the Fields of Artificial Intelligence and Data Sciences

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (pdf 2031 KB)

Appendices

Appendices

1.1 Proofs of the results of section 3.1

Proof of Theorem 3.4

Proof of Corollary 3.5

Proof of Corollary 3.6

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

Geometrically designed, variable knot regression splines

Abstract

Access this article

Similar content being viewed by others

Artificial Intelligence in Physical Sciences: Symbolic Regression Trends and Perspectives

A Guide for Sparse PCA: Model Comparison and Applications

Genetic Algorithms in the Fields of Artificial Intelligence and Data Sciences

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (pdf 2031 KB)

Appendices

Appendices

1.1 Proofs of the results of section 3.1

Proof of Theorem 3.4

Proof of Corollary 3.5

Proof of Corollary 3.6

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation