View Article

Abstract

In this paper we will discuss Generalized Estimating Equations modelling with natural cubic splines application for an example discrete dataset using R software ‘geepack’ and ‘splines’ package. We are taken data of 298 points for binary variables like gender and age for either new treatment or drug control.

Keywords

Generalized Linear Models, Cubic Splines

Introduction

In medical research new medicine and drug control plays vital role according to age and their body immune system. A medicine have different association with 70 year old and 45 years old person with diabetes and blood pressure patients. For example, if a model suggests that an increase in fasting plasma glucose (FPG) leads to an increase in HbA1c, this increase is the same if FPG increase from 120 to 165 mg/dL or if FPG change from 180 to 220 mg/dL. Assumption of linearity would be true often than the assumption of dichotomising data. Many scenarios this assumption would not be true. In longitudinal randomized clinical trials collection of efficacy or safety data often occur off-schedule beyond protocol allowed visit windows, considering this data at scheduled visits introduce potential bias due to clinical observations being carried forward or backward to the closest planned visits. In this situation it might be advantageous to treat time as continuous (actual time from baseline) rather than category variable in analysis of repeated measures mixed modelling. In these scenarios alternative modelling strategy would assume and explore non-linear continuous associations. There are many regression spline models are available to explore, but here we focus use of natural cubic splines. Loic Desquilbet and Fralcoismariotti [1] in this paper non-linear dose response association are widely criticized and restricted cubic spline functions are powerful tools to characterize a dosce response association between a Continues exposer and an outcome.  SAS software is used for fitting Linear, logistic and cox model as wall as linear and logistic generalized estimating equations and statistical tests for overall non-linear associations.   Hansnyquist [2] Penalty function approach as iterate procedure for obtaining the estimates of generalize linear model under linear restrictions on parameters by using likelihood ratio test, Wald test and Lagrange’s Multiple tests are considered as alternatives for testing hypothesis about linear restrictions on the parameters. Mary C. Meyar Et.al [3] explains about non-parametrical modeling using regression splines with Bayesian Approach to generalized partial linear regression model where Knots may be modeled as fixed or free the R code to implement the methods is described. Arisperoglou Et.al [4], in their paper the discussed about popular splines like cubic splines, B-splines, Penalized splines, natural cubic and cardinal splines and these splines where fitted and tested using R. Laralusa and CRT Ahlin[5], in their paper periodic restricted cubical splines (RCS ) and cubical splines are fitted using R packages for different knots for 500 units of data and obtained simulation  results, model accuracy tested using AKAIKE information criteria. Ulrich Halekoh Et.al [6], explains about generalized estimates fact for R and its applications by using cluster binary data was listed. Chin-schang Li [7], explains about a penalized log-likelihood ratio test statistic is constructed for a null hypothesis of the non-parametric components of a semi parametric generalized linear model component estimates using cubic B splines. The knots for this splines is fixed and its limiting its null distribution is the distribution of a linear combination of independent chi-square random variables each with 1 degree of freedom. A real life data is used for practical use of problem.

MEHODOLOGY:

Spline Regression Modelling

In general cubic splines are parabola curves fitted to data

C(Y|X1) = β0 + β1X1 + β2 X12

Piece wise regression in basic form is linear function. Data is divided into parts, the intersection part is called knots. Generally, piece wise regression is as follows

f(x) = β0 + β1x + β2(x − a)+ + β3 (x − b)+ + β4 (x − c)+

where (u)+ = u, u > 0,

                  = 0, u ≤ 0.

According to knots is increasing the function f(x) is as follows

f(x) = β0 + β1x, x ≤ a

        = β0 + β1x + β2(x − a),  a< x≤ b

       = β0 + β1x + β2(x− a) + β3(x− b), b< x≤ c

       = β0 + β1x + β2(x − a) +β3(x− b) + β4(x − c), c < x

A linear spline function with knots at a=1, b=3, c=5

Cubic Splines (or) cubic piece wise regression:

In this situation cubic polynomial regression is fitted for each and every part of data. If data is divided into three knots is as follows

f(X) = β0 + β1X + β2X2 + β3X3 + β4(X − a)3+ + β5(X − b)3+ + β6(X − c)3+

       = Xβ

with constructed variables: X1 = X, X2 = X2, X3 = X3, X4 = (X − a)3+, X5 = (X − b)3+, X6 = (X − c)3+

Restricted Cubic Splines:

The restricted spline function with k knots t1,...,tk is given by

f(X) = β0 + β1X1 + β2X2 + ... + βk−1Xk−1,

where X1 = X and for s = 1,...,m− 2, Xj+1 = (X – ts)3+ − (X – tm−1)3(tm – ts) / (tm – tm−1) + (X – tm)3+ (tm−1 – ts )/(tm – tm−1)

It can be shown that Xj is linear in X for X≥tm.  For numerical behaviour and to put all basis functions for X on the same scale, the above terms divide by τ = (tk − t1)2

Once β0,...,βk−1 are estimated, the restricted cubic spline can be restated in the form

f(X) = β0 + β1X + β2(X − t1)3+ + β3(X − t2)3+ + ... + βm+1(X – tm)3+

by dividing β2,...,βk−1 by τ and computing

βk = [β2(t1 – tm) + β3(t2 – tm)+ ... + βm−1(tm−2 – tm)]/(tm – tm−1)

βk+1 = [β2(t1 − tm−1)+ β3(t2 – tm−1)+ ... + βm−1(tm−2 – tm−1)]/(tm−1 – tm)

A test of linearity in X can be obtained by testing H0: β2 = β3 = ... = βm−1 = 0

Default values for knots

m

Quantiles

3

4

5

6

7

0.10   0.5   0.90

0.05   0.35   0.65   0.95

0.05   0.275   0.5   0.725   0.95

0.05   0.23   0.41   0.59   0.77   0.95

0.025   0.1833   0.3417   0.5   0.6583   0.8167   0.975

Reference

  1. Loic Desquilbet and Francois Mariotti, Statistics in Medicine, 2010, 29, 1034-1057.
  2. Hans Nyquist, Restricted Estimation of generalized linear Models, Appl. statist, 19, 91,40 (1), pp 133-141
  3. Mary C Meyer, Amber J. Hanstadt and Jennifer A-Hoeting, Bayesian estimation and inference for generalized partial linear models using shape-restricted Splines, Journal of Nonparametric Statistics, Vol23(4) December 11 pp 867-884
  4. Aris Perperogloa, Welli Sauerbrei, Michal Abrahnowica and Matthias Schmis, A review of spline function procedures in R, BMC Medical Research Methodology, 2019, 19:46, PP 1 to 16.
  5. Lara Lusa and Crt Ahlin (2020) Restricted Cubic Splines for modeling periodic data, PLOS ONE 15 (10), pp 1 to 17.
  6. Ulrich Halekoh, Soren Hojsgaard and Jun Yan (2006), The R package geepack for Generalized Estimating Equations, Journal & Statistical Software, Vol 15 (2), 1-11
  7. Chin-Shang Li (2012), Lack-of-Fit Tests for generalized Linear Modls via splines, communications in statistics theory and Methods, 41, 4240-4250.

Photo
B. Sarojamma
Corresponding author

Research scholar, Department of Statistics, S V University, Tirupati.

Photo
C. Suresh
Co-author

Research scholar, Department of Statistics, S V University, Tirupati.

Photo
N. Jayalakshmi
Co-author

Department of Computer Science, S.G.S Arts College, TTD, Tirupati.

Photo
Kalava Harish
Co-author

Research scholar, Department of Statistics, S V University, Tirupati.

Photo
B. Triveni
Co-author

Department of Computer Science, Sri Govinda raja Swamy Arts College(A), Tirupati.

Photo
R. V. S. S. Nagabhushana Rao
Co-author

Department of Statistics, Vikrama Simhapuri University, Nellore.

Photo
P. Manohar
Co-author

Department of HAS, Sri Venkateswara College of Engineering & Technology, Chittoor.

C. Suresh, N. Jayalakshmi, Kalava Harish, B. Triveni, R. V. S. S. Nagabhushana Rao, P. Manohar, B. Sarojamma*, Generalized Linear Models with Restricted Cubic Splines, Int. J. Sci. R. Tech., 2026, 3 (4), 258-262. https://doi.org/10.5281/zenodo.19479817

More related articles
Overview Of In Vitro – Antioxidant Models...
Vishal Shewale , Shubham Pawar, Aakanksha Shewale , Nikita Sandha...
Design and Fabrication of a Pedal-Driven Rope Twis...
Karthikeyan K., Dins Milton J., Yuvanesh Kumar V., Manikandan M.,...
A Fruit Review on Marvelous Milberry With Its Nutr...
Safid Halim Khan, Prachi Desale, Vivek Waghere, ...
A Study on Partial Replacement of Cement with Rice Husk Ash and Bamboo Biochar i...
Dr. Pranab Jyoti Barman, Abdul Masud, Anadi Krishna Saikia, Bhanujita Kutum, Chiranjib Gogoi, Madhur...
In-Depth In-Silico Functional, And Structural Screening Of IL-4 Gene Variants Li...
Hamna Tariq, Aniqa Amir, Muhammad Saleem, Kainat Ramzan, Tuba Aslam, Mehmooda Asif, ...
Assessing the Anatomical Variations in Route CT Scan of Paranasal Sinuses in Pat...
Mohit Sharma, Saurav Singh Gusain , Amisha Kharola, Arushi Thakur, ...
Related Articles
Overview Of In Vitro – Antioxidant Models...
Vishal Shewale , Shubham Pawar, Aakanksha Shewale , Nikita Sandhan , Priti Patle, Vaidehi Pawar , ...
More related articles
Overview Of In Vitro – Antioxidant Models...
Vishal Shewale , Shubham Pawar, Aakanksha Shewale , Nikita Sandhan , Priti Patle, Vaidehi Pawar , ...
Design and Fabrication of a Pedal-Driven Rope Twisting Machine with Integrated P...
Karthikeyan K., Dins Milton J., Yuvanesh Kumar V., Manikandan M., Annamalai K., Sankarnarayanan V., ...
Overview Of In Vitro – Antioxidant Models...
Vishal Shewale , Shubham Pawar, Aakanksha Shewale , Nikita Sandhan , Priti Patle, Vaidehi Pawar , ...
Design and Fabrication of a Pedal-Driven Rope Twisting Machine with Integrated P...
Karthikeyan K., Dins Milton J., Yuvanesh Kumar V., Manikandan M., Annamalai K., Sankarnarayanan V., ...