What is centering of data

glmboost

#1

Hello,

The highlighted portion mentions that data should be internally centered as this allows much faster convergence of the algorithm.I understand that if data is centered the number of iterations required will be less but what is centering of data?


#2

Centering simply means subtracting a constant from every value of a variable. What it does is redefine the 0 point for that predictor to be whatever value you subtracted. It shifts the scale over, but retains the units.
The following deals with the ways in which centering can be done:
http://gastonsanchez.com/blog/how-to/2014/01/15/Center-data-in-R.html

The following deals with why it isn’t always a good idea to center/standardize the data:
http://www.unt.edu/benchmarks/archives/2008/june08/rss.htm

Hope this helps!