Gamma parameter tuning

quant108 · October 16, 2018, 10:43pm

Can you provide some guidance on how to tune gamma parameter? In API docs, its ranges [0, inf). Does it depend on training sample size?

hcho3 · October 23, 2018, 7:03am

A good place to start is to plot the distribution of loss changes over all splits in all trees. Use get_dump() with with_stats=True: https://xgboost.readthedocs.io/en/latest/python/python_api.html#xgboost.Booster.get_dump

Set gamma=0 for this step so that all splits would be allowed. See the typical size of loss changes and adjust gamma appropriately.

JiaxiangBU · October 24, 2018, 10:08am

In R package, is there a function like get_dump?

hcho3 · October 24, 2018, 4:35pm

You should use xgb.dump.