Hello,
I have come across this issue in the following code. I am working with a friend who is using a Windows machine and I am using a Mac OS. We are unable to reproduce the same results, which is a big problem for collaborative work on this model.
We’ve found by playing with the parameters that this is due to the parameters subsample and colsample_by_tree.
I’ve noticed a few conversations on the subject, and also noticed this PR https://github.com/dmlc/xgboost/pull/735 that was supposed to fix this, but I still see the problem arise. We both have version 1.1.1.1 of xgboost package on our machines. Could you help us with this issue?
data(agaricus.train, package='xgboost')
data(agaricus.test, package='xgboost')
dtrain <- xgb.DMatrix(agaricus.train$data, label = agaricus.train$label)
dtest <- xgb.DMatrix(agaricus.test$data, label = agaricus.test$label)
watchlist <- list(train = dtrain, eval = dtest)
param <- list(max_depth = 2, eta = 1, silent = 1, nthread = 1,
subsample = 0.5, colsample_bytree = 0.1,
objective = "binary:logistic", eval_metric = "auc")
set.seed(2020)
bst <- xgb.train(param, dtrain, nrounds = 5, watchlist)
Thank you