I have come across this issue in the following code. I am working with a friend who is using a Windows machine and I am using a Mac OS. We are unable to reproduce the same results, which is a big problem for collaborative work on this model.
We’ve found by playing with the parameters that this is due to the parameters subsample and colsample_by_tree.
I’ve noticed a few conversations on the subject, and also noticed this PR https://github.com/dmlc/xgboost/pull/735 that was supposed to fix this, but I still see the problem arise. We both have version 22.214.171.124 of xgboost package on our machines. Could you help us with this issue?
data(agaricus.train, package='xgboost') data(agaricus.test, package='xgboost') dtrain <- xgb.DMatrix(agaricus.train$data, label = agaricus.train$label) dtest <- xgb.DMatrix(agaricus.test$data, label = agaricus.test$label) watchlist <- list(train = dtrain, eval = dtest) param <- list(max_depth = 2, eta = 1, silent = 1, nthread = 1, subsample = 0.5, colsample_bytree = 0.1, objective = "binary:logistic", eval_metric = "auc") set.seed(2020) bst <- xgb.train(param, dtrain, nrounds = 5, watchlist)