I am using the following code to create my daskdmatrix and train on it. The kernel crashes without an error message on the last line:
cluster = dask.distributed.LocalCluster(n_workers = 48, threads_per_worker=1)
client = dask.distributed.Client(cluster)
xTrain = dd.from_pandas(db.iloc[:,1:],chunksize=1000)
yTrain = dd.from_pandas(db.iloc[:,0:1],chunksize=1000)
dTrain = xgb.dask.DaskDMatrix(client=client, data=xTrain, label=yTrain)
params = {'tree_method':'hist','objective':'reg:squarederror'}
reg = xgb.dask.train(client, params, dTrain, num_boost_round=numRounds,verbose_eval=1)
the dataset is ~100GB. I am training it on a AWS instance with 480GB of RAM and 48 CPUs. I have no idea how to fix this since there is no error message.