Out-of-memory when using sparse matrix (python)


#1

I’m using the following test script:

When increasing SAMPLES from 100_000 to 1_000_000 it crashes with the following message:
terminate called after throwing an instance of 'thrust::system::system_error’
** what(): parallel_for failed: out of memory**

(it’s using ~19G, nowhere near the total memory available).

XGBoost v0.90

Any ideas on how should I approach debugging this?

Thanks,


#2

Thrust is a GPU library so I’m assuming you are trying to train on the GPU, hence it’s running out of GPU memory.


#3

I was not trying to use GPUs, didn’t know it would try to use GPUs by default with the above test script.Thanks for the pointer.