Sorry to disturb. I’m a beginner to distributed xgboost version. Recently, I’m planning to use distributed xgboost to train rank task.
I find some docs about rank docs for data format here. https://xgboost.readthedocs.io/en/latest/tutorials/input_format.html#embedding-additional-information-inside-libsvm-file
But I’am confused about the details. I wonder that if I should split the whole dataset into small and place it onto different node or whole dataset onto each node?
Another question is not related to distributed xgboost. When I do ranking task, how can I plot the rank pairwise loss when training?