XGBoost4J PySpark with Python Examples

Daniel8hen · March 2, 2020, 8:07am

Hi,

I have noticed there are no pyspark examples for how to use XGBoost4J.
Is someone can assist with providing one example of the full pipeline? I.e. with VectorAssembler (or DMatrix?), with String Indexer, OHE, or other methods.

Now that you are 1.0.0 I thought maybe someone can help with that, as it would be great to get a new version and fully migrate on Python from now on.

Looking forward!

Thank you,
Daniel

Daniel8hen · March 4, 2020, 9:49am

@hcho3 can you assist?

Daniel8hen · March 10, 2020, 6:51am

@hcho3 do you think this be relevant? or should I close it for now? thx

hcho3 · March 10, 2020, 5:09pm

I don’t think PySpark API made it to 1.0.

Daniel8hen · March 11, 2020, 8:30am

Thank you for the reply.