I’d like to be able to be able to group features and then define subsample probabilities (or direct numbers) based on these groups. This would be equivalent to the ‘sampsize’ argument in R’s randomForest package (https://github.com/cran/randomForest/blob/master/man/randomForest.Rd#L64)
This question seems to be similar (equivalent?) to these discussions:
Is this possible in the current R implementation? If not, does anyone know if it would be difficult to implement?
Thanks in advance,