In this module, users can choose a ML algorithm to train a customized scoring function based on the descriptors
they submitted.
The dataset provided by the user is divided into the
training set and the test set according to the ratio inputted by the user and will be preprocessed using
sklearn.
Three ML algorithms (i.e., eXtreme Gradient Boosting, Support Vector Machine and Random Forest) are provided.
Users can choose a ML algorithm and do settings about hyper-parameter optimization (which hyper-parameter to be
optimized, the hyper-parameter range, how to generate the search space and the optimization rounds). Finally,
according to the
user's input, the server uses hyperopt to find the optimal hyper-parameter
combinations and chooses the
corresponding ML algorithm for training (10-fold cross validation) and testing.
The suggestions for each set of hyper-parameters in this page and the information provided on the result page
include the model performance, the hyper-parameters of the model, the feature importance, the validation curve,
the loss-hype_parameter scatter, the hyper_parameter-tuning_round scatter offer users the directions for the
improvement of model accuracy. And thanks to the multiple options offered by this server, users are able to
continuously use this module to customize their models to achieve better performance.