DEPARTMENT OF COMPUTING

Course Home | Syllabus | Assignments | Schedule | Notes | Downloads | [print]

CS 4320: Machine Learning

Assignment: Hyper Parameter Search

Use the March 2021 Playground Series data set at Kaggle. Use hyper parameter search with cross validation to create a decision tree classification model (such as sklearn.tree.DecisionTreeClassifier), to obtain the best F1 score possible.

It is expected that you will use the Titantic hyper parameter search with cross validation decision tree source code as a starting point for your code development.

Create a report that includes the data exploration plots and analysis, which hyper parameters were used in the search, the range or set of values used for each hyper parameter, the hyper parameters selected, the number of cross validation sets, the F1 cross-validation score obtained, the training F1 score of the model when trained on all training data, and finally, the F1 score of the model on the testing data.

Include a comparison of the three F1 scores, interpret the meaning of these comparisons.

Required Steps

Last Updated 01/16/2023