-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adjust the train_test_split parts in pipelines #91
Labels
Milestone
Comments
momegas
changed the title
Train test split
Adjust the train_test_split parts in pipelines
Feb 15, 2023
I am willing to work on this as my first issue @stavrostheocharis @momegas |
Feel free to open a PR. I assign this to you as well. |
Hey @aditkay95 please tell me when to move to |
I would like to work on this issue @momegas and @stavrostheocharis |
Im assining this to you |
5 tasks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
We use "train_test_split" in "src/analytics/models/pipelines" at lines 40 & 98. In case that after the splitting only one class remains at the train set, an error will follow based on the next calculations (AUC ROC, etc.).
Need to adjust the train_test_split parts and possibly use the "stratify" argument as described inside the docs: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html
The text was updated successfully, but these errors were encountered: