train_test_split doesn't support split of 0.0 #182

robmarkcole · 2024-06-25T12:41:51Z

🐛 Bug

The check 0 < _f <= 1 is failed for value of 0.0

To Reproduce

Pass a value of 0.0 as a split to train_test_split

train_test_split(dataset, splits=[0.0, 0.0, 1.0])

Expected behavior

I can have a split of 0.0

Environment

Master

The text was updated successfully, but these errors were encountered:

deependujha · 2024-06-26T06:58:25Z

Hi @robmarkcole, thanks for pointing out the issue.

I'm working on another issue. When the PR is ready to be merged, I'll try to fix this issue too. I don't think fixing this will require much work to be done.

deependujha · 2024-06-26T20:23:10Z

Btw, why would someone even want a split of 0.0?

This even makes sense: [0.01, 0.01, 0.98] and it works fine.

If I remember correctly, Luca added the condition for each split to be greater than 0, while reviewing the PR.

if not all(0 < _f <= 1 for _f in splits):
        raise ValueError("Each Split should be a float with each value in [0,1].")

robmarkcole · 2024-06-26T20:48:46Z

I have a single dataset and typically random split it. However I also sometimes want to just test on it, so test weighting is 100%

deependujha · 2024-06-26T21:05:25Z

Okay, I was thinking of just updating all(0 **<=** _f <= 1 for _f in splits) will do the work, but, I also need to make some changes internally.

I'll try fixing it as soon as possible. Btw, if you've used train_test_split and any issues encountered, plz mention it here in the same thread. It'll be easier to club them and fix them at once.

robmarkcole added bug Something isn't working help wanted Extra attention is needed labels Jun 25, 2024

deependujha mentioned this issue Jun 27, 2024

Fix: error while splitting dataset with splits=[0.1, 0.2, 0.7] and support split of 0.0 #187

Merged

4 tasks

tchaton closed this as completed in #187 Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train_test_split doesn't support split of 0.0 #182

train_test_split doesn't support split of 0.0 #182

robmarkcole commented Jun 25, 2024

deependujha commented Jun 26, 2024 •

edited

Loading

deependujha commented Jun 26, 2024

robmarkcole commented Jun 26, 2024

deependujha commented Jun 26, 2024

train_test_split doesn't support split of 0.0 #182

train_test_split doesn't support split of 0.0 #182

Comments

robmarkcole commented Jun 25, 2024

🐛 Bug

To Reproduce

Expected behavior

Environment

deependujha commented Jun 26, 2024 • edited Loading

deependujha commented Jun 26, 2024

robmarkcole commented Jun 26, 2024

deependujha commented Jun 26, 2024

deependujha commented Jun 26, 2024 •

edited

Loading