Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inconsistency in the dataset loading approach #6

Open
PCTsai opened this issue Apr 15, 2024 · 0 comments
Open

Inconsistency in the dataset loading approach #6

PCTsai opened this issue Apr 15, 2024 · 0 comments

Comments

@PCTsai
Copy link

PCTsai commented Apr 15, 2024

Hi @AndrzejCodahead,

I've been using the scripts in fish-identification/helper/classification/CreateDatasetAndTrain.ipynb to train my own dataset. This process involves two Python scripts: auto_train_triplet.py and dataset_creator_by_coco.py.

I noticed an inconsistency in the dataset loading approach. Specifically, the script auto_train_triplet.py utilizes the class FishialDatasetFoOnlineCutting to load the train and validate datasets, whereas train.py from the classification module employs FishialDataset. It appears that both classes are essential yet they are used differently. Could you please provide more detailed insights into the training flow and clarify this discrepancy?

Furthermore, if FishialDatasetFoOnlineCutting is indeed the correct class for my purposes, I understand that I need to create a voxel dataset using the fiftyone Python package. However, the output from dataset_creator_by_coco.py are JSON files, not a voxel dataset. Should I be saving my data in voxel dataset format instead?

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant