-
-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error during exporting of split dataset (with null class values). #32
Comments
This happens when exporting to YOLOv5. |
Running the example notebook on Google Colab works fine, for information. My dataset was imported as YOLOv5, will try now to import as COCO and see if there is a difference. |
I suspect that this is also caused by your images without annotations, which creates null values. If it works on Colab it may be because Colab has a different version of pandas. I see this error message above "TypeError: can't multiply sequence by non-int of type 'float'" pandas sometimes changes the datatype of a column which causes errors like this. |
So I imported the dataset using ImportYOLOv5 successfully, then exported to COCO, also successfully. The problem is that when I import the newly exported dataset, errors start to occur such as:
I think these problems stem from the NAN class. Maybe a better solution would be to not consider it a class rather ignore the images that don't contain labels. |
I have to add that I tried the Google Colab notebook with your sample dataset, not mine. |
In terms of machine learning principles, I think it is valid to have images without labels to train a model. So I will try to get it to work and add more test cases. But in the mean time, you can filter out the the rows without labels like this after you import it. dataset.df = dataset.df[dataset.df.cat_id.notnull()] |
Appreciate the concern and thanks for the tip. Will try it out and let you know if sth goes wrong. In the mean time, should I close this issue or leave it open? |
You can leave it open and thank you for reporting it. There is a lot of diversity in how people create and save datasets so it is helpful to see other peoples examples. |
The text was updated successfully, but these errors were encountered: