-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
A few images have incomplete annotations. #21
Comments
It sesms, that most images (I only checked SHSY5Y) have missing annotations: For example:
Further, many annotations appear fragmented and inaccurate. I wonder, if that is due to the rasteration (vector -> pixel) during the annotation process? Example: Are the vector masks available somewhere? |
Hi @ivan-ea, |
Hi @StefanBaar, I believe that the broken visualizations are due to the software you use to make the masks. LIVECell is annotated using polygon annotations and stored in COCO-annotation format, so these fragmented masks are not coming from the raw annotations. When converting polygons to masks, the main challenge will be thin structures like the ones you show and your rendering looks like it needs some tweaking. Cell 117 seems to be missing a protrusion on the bottom left though. |
Hi @RickardSjogren,
I apologize, but I don't think this is correct. It appears, that in your dataset, the annotations are stored as RLE and not as polygons. This means each annotation is stored as pixel mask and not as a list of coordinates. The renders, I have produced above, are as true to the data (provided in the dataset) as possible. This is how I did it in python:
when looking at the data of the first annotation ( annos[0] ), we get the following output:
which looks like RLE and not polygon. I also confirmed the content of the raw json file, in which I could not find any polygon data. Am I doing something wrong here? If the polygon data is contained somewhere within the json file and if you have some time to spare, could you please elaborate on how to retrieve the polygon data? Could you provide the polygon data? Thank you very much for your time. |
@StefanBaar The annotation you provided I think is a polygon. you can refer to COCO data format explain. When |
ok, I got it. In the case of this dataset, I am curious ... what is the intended method to convert the polygon data into pixel masks? Further, I don't really understand why one would use polygons (basically a set of straight lines) to annotate images of round objects. |
Thanks for looking into this @StefanBaar . It sure seems as For the models trained in the paper we used Detectron2, which has its own parsers for COCO-datasets. Under the hood, they us Regarding the choice of polygons. This is the standard way of annotating instances in most fields. There are certainly some drawbacks depending on the vertex-density you use and so on. Even though pixelmasks are more precise, they are far more time consuming to annotate. This is something we have experimented quite a bit with and polygons provide in most cases sufficient precision while being much more budget-friendly. |
Hi, thank you for making such an interesting dataset publicly available!
If I'm not mistaken, I think there are a few images with incomplete annotations.
In other words, the json entry only contains the coordinates of the segmentation
for 1 or 2 cells, while the image shows clearly many more cells. Example in the
image below (id = 150535). Files affected in the table.
The list may not be complete, as the threshold I used was 10 segments per image,
but I'm confident some of the images with around 20+ segmented cells are all ok.
The number of images affected seems to be very small, so excluding these images
should solve any issue. Nevertheless, I think this information could be relevant
for people that are interested in using the dataset for their DL models.
The text was updated successfully, but these errors were encountered: