You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have finetuned the grounding dino model on a custom dataset for binary class object detection. I have finetuned the model on cropped images (512x512). During the inference time using image_demo.py, the model almost correctly classifies the classes along with localization for similar cropped images.
However, when inference is performed on original size image (which in my case is 5464x3640), the performance is very bad.
I believe a sliding window inference feature would help in this case and it would be of great help if someone can help me to modify the image_demo.py to perform sliding window approach.
Currently sliding_window approach can be performed using large_image_demo.py but it can only handle faster_rcnn variant architectures and not Grounding Dino.
The text was updated successfully, but these errors were encountered:
I have finetuned the grounding dino model on a custom dataset for binary class object detection. I have finetuned the model on cropped images (512x512). During the inference time using image_demo.py, the model almost correctly classifies the classes along with localization for similar cropped images.
However, when inference is performed on original size image (which in my case is 5464x3640), the performance is very bad.
I believe a sliding window inference feature would help in this case and it would be of great help if someone can help me to modify the image_demo.py to perform sliding window approach.
Currently sliding_window approach can be performed using large_image_demo.py but it can only handle faster_rcnn variant architectures and not Grounding Dino.
The text was updated successfully, but these errors were encountered: