Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sliding window approach feature in image_demo.py for finetuning Grounding Dino #12290

Open
shvardhan1994 opened this issue Jan 13, 2025 · 0 comments
Assignees

Comments

@shvardhan1994
Copy link

shvardhan1994 commented Jan 13, 2025

I have finetuned the grounding dino model on a custom dataset for binary class object detection. I have finetuned the model on cropped images (512x512). During the inference time using image_demo.py, the model almost correctly classifies the classes along with localization for similar cropped images.
However, when inference is performed on original size image (which in my case is 5464x3640), the performance is very bad.
I believe a sliding window inference feature would help in this case and it would be of great help if someone can help me to modify the image_demo.py to perform sliding window approach.
Currently sliding_window approach can be performed using large_image_demo.py but it can only handle faster_rcnn variant architectures and not Grounding Dino.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants