Add Owlv2; Add option to use owlvit / owlv2's image preprocessing procedure (the roi_align in current implementation processes images differently and results in distribution shifts and subpar performance) #23

xuanlinli17 · 2024-03-07T08:35:42Z

Factor in a bug found by Fix bug with non square images #20
Add Owlv2
Add an option (no_roi_align) in the OwlPredictor class of nanoowl/owl_predictor.py to use the original owlvit / owlv2 implementation's image preprocessing procedure. From my debugging, I find that the current nanoowl repo preprocesses images differently from the original owlvit / owlv2 implementations due to the use of "roi_align". This causes the larger models (base-16, large-14) to perform subpar compared to the original owlvit / owlv2. Passing in no_roi_align=True fixes this issue.
Add an option (nms_threshold) to use non-maximal suppression bounding box filtering
Fix cv2 "image no write permission" error in drawing utils

…rget size

ssmmoo1 and others added 6 commits March 1, 2024 11:57

fix image dimensions and single thresholds

995f6d2

fix drawing read only error

66f5636

initialize owlv2 code

531d255

add owlv2 inference and add no-roi-align option

bd0a813

add nms and folder inference examples

4536288

rm unnecessary files and readme changes

814afd3

xuanlinli17 added 2 commits March 18, 2024 21:54

fix bug of image preprocess resizing when orig image size > resize ta…

ead0ef0

…rget size

add class-based nms

71f2e19

Provide feedback