Hi, thank you for the great work!
I would like to know if this model supports image regional segmentation. Specifically, can it take user-provided prompts such as points or bounding boxes to segment objects, similar to how SAM (Segment Anything Model) works?
For example, given a text instruction like:
"Please segment the object based on the referring points/bbox."
would the model be able to leverage the spatial hints (points or bounding boxes) along with the text to produce the desired segmentation?
If not currently supported, are there any plans to add such functionality in the future?
Thank you!
Hi, thank you for the great work!
I would like to know if this model supports image regional segmentation. Specifically, can it take user-provided prompts such as points or bounding boxes to segment objects, similar to how SAM (Segment Anything Model) works?
For example, given a text instruction like:
would the model be able to leverage the spatial hints (points or bounding boxes) along with the text to produce the desired segmentation?
If not currently supported, are there any plans to add such functionality in the future?
Thank you!