Refinement Module #154

ahm-nq · 2024-12-30T09:07:01Z

First of all, thank you for your incredible work on BiRefNet and for making it publicly available—it’s truly inspiring to see such dedication and innovation in the field.

I’ve been fine-tuning BiRefNet on a car segmentation dataset using the General_244.pth checkpoint and the same configuration provided in the original GitHub repository. However, after 164 epochs of fine-tuning, the results I’m achieving don’t seem to match the quality of the pretrained model.

While exploring the configuration file, I noticed an option to choose a Refinement Module, which is set to an empty string by default. Could this be a potential reason why my results aren’t as refined as those of the pretrained model?

Additionally, I couldn’t find explicit information in the paper about whether Refinement Modules were used during the training of the pretrained BiRefNet. Did you incorporate Refinement Modules to achieve the results shared in the repository? Any insights or guidance would be immensely helpful.

Thank you for your time and for creating such a remarkable contribution to segmentation research!

Let me know if you would like to seee some sample results.

ZhengPeng7 · 2025-01-02T11:57:13Z

Hi, thanks for your interest in looking deeper into my codes.

Some refinement blocks can help, but the relevant codes are in chaos, and GPU memory costs could be extremely high. Based on these concerns, they were not used in my implementation.

I tested some refiners with SwinB as the backbone for lower GPU memory cost during training. In a relatively fair comparison, the one with a refiner can truly bring some improvement, as the screenshot shows below.

ahm-nq · 2025-01-03T09:55:31Z

Ok. Let me add some context. I am fine tuning your model to remove the background through the window regions of the car. I have attached comparison of results original legacy model and fine tuned version. Results are a bit compromised on finetuned results.

Brief settings :
bb is freezed
loss recipe : 30* bce, 0.5iou, 30ssim
augmentations: ['flip', 'enhance', 'rotate', 'crop']

Original image:

Background removed image using legacy weights:

Results i obtained after finetuning legacy:

Can it be the case introducing this new problem of removing bg through the window can affect existing learning? I have tried multiple fixes, but still finetuned results are not as good as original, though i get bg removed through the window inthe fine-tuned model.

ZhengPeng7 · 2025-01-04T02:45:35Z

First, freezing the backbone was not a good idea in my experiments long ago.
Besides, you can try training from scratch if you have hundreds of images.

ZhengPeng7 · 2025-01-06T05:32:06Z

Oh, I forgot one important thing -- I updated the default training to FP16. However, all the existing weights were trained in FP32, if not mentioned. I'm not very sure, but it should do some influence in fine-tuning with previous weights. You can easily comment the --use_accelerate in train.sh to avoid the FP16 training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refinement Module #154

Refinement Module #154

ahm-nq commented Dec 30, 2024

ZhengPeng7 commented Jan 2, 2025

ahm-nq commented Jan 3, 2025 •

edited

Loading

ZhengPeng7 commented Jan 4, 2025

ZhengPeng7 commented Jan 6, 2025

Refinement Module #154

Refinement Module #154

Comments

ahm-nq commented Dec 30, 2024

ZhengPeng7 commented Jan 2, 2025

ahm-nq commented Jan 3, 2025 • edited Loading

ZhengPeng7 commented Jan 4, 2025

ZhengPeng7 commented Jan 6, 2025

ahm-nq commented Jan 3, 2025 •

edited

Loading