performance gap with pytorch 0.4.1: mIoU 79.99 #67

Wangzhuoying0716 · 2019-10-28T06:11:58Z

Sorry to bother you! I tried to implement the training process using the seg_hrnet_w48_train_512x1024_sgd_lr1e-2_wd5e-4_bs_12_epoch484.yaml. But it came out that the validation mIoU for single-scale and no-flip is 79.99, almost 1% lower than your result shown(80.9). I wonder if this is normal variation or I did something wrong ?
Here is the training log:
https://pan.baidu.com/s/1utKUVuBEjDBtfgOk7A-5sQ 【passward: jckv】
Thank you!

Hussainflr · 2019-11-02T14:34:35Z

Hi, @Wangzhuoying0716 would you mind sharing your training settings,
I have been trying to train it, but it always gets stuck at this point

I don't know where I'm doing something wrong.

my training settings:
Pytorch 0.4.1
python 3.7
OpenCV 3.4.2.17
rest is according to requirements.txt

Wangzhuoying0716 · 2019-11-03T04:35:19Z

@Hussainflr If there is no error message but just gets stuck, I guess it's because the memory is too full to keep on? You can check the memory usage when it is stuck to see whether it's almost full.
As for me, I just used 4xTitan Xp GPU cards and the setting in seg_hrnet_w48_train_512x1024_sgd_lr1e-2_wd5e-4_bs_12_epoch484.yaml.

Wangzhuoying0716 · 2019-11-03T04:37:10Z

@Hussainflr I also used
Pytorch 0.4.1 and python 3.7

sunke123 · 2019-12-11T03:29:57Z

@Wangzhuoying0716
Sorry for late reply.
I checked your log. When training on cityscapes, we use the "class balance" in the loss function.
You can change the config and set the class balance as True.

whiteinblue · 2020-01-09T04:11:12Z

Hi @sunke123, LOSS.CLASS_BALANCE defined in lib/config/default.py, but is there any place to use it?
and i found, the loss: criterion = CrossEntropy(ignore_label=config.TRAIN.IGNORE_LABEL,
weight=train_dataset.class_weights), use class weighs, which doesn't matter with CLASS_BALANCE param

sunke123 · 2020-01-09T05:48:20Z

@whiteinblue
Class weights are used to train the model on Cityscapes. If you don't want to use it, you can change it in the yaml.

whiteinblue · 2020-01-10T01:23:17Z

@sunke123 thank you for your reply， and i confuse, it's where that the param LOSS.CLASS_BALANCE used ?

XuShoweR · 2020-01-10T07:47:18Z

Hi, @Wangzhuoying0716 would you mind sharing your training settings,
I have been trying to train it, but it always gets stuck at this point

I don't know where I'm doing something wrong.

my training settings:
Pytorch 0.4.1
python 3.7
OpenCV 3.4.2.17
rest is according to requirements.txt

Hello, i met same problem, have you solved it?

sunke123 · 2020-01-23T03:57:38Z

@whiteinblue Class_balance is used in the loss function.

purse1996 · 2020-12-01T07:39:04Z

So where that the param LOSS.CLASS_BALANCE used? I read the code, it seems that no matter what is CLASS_BALANCE param, the balance loss is used in the cityscape dataset.

verymadmatt mentioned this issue Jan 23, 2020

performance gap of HRNetV2+OCR on cityscape val set using default config #91

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance gap with pytorch 0.4.1: mIoU 79.99 #67

performance gap with pytorch 0.4.1: mIoU 79.99 #67

Wangzhuoying0716 commented Oct 28, 2019

Hussainflr commented Nov 2, 2019 •

edited

Loading

Wangzhuoying0716 commented Nov 3, 2019

Wangzhuoying0716 commented Nov 3, 2019

sunke123 commented Dec 11, 2019

whiteinblue commented Jan 9, 2020

sunke123 commented Jan 9, 2020

whiteinblue commented Jan 10, 2020

XuShoweR commented Jan 10, 2020

sunke123 commented Jan 23, 2020

purse1996 commented Dec 1, 2020

performance gap with pytorch 0.4.1: mIoU 79.99 #67

performance gap with pytorch 0.4.1: mIoU 79.99 #67

Comments

Wangzhuoying0716 commented Oct 28, 2019

Hussainflr commented Nov 2, 2019 • edited Loading

Wangzhuoying0716 commented Nov 3, 2019

Wangzhuoying0716 commented Nov 3, 2019

sunke123 commented Dec 11, 2019

whiteinblue commented Jan 9, 2020

sunke123 commented Jan 9, 2020

whiteinblue commented Jan 10, 2020

XuShoweR commented Jan 10, 2020

sunke123 commented Jan 23, 2020

purse1996 commented Dec 1, 2020

Hussainflr commented Nov 2, 2019 •

edited

Loading