Intiallizing the weight matrix #3

vanrajbrown · 2019-02-05T12:11:02Z

How are you initializing weight matrix in multiclass_classification_gpu.py line number 83. Although when avoiding vanishing and exploding gradient, we need to use np.random.randn(size_l, size_l-1). Can you please explain the numbers you are using - tf.truncated_normal([11,11,3,96], stddev=0.01).
Thanks

vanrajbrown · 2019-02-05T13:16:25Z

Okay, I got the reason as to why are you using these number, these are from AlexNet Architecture. But do you have any write up written on this, as to why are you using what you are on each step. That would be so helpful, I have lot more questions, will it be fine if I post them under the issues page?

MuhammedBuyukkinaci · 2019-02-10T14:38:35Z

Sorry for being late. I tried to implement AlexNet Paper. The hyperparameters used in the model are from that paper. You are highly welcome to ask your questions here. I will try to answer them.

vanrajbrown · 2019-02-20T11:22:44Z

Thanks, I Figured out the Alex net Architecture. I have another question, why the value of Bias constant varies either 0 or 1. How do you decide which layer bias value as 1 or 0 ?

MuhammedBuyukkinaci · 2019-03-20T16:45:48Z

I just got this information from CS224 of Andrej Karpathy on youtube.com .

MuhammedBuyukkinaci closed this as completed Jun 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intiallizing the weight matrix #3

Intiallizing the weight matrix #3

vanrajbrown commented Feb 5, 2019

vanrajbrown commented Feb 5, 2019

MuhammedBuyukkinaci commented Feb 10, 2019

vanrajbrown commented Feb 20, 2019

MuhammedBuyukkinaci commented Mar 20, 2019

Intiallizing the weight matrix #3

Intiallizing the weight matrix #3

Comments

vanrajbrown commented Feb 5, 2019

vanrajbrown commented Feb 5, 2019

MuhammedBuyukkinaci commented Feb 10, 2019

vanrajbrown commented Feb 20, 2019

MuhammedBuyukkinaci commented Mar 20, 2019