andrewpeterson / Amp / issues / #133 - Why biases are excluded from l2_regularization term? — Bitbucket

Issue #133 new

Alireza Khorshidi created an issue 2017-01-17

Why l2_regularization = tf.reduce_sum(tf.square(W_fc)) and not l2_regularization =tf.reduce_sum(tf.square(W_fc))+tf.reduce_sum(tf.square(b_fc)) here?

Comments (2)

Zachary Ulissi
My sense from the literature / tutorials / etc was that biases are usually excluded from regularization.

https://stats.stackexchange.com/questions/153605/no-regularisation-term-for-bias-unit-in-neural-network

On a related note, I'm planning to add another flag for type of regularization (L1 or L2)
- 2017-01-17T17:57:15+00:00
Alireza Khorshidi reporter
Okey, but it seems to be added in all layers other than input, like here or here.
- 2017-01-17T18:08:00+00:00
Log in to comment

Assignee: Zachary Ulissi

Type: proposal

Priority: minor

Status: new

Votes: 0

Watchers: 2

Jira: the preferred issue tracker for Bitbucket. Join the team!