Huber loss / np.where

June 12, 2019 少于 1 分钟阅读

Huber loss:

L_{δ} (y, f (x)) = {\begin{cases} \frac{1}{2} (y - f (x))^{2} & for | y - f (x) | \leq δ, \\ δ | y - f (x) | - \frac{1}{2} δ^{2} & otherwise. \end{cases}

Usually set $δ = 1$
Huber loss is less sensitive to outliers (those $y$ such that $| y - f (x) | > δ$ ) than the MSE
And converges faster than the mean absolute error (because of larger gradients).

A python implementation:

def huber_fn(y_true, y_pred):
    # Suppose delta == 1
    error = y_true - y_pred
    
    is_small_error = tf.abs(error) < 1
    
    squared_loss = tf.square(error) / 2
    linear_loss = tf.abs(error) - 0.5
    
    return tf.where(is_small_error, squared_loss, linear_loss)

If all the arrays are 1-D, np.where(condition, X, Y) is equivalent to:

for (c, x, y) in zip(condition, X, Y):
    yield c ? x : y

X Facebook LinkedIn Bluesky

Huber loss / np.where

分享

留下评论

猜您还喜欢

Lark’s implementation of computing FIRST and FOLLOW sets

LL(1) Parsing

Top-Down Parsers: Recursive Descent, Predictive, and More

Appetizers Before Parsing: Serving Order