15 Loss Functions

8.12

15 Loss Functions🔗ℹ

Loss functions are curried and have the following type:

(target-fn? . -> . expectant-fn?)

where a target-fn? expects a tensor first, and then a theta?, and returns a tensor?. An expectant-fn? expects xs and ys which are two tensors representing a subset of the dataset, and returns an objective-fn?.

These are defined as follows

target-fn? : (-> tensor? (-> theta? tensor?))
expectant-fn? : (-> tensor? tensor? objective-fn?)
objective-fn? : (-> theta? tensor?)

The tensor returned from an objective-fn? must have rank 1, and its tlen should be the same as the number of elements in xs.

The following loss functions are available in malt.

procedure
(((l2-loss target) xs ys) θ) → tensor?
  target : (-> tensor? (-> theta? tensor?))
  xs : tensor?
  ys : tensor
  θ : theta?

Implements the SSE loss function.

(let ((pred-ys ((target xs) theta)))
  (sum
    (sqr
      (- ys pred-ys))))

procedure
(((cross-entropy-loss target) xs ys) θ) → tensor?
  target : (-> tensor? (-> theta? tensor?))
  xs : tensor?
  ys : tensor
  θ : theta?

Implements the cross-entropy loss function.

(let ((pred-ys ((target xs) theta))
      (num-classes (ref (reverse (shape ys)) 0)))
  (* -1
    (/ (dot-product ys (log pred-ys))
       num-classes)))

procedure
(((kl-loss target) xs ys) θ) → tensor?
  target : (-> tensor? (-> theta? tensor?))
  xs : tensor?
  ys : tensor
  θ : theta?

Implements the KL-divergence loss function.

(let ((pred-ys ((target xs) theta)))
(sum (* pred-ys (log (/ pred-ys ys)))))

1	Overview
2	Entry points
3	List functions
4	Tensor functions
5	Extended Functions
6	Automatic Differentiation
7	Differentiable extended numerical functions
8	Non-differentiable extended numerical functions
9	Base-rank (non-extended) differentiable functions
10	Boolean comparison functions
11	Tensorized comparison functions
12	Hyperparameters
13	Gradient Descent Functions and Hyperparameters
14	Layer functions
15	Loss Functions
16	Building blocks for neural networks
17	He Initialization
18	Random number functions
19	Models and Accuracy
20	Logging
21	Utilities
22	Setting tensor implementations