The Multi-Class SVM Loss measures how badly the model got things wrong:
Loss = max(0, score_j - score_correct + 1)
If the prediction is already correct enough (inside the margin), the loss stays at zero.
The minimum possible loss is 0 โ perfect classification.
If all class scores are roughly equal, the loss will be (number of classes - 1)
โ a useful debug check.
Using the mean instead of sum doesnโt change the behavior; it just scales it.