top of page

SoftMax and Multi-class classification

  • 작성자 사진: Shin Yoonah, Yoonah
    Shin Yoonah, Yoonah
  • 2022년 7월 29일
  • 1분 분량

최종 수정일: 2022년 8월 8일

Review the argmax function

argmax: returns the index corresponding to the largest value in a sequence of numbers

Largest value in Z = 100

Corresponding index = 0

Therefore the argmax function will return zero


Multi-class

Logistic regression can be used for two classes, but how about if we wanted to solve the following three class problem

Instead of using one plane to classify the data, we will use one plane for each class

  1. three equations representing three classes

2. Use the graph to represent equations

- nodes are representing the different components of X

- add nodes for each output Z


The edges represent the different learnable parameters with subscript indicating the

dimensions


This is the plane where Z equals zero

the line is where the decision plane intersects with the plane Z equals zero


Overlay the sample images, line split the class

If the input is in the blue region, the value of zero corresponding to the equation zero is the largest

=> Therefore, blue region has a higher value than the other regions


It works same in the other regions (red and yellow)


*Use the planes to classify this unknown point*

Calculate the output of the plane => apply the argmax function

result = class one (blue)


SoftMax

Since the actual distances, that is dot products for each input vector with the parameters connected to the probability



Similar to logistic regression

=> the process of classification is similar but we use the output of the probability


Other method

Sometimes SoftMax is not the best option for multi-class classification

  1. One - vs - rest

  2. One - vs - One

These methods are used for support vector machine


Copyright Coursera All rights reserved

Comments


bottom of page