Object Detection

최종 수정일: 2022년 8월 25일

Outline

- Sliding Windows

- Bounding Box

- Bounding Box Pipeline

- Score

Image classification predicts the class of an object in an image

Classification and object localization

--> Locate the presence of an object and indicate the location with a bounding box and their classes

Sliding Window

= algorithm

- If we want to detect a dog, we consider a fixed window size

- If chosen property, the dog will occupy most of the window

Essentially a sub image that we would like to classify as a dog

The other sub images - classified as background

(Image that does not contain the dog)

*Process

Start in one region in the image, classify that sub-image
Then shift the window and classify the next sub-image
Repeat the process -- when the object occupies with of the window, it will be classified

Problems of Sliding Windows

Overlapping Boxes: object detects often output many overlapping detections
Object Sizes: have the issue of object sizes, where the same object can come in different sizes/Solution: reshaping the image
Overlapping Objects: this may pose issues to the sliding windows

Bounding Box

Bounding box = a rectangular box that can be determined with the lower-right corner of the rectangle with coordinates y=0 and x=0

Y and X are not the same as the classification labels y and the image x