E N D
OBJECT DETECTION HOG AND SIFT ALGORITHM
HOG ALGORITHM • HOG stands for histogram of oriented gradients. • The hog descriptor focuses on structure or shape of the object. • It uses magnitude as well as direction of the gradient to compute the features. • It generates histogram by using magnitude and direction of the gradient.
Here we calculating gradient magnitude and direction, to calculate pixels intensity we need • X direction=|40-70|=30 • Y direction=|20-70|=50 • By these values we are calculating magnitude and direction of the gradient • By using magnitude and direction we calculate feature vectors
Before getting the hog feature and after concatenating feature vectors we are supposed to do normalize. • Suppose we have taken 150*300 pixels and multiply with 2 to increase the brightness and divided by 2 to decrease the brightness, then you cant compare two images without normalization bec’z the pixels intensity will be changed. • But if you normalize the feature vectors it is easy to compare
• For hog features giving human template and giving output for convolving with human model Then it will predict whether it is human or not.
SIFT ALGORITHM • SIFT stands for scalar invariant feature transform and was first presented in 2004, by D.Lowe, University of British Columbia. • It is a way to describe a local area in an image. • In this whole image is reduced to set of points. • SIFT is invariance to image scale and rotation. • This algorithm is patented, so this algorithm is included in the Non-free module in OpenCV.
Major advantages of SIFT are • Distinctiveness: individual features can be matched to a large database of objects • Quantity: many features can be generated for even small objects • Efficiency: close to real-time performance • Extensibility: can easily be extended to a wide range of different feature types, with each adding robustness.
The scale space of an image is a function L(x,y,σ) that is produced from the convolution of a Gaussian kernel(Blurring) at different scales with the input image. • Within an octave, images are progressively blurred using the Gaussian Blur operator. • Mathematically, “blurring” is referred to as the convolution of the Gaussian operator and the image. • Gaussian blur has a particular expression or “operator” that is applied to each pixel. What results is the blurred image. gaussian blur Gaussian blur operator
In difference of gaussian kernel(DOG) we use those blurred images to generate another set of images • These dog images are used to find interesting points in the image. • These process is done for all images in the gausian pyramid. • After that we stack those different images on top of the each other and basically your looking extreme points. • In these locally distinct which stand out and those are your key points.