Yumi's Blog

Part 2 Object Detection using YOLOv2 on Pascal VOC2012 - input and output encoding

Screen Shot 2018-12-26 at 3.21.01 PM The screenshot of Andrew Ng's YOLO lecture

This is the second blog post of Object Detection with YOLO blog series. This blog discusses the YOLO's input and output encoding. I will use PASCAL VOC2012 data.

This blog assumes that the readers have watched Andrew Ng's YOLO lectures on youtube. Specifically, the following 5 videos. Each of these videos are (of course free and) about 10 minutes and in total it takes less than 45 minitues. So please watch through all the videos.

Part 5 Object Detection using RCNN on Pascal VOC2012 - inference

bbx_example

This is the last article of the blog series for Object Detection with R-CNN.

If you are reading this blog, congratulations for getting this far. Now you are ready to experiment the performance of your RCNN classifier. I will use my own image to see whether the classifier can detect my face.

Part 4 Object Detection with Pascal VOC2012 - CNN feature extraction

This is part of the blog series for Object Detection with R-CNN.

Screen Shot 2018-11-23 at 2.43.29 PM Cited from VGG in TensorFlow.

In this blog, we are now ready to train our classifier for object detection. We will use a large pre-trained CNN to extract a fixed-length feature vector from each region, and then create artificial neural networks that mapps the feature vector to the object class. We will focus on detecting a person.

Part 3 Object Detection using RCNN on Pascal VOC2012 - Selective Search

This is part of the blog series for Object Detection with R-CNN.

In this blog, we will review the selective sarch algorithm. The selective search is one of the most successful category-independent region proposal algorithms, and R-CNN also uses selective search to find region proposal.

J.R.R. Uijlings et al take a hierarchical grouping algorithm to form the basis of selective search, and first apply fast segmentation method of Felzenszwalb and Huttenlocher

Part 2 Object Detection using RCNN on Pascal VOC2012 - R-CNN overview

Screen Shot 2018-11-18 at 4.58.16 PM Cited from Rich feature hierarchies for accurate object detection and semantic segmentation paper

This is the second blog post of "Object Detection with R-CNN" series.

In this blog, I will review Rich feature hierarchies for accurate object detection and semantic segmentation paper to understand Regions with CNN features (R-CNN) method. R-CNN is a successful object detection algorithm that can return class label of objects and their bounding boxes for a given image. The work is published in 2013 and there have been many faster algorithms for the object detection algorithm (e.g., fast R-CNN, faster R-CNN and Yolo). But nevertheless, the implementation of the R-CNN is simple, and serves as a powerful bench mark for various object detection tasks. So for that reason, this blog will review the R-CNN algorithm.

Implement Local Binary Pattern Descriptor from scratch

LBP operated image

My goal of this blog post is to understand Local Binary Pattern (LBP) texture operator. It is a simple yet very powerful algorithm to understand image. With the features created by the LBP texture operator, we can tell the texture of the objects in image; The features can, for example, separate images of carpet from image of blicks. That sounds very cool and I want to understand LBP texutre operator better.

Low and High pass filtering on images using FFT

Frequency analysis

In this blog post, I will use np.fft.fft2 to experiment low pass filters and high pass filters.

**Low Pass Filtering** A low pass filter is the basis for most smoothing methods. An image is smoothed by decreasing the disparity between pixel values by averaging nearby pixels (see Smoothing an Image for more information).