Optical Flow


Overview

Large motions remain a challenge for current optical flow algorithms. Traditionally, large motions are addressed using multi-resolution representations like Gaussian pyramids. To deal with large displacements, many pyramid levels are needed and, if the object is small, it may not be visible at the highest pyramid levels. To address this we describe images using a channel representation (CR) and replace the standard brightness constancy assumption with a new descriptor constancy assumption, where the descriptor is the channel representation element at each pixel. Loosely, CRs can be thought of as over-segmenting the scene into layers based on some image feature and representing an image as a collection of local distributions, one at each pixel. If the appearance of a foreground object differs from the background then its descriptor will be different and it will be represented in different layers. We can thus smooth the layers of the CR without mixing together the foreground and background. Thus, we can apply pyramid techniques to the CR to capture large motions without completely losing small objects or blurring across motion boundaries. On the MPI-Sintel benchmark, we are more accurate than the baseline for fast motions and near motion boundaries.



Faculty


Graduate Students


Collaborators


References

  • Laura Sevilla-Lara, Deqing Sun, Erik G. Learned-Miller, Michael J. Black.
    Optical Flow Estimation with Channel Constancy
    European Conference on Computer Vision (ECCV), 2014. (To appear)
    [pdf]