We introduce a novel Deep Network architecture that implements the full feature point handling pipeline, that is, detection, orientation estimation, and feature description. While previous works have successfully tackled each one of these problems individually, we show how to learn to do all three in a unified manner while preserving end-to-end differentiability. We then demonstrate that our Deep pipeline outperforms state-of-the-art methods on a number of benchmark datasets, without the need of retraining.
This teaser video shows feature matching results with our integrated LIFT pipeline and SIFT, for selected sequences of all three datasets, Strecha, DTU, and Webcam. Our results are significantly better overall compared to SIFT. Note that, in our experiments, SIFT still gives results that are on par with the state-of-the-art when evaluated as a whole pipeline. Please see the paper for details.
Click the following link for the supplementary appendix for implementation details.
Dataset and Codes
Datasets used in the paper.
Codes for LIFT with learned modules.