Slides:

(Wedesday 2023-09-20)
By Marc Pollefeys and Siyu Tang. Slides from Luc Van Gool, Andreas Geiger, Ioannis Gkioulekas, and many others.

What a human sees: looking at it, you automatically start recognizing objects, imagine the place, etc.

What a computer sees: a matrix of numbers, encoding the amount of light in that location (or for the 3 colors…)
The goal of computer vision is to give computers (super) human-level perception (in a general setting we are very far from that, but in some narrow tasks cv algorithms can be already better than human level)
The goals more concretely: automatic understanding of images and video
Vision for measurement examples:
Mars rovers need to map their surroundings to be able to choose their next moves (bc information exchange is ≥ 8 mins, so a certain autonomy is needed) → stereo camera system

3D surface models of buildings from videos or set of photos (see prof. Pollefeys’ PhD thesis: https://people.inf.ethz.ch/marc.pollefeys/pubs/PollefeysPhD.pdf) (image on the right)
3D models from photo collections from the internet, using better feature algorithms


Vision for perception, interpretation:

What does it mean, to see? (asked in the very influential book: Vision by David Marr)
Related disciplines: