Mirrors are often studied for camera calibration since they provide symmetric relationship for object which can guarantee synchronization in multiple views. However, it is sometimes difficult to compute the reflection matrices of mirrors. This thesis aims to solve the problem of camera calibration and shape recovery from a two-mirror system which is able to generate five views of an object. It firstly studies the similarity relationship of the motion formed by the five views in two-mirror system with the circular motion. It is shown that the motion formed by the five views can be regarded as two circular motions so that we can avoid computing the reflection matrices of mirrors. This thesis then shows the most important problem which is to recover the vanishing line of rotation plane and the imaged circular points by two unknown equal angles via metric rectification. After that, it is easy to recover the imaged rotation axis and the vanishing points X-axis via imaged circular points. Different from the state-of-the-art algorithm, this thesis avoid computing vanishing points X-axis at first because it will cause accumulative error when recovering the imaged rotation axis. By now it is enough to compute the camera intrinsics which is the main objective of this thesis. At last, a 3D visual hull model of object could be reconstructed once all the projective matrices of views were computed. This thesis uses a short video instead of static snapshots so that the reconstructed 3D visual hull model of each frame can be put together based on the motion sequence of object to make a 3D animation. This animation can help to boost the accuracy of action recognition in contrast to 2D video. In general, the action recognition by 2D videos always distinguishes action according to the side of human taken by videos but cannot do for the side does not appear in videos. It then requires to store every direction for human actions of video into database which causes redundancy. The 3D animation can deal with this problem since the reconstructed model can be seen in every direction so that only one 3D animation of human action is needed to store in database. The experimental results show that the more frames are used, the less error of camera intrinsics will occur and the reconstructed 3D model shows the feasibility of the approach.
|Date of Award||5 Jun 2015|
|Supervisor||Pong Chi YUEN (Supervisor)|
- Computer vision
- Optical pattern recognition
- Computer animation