Beveled truncated / off-axis head tracking projection in OpenGL

Question

Beveled truncated / off-axis head tracking projection in OpenGL

I am trying to do an off-axis projection in my application and am trying to change the perspective of the scene in accordance with the position of the user’s head. Usually, given that I needed to draw a window on the screen, I would draw a box on the screen as:

ofBox(350,250,0,50); //ofBox(x, y, z, size); where x, y and z used here are the screen coordinates

To do an off-axis projection here, I know that I would have to change the perspective projection as follows:

 vertFov = 0.5; near = 0.5; aspRatio = 1.33; glMatrixMode(GL_PROJECTION); glLoadIdentity(); glFrustum(near * (-vertFov * aspRatio + headX), near * (vertFov * aspRatio + headX), near * (-vertFov + headY), near * (vertFov + headY), near, far); //frustum changes as per the position of headX and headY glMatrixMode(GL_MODELVIEW); glLoadIdentity(); gluLookAt(headX * headZ, headY * headZ, 0, headX * headZ, headY * headZ, -1); glTranslate(0,0,headZ);

For symmetrical truncation in the above case (where headX and headY are equal to zero), the left, right parameters come out as -0.33 , 0.33 and bottom, top parameters go out as -0.25, 0.25 and set my cutoff volume along these coordinates. I tried to simulate an off-axis axis using the mouse for the test and did the following:

 double mouseXPosition = (double)ofGetMouseX(); double mouseYPosition = (double)ofGetMouseY(); double scrWidth = (double)ofGetWidth(); double scrHeight = (double)ofGetHeight(); headX = ((scrWidth -mouseXPosition) / scrWidth) - 0.5; headY = (mouseYPosition / scrHeight) - 0.5; headZ = -0.5; //taken z constant for this mouse test

However, I intend to use Kinect , which gives me the coordinates for the head of order (200, 400, 1000) , (-250, 600, 1400) , (400, 100, 1400), etc., and I cannot figure out how change truncation options when I have these head positions. For example: considering that 0 is in the center for Kinect, if the user moves so that his position (200, 400, 1000) , then how would the truncation parameters change?
How should objects be drawn when it is also necessary to consider the z-distance obtained from Kinect ? Objects must be reduced in size like z , and this can happen when glTrasnlate() called inside the above off-axis code, but the two coordinate system scales are different (glFrustum now sets the clipping volume to [-0.25, 0.33] to [0.25 , -0.33], where Kinect is on the order of hundreds (400,200,1000) ). How to apply z values to glFrustum / gluLookAt , then?

+6

c ++ 3d opengl openframeworks

user1240679 May 23 '13 at 20:51

source share

2 answers

The best explanation of how to use glFrustum for head tracking applications you can find in this article by Robert Koyma is called a generalized perspective projection:

http://csc.lsu.edu/~kooima/pdfs/gen-perspective.pdf

It also allows you to simply use stereo projections, you just need to switch between left and right cameras!

+2

linello Jul 03 '13 at 14:54

source share

Andreas Haferburg · Accepted Answer · 2013-05-26T00:54:31+0000

Firstly, you do not want to use gluLookAt . gluLookAt rotates the camera, but the physical screen the user is looking at does not rotate. gluLookAt will only work if the screen gluLookAt so that the normal screen continues to point to the user. The perspective distortion of the off-axis projection will take care of all the turns we need.

What you need to consider in your model is the position of the screen inside the truncated cone. Consider the following image. Red dots are the borders of the screen. What you need to achieve is that these positions remain constant in 3D WCS, since the physical screen in the real world also (hopefully) does not move. I think this is a key understanding of virtual reality and stereoscopy. A screen is something like a window into virtual reality and alignment of the real world with virtual reality, you need to align the truncated angle with this window.

To do this, you need to determine the position of the screen in the Kinect coordinate system. Assuming Kinect is on top of the screen, that + y is pointing down, and that the unit you are using is millimeters, I would expect these coordinates to be something like lines (+ -300, 200, 0), (+ -300, 500, 0).

Now for a distant plane there are two possibilities. You can either choose a fixed distance from the camera to the far. This would mean that the distant plane would move back if the user moved back, possibly cropping the objects you want to draw. Or you can hold the far plane in a fixed position in the WCS, as shown in the image. I find the latter more useful. For the near plane, I think the fixed distance from the camera is fine.

The inputs are the three-dimensional screen positions wcsPtTopLeftScreen and wcsPtBottomRightScreen , the tracked head position wcsPtHead , the z value of the far plane wcsZFar (all in WCS), and z the value of the near plane camZNear (in camera coordinates). We need to calculate the trimming parameters in the coordinates of the camera.

 camPtTopLeftScreen = wcsPtTopLeftScreen - wcsPtHead; camPtTopLeftNear = camPtTopLeftScreen / camPtTopLeftScreen.z * camZNear;

and the same with the bottom right point. Also:

 camZFar = wcsZFar - wcsPtHead.z

Now the only problem is that Kinect and OpenGL use different coordinate systems. In Kinect, CS + y points down, + z points from the user to Kinect. In OpenGL, + y points up, + z points to the viewer. This means that we must multiply y and z by -1:

 glFrustum(camPtTopLeftNear.x, camPtBottomRightNear.x, -camPtBottomRightNear.y, -camPtTopLeftNear.y, camZNear, camZFar);

If you need a better explanation that also covers stereoscopy, watch this video , I found it insightful and well executed.

A quick demo, you may need to configure wcsWidth , pxWidth and wcsPtHead.z .

 #include <glm/glm.hpp> #include <glm/ext.hpp> #include <glut.h> #include <functional> float heightFromWidth; glm::vec3 camPtTopLeftNear, camPtBottomRightNear; float camZNear, camZFar; glm::vec3 wcsPtHead(0, 0, -700); void moveCameraXY(int pxPosX, int pxPosY) { // Width of the screen in mm and in pixels. float wcsWidth = 520.0; float pxWidth = 1920.0f; float wcsHeight = heightFromWidth * wcsWidth; float pxHeight = heightFromWidth * pxWidth; float wcsFromPx = wcsWidth / pxWidth; glm::vec3 wcsPtTopLeftScreen(-wcsWidth/2.f, -wcsHeight/2.f, 0); glm::vec3 wcsPtBottomRightScreen(wcsWidth/2.f, wcsHeight/2.f, 0); wcsPtHead = glm::vec3(wcsFromPx * float(pxPosX - pxWidth / 2), wcsFromPx * float(pxPosY - pxHeight * 0.5f), wcsPtHead.z); camZNear = 1.0; float wcsZFar = 500; glm::vec3 camPtTopLeftScreen = wcsPtTopLeftScreen - wcsPtHead; camPtTopLeftNear = camZNear / camPtTopLeftScreen.z * camPtTopLeftScreen; glm::vec3 camPtBottomRightScreen = wcsPtBottomRightScreen - wcsPtHead; camPtBottomRightNear = camPtBottomRightScreen / camPtBottomRightScreen.z * camZNear; camZFar = wcsZFar - wcsPtHead.z; glutPostRedisplay(); } void moveCameraZ(int button, int state, int x, int y) { // No mouse wheel in GLUT. :( if ((button == 0) || (button == 2)) { if (state == GLUT_DOWN) return; wcsPtHead.z += (button == 0 ? -1 : 1) * 100; glutPostRedisplay(); } } void reshape(int w, int h) { heightFromWidth = float(h) / float(w); glViewport(0, 0, w, h); } void drawObject(std::function<void(GLdouble)> drawSolid, std::function<void(GLdouble)> drawWireframe, GLdouble size) { glPushAttrib(GL_ALL_ATTRIB_BITS); glEnable(GL_COLOR); glDisable(GL_LIGHTING); glColor4f(1, 1, 1, 1); drawSolid(size); glColor4f(0.8, 0.8, 0.8, 1); glDisable(GL_DEPTH_TEST); glLineWidth(1); drawWireframe(size); glColor4f(0, 0, 0, 1); glEnable(GL_DEPTH_TEST); glLineWidth(3); drawWireframe(size); glPopAttrib(); } void display(void) { glPushAttrib(GL_ALL_ATTRIB_BITS); glClear(GL_COLOR_BUFFER_BIT|GL_DEPTH_BUFFER_BIT); glEnable(GL_DEPTH_TEST); // In the Kinect CS, +y points down, +z points from the user towards the Kinect. // In OpenGL, +y points up, +z points towards the viewer. glm::mat4 mvpCube; mvpCube = glm::frustum(camPtTopLeftNear.x, camPtBottomRightNear.x, -camPtBottomRightNear.y, -camPtTopLeftNear.y, camZNear, camZFar); mvpCube = glm::scale(mvpCube, glm::vec3(1, -1, -1)); mvpCube = glm::translate(mvpCube, -wcsPtHead); glMatrixMode(GL_MODELVIEW); glLoadMatrixf(glm::value_ptr(mvpCube)); drawObject(glutSolidCube, glutWireCube, 140); glm::mat4 mvpTeapot = glm::translate(mvpCube, glm::vec3(100, 0, 200)); mvpTeapot = glm::scale(mvpTeapot, glm::vec3(1, -1, -1)); // teapots are in OpenGL coordinates glLoadMatrixf(glm::value_ptr(mvpTeapot)); glColor4f(1, 1, 1, 1); drawObject(glutSolidTeapot, glutWireTeapot, 50); glFlush(); glPopAttrib(); } void leave(unsigned char, int, int) { exit(0); } int main(int argc, char **argv) { glutInit(&argc, argv); glutCreateWindow("glut test"); glutDisplayFunc(display); glutReshapeFunc(reshape); moveCameraXY(0,0); glutPassiveMotionFunc(moveCameraXY); glutMouseFunc(moveCameraZ); glutKeyboardFunc(leave); glutFullScreen(); glutMainLoop(); return 0; }

The following images should be viewed from a distance equal to 135% of their width on the screen (70 cm on my screen 52 cm wide in full screen).

Beveled truncated / off-axis head tracking projection in OpenGL

More articles: