Automatic Tongue Tracking in X-Ray Images
-
Graphical Abstract
-
Abstract
X-ray imaging is an effective technique to obtain the continuous motions of the vocal tract during speech, and Active appearance model (AAM) is a useful tool to analyze the X-ray images. However, for the task of tongue tracking in X-ray images, the accuracy of AAM fitting is insufficient. AAM aims to minimize the residual error between the model appearance and the input image. It often fails to accurately converge to the true landmarks. To improve the tracking accuracy, we propose a fitting method by combining Constrained local model (CLM) into AAM. In our method, we first combine the objective functions of AAM and CLM into a single objective function. Then, we project out the texture variation and derive a gradient based method to optimize the objective function. Our method effectively incorporates not only the shape prior and global texture, but also local texture around each landmark. Experiments demonstrate that the proposed method significantly reduces the fitting error. We also show that realistic 3D tongue animation can be created by using tongue tracking results of the X-ray images.
-
-