Abstract

Example-based mesh deformation techniques produce natural and realistic shapes by learning the space of deformations from examples. However, skeleton-based methods cannot manipulate a global mesh structure naturally, whereas the mesh-based approaches based on a translational control do not allow the user to edit a local mesh structure intuitively. This paper presents an example-driven mesh editing framework that achieves both global and local pose manipulations. The proposed system is built with a surface deformation method based on a two-step linear optimization technique and achieves direct manipulations of a model surface using translational and rotational controls. With the translational control, the user can create a model in natural poses easily. The rotational control can adjust the local pose intuitively by bending and twisting. We encode example deformations with a rotation-invariant mesh representation which handles large rotations in examples. To incorporate example deformations, we infer a pose from the handle translations/rotations and perform pose space interpolation, thereby avoiding involved nonlinear optimization. With the two-step linear approach combined with the proposed multiresolution deformation method, we can edit models at interactive rates without losing important deformation effects such as muscle bulging.

1. Introduction

Editing and animation of character models is an important task in computer graphics. The user, in general, demands the editing procedure to be interactive and intuitive. However, the ability to pose a model naturally and realistically is equally important for a character modeling system. Example-based techniques are a promising approach to improve the realism of interactive editing techniques. Pose space deformation (PSD) [1] is a skeleton-based approach which animates the model using skeletons and then corrects the resulting surface to conform to example shapes. In PSD, corrective displacements, the differences between a skinning surface and examples, are interpolated in pose space formed by joint angles. These displacements are then added to the skinning surface. While this method can model deformation effects such as muscle bulging, editing the pose of the model naturally is difficult and tedious as the number of joints increases.

In contrast, the mesh-based inverse kinematics (MeshIK) technique [2, 3] can pose a model meaningfully by just translating a few vertices of the mesh. Instead of interpolating corrective displacements, MeshIK interpolates local deformations called deformation gradients. It automatically determines interpolation weights from the user-specified position constraints through nonlinear optimization. Because of this iterative process, it is time consuming for this method to edit a large model. Moreover, while MeshIK can edit the global pose naturally, it is counterintuitive to manipulate the local pose via translation; bending and twisting are more intuitive in this case.

In this paper, we present an example-driven mesh deformation technique that allows us not only to edit a global pose naturally but also to adjust a local pose precisely. Our system provides both translational and rotational controls to deform a model. By translating handle vertices, the user can produce a model in natural poses easily. By applying rotations to handle triangles, the user can bend and twist the model as if manipulating a joint. To this end, we formulate the problem by inferring a pose from handle translations/rotations and using a two-step linear surface optimization technique that is based on a linear rotation-invariant mesh representation. The benefits of this formulation are as follows: (1) it is straightforward to incorporate translational and rotational constraints; (2) it achieves rotation invariance and thus is applicable to examples that involves large deformations (rotations and bending); (3) it avoids time-consuming nonlinear optimization. In combination with the proposed multiresolution approach, we can edit high-resolution meshes at interactive rates while retaining important deformation effects such as muscle bulging.

The rest of the paper is organized as follows. In Section 2, we briefly review-related work. We provide an overview of our system in Section 3. We then describe our basic deformation framework in Section 4. In Section 5, we explain our PSD technique. Section 6 shows our multiresolution method. We demonstrate our method and show our results in Section 7. Finally, we conclude our work in Section 8.

Pose Space Deformation
Pose space deformation (PSD) [1] is an example-based deformation method that corrects the artifacts of skeletal subspace deformation (SSD) and models subtle surface deformation, such as muscle bulging. This method is thought of as a combination of SSD and shape interpolation (morphing). In PSD, an initial mesh is first computed by SSD. This is then corrected by adding corrective displacements. These displacements, regarded as differences between SSD and examples, are interpolated such that they can properly correct the SSD surface in an arbitrary pose. Kry et al. [4] compressed corrective displacements using PCA to make them more compact. Kurihara and Miyata [5] introduced weighted pose space deformation to achieve PSD from a limited number of examples. Weber et al. [6] showed that the use of differential surface deformation in the PSD framework improves the quality of the result. They represented corrections with rotations and integrated them by solving a Poisson system. Wang et al. [7] introduced a similar method to [6] using a regression model.

Incorporating Real Dataset
To create realistic animation, some methods [811] use range scanned data as examples. Allen et al. [8] captured 3D shapes of body parts in different poses and animated them using a k-nearest neighbor approach. Anguelov et al. [9] and Hasler et al. [10] developed statistical human models that can create new human models with different body shapes and poses.

Linear Surface Deformation
Linear differential surface deformation methods (see, e.g., [1214] and a well-organized survey [15]) deform a mesh by manipulating handle points or triangles, which provide position or transformation constraints, while preserving original details. The problem with these methods is the deformation of the model must be known beforehand to obtain the deformed vertex positions. Therefore, explicit methods [14, 16, 17] propagate user-specified transformations to other regions of the mesh. The problem of this approach is the translation-insensitivity: this approach cannot avoid shearing distortions when pure translations are applied, because there is no change in orientation. On the other hand, implicit methods [13] linearly approximate transformations. This approach fails when handles are moved largely, because, in 3D, the approximation is only valid for small rotation angles. In 2D, however, we can completely determine transformations linearly. Igarashi et al. [18] exploits this fact and proposed a 2D deformation method that works for large translations, based on a two-step as-rigid-as possible deformation framework. Two-step optimization methods in 3D, based on a linear rotation-invariant mesh representation [10, 12, 19], optimize transformations first and then optimize positions. Because these methods optimize rotations and positions separately, they are also translation insensitive.

Nonlinear Surface Deformation
In contrast, nonlinear methods solve these problems by iteratively optimizing both positions and transformations and thus work for large deformations at the cost of additional computations [2022]. The main criteria for optimizing vertex positions are rigidity, smoothness, and user-specified position constraints. The optimization problem is solved either using the Gauss-Newton method [23] or through a local/global approach [22, 24]. Kilian et al. [25], in contrast, introduced an interesting approach that formulates the optimization as a path problem instead of deforming the reference shape. To improve performance, recent methods [23, 24, 2629] incorporate skeleton/space deformation to form a subspace. These methods are, therefore, scalable in mesh size, that is, the computational cost does not depend on the number of vertices. Consequently, they can edit a large model in real time. Our method also utilizes a subspace method, but we not only increase the speed but also compress example deformations.

Mesh-Based Inverse Kinematics
The mesh-based inverse kinematics (MeshIK) system edits a mesh by manipulating handle points while constraining deformation in example space [2]. Although this method can pose a mesh meaningfully only by moving a few handles, it is expensive to edit a large mesh due to the nonlinear optimization it solves. The computational performance of MeshIK can be improved with the use of the reduced deformable model [3], but it loses deformation effects, for example, muscle bulging. In addition, because the feature vector used in MeshIK is not rotation invariant, there is the risk of producing erroneous results if the orientations and poses of examples differ significantly, as shown in Figure 4.

Deformation Transfer
Deformation transfer [30] is another related class of techniques that literally transfers deformation of one model to another to change the pose. In fact, as pointed out by Botsch et al. [31], this method solves exactly the same Poisson system as the deformation method of [14]. Semantic deformation transfer [32] allows the transfer to drastically different characters that move in unique ways. Like our method, they interpolate a rotation-invariant mesh representation, but the interpolation weights come from the projection of a new model to the example space. In contrast, our method manipulates a surface with handle translations/rotations and computes interpolation weights from them.

Our method is most similar to the variational PSD proposed by Weber et al. [6] and MeshIK proposed by Sumner et al. [2] in the sense that it formulates an example-based deformation technique using differential surface representation. However, we use the rotation-invariant mesh representation. Thus, our method can handle large rotations robustly. In addition, we allow the user to specify both translational and rotational constraints to deform a model. This enables us to edit both global and local poses of the model.

3. Overview

Our goal is to develop a mesh editing system that can manipulate a pose of the model both globally and locally. An overview of our method is depicted in Figure 1. We provide a translational control to achieve global pose editing (Figure 1(a)) and a rotational control for local pose editing (Figure 1(b)).

Our method is based on pose space interpolation and a two-step surface optimization method. The workflow is depicted in Figure 1(c). Prior to deformation, examples are encoded and stored as the rotation-invariant mesh representation. To achieve deformation, the user selects a set of vertices or triangles called handles. If a certain region on the mesh should be kept fixed, the user specifies the fixed region (blue), which is not affected by the examples. From handle translations or rotations, we infer a pose of a model (Section 5.1). When editing with the translational control, the user specifies a coordinate frame and handle vertices on the mesh. The pose is evaluated as relative positions of handles in the local coordinate frame. When editing with the rotational control, the user places a pair of handle triangles called virtual joint. The pose is evaluated as relative rotations between them. Once we have the pose, we then interpolate example deformations in pose space (Section 5.2). The deformed vertex coordinates are obtained using a two-step optimization approach. First, we optimize transformations (Section 4.2.1). Next, we optimize vertex positions from the resulting transformations. Finally, we perform multiresolution deformation to edit high resolution meshes (Section 6).

3.1. Notations

Inputs to our system are the base mesh and example meshes. The base mesh is in the rest pose, and the example meshes are provided in the form of additional geometries (the connectivity is shared with all the examples). Each mesh consists of vertices and triangles. We denote the indices of a vertex and a triangle as and , respectively. We denote the index of an edge between triangle and by , where is the number of triangle pairs. The vertex position is defined by , and it is concatenated into an matrix, . Also, we denote vertices by after deformation.

4. Basic Deformation Framework

Our system is based on the pose space deformation framework that is thought of as a combination of surface deformation and shape interpolation. To integrate these two effectively, we use a two-step optimization method based on a rotation-invariant mesh representation similar to those proposed in [12, 19]. This formulation makes it straightforward to incorporate both translational and rotational constraints. We refer to our rotation-invariant representation as rotation-invariant encoding that is computed from a local deformation called the deformation gradient. In this section, we first describe derivations of these representations. Next, we explain the reconstruction process of vertex positions from these representations. Finally, we introduce surface manipulation tools: handle-based surface deformation and shape interpolation.

4.1. Basic Shape Representation
4.1.1. Deformation Gradient

Consider triangle with its three vertices before and after deformation: and . Given edge vectors before and after deformation, and , we can approximate the deformation gradient by where is the pseudoinverse of edge vectors before deformation. Thus, the computation of deformation gradients can be achieved linearly from vertex positions after deformation . Using a deformation gradient operator , which contains the pseudoinverses of the edge vectors before deformation, we can compute by

4.1.2. Rotation-Invariant Encoding

The deformation gradient is translation invariant but not rotation invariant, which means that it is affected by global rotations. To address this issue, we first decompose the deformation gradient into rotation and scale/shear components by polar decomposition, that is, (Figure 2). Note that the scale/shear component is already rotation invariant, but is not. To achieve rotation invariance, we compute the relative rotation from the absolute rotations of two adjacent triangles, and , as follows: We call and rotation-invariant encoding because they encode changes in surface details and are invariant under the global rotation.

4.2. Reconstruction

To reconstruct vertex positions from rotation-invariant encodings, we solve two sparse linear systems. We use the penalty method [33] to enforce rotation/position constraints. Although the use of the penalty method can only provide approximate constraint satisfaction, it can produce sufficiently close results to the exact method by setting the penalty weight sufficiently high [33]. On the other hand, if we use small values for penalty weights, the deformed mesh does not necessarily conform to handle positions. This flexibility is useful in our pose space surface deformation framework such that the user can adjust deformations to satisfy position constraints exactly or to conform to example deformations. Using a small weight, we can avoid local distortions around handles, caused by the deviation of the pose from the interpolation range.

4.2.1. Rotation Optimization

To compute the new absolute rotation from the relative encoding , we rewrite (3) as follows: By gathering this equation for all the triangle pairs in the mesh, we can write this with a sparse linear system in the following form: where is a identity matrix and the size of is . When achieving surface deformation, rotation constraints are applied to the set of triangles that include . Thus, the absolute rotations are optimized such that they satisfy both (5) and the rotation constraints. In general, there is no exact solution for this, and thus we need a least-squares approximation, that is, (5) becomes . Using the penalty method [33], the problem is formulated as follows: where is the weight of rotation constraints; is a matrix of rotation constraints containing transposed target rotations at to rows if , and otherwise 0; is a diagonal matrix containing 1 at to rows if , and otherwise 0. With , we can choose actual rotations that can be constrained to target rotations as follows: where is a identity matrix. Note that must be constructed and factorized once for every frame, which is somewhat costly for a large mesh.

Having obtained , we perform matrix orthonormalization using SVD to factor out undesired shears. We then compute the new deformation gradient by . Note that we do not allow the user to use scale constraints in our method. Thus, comes only from the result of shape interpolation (Section 4.3.2). For a detail-preserving deformation, is omitted (Section 4.3.1).

4.2.2. Vertex Optimization

Once the deformation gradients are obtained, we can reconstruct vertex coordinates by solving (2). Equation (2) is solved in a weighted least-square sense: where is a diagonal weight matrix containing the areas of the triangles [15]. As indicated by Botsch et al. [31], is equivalent to the Laplace Bertrami operator , which can be computed directly from the cotangent weights [34]. In contrast to , is factorized once for all the frames, which requires only back substitution and is, therefore, very efficient. When a set of position constraints, , is provided, this is solved via the penalty method, which leads to the following system: where is the weight for position constraints; is a matrix containing target positions at row if , and otherwise 0; is a diagonal matrix having 1 at row if , and otherwise 0. Like , we can choose vertices that should be constrained to the target positions using as follows:

4.3. Surface Manipulation Tools

In this section, we introduce two basic surface manipulation tools for formulating our pose space deformation.

4.3.1. Handle-Based Surface Deformation

To deform a surface, we specify handle vertices/triangles on the mesh and apply translations/rotations to them (Figure 3). In addition, if a certain region on the mesh should be kept fixed, the user specifies the fixed region. To obtain the deformed vertices , we solve sparse linear systems (6) and (8). Note that, because we optimize rotations and vertices separately, rotation constraints and translation constraints should be compatible with each other, otherwise, it produces distortions.

4.3.2. Shape Interpolation

The advantage of performing interpolation using the rotation-invariant encoding is that it is robust to large rotations. However, when simply interpolating the components of , it leads to the artifacts which can be seen in the linear blend skinning. One practical way to correctly interpolating rotations is to map them to using the matrix logarithm, interpolate linearly in , and map back to using the matrix exponential [2, 7, 10, 17, 32]. To achieve this, we use Rodrigues’ formula and convert into an axis angle representation (rotation vector) which is a compact form of log , however, can be interpolated directly. Note that, when linearly interpolating matrix logarithms, the interpolation path is not the shortest path, which introduces error. This error is minimal when the rotation is small, that is, the rotation is close to the identity. Therefore, we encode deformation gradients and rotation-invariant encodings relative to the rest pose [6, 32]. As a result, when we encode the rest pose, is the identity for all triangle pairs.

We concatenate into a matrix, . Likewise, a symmetric matrix is flattened to and concatenated into a matrix, . Interpolation of the rotation-invariant encodings of meshes is achieved with a simple weighted combination. The interpolated values and are obtained from the following: where is the weight for example , which satisfies . After converting and to matrix form, the deformed vertex positions are obtained by solving (6) and (8). We constrain one of the rotations to an identity matrix when solving (6) and fix one of the positions when solving (8).

Interpolation Properties
The rotation-invariant encoding is robust-to-extreme orientation differences of input models [12, 19, 32]. To show this, we compared the interpolation results using rotation-invariant encodings and using deformation gradients (Figure 4). Given two meshes, and , which are in extremely different orientations, we create the intermediate steps between them. Let , , and , be rotation-invariant encodings of two meshes. Then, we can perform interpolation with and . The interpolation of deformation gradients is performed in the same manner except that the rotations are represented in absolute world coordinates. If triangle orientations differ by more than 180 degrees, there is ambiguity in interpolating deformation gradients. Because of this ambiguity, the result using deformation gradients is distorted around the foot, as shown in Figure 4. Shape interpolation using rotation-invariant encodings does not suffer from this ambiguity and produces a natural result.

5. Pose Space Surface Manipulation

Now, we are ready to formulate our pose space surface manipulation technique by unifying the handle-based surface deformation and shape interpolation methods described in the previous section. To integrate these two, rotation-invariant encodings must be altered according to handle movements, that is, a technique to associate handle translations/rotations with interpolation weights must be developed.

5.1. Pose Estimation

To obtain interpolation weights, the original MeshIK [2] solves nonlinear optimization to determine the weights from handle positions. We, in turn, compute interpolation weights from the handle translations/rotations based on pose space interpolation. This allows us to avoid involved nonlinear optimization and to incorporate both translational/rotational controls.

To evaluate the pose of the model, we need the measure that is rotation invariant. To this end, we compute relative positions and rotations from handles (Figure 5).

When editing with the translational control, the user places a coordinate frame and handle vertices on the mesh. The frame is placed on a triangle that exists around the trunk of the model and is used to form a local coordinate to compute relative positions of handles. This way, we can eliminate the effect of global rotations and can evaluate the pose successfully. In addition, our system allows the user to manipulate global orientation and position of the model by rotating and translating the frame.

When editing with the rotational control, the user places a pair of handle triangles called virtual joint. The virtual joint consists of parent, which is placed on the proximal segment, and child, which is placed on the distal segment. The user applies rotations to the child triangle whereas the parent triangle is not manipulated. To apply rotations, the user places a frame on a triangle that exists near the actual joint center in order to use its centroid as a rotation center. By rotating the frame, we can achieve twisting easily. By dragging the child triangle around the frame, bending is achieved. The pose is evaluated as a relative rotation between the virtual joint, which is represented as Euler angles.

5.2. Pose Space Deformation

Next, we compute the interpolation weights from the pose using a scattered data interpolation method. The interpolation weights are obtained by evaluating the pose computed from handle translations/rotations, in the pose space formed by example poses. We would like to compute the interpolation weights such that they satisfy the following criteria.(1)At an example point, the weight for that example must be one, and all other weights must be zero.(2)The weights must add up to 1.(3)The weight must change continuously to produce smooth animation.(4)The absolute value of the weight must be small to avoid excessive exaggeration of deformation.

For this task, we use -nearest-neighbors interpolation [8] because it satisfies all of the above criteria and returns the weights within the range [0–1]. This method chooses the closest example points and assigns weights based on the proximity (distance). All other example points are assigned a weight of zero. Before normalization, the weights for each examples are computed as This is then normalized to sum to one. is the distance to the th example and the is the distance to the nearest point. We use in this paper.

Let be the index of handle vertex or virtual joint. Then, the pose is defined by a vector, , that contains relative positions or relative rotations. The distance between an arbitrary pose and example is defined as follows: where is a pose of example .

After computing interpolation weights, we use (10) to obtain new rotation-invariant encodings. The new vertices are calculated by solving (6) and (8). When editing with translational controls, we constrain the rotation of triangle where the frame is placed, to the frame orientation relative to the rest pose. When editing with rotational controls, at least one vertex must be fixed to provide position constraints.

6. Multiresolution Deformation

By incorporating examples, our method can edit the model naturally and capture deformation effects such as muscle bulging. However, solving two large linear systems is still relatively time consuming, mainly due to the rotation optimization, which requires performing factorization for every frame. In addition, it requires a relatively large memory space ( for and for ). To make our representation more compact and to speed up the computation, we use a multiresolution approach.

Our observation is that, if we layer a surface with a coarse mesh and fine details, then deformation induced by pose changes mainly affects the underlying coarse mesh and has little effect on the fine details. In fact, Weber et al. [6] reported that deformation behaviors of a character tend to have a low-frequency nature. They then used this fact to make example deformations more compact using a method similar to the geometry compression proposed in [35]. We, in turn, take a multiresolution approach using coarse meshes to compress example deformations. We compute rotation-invariant encodings from coarse meshes and perform PSD with them. We then add original fine details to the deformed coarse mesh to obtain a dense output model.

Inspired by Sumner et al. [23], we employ linear blend skinning to decouple a dense mesh into a coarse mesh and fine-details. Here, we represent the vertex position of the coarse mesh by , an affine transformation by , and a translation vector by . The deformed vertex is obtained as follows: where is the number of vertices in the coarse mesh, and is a weight for vertex represents the extent to which is influenced by .

To compute the weight , we use harmonic interpolation [16] as opposed the Euclid distance-based weighting method [23]. This eliminates distortions caused by assigning large weights to the vertices that are geodesically distant. As for the boundary condition, vertex is assigned the value , and all other vertices in the coarse mesh are assigned the value . If the harmonic value is , then we replace it with 0 so that we can reduce memory space. The values are then normalized such that their sum is 1.

To construct a coarse mesh, we use a variant of the quadratic mesh simplification method (Qslim) [36]. Because Qslim only considers a static shape, the resulting coarse mesh is not suitable for deformation. For example, if we apply Qslim to a character model with its knee extended, then we would lose vertices around the knee. This loss would be a problem when we want to bend the knee. To solve this problem, we modify Qslim slightly by computing the error metric using all the examples. This method is similar to that proposed in [37], but we do not move the vertices during edge collapse; thus, we can easily specify boundary conditions of harmonic interpolation.

Incorporating the above technique into the pose space surface manipulation framework is straightforward. We prepare coarse examples and store their rotation-invariant encodings. We also store harmonic weights and fine details that are the vectors emanating from the vertex of a coarse mesh to that of a dense mesh, . We then perform PSD using those values. Once the result using coarse meshes is obtained, fine details are added using (13). Unlike [23], who place them on the dense mesh, we place handles directly on the coarse mesh. However, we found that the manipulation of handles on a coarse mesh of around 1k triangles retains sufficient freedom for interactive mesh editing.

7. Results

We implemented the pose space surface manipulation technique on an Intel Core2 Quad Q9400 2.66 GHz machine. We used the sparse Cholesky solver provided with the CHOLMOD [38] library. We found that the range worked well for all the examples presented in this paper. In contrast, we provide a wider range of .

Interactive Editing
In the accompanying video, we show use cases of the pose space surface manipulation framework for interactive editing of mesh models. Figure 6 demonstrates how our method edits a model using both translational and rotational control. By translating the handles on the feet, we can pose the horse (1k triangles) naturally (Figure 6(a)). Once the overall pose is determined, the user manipulates the part of the model with the rotational control (Figure 6(b)). The blue region is frozen so that deformations remain local.

Using our translational editing method, we can create animation sequences. As shown in Figure 7, we can create walking elephant (30k triangles) from 6 coarse examples (1k triangles) by translating handles on the feet. We can also pose the performer (dense: 20k triangles, coarse: 1k triangles) in a running sequence by moving the handles on the feet (Figure 8).

Figure 9 demonstrates the applicability of our method to highly nonrigid objects such as a cloth. We used 14 examples (1k triangles) created by a simulation. We fixed the top edge of a model and placed a frame on the top-left corner. By just dragging the handle on the bottom right corner, we can animate the cloth realistically.

Figure 10 demonstrates the ability of our method to accomplish twisting. We placed handle triangles on the upper arm and shoulder blade. We manipulate the frame placed on the triangle which is near the joint center. The rotational control is more intuitive for controlling shoulder movements than the translational control. Even for these challenging cases involving twisting and bending, our method can produce natural results.

Handle Locations and Sizes
In Figure 11, we evaluated the sensitivity of our method to the locations of triangles. Our method works when the handle triangle is distally or proximally placed in the segment (Figure 11(b) and 11(c)). Even when the handle is placed on the region where it is deformed by muscle bulging, pose space interpolation was successfully achieved (Figure 11(d)). Although we recommend placing the triangles in the middle of the segments, our method works robustly as long as child and parent triangles are placed on different segments. Our method works even when the triangle is placed far from the virtual joint. However, it is somewhat difficult for the user to manipulate the triangle in this case; because joints exist between the triangle and the virtual joint, the distance between the triangle and the virtual joint changes during dragging and the model will deform unexpectedly. In this case, the translational control is more suitable and easy to use. In addition, the size of the triangle is also important. If the size of the triangle is so small and its orientation changes drastically with high-frequency deformations, then the pose space interpolation will probably introduce large approximation errors. For the examples presented in this paper, however, pose space interpolation is successfully achieved because we manipulate relatively large triangles that is on a coarse mesh with around 1k triangles.

Comparisons
In Figure 12, we compare our method using rotational controls with the transformation propagation approach of Zayer et al. [16]. As expected, our method is more realistic where the elbow bends sharply, and the biceps bulges naturally, whereas the transformation propagation method just deforms the model smoothly. In Figure 13, we compared our method with the material modulation approach [17]. Although material modulation exploits the examples, it is difficult to control details precisely because it only computes a weight per vertex from all examples and does not account for pose changes. The result of our method looks nearly identical to the ground truth. We also quantitatively compared our method with [17]. We measured the error defined by the mean distance between the deformed model and the ground truth, which was normalized by the bounding box diagonal. Our method is more accurate than [17] where the error of our method is approximately 1.5% of the bounding box diagonal whereas that of [17] is 5.8%.

Next, we compared our method using translational controls with MeshIK [2]. When the pose is in the example space, our method is on par with MeshIK (compare Figure 14 of this paper and Figure 5 of [2]). As with MeshIK, our method fails when the pose is outside of the example space (Figure 14(a)). The advantage of our method is the applicability to the examples with large bending (Figure 15); this is, important when editing models that have tails, tentacles, and so forth. Since MeshIK uses deformation gradient (Def Grad) as its basic representation, which is defined in the absolute world coordinate, it results in the discontinuity artifact (Figure 15 top right). Our method based on a rotation-invariant mesh representation naturally deforms the model using the examples that involve large rotations and bending (Figure 15 bottom). The downside of our method is the lack of extrapolation capability due to the use of the distance-based interpolation method, which violates the smoothness around handles (Figure 14(c)). This can be avoided by incorporating an adequate number of example poses with extreme poses, since our method accepts examples with large bending. Another way to solve this problem is to use a small value for such that the deformed model conforms to example deformations by not obeying handle positions exactly (Figure 16).

Memory Consumptions
Storing rotation-invariant encodings requires a relatively large amount of memory. To solve this problem, we propose a multiresolution approach. In Figure 17, we show the morphing results of Horse in several mesh resolutions. With only 1k triangles, we can generate a result that is nearly identical to that of the original resolution (17k triangles). Using our multiresolution technique, the memory consumptions in this case can be reduced to at least 6% of the original without degrading the quality of the output.

Performance
In Table 1, we show the performance results. The most time consuming processes in our method are the rotation optimization and the subsequent matrix orthonormalization. Because the matrix is constructed from relative rotations computed after interpolation, the matrix must be factorized for every frame. For the rotation optimization of 17k mesh, construction and factorization of , and back substitution take 0.21 sec, 0.25 sec, and 0.027 sec, respectively. The matrix orthonormalization of absolute rotations using SVD also takes a relatively long time, which is about 0.5 sec for a 17k mesh. However, the vertex optimization is done very efficiently because the construction and factorization of is required once for all the frames, and we execute only back substitution during runtime. To summarize, without the multiresolution method, our method requires approximately 1 second to edit a 17k mesh.

Using the multiresolution method, we can edit a mesh in an interactive rate. Our method requires approximately 0.1 sec to edit a 17k mesh and 0.2 sec to edit a 140k mesh. This is a significant speedup over the original MeshIK that requires approximately 1 sec to solve a single Gauss-Newton iteration to edit an 80k mesh [2].

Limitations
Our method has limitations that need to be overcome. Although the use of the multiresolution method helps achieving interactive rates, fine details remain static. It is, therefore, difficult for our method to model complex deformation, for example, facial expressions generated by muscle contractions. Our method is probably suitable for large- and medium-scale deformation, and it is rather unfit for modeling small scale surface-detail deformations. Also, because we assign one interpolation weight for each example, our method do not accept the user to place many handles, for example, more than 10 or 20 handles. We believe that our global-to-local editing strategy based on translational/rotational controls works well for interactive posing of the model without using many handles, which can create a new shape in a wide range of poses. However, in the case where simultaneous manipulations of a large number of handles are required, for example, deforming the model using motion-capture data points, our method would fail to approximate deformations. This problem could be alleviated by using the weighted pose space deformation framework [5] to compute weights for each vertex instead of each example. Finally, our method lacks the extrapolation capability, which causes shear distortions when the pose is outside of the interpolation range. The ways to solve this problem may be to use a small weight for position constraints or to use examples with extreme poses.

8. Conclusion

Pose space surface manipulation is a novel example-based mesh editing technique that can deform the model naturally and that is able to adjust a pose locally. We provide both translational and rotational constraints to achieve direct manipulation of a surface to pose the model. Our method is reasonably efficient because, by inferring a pose from handles and by performing pose space interpolation, we are able to solve the problem with two linear systems, which avoids involved nonlinear optimization. With this two-step linear approach combined with the multiresolution deformation method, we achieve interactive rates without losing important deformation effects such as muscle bulging. The performance of our method would be further improved by a GPU implementation [11, 39]. It would be interesting to extend our method to edit other surface representations, such as multicomponent objects.

Acknowledgment

The authors would like to thank Brett Allen for the Arm and Shoulder datasets, Robert W. Sumner for the Elephant, Lion, Horse, and Cloth models, and Daniel Vlasic for the Performer model.

Supplementary Materials

The video shows use cases of the pose space surface manipulation framework for interactive editing of mesh models. Using the translational and rotational constraints, the pose space manipulation framework achieves both global and local pose manipulations.

  1. Supplementary Vedio