Derivation of Optimal Motion Index

3.1 Intention Deduction

3.1.2 Derivation of Optimal Motion Index

From the motion generation process discussed above, we can take the intention de-duction process as that of finding proper motion index M I. To find the optimal M I among all M I candidates, we introduce first the process for M I evaluation, shown in Fig. 3.5(a). This process evaluates the fitness of the M I candidates de-rived from the demonstrated motion, based on a reasoning that proper M I should lead to a generated motion very similar to the human demonstrated motion, which includes all the delicate motions. In Fig. 3.5(a), from the demonstrated motions, we select one demonstrated motion as the validating motion and the rest as the training motions. We will discuss the selection of validating and training motions later. For an M I candidate derived from the validating motion, the motion gen-eration module, described above, generates motions based on the training motions and the environmental state corresponding to the validating motion; the generated motions, with their lengths set to be equal to that of the validating motion, are then compared with the validating motion via the motion comparison module, yielding the differences between them (marked as errors). Because the operator may per-form the demonstrations in different speeds and possibly with different orders for the events involved, the corresponding delicate motions are likely to be with various sampling rates, or to appear in different portions of the demonstrated trajectories.

To tackle this, our strategy is to let each of the delicate motions of the validating motion be compared with every portion of the training motion, accompanied by altering sampling rates, showing in Fig. 3.5(b). Through this comparison process, the generated motion, whose delicate motions lead to the minimum difference when compared with those of the validating motion, is determined as the output and sent to the motion comparison module for the following comparison. As a high search complexity is expected, we come up with an approach analogous to that of dynamic time warping (DTW) in execution [56]. Details of this strategy will be explained in next section.

We go on with the process for M I generation, shown in Fig. 3.6. In Fig. 3.6, among all the demonstrated motions, one demonstrated motion is first selected as

Select

Figure 6: Selection of validating and training motions from demonstrated motions.

Validating motion MI Generation

Figure 3.6: Process for M I generation.

: multiple I/O

Figure 7:

The complete process for deriving the optimal motion index (MI).

Figure 3.7: Process for optimal M I derivation.

the validating motion, denoted as Q_V, and the rest as the training motions, Q_T, for each sequence of the process. The process will be repeated until each of the demonstrated motions serves as the validating motion once. In next step, the M I generator will locate all possible M I candidates from Q_V. Because the proposed approach does not constrain the human operator to perform the task with certain motion speed or motion type, and also allows the order of the events to be altered during demonstration, there is in fact not a priori knowledge for the selection of M I.

The criterion for M I generation is thus to let M I candidate correspond to every portion of Q_V with a duration longer than 0.3 second, as human cannot cognize an event until it happens 0.3 second later [59]. It can be expected that there will be a huge number of M I candidates. That is why we employ the method of dynamic programming for the search of the optimal M I.

With the M I evaluation process in Fig. 3.5(b) and M I generation process in Fig. 3.6, Fig. 3.7 shows the entire process for optimal M I derivation. For the

of them serves as the validating motion once. Via the M I generation process, M I candidates along with the validating and training motions are sent into the M I evaluation process to determine which M I candidate leads to the minimum error, identified as an optimal M I candidate. As each validating motion corresponds to one optimal M I candidate, the outputs of the outer dotted block are the optimal M I candidates for each of them. Finally, the optimal M I is determined to be the one with the minimum error among all optimal M I candidates.

3.1.3 Implementation of Intention Deduction

For mathematical formulation of this optimal M I derivation process, we start with the description of M I for a given validating motion Q_V, denoted as M I_V:

M I_V = {d_V₁, d_V₂, .., d_V_N} (3.4) with

d_V_j = {n_V_j, l_V_j, s_V_j} (3.5) where d_V_j indexes the delicate motion D_V_j with n_V_j, l_V_j, and s_V_j the starting time, end time, and number of the operated object. According to M I_V, Q_V can then be expressed as the combination of a series of delicate and move motions:

Q_V = {M_V₁, D_V₁, M_V₂, D_V₂, ..., D_V_N, M_V_{N +1}} (3.6) On the other hand, with the same M I_V, the generated motion Qⁱ_G for each training motion Qⁱ_T can be formulated as

Qⁱ_G = {M_Gⁱ₁, Dⁱ_G₁, M_Gⁱ₂, Dⁱ_G₂, ..., Dⁱ_G_N, M_Gⁱ_{N +1}} (3.7) where Dⁱ_G_j and M_Gⁱ_j are its delicate and move motion, respectively. D_Gⁱ_j can be determined via the M I evaluation process above, of which the minimization between D_Gⁱ_j and D_V_j is dealt with a DTW-like method:

D_Gⁱ_j = similar(Qⁱ_T, D_V_j) (3.8)

In the similar function, the training motion Qⁱ_T is transformed to match the environment of the validating motion according to the possible operated object s_V_j,

and generated delicate motion Dⁱ_G

j is searched from the transformed motion to be similar to DVj as close as possible. Therefore, the search can use a DTW-like method to minimize the difference between D_Gⁱ

j and D_V_j [56]. Details of this method will be explained in next section.

After the generated delicate motions are generated, M_Gⁱ

j determined by func-tion M_G, which utilizes the cubic polynomial to smoothly connect the two delicate motions, D_Gⁱ_j−1 and Dⁱ_G_j:

M_Gⁱ_j = M_G(D_Gⁱ_j−1, D_Gⁱ_j) (3.9)

To determine the optimal motion index M I_V^∗, Q_V will be compared with all Q_G generated according to every M I_V. Because we are looking for an M I_V that may induce all the necessary delicate motions, M I_V^∗ should not induce too much deviation between the delicate motions for Q_V and Q_G, and consequently between the move motions for them. By taking E_max as the maximum difference between the delicate and move motions for Q_V and those Q_G generated for all the training motions corresponding to some M I_V, we determine M I_V^∗, among all M I_V, to be the one that leads to the smallest E_max:

M I_V^∗ = arg min Here, E_D computes the difference between the respective delicate motions for Q_V and those QG, and EM that for the move motions, with MV as a function which outputs the move motion part between two delicate motions of the validating motion, D_V_a and DV_b. Because each demonstrated motion serves as the validating motion once, the final optimal motion index M I^∗∗ for all demonstrated motions will be further

E_max, demoted as E^∗. As the length L_V for each Q_V may not be the same, E^∗ needs to be normalized before the comparison. M I^∗∗ is then formulated as

M I^∗∗= arg min

M I_V^∗ E^∗/L_V (3.14)

The search for M I^∗∗ is of high complexity, as exhibited in Eqs. (3.10)-(3.14) above.

As an attempt to enhance search efficiency, we employ the method of dynamic programming [60] and let the computation of E^∗ in Eq. (3.14) be expressed into a recursive formulation:

E^∗ = min

d_Vk E_R(D_V_k) + E_M(D_V_k, D_V_{N +1}) (3.15) with

E_R(D_V_k) = min

d_Vk−1(E_R(D_V_k−1) + E_M(D_V_k−1, D_V_k)) + E_D(D_V_k) (3.16) where E_R(D_V_k) stands for the minimum difference between the motions from the first move motion to a given delicate motion; d_V_k and d_V_k−1, described in Eq. (3.5), index the delicate motions D_V_k and D_V_k−1; and 1 ≤ k ≤ N . Because the number of delicate motions is not known in advance, N and k are not specific numbers. Also note that, the first move motion is generated between D_V₀ and D_V₁, and the last one between D_V_N and D_V_{N +1}, with D_V₀ and D_V_{N +1} taken as the first and last point of the trajectory, respectively. In Eq. (3.15), E^∗ is derived as the minimum one for all E_R(D_V_k) with E_R(D_V_k) computed recursively via Eq. (3.16). With Eqs. (3.15) and (3.16), dynamic programming can take advantage of the table generated for E_R(D_V_k) to simplify the computation in deriving E^∗.

Based on the discussions above, the algorithm for intention derivation algorithm is formulated in Algorithm 1. Time complexity for this optimal M I derivation pro-cess is related to the number (R) and length (L_V) of the demonstrated motions and the number (S) of objects involved in the task. Here, the lengths of the demon-strated motions are assumed to be close. In Eqs. (3.15) and (3.16), the generation of the table for E_R(D_V_k) takes up most of the time consumed. The table has O(L_V²·S) elements, and each element deals with the complexity of the order of O(R · L_V³· S).

During the entire process, the table needs to be generated R times. The final time complexity is thus computed to be in the order of O(R²· L_V⁵· S²).

The divide-and-conquer method [60] may also be an alternate to solve Eq.

(3.14). However, because our proposed approach takes every portion of the trajec-tory of the validation demonstration as the candidate for a possible delicate motion, it is not that straightforward to divide the trajectory properly. Consequently, the search for the optimal solution may demand a large number of divisions, leading to high computational load.

Algorithm 1 Find the intention of the task through R times of demonstrations Input: the demonstrated trajectories Q_i (1 ≤ i ≤ R) for the R times of

demonstra-tions

Output: the optimal M I^∗∗

1: for i = 1 to n do

2: Select Q_i among the R recorded trajectories as the validating motion Q_V and the rest as the training demonstrations Q_T

3: Apply the method of dynamic programming, based on Eq. (3.10), to deter-mine the optimal M I^∗ for Q_V

4: end for

5: Utilize Eq. (3.14) to determine the optimal M I^∗∗ for the demonstrator among those M I^∗ for the R validating motions

6: return M I^∗∗

在文檔中基於人類示範之意圖推論於工具操作任務之應用 (頁 23-28)