Building the Secure Kernel Matrix from Data Perturbed by Ran-

2.4 Secure SVM Outsourcing with Random Linear Transformation

2.4.2 Building the Secure Kernel Matrix from Data Perturbed by Ran-

After we are able to permit the service provider to have the secure kernel matrix for solv-ing the RSVM, in the followsolv-ing, we show how to enable the service provider to build the secure kernel matrix from the data perturbed by random linear transformation. The random linear transformation does not preserve the dot product or distance relationships between training instances and hence is stronger in privacy preservation. Then the data owner can outsource the SVM by sending the random linearly transformed training in-stances to the service provider, and then the service provider builds a secure kernel matrix without knowing the actual content of the training data where the secure kernel matrix built by the service provider is exactly the same with the one built from the original train-ing data.

The data owner will send the perturbed training instances as well as the perturbed random vectors of the reduced set to the service provider for computing secure kernel matrices. A secure kernel matrix K is computed from training instances and random vectors of the reduced set by K_i,j = k(x_i, r_j), i = 1, . . . , m, j = 1, . . . , ¯m. Our objective is to let the service provider compute the same K from perturbed training instances and random vectors, while the perturbation scheme should not allow the security weakness of the geometric perturbation schemes, i.e., the dot product and Euclidean distance among training instances should not be preserved. The perturbation scheme needs to preserve the kernel evaluations between a training instance and a random vector for computing secure kernel matrices. We utilize the random linearly transformation perturbation for computing the dot product of two differently transformed instances [57]. Note that the attribute vectors xi’s of training instances are usually considered sensitive, but the class labels yi’s are usually not.

Let M be a nonsingular n× n matrix composed of random values. We perturb the instances of the training dataset by a random linear transformation L :Rⁿ → Rⁿ, where the matrix M works as the random linear operator. All training instances are perturbed by

the random linear transformation¹

ci = L(xi) = M xi for i = 1, . . . , m (2.6)

Unlike the geometric perturbation, the random linear transformation does not preserve the Euclidean distance and dot products between training instances since the vector space is randomly transformed. Hence the security weakness of the rotational or translational transformation does not exist in the data perturbed by random linear transformation. The random vectors rj, j = 1, . . . , ¯m of the reduced set are also perturbed by another random linear transformation L^′ :Rⁿ→ Rⁿwith (M^T)⁻¹as the random linear operator:

s_j = L^′(r_j) = (M^T)⁻¹r_j for j = 1, . . . , ¯m (2.7)

The perturbed training instances ci, i = 1, . . . , m and perturbed random vectors of the reduced set sj, j = 1, . . . , ¯m are then sent to the service provider for building secure kernel matrices.

The dot product between an instance x_i and a random vector r_j can be equivalently computed from the dot product of c_iand s_jby c^T_i s_j = (M x_i)^T(M^T)⁻¹r_j = x^T_i M^T(M^T)⁻¹r_j

= x^T_i Ir_j = x^T_i r_j. Therefore, for the dot product-based kernel functions including the lin-ear kernel k(x_i, r_j) = x_i · rj, polynomial kernel k(x_i, r_j) = (gx_i · rj + r)^d, and neural network kernel k(xi, rj) = tanh(gxi· rj+ r) , the kernel evaluations between an instance and a random vector can be equivalently derived from the perturbed training instances and random vectors.

For Gaussian kernel k(x_i, r_j) = exp(−g||xi− rj||²) which is based on the Euclidean distance, a slight modification is needed to add another two dimensions to the original instances x_i ∈ Rⁿ as x^′_i = (x_i,1, x_i,2, . . . , x_i,n, 1,−¹₂||xi||²)^T before applying the trans-formation. The random vectors r_j’s of the reduced set are also added by another two dimensions as r^′_j = (rj,1, rj,2, . . . , rj,n,−¹₂||rj||², 1)^T. Then the corresponded random matrix for random linear transformation is a nonsingular (n + 2)× (n + 2) matrix M.

1It is not necessary to put the whole matrix M in the main memory. The computation can be decomposed to M x = x1M:,1+· · · + xnM:,n, where M:,iis the i-th column of M .

Similarly, the data are perturbed by c_i = M x^′_i, and the random vectors are perturbed by s_j = (M^T)⁻¹r^′_j. The Euclidean distance between x_i and r_j in the Gaussian kernel can be equivalently computed from c_iand s_jby−2c^Ti s_j =−2x^′Ti M^T(M^T)⁻¹r^′_j =−2x^′Ti Ir^′_j =

−2x^′Ti r^′_j =||xi||²− 2x^Tirj+||rj||² =||xi− rj||².

Therefore, for common kernel functions based on dot product or Euclidean distance, the kernel evaluations between an instance in the training dataset and an instance in the reduced set can be equivalently computed from their perturbed versions.

Lemma 4 The RSVM problem (2.4) with dot product-based and Euclidean distance-based kernel functions constructed from the training instances (xi, y_i), i = 1, . . . , m and the random vectors r_j, j = 1, . . . , ¯m can be equivalently obtained from the random linear transformation-perturbed training instances and random vectors c_i, i = 1, . . . , m and s_j, j = 1, . . . , ¯m along with labels y_i, i = 1, . . . , m.

Proof 5 Since the dot product of any (ci, s_j) pair is equal to the dot product or Euclidean distance of the corresponding (x_i, r_j) pair, for dot product or Euclidean distance-based kernel functions, the value of k(x_i, r_j) can be equivalently computed from c_i’s and s_j’s.

Then the secure kernel matrix K composed of K_i,j = k(x_i, r_j), i = 1, . . . , m, j = 1, . . . , ¯m can be equivalently obtained from c_i, i = 1, . . . , m and s_j, j = 1, . . . , ¯m.

Accompanying with the labels yi, i = 1, . . . , m (and the cost parameter C), a completely the same RSVM problem (2.4) can be built.

From Lemma 4, since the same RSVM problem can be built from the perturbed data, the same solutions v and b for the decision function (2.5) can be obtained. Therefore, the data owner can perform privacy-preserving outsourcing of training by perturbing the data and the reduced set with (2.6) and (2.7) and then send the perturbed data with labels and perturbed reduced set to the service provider. Since the service provider can derive the same secure kernel matrix K of (2.4) from the perturbed data c_i’s and s_j’s, i.e., the RSVM optimization problem derived from the perturbed data is the same to the one derived from the original data. Therefore, the service provider can obtain the same solutions v_j, j = 1, . . . , ¯m and b of the decision function (2.5) for sending back to the data owner. Then the

在文檔中隱私保存的高效率資料分類方法 (頁 36-39)