• 沒有找到結果。

© Deng Cai, College of Computer Science, Zhejiang University

N/A
N/A
Protected

Academic year: 2021

Share "© Deng Cai, College of Computer Science, Zhejiang University"

Copied!
23
0
0

加載中.... (立即查看全文)

全文

(1)

© Deng Cai, College of Computer Science, Zhejiang University 

So Far…

It’s time for 

 Unsupervised learning

 We are only given inputs

 Goal: find “interesting patterns”

 Discovering clusters

– Clustering

 Discovering latent factors

– Dimensionality reduction

– Topic modeling

– Matrix factorization

(2)

Deng Cai (蔡登)

College of Computer Science Zhejiang University

[email protected]

2

Matrix Factorization

(3)

© Deng Cai, College of Computer Science, Zhejiang University 

What Is Matrix Factorization?

Is this factorization unique?

 Every column of  and every row of  are normalized

Does this factorization always exist?

∈ , ∈

Σ ∈ ΣΣ

Σ

(4)

© Deng Cai, College of Computer Science, Zhejiang University 

Why Matrix Factorization?

⋮ ⋮ ⋮

⋮ ⋮

⋮ ⋮ ⋮

Each column vector of  can be represented as a linear combination  of column vectors of 

Each column vector of  can be regarded as a low dimensional 

representation of corresponding column vector of 

(5)

© Deng Cai, College of Computer Science, Zhejiang University 

Relation to Dimensionality Reduction

If there is a matrix  which satisfies:

In DR, we learn the transformation matrix In MF, we learn the basis matrix

, , ⋯ , , ⋯

∈ , ∈

(6)

© Deng Cai, College of Computer Science, Zhejiang University 

Relation to Topic Modeling

Terms Documents

| ,

∑ ,

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

Latent  Concepts

Terms Documents

TRADE

econom

imports

trade

| | |

(7)

© Deng Cai, College of Computer Science, Zhejiang University 

Relation to Topic Modeling

| | |

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

| ,

∑ ,

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

(8)

© Deng Cai, College of Computer Science, Zhejiang University 

Algorithms

Singular Value Decomposition

Nonnegative Matrix Factorization

Sparse Coding

(9)

© Deng Cai, College of Computer Science, Zhejiang University 

Singular Value Decomposition (SVD)

For an arbitrary matrix  ∈ there exists a  factorization as follows: 

Σ where

∈ , ∈ , ,

diagonal matrix Σ ∈ If 

∈ , ∈ , ,

Σ diag , , ⋯ ⋯ 0

(10)

© Deng Cai, College of Computer Science, Zhejiang University 

SVD: Low‐rank Approximation

SVD can be used to compute optimal low‐rank approximations.

Approximation problem: 

Solution via SVD

C. Eckart, G. Young, The approximation of a matrix by another of lower rank. Psychometrika, 1, 211‐218, 1936.

set small singular  values to zero

(11)

© Deng Cai, College of Computer Science, Zhejiang University 

Low rank approximation by SVD

(12)

© Deng Cai, College of Computer Science, Zhejiang University 

Relation to PCA

Given an SVD of X, the following two relations hold:

Σ ΣV V Σ Σ V

Σ Σ ΣΣ

(13)

© Deng Cai, College of Computer Science, Zhejiang University 

Latent Semantic Analysis (Indexing)

The Latent Semantic Analysis via SVD can be summarized as  follows:

Document similarity

, Σ , Σ

...

LSA term vectors

... LSA document vectors

=

< >

...

terms

...

documents

(14)

© Deng Cai, College of Computer Science, Zhejiang University 

Latent Semantic Analysis

Latent semantic space: illustrating example

(15)

© Deng Cai, College of Computer Science, Zhejiang University 

Relation to Topic Modeling

| | |

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

| ,

∑ ,

| ⋯ |

⋮ ⋱ ⋮

| ⋯ |

(16)

© Deng Cai, College of Computer Science, Zhejiang University 

Nonnegative Matrix Factorization

Low rank assumption (k hidden factors)

Nonnegative assumption

(17)

© Deng Cai, College of Computer Science, Zhejiang University 

Non‐negative Matrix Factorization

Two cost functions

 Euclidean distance

 Divergence

(18)

© Deng Cai, College of Computer Science, Zhejiang University 

Optimization Problems

Minimize  with respect to  and  ,  subject to the constraints  .

Minimize  with respect to  and  , 

subject to the constraints  .

(19)

© Deng Cai, College of Computer Science, Zhejiang University 

NMF Optimization (Euclidean Distance)

min , . . 0, 0

tr tr

2 2 0

tr 2tr tr

Γ, same size as Φ, same size as

tr Γ tr Φ

Γ

2 2 Φ 0

(20)

© Deng Cai, College of Computer Science, Zhejiang University 

Multiplicative Update Rules

The Euclidean distance  is nonincreasing under the update rules

The Euclidean distance is invariant under these updates if

and only if and are at a stationary point of the distance.

(21)

© Deng Cai, College of Computer Science, Zhejiang University 

NMF vs PLSA

≅ , 0, 0

| ∑ log

, 1

| |

max log

log log

max 1

, log | |

(22)

© Deng Cai, College of Computer Science, Zhejiang University 

Sparse Coding

• Represent input vectors approximately as a weighted linear  combination of a small number of “basis vectors.”

minimize ,

subject to    Σ , , ∀ 1, … , .

(23)

© Deng Cai, College of Computer Science, Zhejiang University 

Matrix Factorization: Summary

∈ , ∈

Low rank assumption (k hidden factors)

 SVD

Nonnegative assumption

 NMF

Sparseness assumption

 Sparse Coding

參考文獻

相關文件

A finite group is nilpotent if and only if it’s a direct product of Sylow

If that circle is formed into a square so that the circumference of the original circle and the perimeter of the square are exactly the same, the sides of a pyramid constructed on

Study the following statements. Put a “T” in the box if the statement is true and a “F” if the statement is false. Only alcohol is used to fill the bulb of a thermometer. An

Wang, Solving pseudomonotone variational inequalities and pseudocon- vex optimization problems using the projection neural network, IEEE Transactions on Neural Networks 17

A subgroup N which is open in the norm topology by Theorem 3.1.3 is a group of norms N L/K L ∗ of a finite abelian extension L/K.. Then N is open in the norm topology if and only if

From these characterizations, we particularly obtain that a continuously differentiable function defined in an open interval is SOC-monotone (SOC-convex) of order n ≥ 3 if and only

Define instead the imaginary.. potential, magnetic field, lattice…) Dirac-BdG Hamiltonian:. with small, and matrix

• A way of ensuring that charge confinement does occurs is if there is a global symmetry which under which quarks (heavy charges) are charged and the gluons and other light fields