• 沒有找到結果。

Matrix Factorization with Domain Knowledge and Behavioral Patternsfor Intent Modeling

N/A
N/A
Protected

Academic year: 2022

Share "Matrix Factorization with Domain Knowledge and Behavioral Patternsfor Intent Modeling"

Copied!
1
0
0

加載中.... (立即查看全文)

全文

(1)

Matrix Factorization with Domain Knowledge and Behavioral Patterns for Intent Modeling

Yun-Nung (Vivian) Chen, Ming Sun, and Alexander I. Rudnicky

The Task

• In a smart-phone intelligent assistant setting (e.g. requesting an app), the feature-enriched MF-SLU can handle users’ open domain intents by

returning relevant apps that provide desired functionality either locally

available or by suggesting installation of suitable apps and doing so in an

unsupervised way.

• The framework can extend to

incorporate personal behavior history for improving a system’s ability to

assist users pursuing personalized multi-app activities.

• The effectiveness of the feature- enriched MF-SLU model can be

shown in different domains, indicating good generality and providing a

reasonable direction for the future work.

 Motivations

o An typical SDS has two main challenges:

1) Predefined ontology: the domain ontology is required to support the corresponding functions

2) Language ambiguity: same utterance may infer different intents during different situations

o Structured knowledge resources are available (e.g. Freebase, Wikipedia, FrameNet) and may provide semantic information

o Users’ behavioral patterns may help disambiguate the current intents o Hidden semantics help infer the relation between different features

Feature-Enriched MF-SLU: Spoken Language Understanding by Matrix Factorization

• Data: speech data collected from the users with the intents from 13

frequently accessed domains in Google Play (WER = 19.8%)

 MF learns a set of well-ranked intents per utterance.

 Lexical Matrix

• Main idea: utilize manual written app description because it

should describe the app’s functionality

• Main idea: retrieve the apps

that are more likely to support users’ requests for self-training

 The feature-enriched MF-SLU can benefit from both hidden information modeled by MF and enriched semantics including structured knowledge and behavioral patterns to improve Intent prediction.

Experiments

Conclusions

1

Lexical Intent (App)

photo tell check camera IM

take this photo

tell vivian this is me in the lab

CAMERA Train IM

Dialogue

check my grades on website send an email to professor

CHROME EMAIL

send null camera

.85 CAMERA

IM

email

1

1

1 1

1

1 .70

chrome

1

1 1

1

1 1

chrome email 1

1

1

1

.95

.80 .55

User Utterance Intended

App

Reasoning with MF

Test Dialogue

take a photo of this send it to alice

Behavioral

1

Enriched Semantics communication

.90 1

1

Utterance 1 i would like to contact alex

Lexical Intent (App)

… …

contact email message Gmail Outlook Skype

Test

.90

Reasoning with MF

Train

… your email, calendar, contacts…

… check and send emails, msgs … Outlook

Gmail

IR for app candidates

App Desc

Self-Train Utterance

Test

Utterance

1 1

1

1

1

1

1

1 1

1

1 .90 .85 .97 .95

Semantics Enrichment Utterance 1 i would like to contact alex

1

1

 Enriched Semantics Matrix  Intent Matrix

• Main idea: slot types and word embeddings help imply semantics for expanding domain knowledge

Feature Matrix ASR Transcripts

LM / MLR MF-SLU LM / MLR MF-SLU

Single-Turn Word Observation 25.1 29.2 (+16.2%) 26.1 30.4 (+16.4%) + Type-Enriched Semantics 31.5 32.2 (+2.1%) 32.9 34.0 (+3.4%) Multi-Turn Word Observation 52.1 52.7 (+1.2%) 55.5 55.4 (-0.2%)

+ Behavioral Patterns 53.9 55.7 (+3.3%) 56.6 57.7 (+1.9%)

 Approaches: Feature-Enriched MF-SLU

o Enrich semantics with the structured knowledge or behavioral patterns for improving intent prediction

o Unify the human written knowledge and automatically inferred information in a matrix and predict user intents in the mean time

 Results

o Feature-enriched MF-SLU benefits from hidden information and rich features, and then outperforms the baselines for both single-turn

requests and multi-turn interactions.

send to vivian

v.s.

Email? Message?

Communication

previous turn

Challenge: language ambiguity 1) User preference

2) App-level contexts

• Modeling Implicit Feedback:

1

𝑓

+

𝑓

𝑓

𝑢

𝑥

Matrix Factorization

• Objective:

Experiment 1: Single-Turn Request

Experiment 2: Multi-Turn Interaction

• Entity Type from Structured Knowledge (e.g. Wikipedia/Freebase)

Q: play lady gaga’s bad romance

… is an American singer, songwriter, and actress. … is a song by American singer …

Chen et al., "Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken Language Understanding," in Proc. of ICMI, 2015.

Data Available at http://AppDialogue.com/

Chen and Rudnicky, "Dynamically Supporting Unexplored Domains in Conversational Interactions by Enriching Semantics with Neural Word Embeddings," in Proc. of SLT, 2014.

參考文獻

相關文件

A spoken language understanding (SLU) component requires the domain ontology to decode utterances into semantic forms, which contain core content (a set of slots and slot-fillers)

Matrix Factorization is an effective method for recommender systems (e.g., Netflix Prize and KDD Cup 2011).. But training

Matrix factorization (MF) and its extensions are now widely used in recommender systems.. In this talk I will briefly discuss three research works related to

include domain knowledge by specific kernel design (e.g. train a generative model for feature extraction, and use the extracted feature in SVM to get discriminative power).

This paper proposes a holistic multi-domain, multi-task (i.e. slot filling, domain and intent detection) modeling approach to estimate complete semantic frames for all user

Matrix Factorization with Domain Knowledge and Behavioral Patterns for Intent Modeling.. Yun-Nung (Vivian) Chen, Ming Sun,

Below we use a principle model, which contains three sets of information, low- level spoken features (word observation matrix), high-level semantic features (enriched semantics

Finally, we train the SLU model by learning latent feature vectors for utterances and slot can- didates through MF techniques. Combining with a knowledge graph propagation model