• 沒有找到結果。

Learning Spoken Language Representations with Neural Lattice Language ModelingChao-Wei HuangYun-Nung(Vivian) Chen

N/A
N/A
Protected

Academic year: 2022

Share "Learning Spoken Language Representations with Neural Lattice Language ModelingChao-Wei HuangYun-Nung(Vivian) Chen"

Copied!
11
0
0

加載中.... (立即查看全文)

全文

(1)

NTUMIULAB

Learning Spoken Language Representations with Neural Lattice Language Modeling

Chao-Wei Huang Yun-Nung (Vivian) Chen

National Taiwan University

[email protected] [email protected]

Code available at https://github.com/MiuLab/Lattice-ELMo

(2)

NTUMIULAB • The idea of LM pretraining is adopted on lattices

• We introduce a lattice language modeling objective

• A 2-stage framework is proposed for learning contextualized representations of lattices efficiently

Highlights

(3)

NTUMIULAB

Intuitive way for SLU: pipelined approach

Task: Spoken Language Understanding

ASR NLU

• ASR errors affects downstream tasks

We can preserve uncertainty using ASR lattices

(4)

NTUMIULAB

Preserve uncertainty using ASR lattices

• Lattices:

directed acyclic graphs which encode several ASR hypotheses

(5)

NTUMIULAB

Preserve uncertainty using ASR lattices

Using lattices helps

LatticeRNN

LM pre-training helps

ELMo

Can we combine them together?

(6)

NTUMIULAB • Use LatticeLSTM to encode nodes of a lattice

• Ask the model to predict the

outgoing transitions(words) given a node’s representation

• When the lattice has only one hypothesis, this reduces to

normal language modeling

Lattice language modeling

(7)

NTUMIULAB

• So now we can pre-train a LatticeELMo!

Lattice language modeling

• However, LatticeLSTM runs prohibitively slow

• Observation: sequential text is actually a lattice with only one hypothesis

=> normal LM pretraining is also lattice LM pretraining

We can do pre-training in two stages!

(8)

NTUMIULAB

LatticeLSTM

LSTM LSTM LSTM

What a day

Linear

a day <EOS>

the, 1.0 0.8

0.2

Linear

0.9 1.0 1.0

0.1

1.0 1.0

the, 1.0

LatticeLSTM Max pooling

classification

Training Target Task Classifier Stage 1: Pre-Training on

Sequential Texts

Stage 2: Pre-Training on Lattices

LatticeLSTM

Two-stage pre-training

(9)

NTUMIULAB 100 96.8

72.18

81.48

91.6 91.89

60.54

67.35 94.99

91.98

61.65

68.52

91.69 93.43

61.29

69.95

95.84 95.37

62.88

72.04 95.97

93.29

61.23

67.9

ATIS SNIPS SWDA MRDA

Manual + ELMo 1-best 1-best + ELMo LatticeLSTM Proposed BERT-base

Results

(10)

NTUMIULAB • We extend the sequential LM objective to a lattice language modeling objective

• We propose a 2-stage framework for learning contextualized representations of lattices efficiently

• Experiments on various SLU tasks show that our proposed framework provides consistent improvements

Conclusion

(11)

NTUMIULAB

Thanks for listening!

Code available at https://github.com/MiuLab/Lattice-ELMo

[email protected] [email protected] Chao-Wei Huang Yun-Nung (Vivian) Chen

參考文獻

相關文件

I started to learn programming language (C/C++) and established my personal website, and I also did in-depth research on the mathematical problems for my high school thesis because

My paper on key term extraction was accepted for the Workshop on Spoken Language Technologies (SLT 2010, Student Best Paper Award), and as an undergraduate I

This paper addresses the above issues by proposing an architecture using end-to-end memory networks to model knowledge carryover in multi-turn conver- sations, where utterances

Matrix Factorization with Domain Knowledge and Behavioral Patterns for Intent Modeling.. Yun-Nung (Vivian) Chen, Ming Sun,

In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages

Finally, we train the SLU model by learning latent feature vectors for utterances and slot can- didates through MF techniques. Combining with a knowledge graph propagation model

 Students are introduced to the writing task - a short story which includes the sentence “I feel rich.” They are provided with the opportunity to connect their learning

S3: And the products were the lipase fatty acid…no, no, fatty acid and glycerol and the enzyme remained unchanged. S1: Our enzyme was amylase and our substrate