summary - 對於祈使機械控制命令的理性預期模型設計

Chapter 4 Implementation 35

4.4 summary

In summary, we introduce the hardware and software environment in our design We introduce the architecture of Open Agent Architecture (OAA),and how we use the structure of OAA to design and implement REM in mentioned environment.

Figure 4.4:Facilitator with One Manager

Figure 4.5: Facilitator with Two Managers

Chapter 5 Analysis

In this chapter, we would introduce the actions of REM in four case. In Section 5.2, we discuss the effect of REM in the dialogue manager with different approach.

5.1 Case Study

According the task we defined before, we list several situations about the missing information of the task. we would continue using the notation C_x, b_xand D_xto composed a task.

We denote the verbs to b_xi and nouns to C_xbased on our model. We list all possible cases.

I. Task with C_x II. Task without Cx

III. Task with b_x IV. Task without b_x

There is a table as show (Table 5.1)

We follow the definition of degree of complexity allocated to each task.[17] A simple task includes only one b_xi, a complex task requires more than one b_xi. On the other hand, the task what remembered the previous information is complex, as referential relations.[22][23]

We analysis four cases. In our proposed model, we can handle more than one b_xi. In following case, we just discuss the situation with one b_xi.

In fact, we can break a task with more than one bxiinto several sub-task with one bxi. On

Table 5.1: Case Study NP

I II

VP III Case 1 Case 2 IV Case 3 Case 4

the other side, the command sending to devices is depended on bxi not previous device information. So, there is no need to analysis overly complex tasks in this paper.

The situation of lack of D_xis similar with lack of C_x. So, we only discuss the lack of C_x.

5.1.1 Case 1

In case 1, our system receives the sentence with C_xand b_xi. In principles, the actions of system are independent because the independence of behaviour manager and component manager. So there is a new b_xi , the system registers the b_xiin dataset and update the device ID of b_xi. And there is a new C_x, the system registers the C_xin dataset and update the device ID of b_xi.

As mentioned above, we analysis four situation as show (Table 5.2)

Existing bxi and Existing Cx It's a general situation with complete sentence. All bxi and Cx

in user command existed in dataset. Because of this, our system acts normally.

Existing b_xi and new C_x If there is only C_x not in the dataset, our system still registers the C_x and update device ID in existed b_xi. Next,our system updates the device ID of b_xi. In doing so, our system can judge the logic of bxi next time.

New bxi and Existing Cx If there is only b_xi not in the dataset, our system still registers the b_xi and update device ID in b_xi. But, no need to register the C_x. The situation is very similar to the previous.

Table 5.2: Case 1

I C_xin NPs

Existing C_x New C_x

III b_xi in VPs Existing b_xi No register No update Register C_x Update de-vice ID of b_xi

New bxi Register bxi Update de-vice ID of b_xi

Register Cx and bxi

Table 5.3: Example of Case 1 Turn on the TV in the bedroom [Turn on] is an existing bxi , [TV] is an existing Cx

No register No update [Turn on] is an new bxi , [TV] is an existing Cx

Update device ID in [Turn on]

[Turn on] is an existing b_xi, [TV] is an new C_x Register [TV]

Update device ID in [Turn on]

[Turn on] is an new b_xi , [TV] is an new C_x Register [Turn on] and [TV]

Update device ID in [Turn on]

New b_xi and new C_x Even the sentence includes b_xi and C_x, the b_xiand C_xare not in dataset.

Our system not only registers bxiand Cx in dataset but also update the device ID in bxi. In doing so, our system can distinguish the b_xi and C_xnext time.

We take a t for a example. 5.3

5.1.2 Case 2

In this case,the dominated factor is the C_x. The factor C_xwould dominate the action of our system. If there is a new C_x, the system have to register the C_x. On the other hand, the system loads the b_xi when the C_xis existed in dataset. In fact, We define a component not only deviceC_xbut also description of device D_x, including location and features. We use C_xand D_x to describe a complete component.

The point of case 2 is system how to handle the situation that there is not any b_xi in user command but C_x. All decision of case 2 are up to C_x. We take a sentence for a example. 5.5

Table 5.4: Case 2

II C_xin NPs

Existing C_x New C_x

III b_xi in VPs NULL Load b_xi according to user file Register C_x NULL Load b_xi according to user file Register C_x

Table 5.5: Example of Case 2 TV in the bedroom

[TV] is an existing C_x Load b_xi according to user preference [TV] is an new C_x Register [TV]

And there is a table as show (Table 5.4)

5.1.3 Case 3

In this case, we pay attention on b_xi -- lack of device type, only behaviour, as show (Table 5.6). The user's input includes only behaviour bxi. we can expect the bxidominates the flow of our system.

There are two possible action. First, user gives a b_xiwhich the system can't realize. Our system registers the bxi without device ID,because there's no Cx in user's input. Second, bxi

form user's sentence exists in dataset. Our system can load the existed C_x from device ID of b_xi according to the behaviour mapping table.

We take a sentence for a example. 5.7

Table 5.6: Case 3

I C_xin NPs

NULL NULL

III bxi in VPs Existing b_xi Load device ID from b_xi Load device ID from b_xi New b_xi Register b_xi,but no device ID Register b_xi,but no device ID

Table 5.7: Example of Case 3 Turn it on in the bedroom

[Turn on] is an existing bxi Load device ID from [Turn on]

[Turn on] is an new b_xi Register [Turn on]

5.1.4 Case 4

There is a situation we do not handle. A sentence without any b_xi and C_xis not a legal command. Because lack of information can not be a complete command,we preclude this situation.

5.2 Discussion

We would discuss the dialogue systems with REM or without REM. As we mentioned Section ??, the REM would fill the missing information in a task. The dialogue manager with REM would receive the task with C_x, b_xand D_x, if REM could find the solution. The dialogue manager without REM would query database or ask user for missing information.

5.2.1 Finite-sate approach with REM

The dialogue manager in finite-state approach needs the a series of ordered words or phrases to be input. The dialogue manager with FSM should check the elements in task in ordered. If the input is not intact, the dialogue manager may stuck.

The REM would deal with the missing information, the dialogue manager could receive the task with full information. The dialogue manager with REM could decrease the size of dialogue and work well.

5.2.2 Frame-based approach with REM

The dialogue manager with frame-based approach fills the designed slots. The element in a task which we determined would be filled in the designed slots. The dialogue manager with frame-based approach would ask users for missing information.

The sub-models in REM could guess the missing information. The dialogue manager could receive the result from REM to fill the slots, not to ask users for missing information.

Chapter 6 Conclusion

The REM could decrease the complex process querying data of dialogue manager about missing information. The dialogue manager does not ask user after the REM.

The basic concept of REM is finite state machine with a series of judging. Because the form of the task is fixed, we apply the finite state machine in determine mechanism even the finite state machine with shortage in scalability. The sub-models deal with the missing information and the unknown element in a task.

If missing information happens, the REM works based on the information of task, as device or behavior. The REM would choose different action based on the device or behavior in the task. With behavior mapping table or component mapping table, the REM could find the corresponding solution to the task with missing information.

By the determined task, we design the REM with distributed agents for missing

information. The form of a task would assist REM to diagnose the state of a task, sub-models in REM could find the solution.

The Rational Exception Model (REM) would help the dialogue manager to analysis the state of dialogue. The sub-models in REM would query the missing information and determine the logic of behavior for corresponding device with distributed agents. The dialogue manager does not query the missing information with mentioned approached.

Chapter 7 Future Work

Future work in our research is to refine the mapping table with more efficient approach.

We choose the simple approach, weight column in mapping table, to infer the solution.

For this purpose, we could query database on-line and determine the relationship between the keyword we searched and the result. For example,we could search the keyword "turn on", the result would be the short sentence "turn on the radio". The short sentence included the corresponding device "radio" for the keyword "turn on". We can refine our mapping table by determining the relationship between "turn on" and "radio".

References

[1] D. Traum and S. Larsson, ''The information state approach to dialogue management,'' in Current and New Directions in Discourse and Dialogue, 2003, pp. 325--353.

[2] A. Cheyer and D. Martin, ''The open agent architecture,'' Journal of Autonomous Agents and Multi-Agent Systems, vol. 4, no. 1, pp. 143--148, March 2001, oAA.

[3] M. F. McTear, ''Spoken dialogue technology: enabling the conversational user interface,'' ACM Comput. Surv., vol. 34, pp. 90--169, March 2002. [Online]. Available:

http://doi.acm.org/10.1145/505282.505285

[4] M. M. University and M. F. Mctear, ''Modelling spoken dialogues with state transition diagrams: experiences with the cslu toolkit,'' in Proc 5th International Conference on Spoken Language Processing, 1998, pp. 1223--1226.

[5] B. Hansen, D. G. Novick, and S. Sutton, ''Systematic design of spoken prompts,'' in Proceedings of the SIGCHI conference on Human factors in computing systems: common ground, ser. CHI '96. New York, NY, USA: ACM, 1996, pp. 157--164. [Online].

Available: http://doi.acm.org/10.1145/238386.238466

[6] H. Aust, M. Oerder, F. Seide, and V. Steinbiss, ''The philips automatic train timetable information system,'' Speech Commun., vol. 17, pp. 249--262, November 1995. [Online].

Available: http://portal.acm.org/citation.cfm?id=219030.219079

[7] D. S.-H. C. Chin-Han Tsai, ''A study on speech dialogue system and dialogue strategy,'' July 2005.

[8] M. Guo, Y. Liu, and J. Malec, ''A new q-learning algorithm based on the metropolis criterion,'' EEE Trans Syst Man Cybern B Cybern, vol. 34, pp. 2140--3, 2004. [Online].

Available:

http://www.biomedsearch.com/nih/new-Q-learning-algorithm-based/15503510.html

[9] E. Levin, R. Pieraccini, and W. Eckert, ''Using markov decision process for learning dialogue strategies,'' in Proc. ICASSP, 1998, pp. 201--204.

[10] H. machine-dialog Corpora, W. Eckert, E. N�th, H. Niemann, and E.-G.

Schukat-Talamazzini, ''Real users behave weird - experiences made collecting large human-machine-dialog corpora,'' 1995.

[11] S. Sutton, R. Cole, J. D. Villiers, J. Schalkwyk, P. Vermeulen, M. Macon, Y. Yan,

E. Kaiser, B. Rundle, K. Shobaki, P. Hosom, A. Kain, Johan, J. Wouters, D. Massaro, and M. Cohen, ''Universal speech tools: The cslu toolkit,'' in In Proceedings of the

International Conference on Spoken Language Processing (ICSLP, 1998, pp.

3221--3224.

[12] M. F. Mctear, ''Using the cslu toolkit for practicals in spoken dialogue technology,'' in University College London, 1999, pp. 1--7.

[13] C. L. Liu, Elements of discrete mathematics, 1977.

[14] J. Chu-Carroll, ''Mimic: an adaptive mixed initiative spoken dialogue system for

information queries,'' in Proceedings of the sixth conference on Applied natural language processing, ser. ANLC '00. Stroudsburg, PA, USA: Association for Computational Linguistics, 2000, pp. 97--104. [Online]. Available:

http://dx.doi.org/10.3115/974147.974161

[15] J. Chu-Carroll and M. K. Brown, ''An evidential model for tracking initiative in collaborative dialogue interactions,'' User Modeling and User-Adapted Interaction, vol. 8, pp. 215--254, February 1998. [Online]. Available:

http://portal.acm.org/citation.cfm?id=598279.598319

[16] J. G. Amores, G. Pérez, and P. Manchón, ''Mimus: a multimodal and multilingual

dialogue system for the home domain,'' in Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, ser. ACL '07. Morristown, NJ, USA: Association for Computational Linguistics, 2007, pp. 1--4. [Online]. Available:

http://portal.acm.org/citation.cfm?id=1557769.1557771

[17] P. Manchón, C. del Solar, G. Amores, and G. Pérez, ''Multimodal interaction analysis in a smart house,'' in Proceedings of the 9th international conference on Multimodal

interfaces, ser. ICMI '07. New York, NY, USA: ACM, 2007, pp. 327--334. [Online].

Available: http://doi.acm.org/10.1145/1322192.1322249

[18] T. Project., Talk and Look: Linguistic Tools for Ambient Linguistic Knowledg, 2007.

[Online]. Available: www.talk-project.org

[19] P. M. Portillo, G. P. García, and G. A. Carredano, ''Multimodal fusion: a new hybrid strategy for dialogue systems,'' in Proceedings of the 8th international conference on Multimodal interfaces, ser. ICMI '06. New York, NY, USA: ACM, 2006, pp. 357--363.

[Online]. Available: http://doi.acm.org/10.1145/1180995.1181061

[20] N. Chomsky, Syntactic structures, 1957.

[21] A. C. G.-L. L. David L. Martin, ''Development tools for the open agent architecture,'' vol.

PAAM 96. SRI AI center, April 1996.

[22] L. Ahrenberg, A. J�nsson, and N. Dahlb�ck, ''Discourse representation and discourse management for a natural language dialogue system,'' in In Proceedings of the Second Nordic Conference on Text Comprehension in Man and Machine, Taby, 1990.

[23] J. Hawkins, ''Definiteness and indefiniteness: A study in reference and grammaticality prediction,'' 1978.

在文檔中對於祈使機械控制命令的理性預期模型設計 (頁 47-0)