Future Research Directions - Conclusions and Future Research Directions

4. Conclusions and Future Research Directions

4.2. Future Research Directions

This research only built a simple prototype of the XML and ontology benchmark workload model of heterogeneous information integration. It still needs more effort to expand its capabilities. We expect this work to continue and evolve in the future. Future research directions include:

 Enhancing the ontology query model. The development of an ontological standard presents many opportunities and challenges. New reasoning tasks may arise in the future.

Retrieval (instances of a concept) and realization (most specific class of instance) may not be sufficient. In order to make the ontology query model more comprehensive, further study to keep track of ontology progression is needed.

 Improving the complexity factors of the XML query model. The complexity factors we analyze in the XML query model are still too rough. Each query type can be analyzed more carefully to refine the query model.

 Implementing various data distributions. In this research, only uniform distribution is implemented. It cannot evaluate performance under different distributions.

Implementation of diverse data distributions will become a user requirement.

 Applying the workload model to other applications. Ontology and XML are complementary technologies, and there are other applications that can apply. In this research, we assume the heterogeneous information integration system is used on Intranets, such as enterprise information integration (EII), electronic business (EB), and enterprise application integration (EAI). There are other applications between enterprises that may need to integrate heterogeneous information, such as business-to-business integration (B2Bi), collaborative commerce (C-Commerce), and electronic commerce

(EC). We can modify the workload model of this research to create other benchmarks that are based on XML and ontology with different characteristics.

References

1. Andersen, B. (2001). What is an ontology. Retrieved February 5, 2004, from http://www.ontologyworks.com/docs/what-is-ontology.pdf

2. Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., & Raghavan, S. (2001).

Searching the Web. ACM Transactions on Internet Technology, 1(1), 2–43.

3. Böhme, T., & Rahm, E. (2001). XMach-1: A Benchmark for XML Data Management. Proceedings of German database conference BTW2001, Oldenburg, Germany, 264-273.

4. Böhme, T., & Rahm, E. (2003). Multi-User Evaluation of XML Data Management Systems with XMach-1. Lecture Notes in Computer Science (LNCS), 2590, 148-159.

5. Bos, B. (1997). The XML Datamodel. Retrieved January 30, 2004, from http://www.w3.org/XML/Datamodel.html

6. Beech, D., Malhotra, A., & Rys, M. (1999). A Formal Data Model and Algebra for XML. W3C XML Query working group note.

7. Bray, T., Paoli, J., Sperberg-McQueen, C. M., & Maler, E. (2000). Extensible Markup Language (XML) 1.0 (Second Edition). Retrieved January 30, 2004, fromhttp://www.w3.org/TR/REC-xml

8. Baader, F., Horrocks, I., & Sattler, U. (2003). Description logics as ontology languages for the semantic web. In Dieter Hutter and Werner Stephan (Ed.), Festschrift in honor of Jörg Siekmann, Lecture Notes in Artificial Intelligence.

Springer.

9. Chamberlin, D., Fankhauser, P., Marchiori, M., & Robie, J. (2003). XML Query

Requirements. Retrieved January 8, 2004, from

http://www.w3.org/TR/xquery-requirements/

10. Cui,Z.,Jones,D.,& O’Brien,P.(2001).Issues in ontology-based information integration. Proceedings of IJCAI-01 Workshop on E-Business & the Intelligent Web.

11. Elhaik, Q., Rousset, M-C, & Ycart., B. (1998). Generating Random Benchmarks for Description Logics. ProceedingsofDL’98.

12. Fernández, M., Malhotra, A., Marsh, J., Nagy, M., & Walsh, N. (2003). XQuery 1.0 and XPath 2.0 Data Model. Retrieved January 30, 2004, from http://www.w3.org/TR/xpath-datamodel/

13. Gray, J. (1993). The Benchmark Handbook (2nd ed.) . Morgan Kaufmann, San

Mateo, CA, Retrieved January 8, 2004, from

http://www.benchmarkresources.com/handbook/index.asp

14. Gruber, T. R. (1993). Towards Principles for the Design of Ontologies Used for Knowledge Sharing. International Workshop on Formal Ontology, Padova, Italy.

15. Guo, Y., Heflin, J., & Pan, Z. (2003). Benchmarking DAML+OIL Repositories.

Proceedings of the 2nd International Semantic Web Conference, LNCS, 2870, 613-627.

16. Gómez-Pérez, A. (1994). Some Ideas and Examples to Evaluate Ontologies.

Technical Report KSL-94-65, Knowedge Systems Laboratory, Stanford University.

17. Gruninger, M., & Fox, M. S. (1995). Methodology for the design and evaluation of ontologies. Proceedings of IJCAI'95 Workshop on Basic Ontological Issues in Knowledge Sharing.

18. Horrocks, I. (2002). DAML+OIL: A Reason-able Web Ontology Language, Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology, 2-13.

19. Horrocks, I. (2002). DAML+OIL and Description Logic Reasoning. Retrieved

February 12, 2004, from

http://www.dcs.shef.ac.uk/~angus/daml-oil-workshop/presentations/horrocks.pdf 20. Horrocks, I., & Patel-Schneider, P. (1998). DL systems comparison. Proceedings

ofDL’98.

21. Heflin, J. (2003). OWL Web Ontology Language Use Cases and Requirements.

Retrieved February 5, 2004, fromhttp://www.w3.org/TR/webont-req/

22. Jiang, H., Lu, H., Wang, W., & Yu, J. X. (2002). Path Materialization Revisited:

An Efficient Storage Model for XML Data. The 13th Australasian Database Conference (ADC 2002), Melbourne, Australia, 85-94.

23. Li, Y. G., Bressan, S., Dobbie, G., Lacroix, Z., Lee, M. L., Nambiar, U., &

Wadhwa, B. (2001). XOO7: applying OO7 benchmark to XML query processing tool. Proceedings of the tenth international conference on Information and knowledge management (CIKM), Atlanta, Georgia, USA, 167-174.

24. Lehti, P. (2001). Design and implementation of a data manipulation processor for an xml query processor. Technical University of Darmstadt, Darmstadt, Germany, Diplomarbeit.

25. Maier, A., Aguado, J., Bernaras, A., Laresgoiti, I., Pedinaci, C., Pena, N., &

Smithers, T. (2003). Integration with Ontologies. Wissensmanagement 2003, 21-24.

26. Manolescu, I., Florescu, D., & Kossmann, D. (2001). Answering XML Queries over Heterogeneous Data Sources. Proceedings of the 27^th VLDB Conference, Roma, Italy.

27. Nambiar, U., Lacroix, Z., Bressan, S., Lee, M. L., & Li, Y. G. (2002). Efficient XML Data Management: An Analysis. Proceedings of the 3rd International Conference on Electronic Commerce and Web Technologies (ECWeb), Aix en

Provence, France, 87-98.

28. Nambiar, U., Lacroix, Z., Bressan, S., Lee, M. L., & Li, Y. G. (2002). Current Approaches to XML Management. IEEE Internet Computing Journal, 6(4), 43-51.

29. Noy, N. F., & McGuinness, D. L. (2001). Ontology Development 101: A Guide to Creating Your First Ontology. Stanford Knowledge Systems Laboratory Technical Report KSL-01-05 and Stanford Medical Informatics Technical Report SMI-2001-0880.

30. Noy, N. F., & Musen, M. A. (2002). Evaluating Ontology-Mapping Tools:

Requirements and Experience. Proceedings of the OntoWeb-SIG3 Workshop EON 2002 at EKAW 2002, Siguenza, Spain, 1-14.

31. Omelayenko B. (2002). Ontology-Mediated Business Integration. Proceedings of the 13-th EKAW 2002 Conference, Siguenza, Spain, LNAI 2473, 264-269.

32. Rys, M. (2002). Proposal for an xml data modification language. Microsoft Corp., Redmond, WA, Proposal.

33. Schmidt, A., Waas, F., Manegold, S., & Kersten, M. (2003). A Look Back on the XML Benchmark Project. Lecture Notes in Computer Science (LNCS), 2818, 263-278.

34. Schmidt, A. R., Waas, F., Kersten, M. L., Florescu, D., Manolescu, I., Carey, M.

J., & Busse, R. (2001). The XML Benchmark Project. Technical Report INS-R0103, CWI, Amsterdam, The Netherlands.

35. Schmidt, A., Waas, F., Kersten, M., Florescu, D., Carey, M. J., Manolescu, I., &

Busse, R. (2001). Why and how to benchmark XML databases. ACM SIGMOD Record, 30(3), 27-32.

36. Schmidt, A. R., Waas, F., Kersten, M. L., Carey, M. J., Manolescu, I., & Busse, R. (2002). XMark: A Benchmark for XML Data Management. Proceedings of the International Conference on Very Large Data Bases (VLDB), Hong Kong, China, 974-985.

37. Sengupta, A., & Mohan, S. (2003). Formal and conceptual models for XML structures - the past, present and future. Retrieved January 30, 2004, from http://www.indiana.edu/~isdept/research/papers/tr137-1.pdf

38. Stevens, R., Goble, C.A., & Bechhofer, S. (2000). Ontology-based Knowledge Representation for Bioinformatics. Briefings in Bioinformatics, 1(4), 398-414.

39. Suarez-Figueroa, M. C., & Gomez-Perez, A. (2003). Results of Taxonomic Evaluation of RDF(S) and DAML+OIL ontologies using RDF(S) and DAML+OIL Validation Tools and Ontology Platforms import services.

Proceedings of the 2nd International Workshop on Evaluation of Ontology-based Tools (EON2003), Sanibel Island, Florida, USA.

40. Staab, S., Schnurr, H. P., Studer, R., & Sure, Y. (2001). Knowledge Processes and Ontologies. IEEE Intelligent Systems, 16(1), 26-34.

41. Simov, K., & Jordanov, S. (2002). BOR: a pragmatic DAML+OIL reasoner.

On-To-Knowledge deliverable D-40, OntoText Lab.

42. Sullivan, D. (2003). Search Engine Sizes. Retrieved May 6, 2004 from http://searchenginewatch.com/reports/article.php/2156481

43. Tempich, C., & Volz, R. (2003). Towards a benchmark for Semantic Web reasoners - an analysis of the DAML ontology library. Proceedings of the 2nd International Workshop on Evaluation of Ontology-based Tools (EON2003), Sanibel Island, Florida, USA.

44. Uschold, M., King, M., Moralee, S., & Zorgios, Y. (1998). The Enterprise Ontology. The Knowledge Engineering Review, 13(1), 31-89.

45. Uschold, M., & Gruninger, M. (1996). Ontologies: principles, methods and applications. Knowledge Engineering Review, 11(2), 122-147.

46. Weißenberg, N., & Gartmann, R. (2003). Ontology Architecture for Semantic Geo Services for Olympia 2008. In: Bernard, L., A. Sliwinski and C. Senkler (Eds). Münsteraner GI-Tage, Münster. IfGIprints 18. 267-283.

47. Weinberger,H.,Te’eni,D.,& Frank,A.J.(2003).Ontologies of Organizational Memory as a Basis for Evaluation. 11th ECIS'03 European Conference on Information Systems, Naples, Italy.

48. Wache, H., Vögele, T., Visser, U., Stuckenschmidt, H., Schuster, G., Neumann, H., & Hübner, S. (2001). Ontology-Based Integration of Information - A Survey of Existing Approaches. Proceedings of the IJCAI-01 Workshop: Ontologies and Information Sharing, 108-117.

49. Weiss, S. (1997). Glossary for Information Retrieval. Retrieve February 22, 2004, fromhttp://www.cs.jhu.edu/~weiss/glossary.html

50. YoshiKawa, M., & Amagasa, T. (2001). XRel: A path-based approach to storage and retrieval of XML documents using relational databases. ACM Transactions on Internet Technology, 1(1), 110-141.

赴國外研究心得報告

Internet has changed the way business conducted between companies worldwide. Firms are now used to exchange business information electronically over Internet. Since the mid-1990s, wave after wave of web technology standards emerge to support the electronic business information exchange. Standards like Extensible Markup Language (XML), Internet Electronic Data Exchange (I-EDI), RosettaNet

, ebXML

, Web Ontology Language (OWL), and Semantic Web (SW) surge and sweep electronic commerce worldwide (W3C 2006) (RosettaNet 2006) (ebXML 2006) (OWL 2004).

These standards impact on contemporary corporations in many aspects. These standards are proposed to provide a uniform way of business information exchange mechanisms.

Semantic not syntactic integration emerges to be the issue that hinders the plan and progress of business-to-business integration electronic commerce (B2Bi EC), which in turn causes time, cost, and reinvention every time there is a change in the public process, there is a change in the standard, and there is a change in the partnership.

The traditional method to tackle the issue can be divided into the programming (ad hoc) approach and the mapping table (syntactic) method. The programming approach solves the problem in a one to one fashion but the result easily becomes the unmanageable

“spaghetti” chaos. The mapping table seems to be an easy and convenientapproach.

However, it only deals with the specific data values not the data definition. An exponentially growing number of trading partners emerge in B2Bi EC. Programming is no longer an effective and flexible way. Mapping table is too primitive and inadequate.

The new complexity of data semantic in the business information exchange makes both approaches even harder to tackle the problem (Stojanovic et al, 2002) (Trastour et al, 2003). We believe that Internet growth makes B2Bi climb to a higher level of exchange, that is, the exchange of business meanings and business constraints. A knowledge-intensive and system-to-system semantic integration model and method is in need.

RosettaNet is a consortium of major computer and consumer electronics, electronic components, semiconductor manufacturing, telecommunications and logistics companies working to create and implement industry-wide, electronic commerce and business process standards. RosettaNet is a subsidiary ofGS1 US, formerly the Uniform Code Council, Inc. (UCC).

ebXML is a worldwide project initiated and driven by the Organization for the Advancement of Structured Information Standards (OASIS) and the United Nations Centre for Trade Facilitation and Electronic Business (UN/CEFACT). ebXML is to map out a common framework to enable interoperable electronic commerce and business expressed in XML.

2. RESEARCH ISSUE

Business-to-business integration is to exchange business information between different firms and interoperate the public processes over Internet. The traditional ways of trading include telephone, fax, and email. These approaches introduce faults, redundancies, and wastes. Electronic data interchange is a 1990s and transaction-based approach. However, the change of EDI specification is neither on line or real time. EDI lacks the ability to quickly respond to business changes and suffers from the scalability in the presence of an exponentially growing number of users. Internet EDI is the next stage of B2Bi development. And new B2Bi standards have been proposed based on XML. They indeed provide a more on line and real time method than traditional EDI. However, companies still struggle with the difficulty of heterogeneity and interoperability in the exchange and execution of processes and protocols. In essence, an enhanced approach needs to provide the technology compatibility and the knowledge representation.

Electronic commerce within and across national boundaries is universal. Most firms if not all have problems in one way or another with business process integration and business model interoperability. On both methodological and pragmatic levels, due to increasing diversity in web pages, web services, data sources, and programming languages in all countries, developing an analysis framework of cross national B2Bi resolution is important at international, national, and intra-national levels. This study will develop an analysis framework and a method to explore the way integration and interoperability over schema and semantics can be achieved in B2Bi EC. The dynamics of Internet and intelligence of XML and ontology interplay with inter-organizational context, making it a base for exploring the model and method.

Various approaches have been proposed to study B2Bi issues. However, they lack the process perspective and the semantic representation. Their interoperability is based on adhocracy. Much is needed in the systematic and methodological enhancement. This research intends to tackle the inadequacy of B2Bi standard implementation in forms. An ontology-assisted analysis framework is created to reconcile and represent the conflicts and correspondences in the B2Bi EC issue. Based on the literature, in general, B2Bi framework has three fundamental layers to deal with (Cut et al, 2002) (Falkovych et al, 2003) (Gasevic et al, 2004). They are the communication layer, the content layer, and the process layer. These layers represent the important mechanism and management in B2Bi such as the coupling among partners, the autonomy, and the security. In essence, they mean the specifications of the message formats, the transport protocol, the procedure, and the security mechanism.

3. RESEARCH METHOD

3.1 Analysis-driven Ontology Modeling

The research structure depicted in Figure 1 is the analysis framework we present in the paper to illustrate the model and the method to be developed and deployed in the B2Bi EC standards implementation. The framework is made up of the Unified Modeling Language, the Extensible Markup Language, and the Ontology technologies. Business process interoperability and business data integration are considered the antecedents to B2Bi strategies. More in the framework, a set of analysis procedures are proposed. We analyze the cross national business partners’ data schema and process model. We examine the electronic commerce standard in the aspect of data semantics and process semantics. A set of heuristics and rules will be created to represent the above analyzed processmodelsand dataschemain form ofsyntax and semantics.Thepartners’and the standards’ontologieswillbeseparately developed using therulesand the heuristics.We will merge these ontologies in order to reconcile their conflicts and correspondences. The resulting merged ontologies are tested by the prototype system.

In the end, we hope there is an evolution step to be undertaken to reuse the resulting ontologies. The trading partners can share the domain knowledge in the future standard implementation. The following subsections describe the procedures of the analysis framework and are divided into Step A through Step D. Step A develops the domain ontology of the firm and of the trading partners. Step B creates the domain ontology of the standards. Step C focuses on the ontology knowledge representation for the firm and for the trading partners. Step D creates the ontology knowledge representation of the standards.

[Insert Figure 1 here]

3.2 Step A –Firm PublicProcessOntology,“as-is” A.to analyzethecurrentbusinessprocess,“as-is”

If we want to analyze the current process, in general, we initiate a meeting. The meeting participants include the process owners and the process users. Through interviewing users, we discover detailed information about the current processes. The detail information contains the process goal, the process flow, the process user role, the process input, the process output and others. This information should be minuted. According to the meeting minutes, we draw the UML diagrams. If we understand the current processes more, we can represent the process as in UML without losing its semantics.

A.1 to design the use case diagram

Before we draw a use case diagram, we have to gather data. We analyze the process actors, the process preconditions, and the process flow to fill out an analysis form. Take the purchase order (PO) as an example. There should be two actors in the purchase order process: buyer and seller. Before the buyer orders something, the seller makes a request for a quote document from the seller first. Then, if the buyer accepts the quote, he sends a purchase order to the seller. When the seller receives the purchase order, the seller confirms the order. This scenario is the common and simple one.

A.2 to design the sequence diagram

In a sequence diagram, we try to discover all messages that are exchanged in a business process and in the purchase order. It can be extracted from the use case diagram and the meeting minutes. In the purchase order example, the PO Request is the first message to be sent from the buyer to the seller. When the seller receives the order request, the seller should check the inventory to determine whether the firm can fulfill that purchase order or not. Then the PO Confirmation is the next message to be sent from the seller to the buyer.

A.3 to design the activity diagram

An activity diagram can show the flow from one activity to another activity. It can represent the detailed process flow. We should find the information from discussion at the meetings so as to develop the activity diagram. We need to discover the detailed actions in the flow, the initial state, and the final state. We then continue the PO example and finish the activity diagram. In this example, we have three actions: request a purchase order, check inventory for this order, and confirm this purchase order.

A.4 to design the class diagram

We try to extract a generic class construct from the use case diagram, the sequence diagram, and the activity diagram. Again, we move on with the PO example. First, we work on the use case diagram. We discover four components: the two actors and the two use cases. We take the two major elements in the use case diagram, Actor and Use Case, to form the two classes: Actor and Activity. Next, we extract the class Message from the sequence diagram, because the sequence diagram describes the message flow and the order flow between the objects. Then, we work on the activity diagram which consists of several actions as described above. The class Action can be extracted.

3.3 Step B –Standard PublicProcessOntology,“to-be” B. to develop the EC-standard-compliant business process

We use four UML diagrams to perform the work such as the use case diagram, the sequence diagram, the activity diagram, and the class diagram. They are utilized to model an EC-standard-compliant business process. The mapping methods between the four diagrams are the same as in Step A. The difference between Step A and Step B is the source of analysis. Step A focuses on the firm existing and current public processes. We have to collect and examine them through interviews and observations. We model the standard processes from B2Bi EC standard specifications at Step B. Some B2B standards have the concept of process, but some do not. If they do not, we should discuss this issue with the trading partners in order to develop a new standard process specification based on the B2Bi EC recommendation. Of course, some B2B standards have adopted UML diagrams to present their standard processes in the specification. We can directly use them.

B.1 to design the use case diagram

We develop the use case diagram based on the B2B standard specification. A B2B standard specification often describes the process purpose and the process definition in the statements. We search and extract the basic components for a use case from the

在文檔中本體論和資料模式輔助之資訊整合與績效評估工作量模型研究(II) (頁 29-88)