IEEE Internet Computing, forthcoming in 1999

Technologies for a Web Object Model*

Frank Manola
Object Services and Consulting, Inc.
151 Tremont Street #22R
Boston, MA 02111
fmanola@objs.com
http://www.objs.com/

1. Introduction

The World Wide Web is becoming an increasingly important factor in planning for general distributed computing environments, for example, to support external access to enterprise systems and information (e.g., by customers, suppliers, and partners), and to support internal enterprise operations. Organizations perceive a number of advantages in using the Web in enterprise computing, a particular advantage being that it provides an information representation which

supports interlinking of all kinds of content
is easy for end-users to access
supports easy content creation using widely-available tools

However, as organizations have attempted to employ the Web in increasingly-sophisticated applications, these applications have begun to overlap in complexity the sorts of distributed applications for which distributed object architectures such as OMG's CORBA, and its surrounding Object Management Architecture (OMA) [omg97] were originally developed. Since the Web was not originally designed to support such applications, Web application development efforts increasingly run into limitations of the basic Web infrastructure.

If the Web is to be used as the basis of complex enterprise applications, it must provide generic capabilities similar to those provided by the OMA (although these may need to be adapted to the more open, flexible nature of the Web, and specific requirements of Web applications). This involves such things as providing higher-level services (such as enhanced query and transaction support) and their composition in the Web. However, the basic data structuring capabilities provided by the Web must also be addressed, since the ability to define and apply powerful generic services in the Web, and the ability to generally use the Web to support complex applications, depends crucially on the ability of the Web's underlying data structuring facilities to support these complex applications and services.

A fundamental direction of efforts to address the limitations of current Web data structuring technology has been attempts to integrate aspects of object technology with the basic infrastructure of the Web. This paper describes a number of new Web technologies currently being developed that further this integration. These technologies provide the basis for integrating data and behavior in the Web, effectively providing the basis of a "Web object model", and allowing the construction of object-like facilities to address the requirements of more powerful Web applications.

2. Increasing the Structuring Power of the Web

The basic data structure of the Web today consists of hyperlinked HTML documents. It is generally recognized that HTML is too simple a data structure to support complex applications [bos97], e.g.:

applications that require the Web client to function as the front-end to enterprise applications or mediate between multiple heterogeneous databases,
applications that require more flexibility in distributing processing load between Web servers and clients, and
applications that require the Web client to present different views of the same data to different users or in which intelligent Web agents need to tailor information discovery to the needs of individual users.

This is primarily because HTML tags deal primarily with the presentation aspects of document structure; they do not identify the essential meaning of the various units of data in documents in a way that supports the association of appropriate processing with them.

A fundamental direction of efforts to address HTML limitations has been attempts to integrate aspects of object technology with the basic infrastructure of the Web. There are a number of reasons for the interest in integrating Web and object technologies. For example:

The Web can already be viewed as a simple form of distributed object system in which HTML pages are considered as objects, having identity provided by URLs, and methods defined by, or that are invoked via, HTTP servers. The basic resemblance of the Web to a simple object system has created a natural interest in seeing how far the resemblance can be further developed.
Object technology is seen as a particularly-convenient way of adding functionality (e.g., behavior) to the Web, both by adding the behavior provided by objects to the static content of HTML, and by allowing Web clients and servers, through distributed object technology, to access other computing resources. For example:

Java can be added to Web pages and, once downloaded to the client, can then execute. In some cases, the client objects then interact with server objects, possibly using a different protocol, e.g., OMG's IIOP or Java's RMI.
Web pages can be treated as objects or collections of objects for use by other code. Dynamic HTML developments by Microsoft and Netscape, and current World Wide Web Consortium (W3C) work on a Document Object Model (see below), allow the contents of an HTML document to be treated as a collection of programmable objects. Client-side code can then access these objects and manipulate them dynamically (e.g., causing immediate changes in the document displayed to the user).

However, until recently, such efforts toward integrating object technology have been based on HTML, with its basic structuring limitations, and much effort has been expended in attempting to deal with these limitations.

A number of new Web technologies are being developed to address the limitations of current Web data structuring technology. In addition to their individual contributions towards improving the functionality of the Web, these Web technologies can be understood as enhancing the capabilities of the Web in supporting an object model [Man98a,b]. This is based on the observation that key components of any object model are:

data structures that can represent object state
ways to associate behavior (object methods) with the object state
ways for the object methods to access and operate on that state

Applying this idea to the Web, Web pages can be considered as state, and objects can be constructed by enhancing those pages with additional "metadata" that allows the pages to be considered as objects in some object model. In particular, it is desirable to enhance Web pages with "metadata" consisting of programs that act as object methods with respect to the "state" represented by the Web page.

Thinking in this way, the "Web object model" can be improved by providing:

a richer base representation than HTML, in order to better represent "object state" (in particular, provide better support for semantic identification of fields, rather than simply supporting the presentation aspects of text)
an API to this state, so that programs can readily access it (without complex parsing)
an enhanced ability to define relationships between this state and specified behavior (e.g., pieces of code) that can serve as object methods, to provide an enhanced ability to compose objects from both existing and new Web resources

At the same time, the openness of the Web compared to conventional programming language object models needs to be preserved, due to the distinct requirements of the Web environment for openness and scalability. The following sections briefly describe a number of Web technologies that contribute to this integration of Web and object technologies.

3. The Extensible Markup Language (XML)

The Web community is increasingly targeting W3C's Extensible Markup Language (XML) as its next-generation data representation. XML defines a data format for structured document interchange on the Web, and is a simple subset of SGML (Standard Generalized Markup Language). Unlike HTML, which defines a fixed set of tags, XML allows the definition of customized markup languages with application-specific tags, e.g., <AUTHOR> or <QTY-ON-HAND>, for representing information in particular application domains such as chemistry, electronics, or general business. Using such tags to delimit individual data items or groups of data (called elements in XML) allows the data to be easily identified by programs, and hence provides the basis for programs to do useful processing with the data in Web resources. [kr97] provides a useful overview of the potential benefits of using XML in Web-related applications. Numerous articles in the trade press have described various potential applications of XML. In addition, a number of texts on XML have also appeared [xmltext1,xmltext2]. These can be consulted for further details on XML.

The XML definition [xmllang] provides specifications for both XML documents, and XML Document Type Definitions (DTDs). An XML document need not have a DTD, but it must be at least well-formed. This means that it must follow a number of simple rules to ensure that it can be parsed properly without a DTD, e.g., an element must generally have both start and end tags, and elements must properly nest inside each other. Applications can readily use well-formed XML documents for data interchange. The following is an example of a well-formed XML document describing a publication:

<?xml

Version="1.0"?>

<PUBLICATION>

  <TITLE>Why I am

Overworked</TITLE>

  <AUTHOR role="author">

<FIRSTNAME>Fred</FIRSTNAME>

<LASTNAME>Smith</LASTNAME>

     <COMPANY>Jones and

Associates</COMPANY>

  </AUTHOR>

  <ABSTRACT>This is the

text of the abstract</ABSTRACT>

</PUBLICATION>

A valid document is well-formed, has a DTD, and the document structure conforms to the DTD. A DTD is a formal definition of a particular type of document. The DTD specifies which element tags can appear in the document, which attributes are associated with each element, and the permitted structure of the elements (e.g., <COMPANY> may only appear inside <AUTHOR>). The following is a DTD for the above document:

<?xml Version="1.0"?>

<!DOCTYPE

PUBLICATION [

<!ELEMENT PUBLICATION

(TITLE,AUTHOR+,ABSTRACT*)>

<!ELEMENT AUTHOR (FIRSTNAME, LASTNAME,

(UNIVERSITY | COMPANY)?)>

<!ATTLIST AUTHOR role (author|techwriter)

"author">

<!ELEMENT FIRSTNAME (#PCDATA)>

<!ELEMENT

LASTNAME (#PCDATA)>

<!ELEMENT UNIVERSITY (#PCDATA)>

<!ELEMENT

COMPANY (#PCDATA)>

<!ELEMENT ABSTRACT

(#PCDATA)>

]>

The linking of resources with their DTDs is similar to the association of a database record with its schema type, and to the association of an object with its type or class definition. DTDs are particularly useful in supporting document creation and editing.

An XML document need not be structured as a single piece of text, stored a single file. Instead, the document can be composed of separate pieces, called entities. The entities may be in separate files identified by URLs (external entities). Internal entities may also be used within a document to define units of text that are reused in several places within a document. XML also supports unparsed entities to represent non-XML resources (e.g., images) that are to form part of an XML document. Each unparsed entity has a NOTATION declaration that identifies the entity's representation, or a processor that can be called by the XML application to process the entity.

XML is an approved W3C recommendation. Additional capabilities for XML are also under development (but not yet approved by W3C). Unlike HTML, XML per se provides no facilities for defining the presentation aspects of documents (e.g., whether certain text should be in a specific size or color). Instead, the presentation aspects of XML documents are intended to be described using separate stylesheets. As a result, stylesheets play a much more important role in XML than they do in HTML. The Extensible Stylesheet Language (XSL) [xsl] defines stylesheet capabilities for XML documents. XSL capabilities are resemble in some respects those of the ISO Document Style Semantics and Specification Language (DSSSL) [iso96] used in formatting SGML documents. XML linking capabilities are described in the XML Linking Language (XLink) [xll] and XML Pointer Language (XPointer) [xptr] specifications. These linking capabilities are much more powerful than those of HTML, providing support for both bidirectional and multi-way links, as well as links to a span of text within the same or other documents. An XML namespace facility [namespace] is also under development. This allows the definition of prefixes associated with URI-identified collections of names (namespaces). These prefixes can be used with element (tag) names to prevent name clashes when developing documents that mix elements from different namespaces.

XML has considerable industry support. For example, Microsoft has built some XML support into Internet Explorer 4 (and further support into IE 5), made available several XML-based tools [msxml], and contributed a number of proposals to W3C on XML extensions and applications; Netscape has made similar contributions. A number of industry groups have defined SGML DTDs for their documents (e.g., the U.S. Defense Department, which requires much of its documentation to be submitted according to SGML DTDs). In many cases these could be either used with XML directly or converted in a straightforward fashion. Work is already underway to define XML-based data exchange formats in both the chemical and healthcare communities, as well as on other applications of XML.

XML does not itself address all of the Web's data structuring requirements, but it does provide a solid data representation in terms of which higher-level capabilities can be defined. In particular, it provides the basis for associating application-specific processing with particular elements, and hence can form the basis of an "object state" representation.

4. Web Metadata and Data Type Facilities

XML, as a richer representation technique for Web information, is an important technology in making the Web an improved basis for enhanced applications of all kinds. However, additional levels of data modeling support are also required to support these applications. One important area of development for such higher-level modeling support has been work on improved technologies for representing Web metadata, i.e., data available via the Web that describes or helps interpret other resources located either on the Web or elsewhere (e.g., in databases or repositories of hard copy documents).

Some of this work has involved attempts to define abstract metadata models, or the basic principles of such models. For example, the Dublin Core defines a minimum standard vocabulary (e.g., TITLE, CREATOR, SUBJECT) for describing documents to support applications such as search and automatic indexing. The work also describes a number of general requirements that need to be supported in any general metadata model, such as the need to support multiple levels of metadata (metadata about metadata), and multiple levels of granularity. The Warwick Framework defines a metadata container architecture that describes how multiple, separately-managed metadata sets should be defined, managed, and associated with the resources they describe. The W3C has done considerable work on metadata mechanisms for describing various characteristics of Web resources. Relevant W3C metadata technologies include the Platform for Internet Content Selection (PICS) and its generalization, the Resource Description Framework (RDF) Web metadata models. PICS defines a set of technologies for defining machine-readable descriptions of rating systems for labeling Internet content, generating content labels according to those rating systems, associating labels with specific Internet resources, and distributing the labels to Web clients as the resources are accessed. All these efforts have been discussed in a recent paper by Ora Lassila [las98].

The RDF combines extensions of the PICS technology with other work on metadata models to support such applications as resource discovery by search engines, cataloging, knowledge sharing and exchange by intelligent software agents, and electronic commerce. RDF [rdflang] defines both a data model for representing RDF metadata and an XML-based syntax for expressing and transporting the metadata. The basis of RDF is a model for representing named properties and their values which form logical assertions about Web resources identified by URLs. The model is based on propositional logic, plus certain modalities. RDF properties can represent both attributes of resources and relationships between resources. RDF also allows the reification of properties, so that individual properties and assertions can themselves be described by properties. Assertions may be associated with the resource they describe in several ways, e.g., embedded in the resource, external to the resource but supplied with the resource in the same retrieval transaction, or retrieved from a separate source. RDF also supports the specification of standard vocabularies (but does not impose one).

XML can also be interpreted as representing Web information in the form of properties (identified by tags) and their values (the contents of the tagged elements), and hence an XML document can also be interpreted as a set of logical assertions. This assertion-based interpretation of Web-represented information, the ability to make assertions about those assertions, and the ability to use URLs (or URIs) as a universal identifier mechanism, creates the basis for a formal model of the Web [ber97], and facilitates the use of more "intelligence" in processing Web information.

Another important area of activity in supporting expanded applications for XML is work on improved facilities for defining data type information for XML. DTDs currently provide only limited support for defining what would be recognized as data types in an object model, or schemas in a database. For example, it is not currently possible to directly specify that a particular XML element is to contain an integer value. A number of proposals have been made for alternative schema and data type facilities for XML, including:

XML-Data [xmldata], a W3C submission defining a mechanism for describing DTD information and "schemas" directly in XML.
Document Content Description for XML [dcd], a W3C submission from Microsoft and IBM defining another XML structural schema facility (it defines a subset of XML-Data as RDF metadata).

In addition, W3C's RDF activity has defined RDF Schemas [rdfschemas] as a type system for RDF. None of these is currently an official W3C specification. However, the W3C plans further work on improvements in schema and data typing facilities for XML.

5. Behavior and Behavior Attachment Mechanisms

Given the ability provided by XML to define application-specific elements in Web documents, an obvious thing to want to do is to associate specific behavior with those elements when processing those documents. A number of Web technologies are being developed to provide this capability.

5.1 The Document Object Model

W3C's Document Object Model (DOM) [dom] is a key technology for providing programs with access to Web data. DOM defines an object-oriented API for HTML and XML documents. The DOM represents a document as a hierarchy of objects of classes such as Element, Attribute, Text, etc., that closely models the actual structure of the document. For example, an element in the document is represented by an Element object in the DOM; another element contained in that element would be represented as a child Element object; text contained in an element is represented as a child Text object; and so on. Through this API, scripts or programs can access and manipulate individual parts of a document's content (including all markup and, in later DOM levels, DTDs) without having to parse the document. By operating on the collection of objects representing the Web page, scripts or programs can change characteristics of page elements, or even replace existing elements with new ones, causing direct changes in the document's presentation. As a result, DOM allows the implementation of dynamic content on the client, rather than forcing all such implementation to be done on the server (with the associated need for additional client-server message traffic). In addition to using DOM itself, a Web client could provide a document's DOM interface to external applications, allowing them to access the document via the client.

DOM is based on Dynamic HTML facilities defined by Microsoft and Netscape. DOM Level 1 extends these capabilities to, for example, allow creation "from scratch" of entire Web documents in memory by creating the appropriate objects. However, DOM does not yet implement all the Dynamic HTML facilities currently available (for example, an event mechanism is not yet defined). These and other capabilities will be defined in later DOM levels. However, by providing an API to document contents, DOM provides the foundation for integrating a document's data with processing code.

5.2 Behavior Representation and Association

One way of associating behavior with Web pages (or with elements in them) would be to simply write application programs that retrieve the appropriate pages, access their content (possibly through DOM interfaces), and perform the appropriate processing. However, the Web already provides more direct means of associating behavior with HTML pages, in such forms as ActiveX controls, Java applets, and scripts in various scripting languages. Possibly the most straightforward approach is to embed either the behavior itself, or a pointer to it, in the page. In HTML, scripts can be embedded using a SCRIPT element, while applets can be embedded using an APPLET (or OBJECT) element. SCRIPT elements can be used to associate collections of scripts with a page, thus forming something akin to an object class. Separately linked style sheets also provide a way of associating sets of behaviors with HTML documents. These HTML mechanisms deal not only with associating behavior with pages as a whole, but also with associating behavior with specific parts of Web pages. For example, a given script can be associated with a particular element to provide behavior for that specific element (e.g., to define what happens when the user clicks on it). Stylesheets also provide mechanisms for associating formatting behavior with specific elements (or possibly all elements satisfying specific criteria).

These mechanisms for associating behavior with HTML pages involve the use of special HTML elements, e.g., OBJECT and SCRIPT, with pre-defined content types and processing behaviors. XML currently does not mandate any particular way of incorporating behavior; e.g., it does not specify such pre-defined elements or behaviors. However, the ability to associate behavior with particular parts of XML pages is particularly important, since each XML element type potentially represents distinct semantics, and hence may need to be associated with behavior specific to those semantics.

XML does provide the ability to define elements that could contain scripts or other behavior representations, or pointers to them. These elements could then be processed (interpreted) by separate interpreters (much as Web browsers refer HTML SCRIPT and OBJECT elements to specific interpreters). For example, one approach to representing behavior in XML and associating it with an XML page would be to simply define an unparsed entity to hold the behavior representation (e.g., the script or applet) as part of the page's content. For example, a NOTATION definition could be specified to associate the notation name JavaScript with a processing engine (interpreter) for JavaScript, identified by a URL. One or more entities could then be defined as having the JavaScript notation to hold whatever scripts are needed. The XML application (e.g., the Web client) could then call the JavaScript interpreter in order to deal with those elements (assuming some additional infrastructure to define when the scripts were to be called, etc.). However, this approach defines a rather tight coupling between the XML and the associated behavior.

XML-related technology is also being developed to support more flexible ways of associating behavior with XML pages or parts of pages. A particularly popular approach is based on the use of stylesheets and related techniques. Stylesheets already define some forms of behavior (e.g., what happens during a mouseover), and they allow for such behavior to be associated with specific elements. In addition, they provide flexibility (since different stylesheets can be applied to the same document) and modularity (since the behavior is defined separately from the page itself, and hence can be applied to multiple pages) in associating behavior with Web pages.

One approach to using stylesheets in providing additional behavior is to generalize the types of results that stylesheet formatting can produce, to allow the inclusion of behavior in the resulting page. A step in this direction is a W3C submission from Hewlett-Packard called Spice [spice]. Spice is a combination of ideas from DSSSL, Cascading Style Sheets (CSS), and JavaScript, designed to make it simple to apply style and behavior to XML documents (it can also be applied to HTML). Spice uses CSS-like style rules to associate flow object classes that define formatting tasks with specific elements to be formatted. However, Spice supports not only the predefined set of CSS flow objects, but also downloadable sets of extended flow objects which can be written in Spice, Java, or ActiveX. These flow objects can exploit the full capabilities of the Document Object Model in processing the document contents. To further control behavior, event handlers can be written to script flow objects. This allows the document to be dynamically altered after it has been loaded. (Spice differs from XSL primarily in using CSS syntax for style rules and properties.)

A Netscape submission to W3C describes another stylesheet-related approach called action sheets [actionsheets]. Action sheets provide a mechanism for defining the script-encoded behavior of document elements in a reusable package, separate from the structural definition of a document. In the same way that external stylesheet rules can associate presentation properties with specific XML elements, external action sheet rules can associate arbitrary event handlers with specific XML elements or classes of elements. An action sheet contains a set of productions (rules) somewhat similar in form to XSL template (formatting) rules. Simplifying somewhat, a rule contains a selector (pattern) which defines the document elements to which the rule applies, and an action, which specifies a script to be run for a given action (e.g., an event such as onClick). Action sheets would be associated with XML documents in the same way as stylesheets. Microsoft has defined a somewhat related technology in Internet Explorer 5.0, called DHTML Behaviors [deB98], which allows a scriptlet (see below) to be associated with a particular element using a CSS stylesheet.

Another interesting development in the area of associating behavior with XML is Microsoft's Scriptlets technology [deB98]. Scriptlets allow COM components to be written in a combination of XML and a scripting language. These components can be used in the same way as any other COM component, e.g., they can be used by COM clients such as Microsoft Office or embedded in Web pages like ActiveX components. A scriptlet is defined by a file with a .sct extension, which contains script code and XML markup (using a specialized tag set) that defines the methods, their parameters, and the properties to be exposed by the component. A special DLL acts as a runtime engine to interpret and execute the XML definition of the scriptlet. It also acts as a broker between clients and the scriptlets. Netscape has defined a similar technology for constructing JavaBeans using XML markup and scripts called JavaScript Beans [Nic98]. Microsoft has also defined "DHTML Behaviors", a mechanism which allows scriptlets to be associated as behaviors with specific document elements using CSS stylesheets. Such scriptlets can also expose custom events to the page, access the containing page's DHTML (or DOM) objects, and receive event notifications.

These and other technologies for associating behavior with XML provide a further integration of Web and object technology, by allowing the construction of object-like aggregates of data and behavior. In particular, Scriptlets and JavaScript Beans show that Web technologies may be used to construct objects not only in an extended notion of "object model", but in conventional programming language object models (COM and Java, respectively) using specialized XML markup and scripting as the representation for the object state, methods, and interfaces. With the appropriate interpreters, this same general approach could also be used to directly construct objects in other object models, such as CORBA IDL. While much work needs to be done on mechanisms for associating behavior with XML, numerous options exist, and the existing technologies show a great deal of promise.

6. Interface and Messaging Technologies

Web technologies are not only being developed to support the state and behavioral aspects of objects. Web technologies are also being developed to support the interfacing and messaging aspects of distributed object systems.

The use of XML to represent object interfaces (and, in fact, complete objects) in Microsoft's Scriptlets has already been mentioned. Technologies such as DataChannel's WebBroker[webbroker] represent attempts to build a complete Web-native distributed object computing model, based on the use of XML and HTTP. WebBroker defines DTDs for XML documents that represent method call and return messages between software component objects. A calling component sends an objectMethodRequest to another component, and receives an objectMethodResponse in return. WebBroker also uses XML to represent interface definitions for these objects. In WebBroker, software components become URL-addressable HTTP resources. The Web client contains a Java applet which acts as a client-side broker for remote requests generated by local Java applets. This applet generates XML request messages from these requests and sends them to a server using the HTTP POST method. Request messages include a callback URL to identify the client. A Java servlet on the server formats the XML request into a call to the appropriate server resource. When the response is ready, it is formatted into an XML response message and sent back to the client using an HTTP POST method to the callback URL. The client also contains an httpd server (a local HTTP server). This client-side server accepts the response and passes it back to the requesting Java applet. WebBroker handles both COM+ and CORBA objects, and has been submitted to the W3C.

DataChannel notes several potential advantages to using XML in a distributed object architecture. For one thing, by using XML, metadata describing object interfaces can be defined as a collection of interlinked XML documents available on a Web repository server, eliminating an unnecessary distinction between this metadata and other information. This would also allow the repository server to provide this information in a single round trip, as opposed to the multiple calls needed to access it using current interfaces. Using XML could also reduce the amount of code needed in lightweight Web clients to handle object messaging, since they will probably be able to process XML already, eliminating the need for extra code to support DCOM or CORBA syntax.

UserLand Software has developed a similar technology called XML-RPC for using XML messages and the HTTP POST method as the basis of remote procedure calls, as part of its Frontier 5 Web content development and management environment. In addition, Microsoft is developing a related protocol, called the Simple Object Access Protocol (SOAP), together with UserLand Software and DevelopMentor. Development of a single XML-based RPC protocol could create the basis of a widely-available "universal ORB" capable of interacting with objects in a wide range of different object models. WebMethods, Inc.'s Web Interface Definition Language (WIDL) [widl, KR97] defines a somewhat similar approach, using XML to define object-like interfaces to Web servers. These interfaces can then be accessed by remote systems using HTTP messages, and provides the structure necessary for generating client code in languages such as Java, C/C++, COBOL, and Visual Basic.

W3C's HTTP-NG project [httpng] is pursuing an approach that is in some sense the opposite of the above technologies. Instead of building a distributed object system on top of the Web, HTTP-NG involves building a distributed object system under the Web, and then converting the current Web to an application of that distributed object system. HTTP-NG represents a longer-term solution to the Web's expansion to include more general distributed applications, based on the idea that layering these applications on top of HTTP will result in problems due to unnecessary performance costs, and lack of functionality and generality.

By basing the Web on a generic distributed object system, the HTTP-NG project hopes to enable distributed applications to use this distributed object system directly. The goal is for the generic distributed object system to be simple, yet rich enough to meet the semantic and performance requirements of CORBA, DCOM, and Java RMI (without, however, unifying their object models). The project has defined:

a message-oriented transport that works well for bursts of transactions, and is well-suited for Web usage
a lightweight object-oriented RPC protocol which has the flexibility and efficiency required by the Web
a characterization of the Web in terms of a set of object interfaces defined in an IDL.

Success of HTTP-NG would mean the insertion of object technology at the heart of the Web, enabling it to not only support object-RPC-like applications more efficiently, but also higher-level integrations of Web and object technologies.

7. Conclusions

A more complete integration of Web and object technologies would provide the basis of both powerful capabilities for integrating all kinds of data and information, together with a wide variety of enhanced services, within a distributed architecture that is both widely-available, and easy-to-use and extend. This paper has described a number of technologies that potentially constitute important contributions to this integration of Web and object technologies. These technologies, as well as others, are the subjects of very active ongoing development efforts. In addition, since in many cases these technologies are being developed in coordination with Web standardization efforts, they are more likely to see widespread use. By building on these technologies, it should be possible to expand the already widespread use of the Web to include a new generation of Web applications.

References

[actionsheets] V. Apparao, et. al., "Action Sheets: A Modular Way of Defining Behavior for XML and HTML", W3C Note, World Wide Web Consortium, 1998; http://www.w3.org/TR/NOTE-AS.

[ber97] T. Berners-Lee, Metadata Architecture, 1997. <http://www.w3.org/DesignIssues/Metadata>.

[bos97] J. Bosak, "XML, Java, and the Future of the Web" <http://sunsite.unc.edu/pub/sun-info/standards/xml/why/xmlapps.htm>, 1997.

[dcd] T. Bray, C. Frankston, and A. Malhotra, "Document Content Description for XML", W3C Note, World Wide Web Consortium, 1998; <http://www.w3.org/TR/NOTE-dcd>

[deB98] M. De Bruijn, "Internet Explorer 5.0--for Intranets Only?", WEBBuilder, 3(9), Sept. 1998, 25-28.

[dom] L. Wood, et. al., "Document Object Model (DOM) Level 1 Specification", W3C Proposed Recommendation, World Wide Web Consortium, 1998; http://www.w3.org/TR/WD-DOM/.

[httpng] H. J. Nielsen, "Hypertext Transfer Protocol - Next Generation", Overview page, August, 1998. <http://www.w3.org/Protocols/HTTP-NG/>.

[iso96] International Standard ISO/IEC 10179:1996(E), Information Technology-Processing Languages-Standard Generalized Markup Language (SGML), 1986.

[kr97] R. Khare and A. Rifkin, "XML: A Door to Automated Web Applications", IEEE Internet Computing, 1(4), July-August 1997, 78-87.

[las98] O. Lassila, "Web Metadata: A Matter of Semantics", IEEE Internet Computing, 2(4), July-August 1998, 30-37.

[Man98a] F. Manola, "Towards a Web Object Model", Technical Report, Object Services and Consulting, Inc., <http://www.objs.com/OSA/wom.htm>, 1998.

[Man98b] F. Manola, "Some Web Object Model Construction Technologies", Technical Report, Object Services and Consulting, Inc., <http://www.objs.com/OSA/wom-II.htm>, 1998.

[msxml] <http://www.microsoft.com/xml/>

[namespace] T. Bray, D. Hollander, and A. Layman, "Namespaces in XML", W3C Working Draft, World Wide Web Consortium, 1998; <http://www.w3.org/TR/WD-xml-names>.

[Nic98] D. Nickerson, Official Netscape JavaBeans Developer's Guide, Ventana Communications Group, 1998.

[omg97] Object Management Group, A Discussion of the Object Management Architecture, June, 1997; http://www.omg.org/library/omaindx.htm.

[rdflang] O. Lassila and R. R. Swick, "Resource Description Framework (RDF) Model and Syntax", W3C Working Draft, World Wide Web Consortium, 1998; http://www.w3.org/TR/WD-rdf-syntax/.

[rdfschemas] D. Brickley, R. V. Guha, and A. Layman, "Resource Description Framework (RDF) Schemas", W3C Working Draft, World Wide Web Consortium, 1998; http://www.w3.org/TR/WD-rdf-schema/.

[spice] R. Stevahn, "Adding Style and Behavior to XML with a dash of Spice", W3C Note, World Wide Web Consortium, 1998; http://www.w3.org/pub/WWW/TR/NOTE-spice.

[webbroker] J. Tigue and J. Lavinder, "WebBroker: Distributed Object Communication on the Web", W3C Note, World Wide Web Consortium, 1998; http://www.w3.org/TR/1998/NOTE-webbroker.

[widl] P. Merrick and C. Allen, "Web Interface Definition Language (WIDL)", W3C Note, World Wide Web Consortium, 1997; http://www.w3.org/TR/NOTE-widl.

[xmllang] T. Bray, J. Paoli, and C. M. Sperberg-McQueen, "Extensible Markup Language (XML) 1.0", W3C Recommendation, World Wide Web Consortium, 1998; http://www.w3.org/TR/REC-xml.

[xll] E. Maler and S. DeRose, "XML Linking Language (XLink)", W3C Working Draft, World Wide Web Consortium, 1998; http://www.w3.org/TR/WD-xlink.

[xptr] E. Maler and S. DeRose, "XML Pointer Language (XPointer)", W3C Working Draft, World Wide Web Consortium, 1998; http://www.w3.org/TR/WD-xptr.

[xsl] J. Clark and S. Deach, "Extensible Stylesheet Language (XSL)", W3C Working Draft, World Wide Web Consortium, 1998; http://www.w3.org/TR/WD-xsl.

[xmldata] A. Layman, et. al., "XML-Data", W3C Note, World Wide Web Consortium, 1998; http://www.w3.org/TR/1998/NOTE-XML-data.

[xmltext1] D. Megginson, Structuring XML Documents, Prentice Hall, 1998.

[xmltext2] E. R. Harold, XML: Extensible Markup Language, IDG Books, 1998.

Acknowledgements: The author would like to acknowledge the contributions of the OBJS team and the participants in the xml-dev email list to the ideas contained in this paper.

* This research is sponsored by the Defense Advanced Research Projects Agency and managed by the U.S. Army Research Laboratory under contract DAAL01-95-C-0112. The views and conclusions contained in this document are those of the author and should not be interpreted as necessarily representing the official policies, either expressed or implied, of the Defense Advanced Research Projects Agency, U.S. Army Research Laboratory, or the United States Government.

Sidebar: What is an Object Model?

The term object model is used in two different senses in discussions of object technology. In one sense, object model refers to the collection of concepts used to describe the generic characteristics objects in a particular object-oriented language or specification, and corresponds closely to the use of the term data model in "the relational data model". In this sense, we speak of "the OMG object model" or "the Java object model", and describe what kinds of inheritance they support, whether they support garbage collection, etc. This is a model of objects in terms of more primitive concepts. This is in contrast to the use of object model to describe the collection of object classes defined to model a particular system or application, a usage common in object analysis and design. The Document Object Model is an object model in the latter sense. This is a model of something else (documents in this case) in objects. This dual usage is unfortunate, but is common in the literature.

Sidebar: A Web Object Model Framework

The Web technologies described here, together with others, provide the basis of a framework for constructing actual Web objects from Web data and code resources. This framework considers these technologies as contributing parts of an object model, and is based on the idea that an "object" in a conventional object model is basically a piece of state with some attached (or associated) programs (methods). In many object model implementations, this idea is exactly reflected in the physical structure of the objects. For example, a Smalltalk object consists of a set of state variables (data), together with a pointer to a class object containing the object's methods. The structure is roughly:

    Object (state)

Class object

  +---------------+              +-------------+
  | class pointer |------------->| Class data  |
  +---------------+              +-------------+
  | variable 1    |              | method 1    |
  | variable 2    |              | method 2    |
  |   ...         |              |   ...       |
  | variable n    |              | method m    |
  +---------------+              +-------------+

C++ implementations use similar structures. These structures are determined by the programming language implementation, and are created as necessary to represent the program and its data. The state is a collection of programming language variables, which are operated on by the methods. The class methods define the way the state is interpreted, and hence is a form of metadata for the state, making the link between the object and its class a metadata link.

Extending this idea to the Web, Web pages (or smaller units of Web data) can be considered as state, and objects constructed by enhancing those pages with additional "metadata" that allows the pages to be considered as objects in some object model. In particular, we want to enhance Web pages with programs that act as methods with respect to the "state" represented by the Web page. The resulting structure would, at a minimum, conceptually be something like:

                       +----------+
           +---------->| method 1 |
           |           +----------+  
+-------+  +           
|  Web  |--+              ...
|  page |--+
+-------+  |           +----------+
           +---------->| method n |
                       +----------+

The methods could be physically embedded in the page, referenced by embedded or separate pointers (URLs), or associated with the page in some other way, e.g., using stylesheets or some similar technology (there are already a number of mechanisms used in the Web to integrate code (behavior) with Web pages, as described in the text). Unlike a programming language object model, in which the methods are tightly coupled to the state, a Web object model would ideally support a looser coupling of methods and state, so that the information represented by a Web page could be reused for different processing requirements.

In this framework, Web technologies can play the following roles:

XML provides a good representation for object state, providing application-specific tagged data elements, nested structures, and powerful linking facilities. In supporting an object model, XML pages (like HTML pages) can also be used as containers for embedded objects and object methods (e.g., Java applets). In addition, the element tags can be used as hooks to associate specific behavior with the elements.
DOM provides an API for XML documents used as object state, and hence provides a mechanism for integrating object state and associated code.
the various behavioral mechanisms described can be used for representing object methods and associating them with XML-based state; these methods can access the state via the DOM-defined interfaces.
RDF and further Web typing facilities can be used to define both higher-level metadata and schema/type information, and to define relationships between documents containing state and separate documents containing object methods (concepts from PICS and RDF can also potentially support automatic access to these methods when the state is accessed).

In addition:

the interface technologies described provide the basis for defining fully-encapsulated objects (as illustrated by Scriptlets).
object messaging can be supported using either HTTP itself, as illustrated by WebBroker, or by HTTP-NG (if this becomes approved technology).

The technical reports from which this paper is drawn [Man98a,b] explore these ideas more fully, and also provide more complete descriptions of these and other technologies which could be used in implementing the framework.