|
||
|
SGML establishes requirements for the digital interchange of technical publication text. Data prepared in conformance to SGML will facilitate the automated storage, retrieval, interchange, and processing of technical documents from heterogeneous data sources.
The CALS SGML standard defines both a methodology and a high level computer language for document representation. It provides a coherent and unambiguous grammar and syntax for describing whatever a user chooses to identify within a document regardless of the type of document or the nature of the document's text and provides a formal markup procedure, also independent of system and output environments for this purpose. The definition of the document's structure or content in terms of "elements", their "attributes", "entities", and other components is a called "Document Type Definition (DTD)". A DTD defines the structure or content of a specific class of documents. "SGML markup" (or an "SGML document instance") consists of unformatted text with inserted SGML "tags" which correspond to the elements and attributes of the DTD. These tags identify elements of the text (e.g., titles, paragraphs, tables, footnotes) defined in the document's DTD. The "marked up" document (or SGML document instance) can then be "parsed" using special software to determine if the document's tagging conforms to the DTD. In order to format an SGML source file, associated formatting information must be provided. This associated formatting information must define formatting characteristics such as a page model, font and family characteristics, point size, indenting, etc. In addition, these formatting characteristics must be responsive to certain SGML tags. For example, a "paragraph" tag may trigger a change in the line leading or a "chapter title" tag may trigger "bolding" and "center" functions. The associated formatting information are provided in the form of an "Output Specification" (OS) DTD. The OS DTD defines a finite set of formatting characteristics used to rigorously describe the composition processing functions to be performed with respect to the tags of a SGML source file. A Formatting Output Specification Instance (FOSI) is an instance of the OS DTD. The FOSI defines values for the formatting characteristics defined in the OS DTD for every SGML element used in the document DTD, taking into account every context in which the SGML element has a unique formatting requirement. In example, a title of a TM chapter is formatted differently than a title of a TM subparagraph. The objective of the FOSI is to rigorously define the format style of the document to be produced from the SGML tagged source file, as required by the appropriate functional specification. A FOSI should be developed for each DTD to describe all default formatting characteristics necessary to compose and publish a document authored according to that DTD. The FOSI should be delivered with the SGML source tagged file. Since all FOSIs will be written with respect to the standard OS (paper medium), vendors will be able to develop software that can accept and process FOSIs and interface with the publishing software. MIL-M-28001B provides a mechanism which enables an electronic review and comment capability for SGML tagged source files. This capability allows reviewers located in diverse environments to make and exchange comments electronically on multiple copies of a document file over a network. The comments may then be sorted, processed, and incorporated into the document by the file "owner". The mechanism for electronic review of SGML tagged source files consists of certain SGML constructs which are incorporated into a DTD for a given document type. These SGML constructs have been defined as generally as possible to take into account the many kinds of reviews: internal contractor reviews, Government reviews, contractor/Government reviews, specification reviews, etc. Plans for future extensions of electronic review include both a CALS graphics comment capability using SGML tagged for the comments, and a capability to link SGML text and CALS graphics files for related changes. Efforts will also be made to develop a more precise addressing mechanism for indicating the location within document elements of a proposed change. Partial document delivery is used to transmit source SGML tagged source data either as an interim deliverable or as an update package containing data for a document that has been previously delivered. Its purpose is to minimize the retransmission of unchanged data or to indicate incomplete data. Partial document delivery is not intended to address the issues of page integrity or fidelity, nor is it intended to include specific change pages. The intent of this methodology is to allow the delivery of certain portions of a source document such that the receiving system can identify the location of the information in the original document and perform the appropriate addition, deletion, or replacement operations. Both the manner in which this is accomplished and the effect of the change on composition depends on the receiving system.
The NATO CALS Organisation is investigating the requirements for the development and maintenance of a NATO CALS SGML Library (NCSL). It is envisioned that a NATO CALS SGML Registrar will administer the NATO CALS SGML Registry (NCSR), a central registry office where DTDs and FOSIs will be registered. The NCSR will maintain a NATO CALS SGML Library which will be an on-line database containing all SGML elements and attributes that have been defined, with cross references to DTDs and governing military specifications. The NATO CALS SGML Registry will require adherence to basic guidelines for acceptance of SGML tags/attributes. Some of these guidelines are:
Currently, there are no tests for vendor products claiming conformance to MIL-M-28001. Such MIL-M-28001 product conformance testing will depend upon the product's function. For instance, conformance testing of SGML parsers entails the correct interpretation of ISO 8879. Conformance testing of "auto -taggers" or "authoring stations" would be limited to determining the parsability of the instances generated, and again would involve correct ISO 8879 interpretation. With few exceptions, there is no disagreement regarding the correct interpretation of ISO 8879. The vendor community is aware of the evolving nature of MIL-M-28001. Some vendors are waiting until the standard is finalized, while other vendors are undertaking full implementations at the present time. A large vendor community is represented on the Electronic Publishing Committee. For the CALS environment, vendors supporting MIL-M-28001 should not "hard-code" their systems to process only a single DTD or FOSI. Certainly, most users will be processing a variety of technical publications which must conform to multiple DTDs and will require a system that can be configured to adapt to new and changing requirements as they arise. Currently there are various types of SGML software products on the market. These include:
|
|
|
||