Precedent documents – Increase value & reduce costs

Author:

Date: 26 October 2006

Presentation by Peter Meyer at the Sinch Precedents Automation Conference in Sydney on 26 October, 2006.

In this presentation, Peter Meyer discusses the limitations and costs associated with the use of word processing tools to maintain precedent documents (templates). It explains how those problems could be overcome through the use of structured XML content management methodologies that are used increasinly in other content management areas. A new XML schema standard being developed by the OASIS eContracts Technical Committee may provide the basis for similar approaches to be used for the maintenance of precedents documents in law firms and other similar organizations.

Precedent documents – Increase value & reduce costs

Introduction

Lawyers create documents and publish them to their clients and other interest holders. At one level, law firms could be viewed as document factories. In this context, precedents are of critical importance.

Lawyers work predominantly in word processing software. Once WordPerfect was supreme. Today, most lawyers use Microsoft Word to create and publish documents.

In most firms, precedents are prepared and managed using the prevailing word processing technology, currently Word. This has obvious attractions. A lawyer using the corresponding software can pick up a precedent, copy it and start editing it to create a working document.

Unfortunately, this model also creates quite a few problems. This presentation examines some of these problems, considers some of the possible improvements that could be made and provides a brief insight into some of the technologies that might be involved.

What are precedents?

Precedents are documents or components of documents we store for re-use in future transactions. Some common characteristics of precedents include:

• The are usually derived from real transaction documents but a lot of work is required to create really useful precedents.
• Thus, there may be different classes or quality levels of precedents, according to how flexible and trustworthy they are regarded by their users.
• Precedents may be complete documents that a lawyer will take and modify to a particular set of instructions. In other words, a starting point.
• They may be complete documents that normally are not modified except by the addition of specific transaction data such as names, addresses, money amounts, dates etc. This data may be added manually or it may be inserted in an automated variables substitution process.
• Precedents may be complex documents with alternative or optional components that are inserted or suppressed according to conditions set in a user interface in a document assembly programme or by values extracted from a database. Conditional processing may be very complex so that software applications can be used by non lawyers can answer questions and provide information to create documents.
• Precedents may be components (clauses) that lawyers can locate from a clause library and insert into new documents or they may be accessed only through a document assembly programs.

Precedents are essential to the efficient operation of a law firm. They enable less experienced practitioners to draw upon the skill and knowledge of more experienced practitioners. They enable the firm to provide services rapidly and cost effectively. They are a critical component in reducing risk.

How are precedents managed today?

Precedents are a lot of work

Precedents development involves a very large amount of work. There is work to ensure that the drafting is accurate, clear and consistent with firm drafting styles. There is work providing annotations or explanations to users. There is work making sure the formatting is consistent with firm styles. There is a lot of work adding metadata so that practitioners can find and use the precedents. There is work adding codes and other markup needed by automated processing systems. There is testing and checking and re-working as the law and clients change.

Precedents development and maintenance involves a lot of time and expense with input from practitioners and dedicated precedents managers.

Precedent documents

Whole documents are almost invariably maintained in the format of the current editing software. The documents are frequently stored in a document management or content management system to facilitate access control and retrieval. Precedents may be discovered through text search or through metadata associated with the precedents.

Variable codes and conditional logic

If precedents are accessed through document assembly software to automate variables substitution, codes or other markup must be added to suit the particular processing software. Most applications define their own system of codes and markup although some may work with codes created by competitor products. Some may use XML based on proprietary schema.

Clause libraries

Larger firms in particular may maintain large clause libraries. As discrete objects, clauses in a clause library present special problems. Since the clause can find its way into many different documents and at different levels within those documents, it can create problems if the user has to re-format them each time they are used so that automatic numbering and display style is correct.

For this reason, some clause libraries hold blobs of plain text that is then formatted after insertion in a working document.

Problems

Use of word processing formats

The use of word processing formats presents many problems:

• Word processing formats store display style information with the content. If you want to change document styles, it is usually necessary to edit every document to make the changes, unless only simple style property changes are involved. On a large database, this can be a massive, costly exercise. Styles in the documents are never completely consistent so attempts to automate the process rarely achieve the expected results. Inconsistency arises from human error or from drift over time as tastes or file formats change. New documents may not be the same as older documents in the database so that a database can hold many different styles and file formats, making automated changes almost impossible.
• The use of word processing formats results in massive duplication of content in precedent systems. It is very difficult to store a clause once and have it re-used in multiple documents. Many document families, particularly in the finance area replicate boilerplate clauses. Often maintenance involves changing the same wording in many places.
• Word processing formats cause particular problems in clause library systems. How do you format a clause that may be inserted at level 1, level 2 or level 3 of a numbered hierarchy? If formatting is removed, how can it be added reliably after the clause is inserted into the document. The choice is to either manually change styles or provide macros to assist. Either way, the process is very inefficient for the practitioner because it involves unnecessary work.
• Periodically, software vendors release new versions of their software and change the file formats. Built in conversion filters don't always work perfectly. Moving documents forward to the new software version can be costly and disruptive.
• If documents are not migrated, they may be lost because they are either incompatible with the new software or they don't display correctly and fall into disuse. Over time, a lot of valuable knowledge can be lost or effort wasted re-creating it.
• It is difficult to create different outputs for word processing documents, eg, on the web, particularly if it is desired to add links between different documents. Manual processing is almost always required.

What is the ongoing cost of maintenance as a result of these problems? What costs are imposed through lost practitioner time?

Proprietary codes for document assembly systems

Every document assembly application defines its own system for codes and other document markup needed to drive conditional processing. There are no standards, although, over time, de facto standards may emerge. However, these are unstable.

The use of proprietary markup creates both a massive barrier to the adoption of a new system that uses different markup and a strong tie to the existing system. There is a great deal of work in applying this markup. It makes acquiring a new system or switching very expensive.

What is the cost of staying with an outdated or inefficient system because the cost of change seems too high?

Blobs

If precedents are stored as whole documents, how does a practitioner find and use a single clause from that document. Using word processing file formats, there is no convenient way to store metadata on each clause to facilitate retrieval. Users must rely on search. The reality is, to extract an individual clause, they must trawl through large documents to work out what is in them and then manually cut, copy, paste and re-format the content.

How much wasted practitioner time does this create? How do you monitor usage of this kind of precedent content and how do you charge for it?

Wouldn't it be nice if …

Each and every one of these problems can be overcome. The solution is fundamentally the same for all:

• What if you could completely change the layout of your documents created from precedents as often as you wished without having to touch a single precedent?
• What if every document created from a precedent was formatted exactly to firm styles? Practitioners would not need to worry about creating contents listings or cover pages or running macros to do so.
• What if you could adopt new editing software that uses a new file format, again without having to touch a single precedent?
• Wouldn't you like to be able to publish precedent content in print and web formats completely automatically? Web content could be linked to related and explanatory materials without manual intervention.
• What if there was no distinction between clauses in the clause library and those in complete documents? Practitioners could search clauses based on metadata and content regardless where they occur. Once found, an object could be copied out and inserted into a new document without manual processing or formatting by the practitioner.
• What if you could create boilerplate clauses and share them in dozens of ready to use documents without duplication?
• Wouldn't you like to be able to much more easily introduce or change document assembly software because it would not be necessary to change the markup in existing precedents to suit.
• Finally, what if these things could be done without changing the software used by the lawyers? They could keep on using their existing word processing software, if they so wish.

What would be the savings and other benefits if these processes were standard procedure?

If it is achievable, why is it not commonplace?

There is no doubt that all the “wouldn't it be nice” scenarios could become reality. However, it would require a substantial change to existing approaches in the legal technology area. It is true that some document assembly system vendors already use the sort of technology that makes this possible. Fundamentally, the reason it is not widespread is that legal documents have some very distinctive characteristics. There are very few systems available that can readily achieve these goals and there are no applicable standards. Where would a law firm start if it wanted to completely change its precedents storage model? It would have to build almost everything from scratch.

Lessons from other content domains

What do other publishers do?

In many other industries, the equivalent problems have been or are being addressed.

Commercial publishers, particularly those involved in legal publishing adopted structured content management strategies 10 or more years ago to enable them to more effectively meet the demands of users for content in different formats and with richer features.

Many legislative drafting agencies (Parliamentary Counsel Offices) have adopted structured content management strategies for the preparation, management and publishing of legislation. Three Australian offices have done so, despite the absence of formal standards. New Zealand is doing so at present. It is occurring increasingly in other countries. The use of structured content management strategies enables the Parliamentary Counsel Offices to maintain legislation indefinitely, independently of particular editing and publishing tools. It also enables them to automate publishing systems and create high quality print and web publications.

For many years, organizations involved in producing very large volumes of complex documentation for equipment, particularly in aero-space and the military, have used structured content for effective content management and publishing. More recently, this trend has begun to filter down to very many organizations that produce product and user assistance documentation for equipment, business processes and software. This is being aided by the development of improved standards for structured content, particularly the Darwin Information Typing Architecture (DITA) released as an OASIS standard in May 2004. It is available at: www.oasis-open.org/specs/index.php#ditav1.0.

The DITA standard seeks to achieve for product documentation many of the benefits sought for legal precedent documents described above. However, there are some important differences between product documentation and legal documentation.

What is structured content?

Structured content is content that is described in a detailed way using metadata. When applied to narrative content such as precedents, structured content generally refers to the use of XML tagging to describe the content and the relationship between individual components.

XML is a meta language used to build domain specific markup languages or grammars. These languages are described by a schema. Common forms of schema include Document Type Definition (DTD) and XML Schema. There are very many XML languages for different kinds of documents.

Commonly available schema can be divided into several different XML markup models:

(a) Rich, presentation based schema such as the Microsoft Office Open XML format and OASIS OpenDocument;
(b) Web presentation schema such as XHTML 1.0;
(c) Generic structural markup schema such as DocBook, DITA, TEI, Elkera BNML and possibly XHTML 2.0 that can be applied to a wide range of documents.

These XML languages are each designed to do very different things and are very different in the way they work. It is critical to be aware that representing something as “XML” does not, on its own, mean anything. The presentation oriented XML languages such as Microsoft Office Open XML in Word, OpenDocument or XHTML are of limited value to long term precedents maintenance. They do not provide a reliable representation of the important objects within a legal document, such as a clause, section or even a paragraph. They do not effectively separate presentation from content. They are satisfactory for the production of documents with arbitrary layout. They are only marginally better than conventional word processing formats for the reliable automatic production of large numbers of documents that conform to standard layouts.

The schema most suited to achieving the goals described above are generic structural schema. These schema can completely describe documents and their components so that no formatting information needs to be stored. It is then possible to automatically transform and style generic structural XML content into any other output. Structured schema provide the most robust model for long term data storage and reliable automation of content re-use and publishing processes. The encapsulation of semantic objects within structured schema enables metadata to be reliably associated with content objects at any level to support greatly improved information retrieval.

Some recent developments

For around the past four years, the OASIS LegalXML, eContracts Technical Committee has been developing an XML schema for contract documents. Fundamentally, the purpose of this schema is to address the problems and provide the “what if” benefits described earlier.

The eContracts schema is a generic structural XML schema. It is the first standard schema designed specifically for contract narrative markup. Although it is specifically designed for contracts, it is straight forward to adapt it to most other legal documentation.

The eContracts TC is currently finalising the documentation or specification for its contracts schema. It is possible that the TC will release its committee specification for public review and comment before the end of 2006.

The eContracts schema has the potential to change the landscape of precedents management. It provides a platform on which law firms, beginning with the larger firms, can build to achieve all the “what if” scenarios described earlier.

Materials for the eContracts TC are available at: www.oasis-open.org/committees/tc_home.php?wg_abbrev=legalxml-econtracts.

Benefits

Ultimately, it is up to the legal community to decide to build on the work of the OASIS LegalXML eContracts TC. Should it do so, it could achieve all the “wouldn't it be nice” scenarios described earlier. The result would include:

• substantially lower costs of maintenance for precedents databases over long periods;
• improved access to precedents and associated explanatory materials by lawyers, leading to reduced risks of error;
• reduced costs of precedent usage by practitioners through more efficient access;
• improved document assembly systems that could be based on the standard legal documentation schema;
• greater flexibility to choose document assembly tools, including to use different tools for different jobs.

Page Options

 

  Print this page

 

  PDF Version

 

  Email this page

         Updated: 10-12-2006