| 
 | Xerces 1.2.0 | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Object | +--org.apache.xml.serialize.BaseMarkupSerializer
Base class for a serializer supporting both DOM and SAX pretty serializing of XML/HTML/XHTML documents. Derives classes perform the method-specific serializing, this class provides the common serializing mechanisms.
 The serializer must be initialized with the proper writer and
 output format before it can be used by calling #init.
 The serializer can be reused any number of times, but cannot
 be used concurrently by two threads.
 
If an output stream is used, the encoding is taken from the output format (defaults to UTF-8). If a writer is used, make sure the writer uses the same encoding (if applies) as specified in the output format.
 The serializer supports both DOM and SAX. DOM serializing is done
 by calling serialize(org.w3c.dom.Element) and SAX serializing is done by firing
 SAX events and using the serializer as a document handler.
 This also applies to derived class.
 
 If an I/O exception occurs while serializing, the serializer
 will not throw an exception directly, but only throw it
 at the end of serializing (either DOM or SAX's DocumentHandler.endDocument().
 
For elements that are not specified as whitespace preserving, the serializer will potentially break long text lines at space boundaries, indent lines, and serialize elements on separate lines. Line terminators will be regarded as spaces, and spaces at beginning of line will be stripped.
When indenting, the serializer is capable of detecting seemingly element content, and serializing these elements indented on separate lines. An element is serialized indented when it is the first or last child of an element, or immediate following or preceding another element.
Serializer, 
DOMSerializer| Field Summary | |
| protected  java.lang.String | _docTypePublicIdThe system identifier of the document type, if known. | 
| protected  java.lang.String | _docTypeSystemIdThe system identifier of the document type, if known. | 
| protected  OutputFormat | _formatThe output format associated with this serializer. | 
| protected  boolean | _indentingTrue if indenting printer. | 
| protected  java.util.Hashtable | _prefixesAssociation between namespace URIs (keys) and prefixes (values). | 
| protected  Printer | _printerThe printer used for printing text parts. | 
| protected  boolean | _startedIf the document has been started (header serialized), this flag is set to true so it's not started twice. | 
| Constructor Summary | |
| protected  | BaseMarkupSerializer(OutputFormat format)Protected constructor can only be used by derived class. | 
| Method Summary | |
|  ContentHandler | asContentHandler()Return a ContentHandlerinterface into this serializer. | 
|  DocumentHandler | asDocumentHandler()Return a DocumentHandlerinterface into this serializer. | 
|  DOMSerializer | asDOMSerializer()Return a DOMSerializerinterface into this serializer. | 
|  void | attributeDecl(java.lang.String eName,
              java.lang.String aName,
              java.lang.String type,
              java.lang.String valueDefault,
              java.lang.String value)Report an attribute type declaration. | 
|  void | characters(char[] chars,
           int start,
           int length)Receive notification of character data. | 
| protected  void | characters(java.lang.String text)Called to print the text contents in the prevailing element format. | 
|  void | comment(char[] chars,
        int start,
        int length)Report an XML comment anywhere in the document. | 
|  void | comment(java.lang.String text) | 
| protected  org.apache.xml.serialize.ElementState | content()Must be called by a method about to print any type of content. | 
|  void | elementDecl(java.lang.String name,
            java.lang.String model)Report an element type declaration. | 
|  void | endCDATA()Report the end of a CDATA section. | 
|  void | endDocument()Called at the end of the document to wrap it up. | 
|  void | endDTD()Report the end of DTD declarations. | 
|  void | endEntity(java.lang.String name)Report the end of an entity. | 
|  void | endNonEscaping() | 
|  void | endPrefixMapping(java.lang.String prefix)End the scope of a prefix-URI mapping. | 
|  void | endPreserving() | 
| protected  org.apache.xml.serialize.ElementState | enterElementState(java.lang.String namespaceURI,
                  java.lang.String localName,
                  java.lang.String rawName,
                  boolean preserveSpace)Enter a new element state for the specified element. | 
|  void | externalEntityDecl(java.lang.String name,
                   java.lang.String publicId,
                   java.lang.String systemId)Report a parsed external entity declaration. | 
| protected  org.apache.xml.serialize.ElementState | getElementState()Return the state of the current element. | 
| protected abstract  java.lang.String | getEntityRef(char ch)Returns the suitable entity reference for this character value, or null if no such entity exists. | 
| protected  java.lang.String | getPrefix(java.lang.String namespaceURI)Returns the namespace prefix for the specified URI. | 
|  void | ignorableWhitespace(char[] chars,
                    int start,
                    int length)Receive notification of ignorable whitespace in element content. | 
|  void | internalEntityDecl(java.lang.String name,
                   java.lang.String value)Report an internal entity declaration. | 
| protected  boolean | isDocumentState()Returns true if in the state of the document. | 
| protected  org.apache.xml.serialize.ElementState | leaveElementState()Leave the current element state and return to the state of the parent element. | 
|  void | notationDecl(java.lang.String name,
             java.lang.String publicId,
             java.lang.String systemId)Receive notification of a notation declaration event. | 
| protected  void | prepare() | 
| protected  void | printDoctypeURL(java.lang.String url)Print a document type public or system identifier URL. | 
| protected  void | printEscaped(char ch) | 
| protected  void | printEscaped(java.lang.String source)Escapes a string so it may be printed as text content or attribute value. | 
| protected  void | printText(char[] chars,
          int start,
          int length,
          boolean preserveSpace,
          boolean unescaped)Called to print additional text with whitespace handling. | 
| protected  void | printText(java.lang.String text,
          boolean preserveSpace,
          boolean unescaped) | 
|  void | processingInstruction(java.lang.String target,
                      java.lang.String code)Receive notification of a processing instruction. | 
|  boolean | reset() | 
|  void | serialize(Document doc)Serializes the DOM document using the previously specified writer and output format. | 
|  void | serialize(DocumentFragment frag)Serializes the DOM document fragmnt using the previously specified writer and output format. | 
|  void | serialize(Element elem)Serializes the DOM element using the previously specified writer and output format. | 
| protected abstract  void | serializeElement(Element elem)Called to serializee the DOM element. | 
| protected  void | serializeNode(Node node)Serialize the DOM node. | 
| protected  void | serializePreRoot()Comments and PIs cannot be serialized before the root element, because the root element serializes the document type, which generally comes first. | 
|  void | setDocumentLocator(Locator locator)Receive an object for locating the origin of SAX document events. | 
|  void | setOutputByteStream(java.io.OutputStream output)Specifies an output stream to which the document should be serialized. | 
|  void | setOutputCharStream(java.io.Writer writer)Specifies a writer to which the document should be serialized. | 
|  void | setOutputFormat(OutputFormat format)Specifies an output format for this serializer. | 
|  void | skippedEntity(java.lang.String name)Receive notification of a skipped entity. | 
|  void | startCDATA()Report the start of a CDATA section. | 
|  void | startDocument()Receive notification of the beginning of a document. | 
|  void | startDTD(java.lang.String name,
         java.lang.String publicId,
         java.lang.String systemId)Report the start of DTD declarations, if any. | 
|  void | startEntity(java.lang.String name)Report the beginning of an entity in content. | 
|  void | startNonEscaping() | 
|  void | startPrefixMapping(java.lang.String prefix,
                   java.lang.String uri)Begin the scope of a prefix-URI Namespace mapping. | 
|  void | startPreserving() | 
|  void | unparsedEntityDecl(java.lang.String name,
                   java.lang.String publicId,
                   java.lang.String systemId,
                   java.lang.String notationName)Receive notification of an unparsed entity declaration event. | 
| Methods inherited from class java.lang.Object | 
| clone, 
equals, 
finalize, 
getClass, 
hashCode, 
notify, 
notifyAll, 
toString, 
wait, 
wait, 
wait | 
| Field Detail | 
protected boolean _started
protected java.util.Hashtable _prefixes
protected java.lang.String _docTypePublicId
protected java.lang.String _docTypeSystemId
protected OutputFormat _format
protected Printer _printer
protected boolean _indenting
| Constructor Detail | 
protected BaseMarkupSerializer(OutputFormat format)
#init.| Method Detail | 
public DocumentHandler asDocumentHandler()
                                  throws java.io.IOException
DocumentHandler interface into this serializer.
 If the serializer does not support the DocumentHandler
 interface, it should return null.
public ContentHandler asContentHandler()
                                throws java.io.IOException
ContentHandler interface into this serializer.
 If the serializer does not support the ContentHandler
 interface, it should return null.
public DOMSerializer asDOMSerializer()
                              throws java.io.IOException
DOMSerializer interface into this serializer.
 If the serializer does not support the DOMSerializer
 interface, it should return null.public void setOutputByteStream(java.io.OutputStream output)
public void setOutputCharStream(java.io.Writer writer)
public void setOutputFormat(OutputFormat format)
format - The output format to usepublic boolean reset()
protected void prepare()
                throws java.io.IOException
public void serialize(Element elem)
               throws java.io.IOException
elem - The element to serialize
public void serialize(DocumentFragment frag)
               throws java.io.IOException
elem - The element to serialize
public void serialize(Document doc)
               throws java.io.IOException
doc - The document to serialize
public void startDocument()
                   throws SAXException
The SAX parser will invoke this method only once, before any
 other methods in this interface or in DTDHandler (except for setDocumentLocator).
ContentHandler.endDocument()
public void characters(char[] chars,
                       int start,
                       int length)
The Parser will call this method to report each chunk of character data. SAX parsers may return all contiguous character data in a single chunk, or they may split it into several chunks; however, all of the characters in any single event must come from the same external entity so that the Locator provides useful information.
The application must not attempt to read from the array outside of the specified range.
Note that some parsers will report whitespace in element
 content using the ignorableWhitespace
 method rather than this one (validating parsers must 
 do so).
ch - The characters from the XML document.start - The start position in the array.length - The number of characters to read from the array.ContentHandler.ignorableWhitespace(char[], int, int), 
Locator
public void ignorableWhitespace(char[] chars,
                                int start,
                                int length)
Validating Parsers must use this method to report each chunk of whitespace in element content (see the W3C XML 1.0 recommendation, section 2.10): non-validating parsers may also use this method if they are capable of parsing and using content models.
SAX parsers may return all contiguous whitespace in a single chunk, or they may split it into several chunks; however, all of the characters in any single event must come from the same external entity, so that the Locator provides useful information.
The application must not attempt to read from the array outside of the specified range.
ch - The characters from the XML document.start - The start position in the array.length - The number of characters to read from the array.ContentHandler.characters(char[], int, int)
public void processingInstruction(java.lang.String target,
                                  java.lang.String code)
The Parser will invoke this method once for each processing instruction found: note that processing instructions may occur before or after the main document element.
A SAX parser must never report an XML declaration (XML 1.0, section 2.8) or a text declaration (XML 1.0, section 4.3.1) using this method.
target - The processing instruction target.data - The processing instruction data, or null if
        none was supplied.  The data does not include any
        whitespace separating it from the target.
public void comment(char[] chars,
                    int start,
                    int length)
This callback will be used for comments inside or outside the document element, including comments in the external DTD subset (if read).
ch - An array holding the characters in the comment.start - The starting position in the array.length - The number of characters to use from the array.public void comment(java.lang.String text)
public void startCDATA()
The contents of the CDATA section will be reported through
 the regular characters event.
LexicalHandler.endCDATA()public void endCDATA()
LexicalHandler.startCDATA()public void startNonEscaping()
public void endNonEscaping()
public void startPreserving()
public void endPreserving()
public void endDocument()
                 throws SAXException
public void startEntity(java.lang.String name)
NOTE: entity references in attribute values -- and the start and end of the document entity -- are never reported.
The start and end of the external DTD subset are reported using the pseudo-name "[dtd]". All other events must be properly nested within start/end entity events.
Note that skipped entities will be reported through the
 skippedEntity
 event, which is part of the ContentHandler interface.
name - The name of the entity.  If it is a parameter
        entity, the name will begin with '%'.LexicalHandler.endEntity(java.lang.String), 
DeclHandler.internalEntityDecl(java.lang.String, java.lang.String), 
DeclHandler.externalEntityDecl(java.lang.String, java.lang.String, java.lang.String)public void endEntity(java.lang.String name)
name - The name of the entity that is ending.LexicalHandler.startEntity(java.lang.String)public void setDocumentLocator(Locator locator)
SAX parsers are strongly encouraged (though not absolutely required) to supply a locator: if it does so, it must supply the locator to the application by invoking this method before invoking any of the other methods in the ContentHandler interface.
The locator allows the application to determine the end position of any document-related event, even if the parser is not reporting an error. Typically, the application will use this information for reporting its own errors (such as character content that does not match an application's business rules). The information returned by the locator is probably not sufficient for use with a search engine.
Note that the locator will return correct information only during the invocation of the events in this interface. The application should not attempt to use it at any other time.
locator - An object that can return the location of
                any SAX document event.Locator
public void skippedEntity(java.lang.String name)
                   throws SAXException
The Parser will invoke this method once for each entity
 skipped.  Non-validating processors may skip entities if they
 have not seen the declarations (because, for example, the
 entity was declared in an external DTD subset).  All processors
 may skip external entities, depending on the values of the
 http://xml.org/sax/features/external-general-entities
 and the
 http://xml.org/sax/features/external-parameter-entities
 properties.
name - The name of the skipped entity.  If it is a 
        parameter entity, the name will begin with '%', and if
        it is the external DTD subset, it will be the string
        "[dtd]".
public void startPrefixMapping(java.lang.String prefix,
                               java.lang.String uri)
                        throws SAXException
The information from this event is not necessary for
 normal Namespace processing: the SAX XML reader will 
 automatically replace prefixes for element and attribute
 names when the http://xml.org/sax/features/namespaces
 feature is true (the default).
There are cases, however, when applications need to use prefixes in character data or in attribute values, where they cannot safely be expanded automatically; the start/endPrefixMapping event supplies the information to the application to expand prefixes in those contexts itself, if necessary.
Note that start/endPrefixMapping events are not
 guaranteed to be properly nested relative to each-other:
 all startPrefixMapping events will occur before the
 corresponding startElement event, 
 and all endPrefixMapping
 events will occur after the corresponding endElement event, but their order is not otherwise 
 guaranteed.
There should never be start/endPrefixMapping events for the "xml" prefix, since it is predeclared and immutable.
prefix - The Namespace prefix being declared.uri - The Namespace URI the prefix is mapped to.ContentHandler.endPrefixMapping(java.lang.String), 
ContentHandler.startElement(java.lang.String, java.lang.String, java.lang.String, org.xml.sax.Attributes)
public void endPrefixMapping(java.lang.String prefix)
                      throws SAXException
See startPrefixMapping for 
 details.  This event will always occur after the corresponding 
 endElement event, but the order of 
 endPrefixMapping events is not otherwise
 guaranteed.
prefix - The prefix that was being mapping.ContentHandler.startPrefixMapping(java.lang.String, java.lang.String), 
ContentHandler.endElement(java.lang.String, java.lang.String, java.lang.String)
public void startDTD(java.lang.String name,
                     java.lang.String publicId,
                     java.lang.String systemId)
Any declarations are assumed to be in the internal subset
 unless otherwise indicated by a startEntity
 event.
Note that the start/endDTD events will appear within the start/endDocument events from ContentHandler and before the first startElement event.
name - The document type name.publicId - The declared public identifier for the
        external DTD subset, or null if none was declared.systemId - The declared system identifier for the
        external DTD subset, or null if none was declared.LexicalHandler.endDTD(), 
LexicalHandler.startEntity(java.lang.String)public void endDTD()
LexicalHandler.startDTD(java.lang.String, java.lang.String, java.lang.String)
public void elementDecl(java.lang.String name,
                        java.lang.String model)
The content model will consist of the string "EMPTY", the string "ANY", or a parenthesised group, optionally followed by an occurrence indicator. The model will be normalized so that all whitespace is removed,and will include the enclosing parentheses.
name - The element type name.model - The content model as a normalized string.
public void attributeDecl(java.lang.String eName,
                          java.lang.String aName,
                          java.lang.String type,
                          java.lang.String valueDefault,
                          java.lang.String value)
Only the effective (first) declaration for an attribute will be reported. The type will be one of the strings "CDATA", "ID", "IDREF", "IDREFS", "NMTOKEN", "NMTOKENS", "ENTITY", "ENTITIES", or "NOTATION", or a parenthesized token group with the separator "|" and all whitespace removed.
eName - The name of the associated element.aName - The name of the attribute.type - A string representing the attribute type.valueDefault - A string representing the attribute default
        ("#IMPLIED", "#REQUIRED", or "#FIXED") or null if
        none of these applies.value - A string representing the attribute's default value,
        or null if there is none.
public void internalEntityDecl(java.lang.String name,
                               java.lang.String value)
Only the effective (first) declaration for each entity will be reported.
name - The name of the entity.  If it is a parameter
        entity, the name will begin with '%'.value - The replacement text of the entity.DeclHandler.externalEntityDecl(java.lang.String, java.lang.String, java.lang.String), 
DTDHandler.unparsedEntityDecl(java.lang.String, java.lang.String, java.lang.String, java.lang.String)
public void externalEntityDecl(java.lang.String name,
                               java.lang.String publicId,
                               java.lang.String systemId)
Only the effective (first) declaration for each entity will be reported.
name - The name of the entity.  If it is a parameter
        entity, the name will begin with '%'.publicId - The declared public identifier of the entity, or
        null if none was declared.systemId - The declared system identifier of the entity.DeclHandler.internalEntityDecl(java.lang.String, java.lang.String), 
DTDHandler.unparsedEntityDecl(java.lang.String, java.lang.String, java.lang.String, java.lang.String)
public void unparsedEntityDecl(java.lang.String name,
                               java.lang.String publicId,
                               java.lang.String systemId,
                               java.lang.String notationName)
Note that the notation name corresponds to a notation
 reported by the notationDecl event.  
 It is up to the application to record the entity for later 
 reference, if necessary.
If the system identifier is a URL, the parser must resolve it fully before passing it to the application.
name - The unparsed entity's name.publicId - The entity's public identifier, or null if none
        was given.systemId - The entity's system identifier.notation - name The name of the associated notation.DTDHandler.notationDecl(java.lang.String, java.lang.String, java.lang.String), 
AttributeList
public void notationDecl(java.lang.String name,
                         java.lang.String publicId,
                         java.lang.String systemId)
It is up to the application to record the notation for later reference, if necessary.
At least one of publicId and systemId must be non-null. If a system identifier is present, and it is a URL, the SAX parser must resolve it fully before passing it to the application through this event.
There is no guarantee that the notation declaration will be reported before any unparsed entities that use it.
name - The notation name.publicId - The notation's public identifier, or null if
        none was given.systemId - The notation's system identifier, or null if
        none was given.DTDHandler.unparsedEntityDecl(java.lang.String, java.lang.String, java.lang.String, java.lang.String), 
AttributeListprotected void serializeNode(Node node)
serializeElement(org.w3c.dom.Element).node - The node to serializeserializeElement(org.w3c.dom.Element)protected org.apache.xml.serialize.ElementState content()
protected void characters(java.lang.String text)
text - The text to printunescaped - True is should print unescapedprotected abstract java.lang.String getEntityRef(char ch)
ch - Character valueprotected abstract void serializeElement(Element elem)
elem - The element to serializeprotected void serializePreRoot()
protected final void printText(char[] chars,
                               int start,
                               int length,
                               boolean preserveSpace,
                               boolean unescaped)
#printText(String) with a call to #breakLine
 for each new line. If spaces are not preserved, the text is
 broken at space boundaries if longer than the line width;
 Multiple spaces are printed as such, but spaces at beginning
 of line are removed.text - The text to printpreserveSpace - Space preserving flagunescaped - Print unescaped
protected final void printText(java.lang.String text,
                               boolean preserveSpace,
                               boolean unescaped)
protected void printDoctypeURL(java.lang.String url)
printText(char[], int, int, boolean, boolean).url - The document type url to printprotected void printEscaped(char ch)
protected void printEscaped(java.lang.String source)
source - The string to escapeprotected org.apache.xml.serialize.ElementState getElementState()
protected org.apache.xml.serialize.ElementState enterElementState(java.lang.String namespaceURI,
                                                                  java.lang.String localName,
                                                                  java.lang.String rawName,
                                                                  boolean preserveSpace)
protected org.apache.xml.serialize.ElementState leaveElementState()
protected boolean isDocumentState()
protected java.lang.String getPrefix(java.lang.String namespaceURI)
namespaceURI - The namespace URI| 
 | Xerces 1.2.0 | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||