| |
TclDOM Reference
TclDOM is a Tcl language binding for the DOM -
the Document Object Model. DOM provides a view of a XML (or HTML) document as a tree
structure.
Currently, TclDOM only supports XML documents. It provides a convenience function to
parse XML documents into a DOM structure,
allows DOM structures to be built in-memory from scratch and
finally provides a method to output XML formatted text
from a DOM structure.
This version of TclDOM implements DOM Level 1.
The DOM Level 1 specification should be read in conjunction with this reference manual,
as it explains the meaning and purpose of the various interfaces. This manual is not a
tutorial on how to use the DOM.
Table Of Contents
- Packages and Namespaces
- Tokens
- DOM Commands
Packages and Namespaces
The TclDOM package defines the dom package and also a namespace using that name.
Tokens
The TclDOM package uses "tokens" as identifiers for nodes within the document tree.
This technique has been used to allow alternate implementations of TclDOM to be efficient,
while retaining compatibility with the pure-Tcl implementation.
The format of the token itself as well as the data structure referred to by the token are not
public and an application should not rely on these. Instead, an application should use the accessor
methods provided by the package.
DOM Commands
Each Interface in the DOM specification is implemented with a Tcl command in
the dom namespace. These commands are described below:
dom::DOMImplementation method ?args ...?
- This command provides implementation-specific features not explicitly defined in the DOM
specification. Methods include:
dom::DOMImplementation hasFeature feature
- Provides a test for existence of a feature. Returns "1" if a feature is implemented,
"0" otherwise.
dom::DOMImplementation create ?arrayName?
- Creates the root node of a new DOM document. The return value is a token referring to the root node
of the newly created document.
With no argument this command creates a data structure within the dom namespace.
An arrayName argument may be supplied which the package will use to store
its data structures in. Future implementations of TclDOM may choose to ignore this argument.
dom::DOMImplementation destroy token
- This method frees all data structures associated with a DOM document. The
token
argument must refer to a valid token for any node in the document tree.
dom::DOMImplementation parse data ?options ...?
- This method parses XML formatted text given by the
data argument
and constructs a DOM tree for the document. The return result
is the token of the root node of the newly created document.
This method requires an event-based XML parser to be loaded to perform the parsing operation.
The dom package itself does not include a XML parser. Support for the use of
two Tcl event-based parsers is provided: TclXML and
TclExpat. Any other Tcl event-based XML parser
implementing the TclXML API may also be used.
By default the dom package will automatically attempt to
load either of the supported parsers (using the package require Tcl command).
It will first attempt to load TclExpat and if that fails it will then attempt to load TclXML.
The command fails if neither parser can be loaded. This behaviour can be overridden by the
-parser option.
Valid configuration options are:
-parser {}|expat|tcl
- This option specifies which XML parser to use to parse the XML data. If an empty string is
given then the default behaviour described above is used. The value "expat" specifies that
the TclExpat package must be used. The value "tcl" specifies that the TclXML package
must be used. If an explicit value is given and that parser cannot be loaded then the command
will fail, despite the fact that the other parser may be available.
-progresscommand script
- This option specifies a Tcl command to be invoked from time to time while the DOM tree
is being constructed. The script will be invoked after a certain number of element start tags
have been processed, given by the
-chunksize option.
-chunksize integer
- This option specifies how many element start tags to process before invoking the
script given by the
-progresscommand option.
dom::DOMImplementation serialize token ?options ...?
- This method returns the XML formatted text corresponding to the node given by
token . The text is guaranteed to be a well-formed XML document.
Valid configuration options are:
-newline elementlist
- This option specifies a list of element types for which newline characters will be added
before and after the start and end tags for those elements upon serialization.
White space is significant in XML, so the dom package never adds extra white space
for purposes of "pretty-printing" the XML source document. On some platforms, such as
VMS, this can actually cause serious problems due to line length limitations. This option provides
the convenience of adding newlines to certain nominated element types for formatting the source
into lines.
Examples:
Suppose the following DOM document is constructed:
set doc [::dom::DOMImplementation create]
set top [::dom::document createElement $doc Root]
set node [::dom::document createElement $top First]
::dom::document createElement $node Second
::dom::document createElement $top First
Without the -newline option the serialized document would be:
::dom::DOMImplementation serialize $doc
<?xml version="1.0"?>
<!DOCTYPE Root>
<Root><First><Second/></First><First/></Root>
With the -newline option the serialized document would be:
::dom::DOMImplementation serialize $doc -newline First
<?xml version="1.0"?>
<!DOCTYPE Root>
<Root>
<First>
<Second/>
</First>
<First/>
</Root>
dom::DOMImplementation trim token
- This method removes any node containing only white space from the document tree of the node given by
token .
dom::document method token ?arguments...?
- This command implements the Document interface in the DOM specification. The most important
aspect of this command are its factory methods for creating nodes.
The methods accepted by this command are as follows:
cget token option
- This method returns the value of the given configuration option.
configure token option value
- This method sets the value of the given configuration option.
Valid configuration options are:
-doctype token
- Specifies the token of the Document Type Declaration node.
This is a read-only option. Use the factory method to create a Document Type Declaration node.
-implementation token
- Specifies the token of the document's implementation.
-documentElement token
- Specifies the token of the document's document element node. A document may only have
one document element, but may have other types of children.
This is a read-only option. Use the factory method to create a document element node.
dom::document createElement token type
- This method creates an element node as a child of the given node specified by
token .
token must be a node of type element, document or documentFragment.
The new, child element is added as the last child of token's list of children.
The new element's type is given by the type argument. The new element
is created with an empty attribute list.
dom::document createDocumentFragment token
- This method creates a documentFragment node as a child of the given node specified by
token .
token must be a node of type element, document or documentFragment.
The new, child element is added as the last child of token's list of children.
dom::document createTextNode token text
- This method creates a textNode node as a child of the given node specified by
token .
token must be a node of type element, document or documentFragment.
The new, child element is added as the last child of token's list of children.
The new element is created with its value set to text .
dom::document createComment token data
- This method creates a comment node as a child of the given node specified by
token .
token must be a node of type element, document or documentFragment.
The new, child element is added as the last child of token's list of children.
The new element is created with its value set to data .
dom::document createCDATASection token text
- This method creates a CDATASection node as a child of the given node specified by
token .
token must be a node of type element, document or documentFragment.
The new, child element is added as the last child of token's list of children.
The new element is created with its value set to text .
dom::document createProcessingInstruction token target data
- This method creates a processingInstruction node as a child of the given node specified by
token .
token must be a node of type element, document or documentFragment.
The new, child element is added as the last child of token's list of children.
The new element is created with its name set to target and its
value set to data .
dom::document createAttribute token name
- This method creates an attribute node as a child of the given node specified by
token .
token must be a node of type element.
The new attribute is created with its name set to name and an empty value.
NB. This method is included for completeness with respect to the DOM specification.
The preferred method for setting element attributes is to use the element
command.
dom::document createEntity token
- Not currently implemented.
dom::document createEntityReference token
- Not currently implemented.
dom::document createDocTypeDecl token name extid dtd entities notations
- This method creates a Document Type Declaration node as a child of the given node specified by
token .
token must be a node of type document.
name is the element type of the document element. If the document already has
a document element then this name must match with that element type.
extid is an external identifier to include in the document type declaration.
dtd is an internal DTD subset to incldue in the document type declaration.
This is specified as XML text.
entities and notations are included for completeness
with the DOM specification, but are not currently implemented.
Non-standard: This method is not a standard method as specified by the
DOM Recommendation.
dom::document getElementsByTagName token name
- This method searches the node given by the argument
token for
child elements with a type matching the argument name .
The tokens for all those elements which match are returned as a Tcl list.
The search only occurs on the immediate children of token , not any
descendants.
dom::node method token ?arguments...?
- This command implements generic functions for DOM nodes.
The methods accepted by this command are as follows:
dom::node cget token option
- This method returns the value of the given configuration option for the node given by
token .
dom::node configure token option value
- This method sets the value of the given configuration option for the node given by
token .
Valid configuration options are as follows:
-nodeName
- Returns the node name. This is a read-only option.
The DOM specification gives the meaning of names for different types of nodes.
For example, the nodeName option of an element node is the element type.
-nodeType
- Returns the type of the node given by
token .
This is a read-only option.
-parentNode
- Returns the parent node of the node given by
token .
This is a read-only option.
-childNodes
- Returns a Tcl variable which contains a list of the children of the node
given by
token . The variable contains the "live"
list of children.
This is a read-only option.
-firstChild
- Returns the first child node of the node given by
token .
This is a read-only option.
-lastChild
- Returns the last child node of the node given by
token .
This is a read-only option.
-previousSibling
- Returns the parent's child node which appears before this node. If this child is
the first child of its parent then returns an empty string.
This is a read-only option.
-nextSibling
- Returns the parent's child node which appears after this node. If this child is
the last child of its parent then returns an empty string.
This is a read-only option.
-attributes
- Returns a Tcl array variable which contains the attribute list for an element node.
If the node is not an element type node then returns an empty string.
The indices of the array are attribute names, and the values of the array elements
are their corresponding attribute values.
This is a read-only option.
-nodeValue data
- Specifies the value of a node.
The DOM specification gives the meaning of values for different types of nodes.
For example, the nodeValue option of a textNode node is the node's text.
dom::node insertBefore token newchild ?refchild?
- This method removes the node given by
newchild from its parent.
If no refchild argument is given then newchild
is appended to token 's list of children.
If the refchild argument is given then this method adds
newchild as a child of token .
The new child node is positioned before the node refchild in
token 's list of children.
Returns an empty string.
dom::node replaceChild token newchild oldchild
- This method removes the node given by
newchild from its parent.
It then also removes the node given by oldchild from token .
newchild is then added as a child of token
in oldchild 's original position in the list of children.
The method returns the token oldchild , which will now have no parent.
dom::node removeChild token oldchild
- This method removes the node given by
oldchild from token .
The method returns the token oldchild , which will now have no parent.
dom::node appendChild token newchild
- This method appends the node given by
oldchild to the end of the list
of children for node token .
dom::node hasChildNodes token
- Returns "1" if the given node has any child nodes, "0" otherwise.
dom::node clodeNode token ?deep?
- This method makes a copy the node given by
token .
If the argument deep is not specified or has the value "0"
then only the node itself is copied, not its children. If the argument deep
has the value "1" then token 's children are also copied
recursively.
This method returns the token of the newly created node. This new node will have no parent.
dom::node children token
- This is a convenience method which returns the list of child nodes for the given node.
This is not a standard DOM method for this interface
dom::node parent token
- This is a convenience method which returns the parent node for the given node.
This is not a standard DOM method for this interface
dom::element method token ?arguments...?
- This command provides functions for element type nodes.
Valid methods for this command are as follows:
dom::element cget token option
- This method returns the current setting of configuration options for an element.
See the configure method for the list of valid configuration
options.
dom::element configure token option value
- This method sets configuration options for an element. Note that element type nodes
only have read-only options.
Valid configuration options are as follows:
-tagName name
- The tag name, or element type, of this element.
-empty boolean
- Sets whether this element was specified as an empty element when the document was parsed.
That is, XML empty element syntax such as
<Empty/> was used.
This option has no effect upon output (serialization) of the XML document.
Empty element syntax is automatically used where appropriate.
dom::element getAttribute token name
- This method returns the attribute value of the attribute given by
name .
If the attribute does not exist, then an empty string is returned.
dom::element setAttribute token name value
- This method sets the attribute value for the attribute given by
name .
If the attribute already exists then its value is replaced, otherwise the attribute is created.
dom::element removeAttribute token name
- This method deletes the attribute given by
name . If the attribute
does not exist then the method has no effect.
dom::element getAttributeNode token name
dom::element setAttributeNode token name
dom::element removeAttributeNode token name
- Interfaces to attributeNodes are not implemented.
dom::element getElementsByTagName token name
- This method searches the node given by the argument
token for
child elements with a type matching the argument name .
The tokens for all those elements which match are returned as a Tcl list.
The search only occurs on the immediate children of token , not any
descendants.
dom::element normalize token
- This method recursively coalesces textNodes within the children of the given node.
textNodes which are adjacent in the DOM tree cannot be distinguished in the serialized XML document.
dom::processinginstruction method token ?arguments...?
- This command provides functions for processingInstruction type nodes.
Valid methods for this command are as follows:
dom::processinginstruction cget token option
- This method returns the current setting of configuration options for a processing instruction.
dom::processinginstruction configure token option value
- This method sets the configuration options for a processing instruction.
Valid configuration options are as follows:
-target name
- This option sets the target of the processing instruction.
This is a read-only configuration option.
-data string
- This option sets the data of the processing instruction.
|
|