site/tutorials/tdmlTutorial.tdml.xml

<?xml version="1.0" encoding="ASCII"?> <?xml-stylesheet type="text/xsl" href="DFDLTutorialStylesheet.xsl"?>  <tdml:testSuite suiteName="A TDML Tutorial" description="Illustration of how to test DFDL schemas, and also how to report a bug using TDML." xmlns:tdml="http://www.ibm.com/xmlns/dfdl/testData" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xml="http://www.w3.org/XML/1998/namespace" xmlns:dfdl="http://www.ogf.org/dfdl/dfdl-1.0/" xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:ex="http://example.com" xmlns:gpf="http://www.ibm.com/dfdl/GeneralPurposeFormat" xmlns:daf="urn:ogf:dfdl:2013:imp:daffodil.apache.org:2018:ext" xmlns="http://www.w3.org/1999/xhtml" defaultRoundTrip="true"> <tdml:tutorial xml:space="preserve"> This file is an example of a self-contained test described in a TDML file. These are easily run using the Daffodil command line interface (CLI). A TDML file is actually a test suite of tests, and we include two here. One is a parser test case, the other an unparser test case. The file root element is tdml:testSuite which contains namespace prefix definitions (several are generally needed), and other important attributes: <dl> <dt>defaultRoundTrip</dt> <dd>by default a parse test will be run "round trip" meaning both a parse and unparse will be done. If false or omitted a parse test will only parse, an unparse test will only unparse, though this may be controlled on a test-by-test basis with the roundTrip attribute of parserTestCase and unparserTestCase elements.</dd> <dt>defaultConfig</dt> <dd>gives the name of a defined configuration, which can bind external variables, or set tunable parameters. This can also be controlled on a test-by-test basis with the config attribute of the parserTestCase and unparserTestCase elements.</dd> </dl> We begin below by defining a DFDL schema directly in our TDML file. Use defineSchema to include a DFDL schema directly inside the TDML file. You can alternatively put the DFDL schema in a separate file if you prefer. The target namespace of these named defineSchemas will be http://example.com which is bound to the prefix "ex" above. Each defineSchema has a name, so that one TDML file can contain tests which reference different DFDL schemas. To embed a schema inside the TDML you don't include the xs:schema element from the schema file, nor do you need to wrap the top-level DFDL annotation objects with xs:annotation and xs:appinfo. In other words, inside a defineSchema you can directly put: dfdl:defineFormat, dfdl:defineEscapeSchema, dfdl:format (for the default format), xs:element, xs:simpleType, xs:complexType, xs:group, xs:import, or xs:include. It is common for a TDML file to contain an embedded schema which includes or imports other DFDL schemas that are in files. Our embedded DFDL schema begins with a named format definition - notice no surrounding xs:annotation nor xs:appinfo We reference a useful starting point format definition provided to the DFDL community by IBM. (It is built into the Daffodil software.) </tdml:tutorial> <tdml:defineSchema name="s1" elementFormDefault="unqualified"> <xs:include schemaLocation="/org/apache/daffodil/xsd/DFDLGeneralFormat.dfdl.xsd"/> <dfdl:defineFormat name="myDefaults"> <dfdl:format lengthKind="implicit" representation="text" encoding="ASCII" initiator="" terminator="" separator="" ref="gpf:GeneralPurposeFormat" /> </dfdl:defineFormat>  <dfdl:format ref="ex:myDefaults" />  <xs:import namespace="http://www.ibm.com/dfdl/GeneralPurposeFormat" schemaLocation="/IBMdefined/GeneralPurposeFormat.xsd" />  <xs:element name="myTestRoot" type="xs:dateTime" dfdl:calendarPattern="MM.dd.yyyy 'at' HH:mm:ssZZZZZ" dfdl:calendarPatternKind="explicit" dfdl:lengthKind="delimited" dfdl:terminator="%NL;" /> </tdml:defineSchema> <tdml:tutorial xml:space="preserve"> <h1>Parser Test Cases</h1> Here is a test case that exercises the above schema. A single TDML file can contain many test cases like the one below. You must give the name of the model (aka the schema), that can be the name of a schema defined immediately in this file like above, or a file name. You must also give the name of the root element that the test will use. Because the tdml:testSuite element has defaultRoundTrip="true" this "parser" test case will actually test parsing and unparsing, but as it is a parser test case, it begins with parsing the data to an infoset, then unparses it and checks that it gets back the original data. Except, that it won't get back the exact original data. The Time Zone notation won't be reproduced exactly. What is unparsed is equivalent to the original, but not identical. So this pass requires that we do a second parse pass, and verify that we get the infoset back that we expected. That is, what was unparsed can be reparsed back to the same infoset. So this test specifies the roundTrip="twoPass" attribute which overrides the default behavior of the suite. </tdml:tutorial> <tdml:parserTestCase name="dateTimeTest" root="myTestRoot" model="s1" description="Test of date/time. Runs round trip (parse and unparse) because that is the default for this test suite." roundTrip="twoPass"> <tdml:tutorial xml:space="preserve"> The data for your test is given by the tdml:document element, which optionally contains tdml:documentPart elements. A tdml:documentPart can be of type "text", "bytes" or "bits". Notice specifically the use of the CDATA bracketing of the data in the TDML file. This insures that no unintended whitespace gets inserted around your data. DFDL Character Entities can be used in textual data. (See the %LF; in there), if the attribute 'replaceDFDLEntities', a boolean, is true. There are ways to do binary data in hexadecimal, and even bit by bit, and even mixtures of text and binary data. For this example we'll just look at a simple textual data document. </tdml:tutorial> <tdml:document> <tdml:documentPart type="text" replaceDFDLEntities="true"><![CDATA[04.02.2013 at 14:00:56 GMT-05:00%LF;]]></tdml:documentPart> </tdml:document> <tdml:tutorial xml:space="preserve"> The infoset element gives the expected infoset, expressed as an XML fragment. </tdml:tutorial> <tdml:infoset> <tdml:dfdlInfoset>  <ex:myTestRoot>2013-04-02T14:00:56-05:00</ex:myTestRoot> </tdml:dfdlInfoset> </tdml:infoset>  </tdml:parserTestCase> <tdml:tutorial xml:space="preserve"> <h1>Unparser Test Cases</h1> This is how unparser tests work. Note the use of the roundTrip="false" attribute to override the defaultRoundTrip="true" that was set on the tdml:testSuite element. So this test will unparse only, and not attempt to use the same schema to parse the data. </tdml:tutorial> <tdml:unparserTestCase name="unparseDateTimeTest" root="myTestRoot" model="s1" description="date time issue, unparser" roundTrip="false"> <tdml:tutorial xml:space="preserve"> For an unparser test, the incoming data is the infoset element. The data for your test is what is expected as output of the unparse. The infoset element gives the expected infoset, expressed as an XML fragment. In an unparser test, it is normally written first, with the data document second. The order doesn't actually matter. </tdml:tutorial> <tdml:infoset> <tdml:dfdlInfoset> <ex:myTestRoot>2013-04-02T14:00:56-05:00</ex:myTestRoot> </tdml:dfdlInfoset> </tdml:infoset> <tdml:document> <tdml:documentPart type="text" replaceDFDLEntities="true"><![CDATA[04.02.2013 at 14:00:56-05:00%CR;%LF;]]></tdml:documentPart> </tdml:document>  </tdml:unparserTestCase>  </tdml:testSuite>

site/tutorials/tdmlTutorial.tdml.xml (162 lines of code) (raw):