XMLC To-Do List

Note: this list needs to be reviewed, some things maybe done or may no longer need to be done

Misc

  • Need to figure out ways to improve compatiblity for already compiled objects between XMLC release.
  • Setting of standalone header in XML files is not handled correctly. org.enhydra.xml.io.XMLFormatter.writeXMLHeader() set standalone if there is no system of public id. However, it's possible to have a document that has no system or public id, with the document have the DTD in the internal subset, yet references external external entities. This is demonstated by: tests/org/enhydra/xml/xmlc/xml/input/bigentity/BigEntityTest.vxml which had to be modified for the tests. It wasc hanged to use the Document.getStandalone(), but XMLC doesn't set this option in generated code.
  • Can't handle recompiliation when pages are on different drives than the JDK. Because parameters get passed to java.exe and and javac.exe that begin, not with a drive letter, but with a backslash. This is because Recompiler.getMetaData() is calling getPath()
  • DocumentType entity nodes are not build in XML documents.
  • Disable tidy from deleting empty span tags.
  • Invalid characters in an encoding should be mapped by the formatter to a character entity reference, not to allowed to fall though to the java encoding which then outputs as `?'. Currently a work around of Emoji Shift_JIS characters, but need a general solution.
  • -urlsetting no longer take precedence over -urlmapping.
  • getElementById doesn't work for XML
  • Add command line option to set documentFormat.
  • Detected attempts to parse XHTML with the HTML parser
  • Create getText for all PCDATA.
  • Ceate cleaner way to manage DOMFormatter (DOMWriter???)
  • Add option to expand entity references on formatting.
  • xHTML support.
  • Finish tests based on the OASIS XML parse conformance tests
  • Upgrade Tidy parser.
  • Create LazyDOM version of WML DOM.
  • Straighten out relationship between DOM factory and W3C DOMImplementaion created documents.
  • Add test for compiling file with a schema.
  • Include WML 1.2 and 1.3 DTDs.
  • LazyHTMLDocument should consider templates when doing getHead(), getBody(), etc.
  • JAXP support for DOMParser object.
  • Need to do performance analysis of Lazy DOM.
  • Recompilation needs to be awayre of SSI.
  • Need to do performance analysis of compile.
  • Review use of HashTable; convert some to HashMap and some to HashSet.
  • URL rewriting only works when the XMLC document object is passed to the formatter. Passing the contained document or any other node results in no URL rewriting. This restriction is caused by the inability to get back to the XMLObject from the DOM.
  • Should ignorable whitespace be preserved by XMLC?
  • HTML formatter should preserve DOCTYPE declaration.
  • When using the metadata API, the source file for HTML can't contain file: and the XML files must contain file: (per PM).
  • Create WML Lazy DOM.
  • It should be possible to compile a file just specifying a metadata file.
  • It appears that disabling entity reference expansion for XML parsing causes character entity to be reference nodes, which is painful
  • Need options to control entity reference expansion.
  • The DOMFormatter supports formatting HTML as XHTML and XHTML as HTML, however the preformatted text of the arguments will output the wrong arguments. Also, how should XHTML character entities be handled?
  • Add Ant task definition to XMLC
  • Compiling with -dom xerces creates a discarded element
            $elem1 = document.getDocumentElement();
            document.createElement("HTML");
    
  • Need to URL mapping order and behavior; including how it works with XML files.
  • add more setText*() methods. Maybe check for text child. <H1>, <P>.
  • Should generated classes be serializable?
  • Need carry forward information about which attributes have URLs for runtime substitution.
  • Should W3C HTML 4.0 header be included in HTML output?
  • Generated function to create object maybe too large. (Can't reporduce yet).
  • Missing DOM support that must be implemented: - notationDecl - Have notation, but its not writable.. - unparsedEntityDecl - No way to create DocumentType stuff: - creating document types. - ElementDecl. - attlistDecl - internalEntityDecl - All by Entity.. - externalEntityDecl - unparsedEntityDecl - parameterEntity - notationDecl
  • One issue is when entity references are expand, particularly in things like internal DTDs. Should entities be expanded in the XMLC object? XP expands them.
  • Mark Tierney <mark@sics.se> Wants a way for all access methods to return Element: The error-checking I'm talking about is that when compiling the XMLC-generated file, we get an error if the wanted id(s) are not present in the XML file. Yes, it sounds better to check the return types as well, but unfortunatley, that is not possible when we're using more than one DTD.

    Bugs

    Possible Enhancements and Missing Functionality