document to HTML to Text. The charset information is first encoded
in XIF and then that information is used when interpretting the unicode
for output.
Added support to output character entity information which should address
bug 4709
git-svn-id: svn://10.0.0.236/trunk@29162 18797224-902f-48f8-a5cc-f745e15eee43