#1
|
|||
|
|||
Save Word doc in unicode html (utf-8)
When I save as html, Word saves it with the doctype of windows-1258 and all my unicode symbols get shifted into entity codes (making them completely unreadable in a html editor). I can't find any option to save docs in unicode.
Does such an option exist any more (it used to in earlier Word versions). Thanks. |
#2
|
||||
|
||||
In the Word Options dialog box (Office button | Word Options), click the Advanced category. At the bottom, click the Web Options button, and look at the Encoding tab.
__________________
Stefan Blom Microsoft Word MVP Microsoft 365 apps for business Windows 11 Professional |
#3
|
|||
|
|||
Thanks--that option is well hidden.
Now if there were only a way to suppress all of the Word-specific coding that gets generated! |
#4
|
||||
|
||||
To get rid of Word-specific HTML, you will have to save in the "Web Page, Filtered" format.
__________________
Stefan Blom Microsoft Word MVP Microsoft 365 apps for business Windows 11 Professional |
#5
|
|||
|
|||
Yes, but this still produces extraneous markup and a lot of Word-specific tags, but it's much better.
Back to my first question: when I tried saving the unicode doctype utf-8 as default for all documents, it seems that Word doesn't do that; its default seems to be windows-1258 encoding. At least when I change the encoding and check the "Always use..." option, Word seems to revert to its built-in default. New documents saved as html are in windows encoding. Can the change be made the permanent default? --Thanks. |
#6
|
||||
|
||||
Your observation is correct. "Always save Web pages in the default encoding" means to use something other than what is selected in the Web Options dialog box. I don't know how to actually change the default encoding in Word, unfortunately.
__________________
Stefan Blom Microsoft Word MVP Microsoft 365 apps for business Windows 11 Professional |
Thread Tools | |
Display Modes | |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Word Doesn't Convert Non-Unicode Characters As Expected | fitchkd25 | Word | 1 | 08-17-2012 12:24 AM |
Word - Sharepoint and HTML | stuartadair | Word | 0 | 07-22-2010 05:09 AM |
Can you actually write HTML and CSS in a word document and send it as an html page | jackaroo | Word | 0 | 07-12-2010 07:49 AM |
Word Doc Copied from HTML | Pat801 | Word | 2 | 03-30-2010 01:37 PM |
ANSI-ost to unicode-ost | jeff13 | Outlook | 0 | 01-07-2010 11:48 AM |