Microsoft Office Forums

Go Back   Microsoft Office Forums > Microsoft Word > Word

Reply
 
LinkBack Thread Tools Display Modes
  #1  
Old 09-25-2011, 08:48 PM
rybrns rybrns is offline Windows XP Office 2007
Novice
 
Join Date: Sep 2011
Posts: 9
rybrns is on a distinguished road
Default Save Word doc in unicode html (utf-8)

When I save as html, Word saves it with the doctype of windows-1258 and all my unicode symbols get shifted into entity codes (making them completely unreadable in a html editor). I can't find any option to save docs in unicode.

Does such an option exist any more (it used to in earlier Word versions).

Thanks.
Reply With Quote
  #2  
Old 09-25-2011, 10:51 PM
Stefan Blom Stefan Blom is offline Windows 7 64bit Office 2010 32bit
Expert
 
Join Date: Aug 2011
Posts: 1,936
Stefan Blom will become famous soon enough
Default

In the Word Options dialog box (Office button | Word Options), click the Advanced category. At the bottom, click the Web Options button, and look at the Encoding tab.
__________________
Stefan Blom
Microsoft Word MVP
Reply With Quote
  #3  
Old 09-26-2011, 10:41 AM
rybrns rybrns is offline Windows XP Office 2007
Novice
 
Join Date: Sep 2011
Posts: 9
rybrns is on a distinguished road
Default

Thanks--that option is well hidden.

Now if there were only a way to suppress all of the Word-specific coding that gets generated!
Reply With Quote
  #4  
Old 09-26-2011, 10:59 AM
Stefan Blom Stefan Blom is offline Windows 7 64bit Office 2010 32bit
Expert
 
Join Date: Aug 2011
Posts: 1,936
Stefan Blom will become famous soon enough
Default

To get rid of Word-specific HTML, you will have to save in the "Web Page, Filtered" format.
__________________
Stefan Blom
Microsoft Word MVP
Reply With Quote
  #5  
Old 09-26-2011, 11:29 AM
rybrns rybrns is offline Windows XP Office 2007
Novice
 
Join Date: Sep 2011
Posts: 9
rybrns is on a distinguished road
Default

Yes, but this still produces extraneous markup and a lot of Word-specific tags, but it's much better.

Back to my first question: when I tried saving the unicode doctype utf-8 as default for all documents, it seems that Word doesn't do that; its default seems to be windows-1258 encoding. At least when I change the encoding and check the "Always use..." option, Word seems to revert to its built-in default. New documents saved as html are in windows encoding.

Can the change be made the permanent default? --Thanks.
Reply With Quote
  #6  
Old 09-26-2011, 02:18 PM
Stefan Blom Stefan Blom is offline Windows 7 64bit Office 2010 32bit
Expert
 
Join Date: Aug 2011
Posts: 1,936
Stefan Blom will become famous soon enough
Default

Your observation is correct. "Always save Web pages in the default encoding" means to use something other than what is selected in the Web Options dialog box. I don't know how to actually change the default encoding in Word, unfortunately.
__________________
Stefan Blom
Microsoft Word MVP
Reply With Quote
Reply
Please reply to this thread with any new information or opinions.

Thread Tools
Display Modes


Similar Threads
Thread Thread Starter Forum Replies Last Post
Word Doesn't Convert Non-Unicode Characters As Expected fitchkd25 Word 1 08-17-2012 12:24 AM
Word - Sharepoint and HTML stuartadair Word 0 07-22-2010 05:09 AM
Can you actually write HTML and CSS in a word document and send it as an html page jackaroo Word 0 07-12-2010 07:49 AM
Word Doc Copied from HTML Pat801 Word 2 03-30-2010 01:37 PM
ANSI-ost to unicode-ost jeff13 Outlook 0 01-07-2010 10:48 AM


All times are GMT -7. The time now is 11:54 AM.


Powered by vBulletin® Version 3.8.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
SEO by vBSEO ©2011, Crawlability, Inc.
MSOfficeForums.com is not affiliated with Microsoft