<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="bbPress/1.0.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>pnotepad.org forums &#187; Topic: Code Page AND Character Set?</title>
		<link>http://pnotepad.org/forums/topic/611</link>
		<description>Programmer&#039;s Notepad Forums</description>
		<language>en-US</language>
		<pubDate>Thu, 09 Feb 2012 06:58:00 +0000</pubDate>
		<generator>http://bbpress.org/?v=1.0.3</generator>
		<textInput>
			<title><![CDATA[Search]]></title>
			<description><![CDATA[Search all topics from these forums.]]></description>
			<name>q</name>
			<link>http://pnotepad.org/forums/search.php</link>
		</textInput>
		<atom:link href="http://pnotepad.org/forums/rss/topic/611" rel="self" type="application/rss+xml" />

		<item>
                        <title>Code Page AND Character Set? (horus)</title>
			<link>http://pnotepad.org/forums/topic/611#post-7293</link>
			<pubDate>Fri, 13 Aug 2010 05:58:40 +0000</pubDate>
			<dc:creator>horus</dc:creator>
			<guid isPermaLink="false">7293@http://pnotepad.org/forums/</guid>
			<description>&#60;p&#62;Sorry, but I'm another user who is also very confused by this and I don't really understand the subtlety.  I've read the sentence &#34;The character set chooses the character set to be used for displaying the actual text ...&#34; ten times but still can't get a picture of it :p&#60;/p&#62;
&#60;p&#62;For me and in general, the terms &#34;code page&#34;, &#34;character set&#34; and &#34;encoding&#34; are synonyms.  Having both things could lead to some meaningless combination.  Let's take an example:&#60;/p&#62;
&#60;p&#62;In Big5 (by the way, I've no idea why it's called *simple* Chinese, but that's another discussion), Hebrew characters are not represented in it.  You could check that by looking up all supported characters in Big5 table:&#60;br /&#62;
&#60;a href=&#34;http://www.khngai.com/chinese/charmap/tblbig.php?page=0&#34; rel=&#34;nofollow&#34;&#62;http://www.khngai.com/chinese/charmap/tblbig.php?page=0&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;So, if I choose&#60;br /&#62;
 Code Page: Simple Chinese Big5 &#38;amp; Character Set: Hebrew&#60;br /&#62;
This combination is meaningless.&#60;/p&#62;
&#60;p&#62;Some other combinations don't quite make sense other, eg&#60;br /&#62;
 Code Page: Simple Chinese Big5 &#38;amp; Character Set: Shift-JIS&#60;br /&#62;
or&#60;br /&#62;
 Code Page: Simple Chinese Big5 &#38;amp; Character Set: Greek&#60;br /&#62;
because Big5 doesn't contain accentuated Greek characters.  So, the word &#34;colour&#34; in Greek which is χρώμα (phonetically &#34;chroma&#34;) is not possible as the character ώ isn't a member of Big5.&#60;/p&#62;
&#60;p&#62;Could I suggest that &#34;Code page&#34; and &#34;Character set&#34; be merged as one option?
&#60;/p&#62;</description>
		</item>
		<item>
                        <title>Code Page AND Character Set? (rc-flitzer)</title>
			<link>http://pnotepad.org/forums/topic/611#post-2219</link>
			<pubDate>Mon, 13 Jul 2009 14:19:22 +0000</pubDate>
			<dc:creator>rc-flitzer</dc:creator>
			<guid isPermaLink="false">2219@http://pnotepad.org/forums/</guid>
			<description>&#60;p&#62;Despite the naming of Scintilla, it would be a good idea to use the correct terms or to simplify the process of choosing an encoding.&#60;/p&#62;
&#60;p&#62;Stijn asked which value he/she should use for &#38;quot;character set&#38;quot; when using UTF-8 encoding. From Simon's explanation I think it doesn't matter. But why not grey out the option when using UTF-8?&#60;/p&#62;
&#60;p&#62;The terms code page, encoding, character set are somehow mixed. I would prefer using the term &#38;quot;encoding&#38;quot; instead of &#38;quot;code page&#38;quot;. The term &#38;quot;character set&#38;quot; is okay though it's unneccessary when using Unicode (UTF-7, -8, -16, -32, UCS-2 (which is indeed a restriction to the first 65000 characters)).&#60;/p&#62;
&#60;p&#62;BTW:&#60;br /&#62;
There is &#60;strong&#62;still&#60;/strong&#62; no option to set the encoding when saving a file or otherwise change encoding &#60;strong&#62;without&#60;/strong&#62; changing the file's content.
&#60;/p&#62;</description>
		</item>
		<item>
                        <title>Code Page AND Character Set? (simon)</title>
			<link>http://pnotepad.org/forums/topic/611#post-2202</link>
			<pubDate>Thu, 09 Jul 2009 08:54:46 +0000</pubDate>
			<dc:creator>simon</dc:creator>
			<guid isPermaLink="false">2202@http://pnotepad.org/forums/</guid>
			<description>&#60;p&#62;The code page is used by Scintilla to work out how to handle the bytes representing the characters in the document - allowing the correct support of multi-byte characters. UTF-8 is automatically set when any unicode format document is used, and otherwise the value from the defaults page is used.&#60;/p&#62;
&#60;p&#62;The character set chooses the character set to be used for displaying the actual text (as opposed to just understanding character boundaries). When not using unicode this defines which characters map to which byte values.
&#60;/p&#62;</description>
		</item>
		<item>
                        <title>Code Page AND Character Set? (Stijn)</title>
			<link>http://pnotepad.org/forums/topic/611#post-2201</link>
			<pubDate>Wed, 08 Jul 2009 21:50:59 +0000</pubDate>
			<dc:creator>Stijn</dc:creator>
			<guid isPermaLink="false">2201@http://pnotepad.org/forums/</guid>
			<description>&#60;p&#62;Nobody able to answer my question?
&#60;/p&#62;</description>
		</item>
		<item>
                        <title>Code Page AND Character Set? (Stijn)</title>
			<link>http://pnotepad.org/forums/topic/611#post-2190</link>
			<pubDate>Wed, 01 Jul 2009 14:36:27 +0000</pubDate>
			<dc:creator>Stijn</dc:creator>
			<guid isPermaLink="false">2190@http://pnotepad.org/forums/</guid>
			<description>&#60;p&#62;Just installed this program, and I was quite surprised when I saw I have to pick a Code Page AND a Character Set.&#60;br /&#62;
As I understand it, both of those are encodings but with some differences.&#60;/p&#62;
&#60;p&#62;So why do I have to pick one of each?&#60;br /&#62;
If I set the Code Page to UTF-8, then what should the Character Set be?
&#60;/p&#62;</description>
		</item>

	</channel>
</rss>

