<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Character sets: latin1 vs. ascii</title>
	<atom:link href="http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/feed" rel="self" type="application/rss+xml" />
	<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii</link>
	<description>Blog by Shlomi Noach</description>
	<lastBuildDate>Fri, 12 Mar 2010 13:29:20 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: shlomi</title>
		<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/comment-page-1#comment-2578</link>
		<dc:creator>shlomi</dc:creator>
		<pubDate>Thu, 09 Jul 2009 04:43:58 +0000</pubDate>
		<guid isPermaLink="false">http://code.openark.org/blog/?p=828#comment-2578</guid>
		<description>Hi Brian,

Somehow I&#039;m not surprised. You guys take the good stuff and throw away the rest!

Shlomi</description>
		<content:encoded><![CDATA[<p>Hi Brian,</p>
<p>Somehow I&#8217;m not surprised. You guys take the good stuff and throw away the rest!</p>
<p>Shlomi</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian Aker</title>
		<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/comment-page-1#comment-2570</link>
		<dc:creator>Brian Aker</dc:creator>
		<pubDate>Wed, 08 Jul 2009 15:38:19 +0000</pubDate>
		<guid isPermaLink="false">http://code.openark.org/blog/?p=828#comment-2570</guid>
		<description>Hi!

In Drizzle we made utf8 the default and optimized around it (the default collatin utf8_general_ci). For anything else? Just use binary.

Cheers,
   -Brian</description>
		<content:encoded><![CDATA[<p>Hi!</p>
<p>In Drizzle we made utf8 the default and optimized around it (the default collatin utf8_general_ci). For anything else? Just use binary.</p>
<p>Cheers,<br />
   -Brian</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mchl</title>
		<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/comment-page-1#comment-2569</link>
		<dc:creator>Mchl</dc:creator>
		<pubDate>Wed, 08 Jul 2009 12:07:57 +0000</pubDate>
		<guid isPermaLink="false">http://code.openark.org/blog/?p=828#comment-2569</guid>
		<description>Yeah. I forgot how VARCHAR behaves in MEMORY for a moment. 
It gets tricky indeed ;)

Personally I use case insensitive collations more often (for user supplied data at least).</description>
		<content:encoded><![CDATA[<p>Yeah. I forgot how VARCHAR behaves in MEMORY for a moment.<br />
It gets tricky indeed <img src='http://code.openark.org/blog/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>Personally I use case insensitive collations more often (for user supplied data at least).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: shlomi</title>
		<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/comment-page-1#comment-2567</link>
		<dc:creator>shlomi</dc:creator>
		<pubDate>Wed, 08 Jul 2009 10:34:23 +0000</pubDate>
		<guid isPermaLink="false">http://code.openark.org/blog/?p=828#comment-2567</guid>
		<description>Mchl,

Just as another example, we can define a VARCHAR, utf8 column on a MEMORY table.
I wasn&#039;t asking for fixed width - but MySQL/MEMORY made it so.

Regards</description>
		<content:encoded><![CDATA[<p>Mchl,</p>
<p>Just as another example, we can define a VARCHAR, utf8 column on a MEMORY table.<br />
I wasn&#8217;t asking for fixed width &#8211; but MySQL/MEMORY made it so.</p>
<p>Regards</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: shlomi</title>
		<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/comment-page-1#comment-2565</link>
		<dc:creator>shlomi</dc:creator>
		<pubDate>Wed, 08 Jul 2009 10:09:59 +0000</pubDate>
		<guid isPermaLink="false">http://code.openark.org/blog/?p=828#comment-2565</guid>
		<description>hartmut,

Thanks, I think we both agree here. 
I saw need to mention that because the misconception that utf8 columns will always require only as much storage as needed - is widespread.
So the notion of &quot;you asked for a fixed size column&quot; is not clear to some.

I hope this clarifies.
Regards</description>
		<content:encoded><![CDATA[<p>hartmut,</p>
<p>Thanks, I think we both agree here.<br />
I saw need to mention that because the misconception that utf8 columns will always require only as much storage as needed &#8211; is widespread.<br />
So the notion of &#8220;you asked for a fixed size column&#8221; is not clear to some.</p>
<p>I hope this clarifies.<br />
Regards</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: hartmut</title>
		<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/comment-page-1#comment-2564</link>
		<dc:creator>hartmut</dc:creator>
		<pubDate>Wed, 08 Jul 2009 09:47:10 +0000</pubDate>
		<guid isPermaLink="false">http://code.openark.org/blog/?p=828#comment-2564</guid>
		<description>&gt; For example, if you have CHAR(10) CHARSET utf8, then each such value will take exactly 30 bytes, regardless of content

well, you asked for a fixed size column, so you got a fixed size column, and as it is fixed size it needs to be big enough to store 10 3 byte utf8 sequences up front</description>
		<content:encoded><![CDATA[<p>&gt; For example, if you have CHAR(10) CHARSET utf8, then each such value will take exactly 30 bytes, regardless of content</p>
<p>well, you asked for a fixed size column, so you got a fixed size column, and as it is fixed size it needs to be big enough to store 10 3 byte utf8 sequences up front</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: shlomi</title>
		<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/comment-page-1#comment-2563</link>
		<dc:creator>shlomi</dc:creator>
		<pubDate>Wed, 08 Jul 2009 08:38:44 +0000</pubDate>
		<guid isPermaLink="false">http://code.openark.org/blog/?p=828#comment-2563</guid>
		<description>Thanks for the correction; I&#039;ve updated the text.

I have the opinion that collations should be case sensitive by default; this makes for faster comparisons.

utf8 encodes ASCII as single character - true; by MySQL and its engines do not necessarily follow. For example, if you have CHAR(10) CHARSET utf8, then each such value will take exactly 30 bytes, regardless of content. See also: &lt;a href=&quot;http://code.openark.org/blog/mysql/mysqls-character-sets-and-collations-demystified&quot; rel=&quot;nofollow&quot;&gt;MySQL’s character sets and collations demystified&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>Thanks for the correction; I&#8217;ve updated the text.</p>
<p>I have the opinion that collations should be case sensitive by default; this makes for faster comparisons.</p>
<p>utf8 encodes ASCII as single character &#8211; true; by MySQL and its engines do not necessarily follow. For example, if you have CHAR(10) CHARSET utf8, then each such value will take exactly 30 bytes, regardless of content. See also: <a href="http://code.openark.org/blog/mysql/mysqls-character-sets-and-collations-demystified" rel="nofollow">MySQL’s character sets and collations demystified</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mchl</title>
		<link>http://code.openark.org/blog/mysql/character-sets-latin1-vs-ascii/comment-page-1#comment-2562</link>
		<dc:creator>Mchl</dc:creator>
		<pubDate>Wed, 08 Jul 2009 08:16:05 +0000</pubDate>
		<guid isPermaLink="false">http://code.openark.org/blog/?p=828#comment-2562</guid>
		<description>Latin1 covers Western European languages. Central Europe is covered by Latin2 CP. ;)

I agree though, utf8 should be introduced as a default encoding, and utf8_general_ci as default collation. AFAIK utf8 stores ASCII characters as single byte values.</description>
		<content:encoded><![CDATA[<p>Latin1 covers Western European languages. Central Europe is covered by Latin2 CP. <img src='http://code.openark.org/blog/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>I agree though, utf8 should be introduced as a default encoding, and utf8_general_ci as default collation. AFAIK utf8 stores ASCII characters as single byte values.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
