article

Action executed in 0.000

The Atilde Entity and Unicode

Apparently my RDF feed doesn't validate while my XHTML web pages do. Specifically i'm referring to the Paris Combo "Motifs" article and the Atilde entity, the capital A with a tilde over it. Why, and what should i do about it?

XHTML 1.1 includes XHTML Character Entities 1.0 (xhtml-text-1.mod, nb. no 11) which is originally part of Modularization of XHTML. XHTML Character Entities includes Latin 1 (xhtml-lat1.ent) for XHTML where the entity is defined.

RDF Site Summary (RSS) 1.0 is an application of W3C's Resource Description Framework (RDF), maintained by the RSS-DEV Working Group. It does not include this entity sett. On this page page, there's recommendation to insert a DOCTYPE to explicitly include it.

<?xml version="1.0"?>

<!DOCTYPE rdf:RDF [
<!ENTITY % HTMLlat1 PUBLIC
   "-//W3C//ENTITIES Latin 1 for XHTML//EN"
   "http://www.w3.org/TR/xhtml1/DTD/xhtml-lat1.ent">
%HTMLlat1;
]>

<rdf:RDF
 xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
 xmlns="http://purl.org/rss/1.0/"
 xmlns:dc="http://purl.org/dc/elements/1.1/"
 xmlns:ag="http://purl.org/rss/1.0/modules/aggregation/"
 xmlns:admin="http://webns.net/mvcb/"
 xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" 
>
...
<rdf:RDF>

Why don't i like this? There's no need to, or at least there should be no need to. As an author, i write &Atilde; because i have to. Eventually document (text) editors will become intelligent enough to understand the Unicode character set and the UTF-8 encoding of it. (On a side note, i prefer the UTF-16 or UCS-2 encoding.) Our documents, as we store them in files or databases, will not have &Atilde; but instead have the encoding of it, just like every other character in the document. There's nothing special about these letters. So the problem really lies with text editors.

Besides, there are too many glyphs too ASCII escape like this.

Browers already understand UTF-8 and if they have the satisfactory fonts, they will correctly render all international characters. The only characters that need escaping are <, &, and in CDATA sections > also.

Now i just need to figure out how to write Unicode with my favorite text editor, Kate.

Comments

VIM

tags:
parent post: The Atilde Entity and Unicode
notify me: yes

Kate

Kate natively supports UTF-8. All i need to do to "type in" international characters is open the KCharSelect application.
parent post: The Atilde Entity and Unicode
notify me: yes

Post a Comment

* indicates a required field
anonymous (If you want to identify yourself, please sign in first.)
required This field is required.

Max size is 2 MB, aspect ratio 3:4 width:height
required This field is required.
Please include a short description.
required This field is required.

480 characters remaining.
is public

(Use this field if you have to. 3000 characters remaining.)
1 dime, 1 nickel, 1 quarter + 100

Trackback URL

http://derocher.org/~brian//trackback.php?ParentId=1278

form