Home / Pete's stuff / Not computer games / Browser symbol tests

Been Framed?

If this page is displayed in a frame, which should happen only when the Geocities/Yahoo server puts them there and not because some other site is displaying my content, please reload this page to bust out of them.


 

Page Purpose

This page was built to test several web browsers for proper display of some foreign-language, purely symbolic and punctuation (em dash, en dash, horizontal ellipsis) glyphs needed by my other web pages. I'm trying to see what other users would see, so I'm making only the changes available to an HTML editor, i.e. no server directives (which I can't do on Geocities/Yahoo anyway, right?). The symbol/character/glyph/entity terminology I'm using is probably wrong, but it's all rather confusing to somebody like me who's just trying to make things "look right."

Currently directly testing with Microsoft Internet Explorer 6.0 (6.0.2800.1106 & 1106CO) and Netscape Communicator 7.01. Browsers running under Windows 2000 (Version 5.0, Build 2195: Service Pack 4) (IE 6.0.2800.1106CO), Windows XP Pro (Version 5.1 Build 2600) (IE 6.0.2800.1106) and Windows ME (Netscape 7.01 and IE 6.0). Language preferences set to en-us, en, en-gb, then fr. No special internationalization done for browsers, e.g. loading fonts or setting different Windows input locales. Attempting to use only Symbol and Arial fonts by name. Base language of en-US set as HTML attribute. Character set is specified with XML and META elements. Note: the user's switching to another one in IE 4.0 or Netscape 4.72 (approximate earliest version) doesn't work if one is set in the HTML. If there was any significant difference in appearance between default character set (ISO-8859-1) and UTF-8, it is noted below.

Some of the entity references (things like &agr;) are supposedly SGML and may not work in HMTL. These were found on technical publishing sites along with references to ISO-8879. But they're worth a shot…

Font forcing is bad. Font forcing is evil. But I did it here, because when I first started doing this in 2001, it was sometimes the most universal solution available. "Symbol" should be available on every non-Greek computer that has a reasonable hope of rendering a lowercase alpha. "Arial" on the other hand, is just there to be annoying, and it's one of the few short font names that I know that should be on both PCs and Macs. Symbol is the one I'm really interested in. Therefore, when I'm specifically asking for the Arial font, the fact that what comes out is incongruous compared to the default font nearby will pass without comment. It would be noticeable if you were using font forcing within a word. I'll say it's not recommended for my intended use (properly showing occasional non-English glyphs in English text), especially since the entity reference usually works well.

The "done with" rows in the tables below contain some extra space characters that aren't in the implementation rows above them. These spaces are just there to make the contents of the table cells "line break" better. You may notice some strange situations though, such as a line break between "&" and "#x1234;" even though there is a convenient space just before the "&". Sigh… Also, the implementations are all in lowercase now as required by XHTML 1.0.

Windows has an Accessory called Character Map that allows you to directly insert strange characters into documents, or tells you the keystrokes so you can do it yourself. Unfortunately, using this for some characters doesn't work too well, especially if the symbol isn't supported in other fonts, and it will break a SGML-based validator. So, I'm not using Character Map at all, in favor of the HTML named and numbered entities. The Character Map stuff also often breaks under the UTF-8 character set.

Updates

Useful sites elsewhere: [External Links following]

 

Switching Character Sets

The recommended thing to do with "web pages" is to set a character set in a META element near the top of the HEAD section. In fact, that declaration should appear before even the TITLE tag, in case there's something strange in the TITLE text. However, there are 2 problems with doing that:

  1. Some web browsers don't actually change the character set if it's set in the HTML element, then the user sets it to something different. Apparently, the META setting overwrites what the browser is doing, and nothing changes on the screen. Problem seen with Netscape 4.72 and IE 4, probably IE 5 as well.
  2. If the character set in the META element is UTF-8, IE 4 will crash when you try to invoke an internal link on the same page, e.g. <a href="#on_this_page">jump to somewhere else on this page</a>. This may be only when the page is local, i.e. on your computer's hard drive, but since people can save HTML source from websites rather easily, the problem can occur anywhere even if it doesn't on the web server.

My decision: Since IE 4 is obsolete, and I can't get away with not putting a character set on a page anymore (validation for XHTML 1.0 requires it), it's been ISO-8859-1 for me for quite a while. All my web pages that might have a problem with displaying strange symbols should link to this symbol test page. While I'll try to provide a reasonably complete discussion of the problem, I've had to pick my way of solving it already.

Interested readers (all of you who haven't already left, I hope) can still try to change their "character encoding" or "font" to see if it makes a difference. In Netscape and Internet Explorer, the Character Encoding/Character Set/Font setting is in the View menu. You should be able to leave that setting where it is throughout my site, but I'll warn you that you may see some strange behavior regardless.
First, bad old Netscape 4.7 put boxes between each character that isn't normal text size, e.g. in headings, while loading pages that are set to display in some other character set than the default, then they suddenly disappeared when the page was loaded. Visually annoying, but only cosmetic and temporary. You can get around this by setting your browser back to its original setting when you're done with my pages.
Second, you may get unpredictable things when you visit web pages elsewhere that have done bizarre things with specifying the character set used by that page. I apologize for the confusion this may cause, but here's hoping it's only temporary. When everybody's way of viewing web text content has proper internationalization (i18n)/Unicode/UTF-8/ISO-10646 support, all this will be a bad memory.

 

Foreign Glyph Tests

The following tests are for foreign language glyphs used on other web pages on this site. The languages include Greek, French, Spanish, Portuguese, romanized Japanese, and Polish.

lowercase alpha
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase, lowercase w/ & w/o leading zeroes Font forcing (describe) other (describe)
lowercase alpha (Greek, APL) α &agr; &b.alpha; à a α ⍺ α α α α, ⍺ ⍺, 𝛂 𝛂 a *
α
*
done with: &alpha; &agr; &b.alpha; <span xml:lang="el" lang="el"> &#224; &#97; </span> &#945; &#9082; &#x03B1; &#x03b1; &#x3B1; &#x3b1;, &#x237A; &#x237a;, &#x1D6C2; &#x1d6c2; <font face="Symbol"> &#97; </font> *<PRE> &alpha; <PRE>*
IE 5 results: CORRECT SYMBOL DISPLAYED FOLLOWED BY &agr; &b.alpha; lowercase a grave, lowercase a DISPLAYED CORRECT SYMBOL DISPLAYED FOLLOWED BY BOX SYMBOL CORRECT SYMBOL DISPLAYED 4 TIMES, 2 BOX SYMBOLS DISPLAYED, 2 BOX SYMBOLS DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED BETWEEN BLANK LINES
IE 6 results: CORRECT SYMBOL DISPLAYED FOLLOWED BY &agr; &b.alpha; lowercase a grave, lowercase a DISPLAYED CORRECT SYMBOL DISPLAYED FOLLOWED BY TALL BOX SYMBOL CORRECT SYMBOL DISPLAYED 4 TIMES, 2 TALL BOX SYMBOLS DISPLAYED, 2 BOX SYMBOLS DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED BETWEEN BLANK LINES
Netscape 6 & 7 results: CORRECT SYMBOL DISPLAYED FOLLOWED BY &agr; &b.alpha; lowercase a grave, lowercase a DISPLAYED CORRECT SYMBOL DISPLAYED FOLLOWED BY ? CORRECT SYMBOL DISPLAYED 4 TIMES, ? ? DISPLAYED [, NOT TESTED] a DISPLAYED CORRECT SYMBOL DISPLAYED BETWEEN BLANK LINES

&agr; and &b.alpha; are SGML and probably won't work in HTML.
Switching to UTF-8 makes Netscape 4.7 marginally better. The "Unicode in decimal" column becomes "CORRECT SYMBOL FOLLOWED BY ? DISPLAYED", and the "Unicode in hexadecimal" column becomes "CORRECT SYMBOL DISPLAYED 4 TIMES, 2 BOX SYMBOLS DISPLAYED" (that one just like IE5!).
Sorry, Netscape 4.7 and other bad browser users, I had to code to the standard.
IE 6.0 now has two different box symbols that appear when it can't display a Unicode character. One is noticeably taller than another, so they're called TALL BOX SYMBOL and BOX SYMBOL in the table above. I have no idea if there's any real difference between them, or exactly what made them start showing up for me on 2005-10-25.
The PRE thing is there just as a curiosity; it was the ONLY way to get IE 4 to show the correct symbol, but it has too many disadvantages.

WHERE USED: (in all cases to refer to my twice-removed alternate history timeline)

CONCLUSION:
The named entity (for readability) or the #x3B1 numerical entity in hexadecimal (preferred) is the way to go. It even appears reasonable in the window title, when placed in the title attribute within the head section of a page.


lowercase a tilde
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase, lowercase w/ & w/o leading zeroes Font forcing (describe) other (describe)
lowercase a tilde (Portuguese) ã ã ã ã ã ã ã ã
done with: &atilde; <span xml:lang="pt" lang="pt"> &#227;</span> &#227; &#x00E3; &#x00e3; &#xE3; &#xe3; <font face="Arial"> &#227; </font> a&#x303;
IE 5 & 6, Netscape 6 & 7 results: CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED

There used to be a second "lowercase a tilde" character in the "language-declared" implementation, inserted from the Windows (95 and later) Character Map Accessory. It's Alt+0227 in most fonts. While this has been used by others, it is a BAD idea! First, it breaks badly under UTF-8. The tested browser's parsers usually took the inserted character plus the next two characters, and treated them together as a box symbol, or just rendered a box. Netscape 6 showed a "?". Yuck! Second, it breaks the W3C XHTML Validator, as the inserted character is invalid in the default character set, and causes the entire file to be rejected!
The "a&#x303;" construct is using the combining tilde accent from Unicode.
In Netscape 6 and 7, the combining accent isn't as low as it appears on the other representations.

WHERE USED:

CONCLUSION:
For lowercase a tilde, use the named entity. I really should check if it works properly inside a language declaration though…


lowercase e acute
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase, lowercase w/ & w/o leading zeroes Font forcing (describe) other (describe)
lowercase e acute (French, Portuguese, the Pokemon game) é é é é é é é é é é é
done with: &eacute; <span xml:lang="fr" lang="fr"> &#233; &eacute; </span> <span xml:lang="pt" lang="pt">&#233; &eacute; </span> &#233; &#x00E9; &#x00e9; &#xE9; &#xe9; <font face="Arial"> &#233; </font> e&#x301;
IE 5 & 6, Netscape 6 & 7 results: CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED

The "e&#x301;" construct is using the combining acute accent from Unicode.
There used to be a second "lowercase e acute" character in the "other" column, inserted from the Windows 95 Character Map Accessory (Alt+0233 in most fonts). While this has been used by others, it is a BAD idea! First, it breaks badly under UTF-8. The tested browser parsers usually took the inserted character plus the next two characters, and treated them together as a box symbol, or just rendered a box. Netscape 6 showed a "?". Yuck! Second, it breaks the W3C XHTML Validator, as the inserted character is invalid in the default character set, and causes the entire file to be rejected!
Switching to UTF-8 makes both IE 4 and 5 marginally (and identically) worse. The inserted (from the Character Map accessory) "lowercase e acute" character (Alt+0233 in most fonts) is displayed erroneously with the end of the TD closing element as a box symbol followed by "td>". Strangely, the table cell and row appear to complete rendering correctly regardless. IE 6 is even stranger; it shows a Chinese glyph instead of the box.
Switching to UTF-8 made Netscape 4.7 show a box symbol instead of the inserted character; Netscape 6 showed a "?".
In Netscape 6 and 7, the combining accent isn't as low as it appears on the other representations.
&Eacute; (uppercase e acute) should be supported just as well as the lowercase one; see below for that one.

WHERE USED:

CONCLUSION:
For lowercase e acute, use the named entity. Seems to works properly inside a language declaration too. The numerical one might save a byte or two.


lowercase i acute
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase, lowercase w/ & w/o leading zeroes Font forcing (describe) other (describe)
lowercase i acute (Portuguese) í í í í í í í í í
done with: &iacute; <span xml:lang="pt" lang="pt"> &#237; &iacute; </span> &#237; &#x00ED; &#x00ed; &#xED; &#xed; <font face="Arial"> &#237; </font> i&#x301;
IE 5 & 6, Netscape 6 & 7 results: CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED TWICE CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED TWICE

The "i&#x301;" construct is using the combining acute accent from Unicode.
There used to be a second "lowercase i acute" character in the "other" column, inserted from the Windows 95 Character Map Accessory (Alt+0237 in most fonts). While this has been used by others, it is a BAD idea! First, it breaks badly under UTF-8. The tested browser parsers usually took the inserted character plus the next two characters, and treated them together as a box symbol, or just rendered a box. Netscape 6 showed a "?". Yuck! Second, it breaks the W3C XHTML Validator, as the inserted character is invalid in the default character set, and causes the entire file to be rejected!
Switching to UTF-8 makes IE 4, 5 and 6 marginally (and identically) worse. The inserted (from the Character Map accessory) "lowercase i acute" character (Alt+0237 in most fonts) is displayed erroneously with the end of the TD closing element as a box symbol followed by "td>". Strangely, the table cell and row appear to complete rendering correctly regardless.
Switching to UTF-8 makes Netscape 4.7 show a box symbol instead of the inserted character; Netscape 6 shows a "?".
In Netscape 6 and 7, the combining accent isn't as low as it appears on the other representations, in fact it doesn't replace the dot over the "i".

WHERE USED:

CONCLUSION:
For lowercase i acute, use the named entity. Seems to work properly inside a language declaration too.


lowercase o macron
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase, lowercase w/ & w/o leading zeroes Font forcing (describe) other (describe)
lowercase o macron (romanized Japanese) ō ⦵ ō ō ō ō ō ō ō
done with: &omacr; &ohbar; <span xml:lang="ja" lang="ja"> &#333; </span> &#333; &#x014D; &#x014d; &#x14D; &#x14d; <font face="Arial"> &#333; </font> o&#x304;
IE 5 & 6, Netscape 6 & 7 results: &omacr; &ohbar; DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED

&omacr; and &ohbar are SGML and probably won't work in HTML.
The "o&#x304;" construct is using the combining macron from Unicode.
In Netscape 6 and 7, the combining accent isn't as low as it appears on the other representations.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
For lowercase o macron, use &#x14D;, preferably with the entire word inside a Japanese language declaration. Declare UTF-8 character set for the page if possible.


lowercase u macron
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase, lowercase w/ & w/o leading zeroes Font forcing (describe) other (describe)
lowercase u macron (romanized Japanese) ū ū ū ū ū ū ū ū
done with: &umacr; <span xml:lang="ja" lang="ja"> &#363; </span> &#363; &#x016B; &#x016b; &#x16B; &#x16b; <font face="Arial"> &#363; </font> u&#x304;
IE 5 & 6, Netscape 6 & 7 results: &umacr; DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED

&umacr; is SGML and probably won't work in HTML.
The "u&#x304;" construct is using the combining macron from Unicode.
In Netscape 6 and 7, the combining accent isn't as low as it appears on the other representations.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
For lowercase u macron, use &#x16B;, preferably with the entire word inside a Japanese language declaration. Declare UTF-8 character set for the page if possible.


lowercase s acute
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase, lowercase w/ & w/o leading zeroes Font forcing (describe) other (describe)
lowercase s acute (Polish) ś ś ś ś ś ś ś ś
done with: &sacute; <span xml:lang="pl" lang="pl"> &#347; </span> &#347; &#x015B; &#x015b; &#x15B; &#x15b; <font face="Arial"> &#347; </font> s&#x301;
IE 5 & 6, Netscape 6 & 7 results: &sacute; DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED

&sacute; is SGML and probably won't work in HTML.
The "s&#x301;" construct is using the combining acute accent from Unicode.
In Netscape 6 and 7, the combining accent isn't as low as it appears on the other representations.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
For lowercase s acute, use &#x15B;, preferably with the entire word inside a Polish language declaration. Declare UTF-8 character set for the page if possible.


lowercase e ogonek
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase, lowercase w/ & w/o leading zeroes Font forcing (describe) other (describe)
lowercase e ogonek (Polish) (none) ę ę ę ę ę
done with:   <span xml:lang="pl" lang="pl"> &#281; </span> &#281; &#x0119; &#x119; <font face="Arial"> &#281; </font> e&#x328;
IE 5 & 6, Netscape 6 & 7 results:   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED TWICE CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED

This character isn't in ISO-8859-1, you have to use ISO-8859-2 to get it "natively".
The "e&#x328;" construct is using the combining ogonek from Unicode.
In Netscape 6 and 7, the combining accent isn't centered like it appears on the other representations.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
For lowercase e ogonek, use &#x119;, preferably with the entire word inside a Polish language declaration. Declare UTF-8 character set for the page if possible.


I'm using a few more foreign-language symbols too, but these are believed in common enough usage elsewhere that they aren't tested in the exhaustive table form as used above.

WHAT & WHERE USED:


Symbol Tests

The following tests are for purely symbolic glyphs used on other web pages on my site. These include a square, triangle, circle, and arrows. There is no real "language" for them (those who say there's MathML can send me working examples). The 4 arrows are to indicate game controller directions, but are using formal mathematical symbols. The arrows, and the open circle, triangle, square and san-serif X, are all useful for PlayStation games. Several others symbols are only tested in a "short" form.

left arrow symbol
description named entity (&thing;) language-declared character Unicode in decimal Unicode in hexadecimal Font forcing (describe) other (describe)
left arrow symbol (none) ¬ (none)
done with: &larr;   &#8592; &#x2190; <font face="Symbol"> &#172; </font>  
IE 5 & 6 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED  
Netscape 6 & 7 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED WRONG SYMBOL DISPLAYED  

Font forcing Symbol to the Unicode 8592 decimal is a BAD idea. In IE4 it produced an invisible character about 900 pixels wide!
Netscape 6 and 7 show something like an en dash with a short vertical bar dangling down from the right end. Whatever it is, it's not what I want.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
There's no universal solution, but the "standards" way to go is to use the named or numerical entity, not the Symbol font. Sorry, users of Netscape 4.7!


up arrow symbol
description named entity (&thing;) language-declared character Unicode in decimal Unicode in hexadecimal Font forcing (describe) other (describe)
up arrow symbol (none) ­ (none)
done with: &uarr;   &#8593; &#x2191; <font face="Symbol"> &#173; </font>  
IE 5 & 6, Netscape 6 & 7 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED NOTHING DISPLAYED  

Font forcing Symbol to the Unicode 8593 decimal is a BAD idea. In IE4 it produced an invisible character about 900 pixels wide!
IE 5 & 6 and Netscape 6 & 7 all display nothing for the Symbol font. Not even a blank space, just nothing. Consistent but strange.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
There's no universal solution, but the "standards" way to go is to use the named or numerical entity, not the Symbol font. Sorry, users of Netscape 4.7!


right arrow symbol
description named entity (&thing;) language-declared character Unicode in decimal Unicode in hexadecimal Font forcing (describe) other (describe)
right arrow symbol (none) ® (none)
done with: &rarr;   &#8594; &#x2192; <font face="Symbol"> &#174; </font>  
IE 5 & 6 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED  
Netscape 6 & 7 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED WRONG SYMBOL DISPLAYED  

Font forcing Symbol to the Unicode 8594 decimal is a BAD idea. In IE4 it produced an invisible character about 900 pixels wide!
Netscape 6 and 7 show the "R in a circle" (Registered symbol) for the Symbol font.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
There's no universal solution, but the "standards" way to go is to use the named or numerical entity, not the Symbol font. Sorry, users of Netscape 4.7!


down arrow symbol
description named entity (&thing;) language-declared character Unicode in decimal Unicode in hexadecimal Font forcing (describe) other (describe)
down arrow symbol (none) ¯ (none)
done with: &darr;   &#8595; &#x2193; <font face="Symbol"> &#175; </font>  
IE 5 & 6 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED  
Netscape 6 & 7 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED WRONG SYMBOL DISPLAYED  

Font forcing Symbol to the Unicode 8595 decimal is a BAD idea. In IE4 it produced an invisible character about 900 pixels wide!
Netscape 6 and 7 show something like an en dash but very high in the dot matrix, not quite at the top. Whatever it is, it's not what I want.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
There's no universal solution, but the "standards" way to go is to use the named or numerical entity, not the Symbol font. Sorry, users of Netscape 4.7!


open point-up triangle symbol
description named entity (&thing;) language-declared character (<span xml:lang="xx" lang="xx">) Unicode in decimal Unicode in hexadecimal uppercase & lowercase Font forcing (describe) other (describe)
open point-up triangle symbol ▵ Δ &b.Delta; Δ Δ Δ △ △ D (none)
done with: &utri; &Delta; &b.Delta; <span xml:lang="el" lang="el"> &#916; </span> &#9651; &#x0394; &#x394; &#x25B3; &#x25b3; <font face="Symbol"> D </font>  
IE 6 under Windows XP Pro results: &utri;, CORRECT SYMBOL, &b.Delta; DISPLAYED CORRECT SYMBOL DISPLAYED BOX SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED TWICE, BOX SYMBOL DISPLAYED TWICE CORRECT SYMBOL DISPLAYED  
IE 5 & 6 (except XP Pro) results: &utri;, CORRECT SYMBOL, &b.Delta; DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED  
Netscape 6 & 7 results: &utri;, CORRECT SYMBOL, &b.Delta; DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES "D" DISPLAYED  

&utri; and &b.Delta; are SGML and probably won't work in HTML.
Font forcing Symbol to the Unicode 9651 decimal isn't helpful. In IE4 it produced a "?".
IE 6 under Windows XP Pro behaves differently than in Windows 2000! The Unicode cases display a box instead of the correct symbol!
Switching to UTF-8 made Netscape 4.7 only marginally better, and it wasn't a useful solution.
The geometric symbol (equilateral triangle) looks more like what I want than the Greek letter (isoceles triangle).

WHERE USED:

CONCLUSION:
There's no universal solution, but the "standards" way to go is to use the named or numerical entity, not the Symbol font. Sorry, users of Netscape 4.7! I avoided this one by creating my own graphic image of the PlayStation button that I was really trying to represent: the PSX triangle button


open circle symbol
description named entity (&thing;) language-declared character Unicode in decimal Unicode in hexadecimal uppercase & lowercase Font forcing (describe) other (describe)
open circle symbol (none) ○ ○ ◯ ◯ O (none)
done with: &cir;   &#9675; &#x25CB; &#x25cb; &#x25EF; &#x25ef; <font face="Symbol"> O </font>  
IE 6 under Windows XP Pro results: &cir; DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED TWICE, BOX SYMBOL DISPLAYED TWICE CORRECT SYMBOL DISPLAYED  
IE 5 & 6 (except XP Pro), Netscape 6 & 7 results: &cir; DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES CORRECT SYMBOL DISPLAYED  

&cir; is SGML and probably won't work in HTML.
If all else fails, an acceptable glyph for this is an uppercase letter o, if the font being used isn't script, ornate or decorative. Lowercase is too small for my intended use. The 25EF is a larger version of the 25CB circle symbol.
IE 6 under XP Pro behaves differently than in 2000! The 25EF cases display a box instead of the correct symbol!
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
Looks like the universal solution for the open circle symbol is the uppercase letter o, font forced to Symbol (Arial comes out too narrow to look like a circle). Sorry everybody! I avoided this one by creating my own graphic image of the PlayStation button that I was really trying to represent: the PSX O button


open square symbol
description named entity (&thing;) language-declared character Unicode in decimal Unicode in hexadecimal uppercase & lowercase Font forcing (describe) other (describe)
open square symbol (none) □ □ (none) (none)
done with: &squ;   &#9633; &#x25A1; &#x25a1;    
IE 6 under Windows XP Pro results: &squ; DISPLAYED   BOX SYMBOL DISPLAYED BOX SYMBOL DISPLAYED TWICE    
IE 5 & 6 (except XP Pro), Netscape 6 & 7 results: &squ; DISPLAYED   CORRECT SYMBOL DISPLAYED CORRECT SYMBOL DISPLAYED TWICE    

&squ; is SGML and probably won't work in HTML.
The successful results are actually an assumption. It could be that we're getting the box symbol indicating "browser can't display this", but it looks like an open square. There's no font forcing for this one, as I can't find a square in either Symbol or Arial; the box is definitely a rectangle there in the Windows Character Map.
IE 6 under XP Pro behaves differently than in 2000! The 3 Unicode cases display a noticeably squashed almost-square.
Switching to UTF-8 made Netscape 4.7 significantly better, but it wasn't a universal solution.

WHERE USED:

CONCLUSION:
No universal solution found for the open square symbol. I avoided this one by creating my own graphic image of the PlayStation button that I was really trying to represent: the PSX square button


san-serif X symbol
description named entity (&thing;) language-declared character Unicode in decimal Unicode in hexadecimal uppercase & lowercase, w/ & w/o leading zeroes Font forcing (describe) other (describe)
san-serif X symbol × (none) × ╳ × × × ×, ╳ (none) (none)
done with: &times;   &#215; &#9587; &#x00D7; &#x00d7; &#xD7; &#xd7;, &#x2573;    
IE 5 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL, BOX DISPLAYED CORRECT SYMBOL DISPLAYED 4 TIMES, BOX ONCE    
Netscape 6 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED TWICE CORRECT SYMBOL DISPLAYED 4 TIMES, BOX ONCE    
IE 6 and Netscape 7 results: CORRECT SYMBOL DISPLAYED   CORRECT SYMBOL DISPLAYED TWICE CORRECT SYMBOL DISPLAYED 4 TIMES, CORRECT SYMBOL DISPLAYED    

An acceptable glyph for this is a multiplication sign. Although my intended use is a representation of the X button on the PlayStation, going to the X in a circle symbol in the Symbol font (Alt+0196) was unacceptable since the other 3 buttons would then be missing an enclosing circle. All the non-symbolic fonts that I checked had a multiplication sign at 215, so there's no need for font forcing here. Surprisingly, there was an improvement here for both Netscape 6 to 7 and IE 5 to 6.
Switching to UTF-8 made Netscape 4.7 only marginally better, and it wasn't a useful solution.

WHERE USED:

CONCLUSION:
Use the entity reference for this one. But I stayed consistent and avoided this one too by creating my own graphic image of the PlayStation button that I was really trying to represent: the PSX X button


I'm using a few more symbols too, but these are believed in common enough usage elsewhere that they aren't tested in the exhaustive table form as used above. Or maybe I got lazy.

WHAT & WHERE USED:


Punctuation Glyph Tests

The following tests are for punctuation glyphs used on this and other web pages on this site. They are all valid for English, and incorrect implementations shouldn't occur.

WHAT & WHERE USED:


Language Specifiers

The international standard for specifying language is ISO 639. There are apparently two acceptable versions of this, though. ISO 639 uses two English lowercase letters, and ISO 639-2 uses three English lowercase letters. I don't expect current browsers to understand the 3-letter ones yet. A second group of 2 letters can be appended to the ISO 639 specifier (with a separating "-") to specify a particular dialect or implementation, e.g. "en-US". The second two letters can be uppercase or lowercase, but uppercase seems to be preferred. So, here are some specifiers that I found useful:

I've been specifying "en-US" in the headers of my HTML files, then using language specifiers for particular areas of the document with <span xml:lang="xx" lang="xx">blah</span>.

In general, language specifiers don't seem to be helping much with displaying what I want. But at least they don't hurt. So I'll continue to insert them where appropriate.


Overall Evaluation

Unsurprisingly, the current IE 6 and Netscape 7 are best for this set of test cases. The older versions (IE 5.5, 5.0, 4.0 and Netscape 4.7) have various failings. The PC's OS is also a factor, e.g. Windows NT 4.0 had better Unicode support and font renderings than Windows 95b even though both were using IE 4.0. You should use the most recent version of your preferred free web browser software, in the hope that bug fixes and improving standards support will always enhance the experience.

I strongly recommend upgrading any Netscape 4.7 install if at all possible to the most current non-beta Netscape release (8.0.4 as of 2005-10-25), because of the numerous problems fixed compared to 4.7. Or Firefox if you would rather go open-source. I once had a note that said "Netscape (4.7) needs to have user set View/Character Set to UTF-8 to work with the &#bignum; syntax", which is pretty good advice if it helps even a little and doesn't hurt.

The PRE construct was a fluke that I discovered, but I can't see a good use for it. The <PRE> construct only displays the desired character if you don't set font-family: monospace for PRE using CSS; but at least one CSS expert recommends that you do. If you do, you get useless boxes instead. Therefore, this little fluke in the browsers is even less useful than previously believed.

MATH tags seemed to only cause validator errors, and not render as anything different.

Declaring the language didn't seem to help at all.


[to Home Page]
[top of this page]
[up: Not Computer Games]

Counter says: [a number only available as graphics] hits on this page since initialization.
By Peter Karsanow.
The Home page has overall site and copyright information.


Hosted by www.Geocities.ws

1