############################################################################### # # Name: IBM PC cp437_DOSLatinUS to Unicode table # Unicode version: 15.0.0 # Table version: 1.0 # Table format: Format A # Date: 2025-04-26 # Author: Michael Walden # # Copyright (c) 2025 Michael Walden. All Rights reserved. # # General notes: # # This table contains data on IBM PC code page 437 DOSLatinUS. # The characters in positions 01..1F and 7F are mostly only able to # be viewed when placed directly into IBM PC memory-mapped video # graphics adapter RAM. Printing the characters on an IBM PC compatible # PC will mostly yield their control code functions and not print a # character. # # I am naming this mapping table "IBM PC cp437_DOSLatinUS to Unicode table" # as opposed to the earlier "cp437_DOSLatinUS to Unicode table" (in CP437.TXT # cited below) to distinguish it as the one most accurately representing the # IBM PC's character set in Unicode. Some IBM PC descendant computers and other # hardware have incorrectly represented the IBM PC's character set, having some # errors. This mapping table would need to be updated to apply exactly to them. # # I feel that the following mapping tables have been obsoleted by this one and # that they should no longer be used in any situation attempting to emulate the # IBM PC's character set. # # # # # Format: Three tab-separated columns # Column #1 is the IBM PC cp437_DOSLatinUS code (in hex as XX) # Column #2 is the Unicode code point (in hex as XXXX) # Column #3 is a remark symbol '#' followed by the Unicode UTF-8 # character followed by a space and then the character name. # The names were derived from UnicodeData.txt at: # # On the right are labels "CORRECT" and "INCORRECT". I am # including incorrect mappings from other people's earlier # work for historical purposes. This matter was related to # Microsoft Windows Glyph List 4 (WGL4) which intended to # support Code page 437, but missed some CORRECT characters # and implemented some INCORRECT characters instead. # Fortunately, WGL4 is mostly a thing of the past at this point # and fonts today cover more than the WGL4 recommended. If you # wish to have an INCORRECT mapping character appear in the # mapping here, shift the "#" remark symbol, on the left, from # an INCORRECT line (enabling it) to a CORRECT line (disabling # it). The "WIDELY SUPPORTED" statements relate to the # character's availability in fonts at the time I originally # did my work. Currently these statements are mostly obsolete # since all of my CORRECT characters (except 1D161 𝅘𝅥𝅯 MUSICAL # SYMBOL SIXTEENTH NOTE) are supported. One lingering issue is # that some of the NOT WIDELY SUPPORTED characters are rendered # in a different size from most of the other characters in some # fonts. In the future I may update hex 0D to have Unicode # code point hex 1D161 as the enabled mapping when it is widely # supported. # # The entries are in IBM PC cp437_DOSLatinUS order. # # Version history: # 1.0 - First public release. # ############################################################################### # # Control Characters from ANSI X3.4-1977 - For reference only # # 00 0000 #Null # 01 0001 #Start of Heading (CC) # 02 0002 #Start of Text (CC) # 03 0003 #End of Text (CC) # 04 0004 #End of Transmission (CC) # 05 0005 #Enquiry (CC) # 06 0006 #Acknowledge (CC) # 07 0007 #Bell (Audible Signal) # 08 0008 #Backspace (FE) # 09 0009 #Horizontal Tabulation (FE) # 0A 000A #Line Feed (FE) # 0B 000B #Vertical Tabulation (FE) # 0C 000C #Form Feed (FE) # 0D 000D #Carriage Return (FE) # 0E 000E #Shift Out # 0F 000F #Shift In # 10 0010 #Data Link Escape (CC) # 11 0011 #Device Control 1 # 12 0012 #Device Control 2 # 13 0013 #Device Control 3 # 14 0014 #Device Control 4 # 15 0015 #Negative Acknowledge (CC) # 16 0016 #Synchronous Idle (CC) # 17 0017 #End of Transmission Block (CC) # 18 0018 #Cancel # 19 0019 #End of Medium # 1A 001A #Substitute # 1B 001B #Escape # 1C 001C #File Separator (IS) # 1D 001D #Group Separator (IS) # 1E 001E #Record Separator (IS) # 1F 001F #Unit Separator (IS) # # 7F 007F #Delete # # (CC) Communication Control; (FE) Format Effector; (IS) Information Separator # ############################################################################### # #------------------------------------------------------------------------------ 00 2007 #  FIGURE SPACE CORRECT #00 0000 # = NULL INCORRECT #------------------------------------------------------------------------------ 01 263A #☺ WHITE SMILING FACE 02 263B #☻ BLACK SMILING FACE 03 2665 #♥ BLACK HEART SUIT 04 2666 #♦ BLACK DIAMOND SUIT 05 2663 #♣ BLACK CLUB SUIT 06 2660 #♠ BLACK SPADE SUIT 07 2022 #• BULLET 08 25D8 #◘ INVERSE BULLET 09 25CB #○ WHITE CIRCLE 0A 25D9 #◙ INVERSE WHITE CIRCLE 0B 2642 #♂ MALE SIGN 0C 2640 #♀ FEMALE SIGN #------------------------------------------------------------------------------ #0D 1D161 #𝅘𝅥𝅯 MUSICAL SYMBOL SIXTEENTH NOTE CORRECT - BUT TOO NEW FOR SUPPORT IN APPLICATIONS AND FONTS 0D 266A #♪ EIGHTH NOTE INCORRECT - BUT WIDELY SUPPORTED #------------------------------------------------------------------------------ #------------------------------------------------------------------------------ 0E 266C #♬ BEAMED SIXTEENTH NOTES CORRECT #0E 266B #♫ BEAMED EIGHTH NOTES INCORRECT #------------------------------------------------------------------------------ 0F 263C #☼ WHITE SUN WITH RAYS #------------------------------------------------------------------------------ 10 25B6 #▶ BLACK RIGHT-POINTING TRIANGLE CORRECT - BUT NOT WIDELY SUPPORTED #10 25BA #► BLACK RIGHT-POINTING POINTER INCORRECT - BUT WIDELY SUPPORTED #------------------------------------------------------------------------------ #------------------------------------------------------------------------------ 11 25C0 #◀ BLACK LEFT-POINTING TRIANGLE CORRECT - BUT NOT WIDELY SUPPORTED #11 25C4 #◄ BLACK LEFT-POINTING POINTER INCORRECT - BUT WIDELY SUPPORTED #------------------------------------------------------------------------------ 12 2195 #↕ UP DOWN ARROW 13 203C #‼ DOUBLE EXCLAMATION MARK 14 00B6 #¶ PILCROW SIGN 15 00A7 #§ SECTION SIGN 16 25AC #▬ BLACK RECTANGLE 17 21A8 #↨ UP DOWN ARROW WITH BASE 18 2191 #↑ UPWARDS ARROW 19 2193 #↓ DOWNWARDS ARROW 1A 2192 #→ RIGHTWARDS ARROW 1B 2190 #← LEFTWARDS ARROW #------------------------------------------------------------------------------ 1C 2319 #⌙ TURNED NOT SIGN CORRECT #1C 221F #∟ RIGHT ANGLE INCORRECT #------------------------------------------------------------------------------ 1D 2194 #↔ LEFT RIGHT ARROW 1E 25B2 #▲ BLACK UP-POINTING TRIANGLE 1F 25BC #▼ BLACK DOWN-POINTING TRIANGLE 20 0020 # SPACE 21 0021 #! EXCLAMATION MARK 22 0022 #" QUOTATION MARK 23 0023 ## NUMBER SIGN 24 0024 #$ DOLLAR SIGN 25 0025 #% PERCENT SIGN 26 0026 #& AMPERSAND #------------------------------------------------------------------------------ 27 2019 #’ RIGHT SINGLE QUOTATION MARK CORRECT #27 0027 #' APOSTROPHE INCORRECT #------------------------------------------------------------------------------ 28 0028 #( LEFT PARENTHESIS 29 0029 #) RIGHT PARENTHESIS 2A 002A #* ASTERISK 2B 002B #+ PLUS SIGN 2C 002C #, COMMA 2D 002D #- HYPHEN-MINUS 2E 002E #. FULL STOP 2F 002F #/ SOLIDUS 30 0030 #0 DIGIT ZERO 31 0031 #1 DIGIT ONE 32 0032 #2 DIGIT TWO 33 0033 #3 DIGIT THREE 34 0034 #4 DIGIT FOUR 35 0035 #5 DIGIT FIVE 36 0036 #6 DIGIT SIX 37 0037 #7 DIGIT SEVEN 38 0038 #8 DIGIT EIGHT 39 0039 #9 DIGIT NINE 3A 003A #: COLON 3B 003B #; SEMICOLON 3C 003C #< LESS-THAN SIGN 3D 003D #= EQUALS SIGN 3E 003E #> GREATER-THAN SIGN 3F 003F #? QUESTION MARK 40 0040 #@ COMMERCIAL AT 41 0041 #A LATIN CAPITAL LETTER A 42 0042 #B LATIN CAPITAL LETTER B 43 0043 #C LATIN CAPITAL LETTER C 44 0044 #D LATIN CAPITAL LETTER D 45 0045 #E LATIN CAPITAL LETTER E 46 0046 #F LATIN CAPITAL LETTER F 47 0047 #G LATIN CAPITAL LETTER G 48 0048 #H LATIN CAPITAL LETTER H 49 0049 #I LATIN CAPITAL LETTER I 4A 004A #J LATIN CAPITAL LETTER J 4B 004B #K LATIN CAPITAL LETTER K 4C 004C #L LATIN CAPITAL LETTER L 4D 004D #M LATIN CAPITAL LETTER M 4E 004E #N LATIN CAPITAL LETTER N 4F 004F #O LATIN CAPITAL LETTER O 50 0050 #P LATIN CAPITAL LETTER P 51 0051 #Q LATIN CAPITAL LETTER Q 52 0052 #R LATIN CAPITAL LETTER R 53 0053 #S LATIN CAPITAL LETTER S 54 0054 #T LATIN CAPITAL LETTER T 55 0055 #U LATIN CAPITAL LETTER U 56 0056 #V LATIN CAPITAL LETTER V 57 0057 #W LATIN CAPITAL LETTER W 58 0058 #X LATIN CAPITAL LETTER X 59 0059 #Y LATIN CAPITAL LETTER Y 5A 005A #Z LATIN CAPITAL LETTER Z 5B 005B #[ LEFT SQUARE BRACKET 5C 005C #\ REVERSE SOLIDUS 5D 005D #] RIGHT SQUARE BRACKET 5E 005E #^ CIRCUMFLEX ACCENT 5F 005F #_ LOW LINE 60 0060 #` GRAVE ACCENT 61 0061 #a LATIN SMALL LETTER A 62 0062 #b LATIN SMALL LETTER B 63 0063 #c LATIN SMALL LETTER C 64 0064 #d LATIN SMALL LETTER D 65 0065 #e LATIN SMALL LETTER E 66 0066 #f LATIN SMALL LETTER F 67 0067 #g LATIN SMALL LETTER G 68 0068 #h LATIN SMALL LETTER H 69 0069 #i LATIN SMALL LETTER I 6A 006A #j LATIN SMALL LETTER J 6B 006B #k LATIN SMALL LETTER K 6C 006C #l LATIN SMALL LETTER L 6D 006D #m LATIN SMALL LETTER M 6E 006E #n LATIN SMALL LETTER N 6F 006F #o LATIN SMALL LETTER O 70 0070 #p LATIN SMALL LETTER P 71 0071 #q LATIN SMALL LETTER Q 72 0072 #r LATIN SMALL LETTER R 73 0073 #s LATIN SMALL LETTER S 74 0074 #t LATIN SMALL LETTER T 75 0075 #u LATIN SMALL LETTER U 76 0076 #v LATIN SMALL LETTER V 77 0077 #w LATIN SMALL LETTER W 78 0078 #x LATIN SMALL LETTER X 79 0079 #y LATIN SMALL LETTER Y 7A 007A #z LATIN SMALL LETTER Z 7B 007B #{ LEFT CURLY BRACKET #------------------------------------------------------------------------------ 7C 00A6 #¦ BROKEN BAR CORRECT #7C 007C #| VERTICAL LINE INCORRECT #------------------------------------------------------------------------------ 7D 007D #} RIGHT CURLY BRACKET 7E 007E #~ TILDE #------------------------------------------------------------------------------ 7F 2302 #⌂ HOUSE CORRECT #7F 0394 #Δ GREEK CAPITAL LETTER DELTA INCORRECT #------------------------------------------------------------------------------ 80 00C7 #Ç LATIN CAPITAL LETTER C WITH CEDILLA 81 00FC #ü LATIN SMALL LETTER U WITH DIAERESIS 82 00E9 #é LATIN SMALL LETTER E WITH ACUTE 83 00E2 #â LATIN SMALL LETTER A WITH CIRCUMFLEX 84 00E4 #ä LATIN SMALL LETTER A WITH DIAERESIS 85 00E0 #à LATIN SMALL LETTER A WITH GRAVE 86 00E5 #å LATIN SMALL LETTER A WITH RING ABOVE 87 00E7 #ç LATIN SMALL LETTER C WITH CEDILLA 88 00EA #ê LATIN SMALL LETTER E WITH CIRCUMFLEX 89 00EB #ë LATIN SMALL LETTER E WITH DIAERESIS 8A 00E8 #è LATIN SMALL LETTER E WITH GRAVE 8B 00EF #ï LATIN SMALL LETTER I WITH DIAERESIS 8C 00EE #î LATIN SMALL LETTER I WITH CIRCUMFLEX 8D 00EC #ì LATIN SMALL LETTER I WITH GRAVE 8E 00C4 #Ä LATIN CAPITAL LETTER A WITH DIAERESIS 8F 00C5 #Å LATIN CAPITAL LETTER A WITH RING ABOVE 90 00C9 #É LATIN CAPITAL LETTER E WITH ACUTE 91 00E6 #æ LATIN SMALL LETTER AE 92 00C6 #Æ LATIN CAPITAL LETTER AE 93 00F4 #ô LATIN SMALL LETTER O WITH CIRCUMFLEX 94 00F6 #ö LATIN SMALL LETTER O WITH DIAERESIS 95 00F2 #ò LATIN SMALL LETTER O WITH GRAVE 96 00FB #û LATIN SMALL LETTER U WITH CIRCUMFLEX 97 00F9 #ù LATIN SMALL LETTER U WITH GRAVE 98 00FF #ÿ LATIN SMALL LETTER Y WITH DIAERESIS 99 00D6 #Ö LATIN CAPITAL LETTER O WITH DIAERESIS 9A 00DC #Ü LATIN CAPITAL LETTER U WITH DIAERESIS 9B 00A2 #¢ CENT SIGN 9C 00A3 #£ POUND SIGN 9D 00A5 #¥ YEN SIGN 9E 20A7 #₧ PESETA SIGN 9F 0192 #ƒ LATIN SMALL LETTER F WITH HOOK A0 00E1 #á LATIN SMALL LETTER A WITH ACUTE A1 00ED #í LATIN SMALL LETTER I WITH ACUTE A2 00F3 #ó LATIN SMALL LETTER O WITH ACUTE A3 00FA #ú LATIN SMALL LETTER U WITH ACUTE A4 00F1 #ñ LATIN SMALL LETTER N WITH TILDE A5 00D1 #Ñ LATIN CAPITAL LETTER N WITH TILDE A6 00AA #ª FEMININE ORDINAL INDICATOR A7 00BA #º MASCULINE ORDINAL INDICATOR A8 00BF #¿ INVERTED QUESTION MARK A9 2310 #⌐ REVERSED NOT SIGN AA 00AC #¬ NOT SIGN AB 00BD #½ VULGAR FRACTION ONE HALF AC 00BC #¼ VULGAR FRACTION ONE QUARTER AD 00A1 #¡ INVERTED EXCLAMATION MARK AE 00AB #« LEFT-POINTING DOUBLE ANGLE QUOTATION MARK AF 00BB #» RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK B0 2591 #░ LIGHT SHADE B1 2592 #▒ MEDIUM SHADE B2 2593 #▓ DARK SHADE B3 2502 #│ BOX DRAWINGS LIGHT VERTICAL B4 2524 #┤ BOX DRAWINGS LIGHT VERTICAL AND LEFT B5 2561 #╡ BOX DRAWINGS VERTICAL SINGLE AND LEFT DOUBLE B6 2562 #╢ BOX DRAWINGS VERTICAL DOUBLE AND LEFT SINGLE B7 2556 #╖ BOX DRAWINGS DOWN DOUBLE AND LEFT SINGLE B8 2555 #╕ BOX DRAWINGS DOWN SINGLE AND LEFT DOUBLE B9 2563 #╣ BOX DRAWINGS DOUBLE VERTICAL AND LEFT BA 2551 #║ BOX DRAWINGS DOUBLE VERTICAL BB 2557 #╗ BOX DRAWINGS DOUBLE DOWN AND LEFT BC 255D #╝ BOX DRAWINGS DOUBLE UP AND LEFT BD 255C #╜ BOX DRAWINGS UP DOUBLE AND LEFT SINGLE BE 255B #╛ BOX DRAWINGS UP SINGLE AND LEFT DOUBLE BF 2510 #┐ BOX DRAWINGS LIGHT DOWN AND LEFT C0 2514 #└ BOX DRAWINGS LIGHT UP AND RIGHT C1 2534 #┴ BOX DRAWINGS LIGHT UP AND HORIZONTAL C2 252C #┬ BOX DRAWINGS LIGHT DOWN AND HORIZONTAL C3 251C #├ BOX DRAWINGS LIGHT VERTICAL AND RIGHT C4 2500 #─ BOX DRAWINGS LIGHT HORIZONTAL C5 253C #┼ BOX DRAWINGS LIGHT VERTICAL AND HORIZONTAL C6 255E #╞ BOX DRAWINGS VERTICAL SINGLE AND RIGHT DOUBLE C7 255F #╟ BOX DRAWINGS VERTICAL DOUBLE AND RIGHT SINGLE C8 255A #╚ BOX DRAWINGS DOUBLE UP AND RIGHT C9 2554 #╔ BOX DRAWINGS DOUBLE DOWN AND RIGHT CA 2569 #╩ BOX DRAWINGS DOUBLE UP AND HORIZONTAL CB 2566 #╦ BOX DRAWINGS DOUBLE DOWN AND HORIZONTAL CC 2560 #╠ BOX DRAWINGS DOUBLE VERTICAL AND RIGHT CD 2550 #═ BOX DRAWINGS DOUBLE HORIZONTAL CE 256C #╬ BOX DRAWINGS DOUBLE VERTICAL AND HORIZONTAL CF 2567 #╧ BOX DRAWINGS UP SINGLE AND HORIZONTAL DOUBLE D0 2568 #╨ BOX DRAWINGS UP DOUBLE AND HORIZONTAL SINGLE D1 2564 #╤ BOX DRAWINGS DOWN SINGLE AND HORIZONTAL DOUBLE D2 2565 #╥ BOX DRAWINGS DOWN DOUBLE AND HORIZONTAL SINGLE D3 2559 #╙ BOX DRAWINGS UP DOUBLE AND RIGHT SINGLE D4 2558 #╘ BOX DRAWINGS UP SINGLE AND RIGHT DOUBLE D5 2552 #╒ BOX DRAWINGS DOWN SINGLE AND RIGHT DOUBLE D6 2553 #╓ BOX DRAWINGS DOWN DOUBLE AND RIGHT SINGLE D7 256B #╫ BOX DRAWINGS VERTICAL DOUBLE AND HORIZONTAL SINGLE D8 256A #╪ BOX DRAWINGS VERTICAL SINGLE AND HORIZONTAL DOUBLE D9 2518 #┘ BOX DRAWINGS LIGHT UP AND LEFT DA 250C #┌ BOX DRAWINGS LIGHT DOWN AND RIGHT DB 2588 #█ FULL BLOCK DC 2584 #▄ LOWER HALF BLOCK DD 258C #▌ LEFT HALF BLOCK DE 2590 #▐ RIGHT HALF BLOCK DF 2580 #▀ UPPER HALF BLOCK #------------------------------------------------------------------------------ E0 03B1 #α GREEK SMALL LETTER ALPHA CORRECT #E0 221D #∝ PROPORTIONAL TO INCORRECT #------------------------------------------------------------------------------ #------------------------------------------------------------------------------ E1 03B2 #β GREEK SMALL LETTER BETA CORRECT #E1 00DF #ß LATIN SMALL LETTER SHARP S INCORRECT #------------------------------------------------------------------------------ E2 0393 #Γ GREEK CAPITAL LETTER GAMMA E3 03C0 #π GREEK SMALL LETTER PI E4 03A3 #Σ GREEK CAPITAL LETTER SIGMA E5 03C3 #σ GREEK SMALL LETTER SIGMA #------------------------------------------------------------------------------ E6 03BC #μ GREEK SMALL LETTER MU CORRECT #E6 00B5 #µ MICRO SIGN INCORRECT #------------------------------------------------------------------------------ #------------------------------------------------------------------------------ E7 03B3 #γ GREEK SMALL LETTER GAMMA CORRECT #E7 03C4 #τ GREEK SMALL LETTER TAU INCORRECT #------------------------------------------------------------------------------ E8 03A6 #Φ GREEK CAPITAL LETTER PHI #------------------------------------------------------------------------------ E9 03B8 #θ GREEK SMALL LETTER THETA CORRECT #E9 0398 #Θ GREEK CAPITAL LETTER THETA INCORRECT #------------------------------------------------------------------------------ #------------------------------------------------------------------------------ EA 03A9 #Ω GREEK CAPITAL LETTER OMEGA CORRECT #EA 2126 #Ω OHM SIGN INCORRECT #------------------------------------------------------------------------------ EB 03B4 #δ GREEK SMALL LETTER DELTA EC 221E #∞ INFINITY #------------------------------------------------------------------------------ ED 2205 #∅ EMPTY SET CORRECT - BUT NOT WIDELY SUPPORTED #ED 03C6 #φ GREEK SMALL LETTER PHI INCORRECT - BUT WIDELY SUPPORTED #ED 00F8 #ø LATIN SMALL LETTER O WITH STROKE INCORRECT - BUT WIDELY SUPPORTED #------------------------------------------------------------------------------ #------------------------------------------------------------------------------ EE 2208 #∈ ELEMENT OF CORRECT - BUT NOT WIDELY SUPPORTED #EE 03B5 #ε GREEK SMALL LETTER EPSILON INCORRECT - BUT WIDELY SUPPORTED #EE 0404 #Є CYRILLIC CAPITAL LETTER UKRAINIAN IE INCORRECT - BUT WIDELY SUPPORTED #------------------------------------------------------------------------------ EF 2229 #∩ INTERSECTION F0 2261 #≡ IDENTICAL TO F1 00B1 #± PLUS-MINUS SIGN F2 2265 #≥ GREATER-THAN OR EQUAL TO F3 2264 #≤ LESS-THAN OR EQUAL TO F4 2320 #⌠ TOP HALF INTEGRAL F5 2321 #⌡ BOTTOM HALF INTEGRAL F6 00F7 #÷ DIVISION SIGN F7 2248 #≈ ALMOST EQUAL TO F8 00B0 #° DEGREE SIGN #------------------------------------------------------------------------------ F9 2219 #∙ BULLET OPERATOR CORRECT #F9 25AA #▪ BLACK SMALL SQUARE INCORRECT - BUT LOOKS BETTER IN SOME FONTS #------------------------------------------------------------------------------ #------------------------------------------------------------------------------ FA 22C5 #⋅ DOT OPERATOR CORRECT #FA 00B7 #· MIDDLE DOT INCORRECT #------------------------------------------------------------------------------ FB 221A #√ SQUARE ROOT FC 207F #ⁿ SUPERSCRIPT LATIN SMALL LETTER N FD 00B2 #² SUPERSCRIPT TWO #------------------------------------------------------------------------------ FE 220E #∎ END OF PROOF CORRECT #FE 25A0 #■ BLACK SQUARE INCORRECT #------------------------------------------------------------------------------ FF 00A0 # NO-BREAK SPACE # ############################################################################### # ## END OF FILE ##