G
Guest
Here's my setup:
- My Regional and Language options are set to Arabic (Saudi Arabia) on both
the Regional Options tab and the Advanced tab
- My keyboard is set to Arabic Saudi Arabia (MS-101)
- My browser encoding is set to UTF-8 (I've also tried all available Arabic
endcodings on the browser)
- I'm testing with a web page that displays the Unicode code points for
characters that you enter:
http://people.w3.org/rishida/scripts/uniview/conversion
Arabic characters change shape depending on where they are positioned in a
word. When I type in the characters sin, ya, and dal (English keyboard
equivalent 'sd]') the characters change shape on the display as they are
supposed to. When I hit the button to show the Unicode and UTF-8 equivalents
I'm expecting to see Hexadecimal code points: 'FEB3 FEF3 62F' which
correspond to the 'shaped' version of the characters I typed in. Instead I
get Hexadecimal code points: '633 64A 62F' which correspond to the
stand-alone version of those same characters.
I'm using this Unicode Converter page to illustrate the problem my Web
application is showing. Both behave the same way as far as converting the
Arabic to Unicode. This is affecting data entry to a mainframe database as
the shaping information is getting lost in the transfer.
What do I need to do to be able to type in Arabic data to a browser window
and have the correct translation to Unicode?
- My Regional and Language options are set to Arabic (Saudi Arabia) on both
the Regional Options tab and the Advanced tab
- My keyboard is set to Arabic Saudi Arabia (MS-101)
- My browser encoding is set to UTF-8 (I've also tried all available Arabic
endcodings on the browser)
- I'm testing with a web page that displays the Unicode code points for
characters that you enter:
http://people.w3.org/rishida/scripts/uniview/conversion
Arabic characters change shape depending on where they are positioned in a
word. When I type in the characters sin, ya, and dal (English keyboard
equivalent 'sd]') the characters change shape on the display as they are
supposed to. When I hit the button to show the Unicode and UTF-8 equivalents
I'm expecting to see Hexadecimal code points: 'FEB3 FEF3 62F' which
correspond to the 'shaped' version of the characters I typed in. Instead I
get Hexadecimal code points: '633 64A 62F' which correspond to the
stand-alone version of those same characters.
I'm using this Unicode Converter page to illustrate the problem my Web
application is showing. Both behave the same way as far as converting the
Arabic to Unicode. This is affecting data entry to a mainframe database as
the shaping information is getting lost in the transfer.
What do I need to do to be able to type in Arabic data to a browser window
and have the correct translation to Unicode?