NYCPHP Meetup

NYPHP.org

[nycphp-talk] question about utf-8 or unicode?

Joel De Gan joel at tagword.com
Sun Sep 5 17:46:13 EDT 2004


Well..
The main issue is something like this:
http://historyagent.com/index.php?str=Русскийязык
Which the browser translates into:
http://historyagent.com/index.php?str=%D0%A0%D1%83%D1%81%D1%81%D0%BA%D0%B8%D0%B9%D1%8F%D0%B7%D1%8B%D0%BA
Which if piped through:
html_entity_decode(rawurldecode ($desc)) 
works fine.

However,
when passed through javascript escape I get: 
http://historyagent.com/index.php?str=%u0420%u0443%u0441%u0441%u043A%u0438%u0439%20%u044F%u0437%u044B%u043A
Which is having issues being translated in php

I think I am going to remove the javascript escaping..
-joel

On Sun, 2004-09-05 at 21:16, Daniel Convissor wrote:
> Hey Joel:
> 
> URL encoding is based on RFC 1738, which says the encoding consists 
> "of the character "%" followed by the two hexadecimal digits" 
> representing the octet for a "character within the US-ASCII coded 
> character set."  So, Unicode seems out of the question.
> 
> See you,
> 
> --Dan
-- 
joeldg - developer, Intercosmos media group.
http://lucifer.intercosmos.net




More information about the talk mailing list