NYCPHP Meetup

NYPHP.org

[nycphp-talk] question about utf-8 or unicode?

Joel De Gan joel at tagword.com
Sun Sep 5 18:07:29 EDT 2004


In case you are interested in what this is being used for.
the fully multi language History Agent I am working on:
Here are some samples
http://historyagent.com/joeldg/

-joel

On Sun, 2004-09-05 at 21:46, Joel De Gan wrote:
> Well..
> The main issue is something like this:
> http://historyagent.com/index.php?str=Русскийязык
> Which the browser translates into:
> http://historyagent.com/index.php?str=%D0%A0%D1%83%D1%81%D1%81%D0%BA%D0%B8%D0%B9%D1%8F%D0%B7%D1%8B%D0%BA
> Which if piped through:
> html_entity_decode(rawurldecode ($desc)) 
> works fine.
> 
> However,
> when passed through javascript escape I get: 
> http://historyagent.com/index.php?str=%u0420%u0443%u0441%u0441%u043A%u0438%u0439%20%u044F%u0437%u044B%u043A
> Which is having issues being translated in php
> 
> I think I am going to remove the javascript escaping..
> -joel
> 
> On Sun, 2004-09-05 at 21:16, Daniel Convissor wrote:
> > Hey Joel:
> > 
> > URL encoding is based on RFC 1738, which says the encoding consists 
> > "of the character "%" followed by the two hexadecimal digits" 
> > representing the octet for a "character within the US-ASCII coded 
> > character set."  So, Unicode seems out of the question.
> > 
> > See you,
> > 
> > --Dan
-- 
joeldg - developer, Intercosmos media group.
http://lucifer.intercosmos.net




More information about the talk mailing list