[ic] Call for testers

David Christensen david at endpoint.com
Fri Mar 13 13:09:37 UTC 2009


On Mar 13, 2009, at 4:29 AM, Peter wrote:

>> and if it's enabled, see any invalid UTF-8 bytes converted to ?
>> characters. That's simple, nonfatal at runtime, and yet gently  
>> encourages
>> developers to get their sources in the proper UTF-8 encoding.
>
> I'm fine with that, and that was the original proposal.  One problem,
> though, is that while I thought that the Encode module could do that,
> apparently it can only barf when decoding unicode input, so we would
> have to find another way to find the invalid chars and change them  
> over.


There is a third param to Encode::decode which specifies the behavior  
of invalid decodes, which by default is to die, but can warn, ignore  
or silently substitute IIRC.  So I think this could be make to  
substitute the invalid character marker without much problem.

Regards,

David
--
David Christensen
End Point Corporation
david at endpoint.com
212-929-6923
http://www.endpoint.com/






More information about the interchange-users mailing list