[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Omaha.pm] Fwd: the joys of encoding
Hmm... Sean: Looks like this is a 5.10.0 bug that people prepped patches for in 2008/2009 but I can't tell based on that thread if the patches got applied or not.
http://groups.google.com/group/perl.perl5.porters/browse_thread/thread/94914fe2d844f5ec/2860ab39fbae7c9c?lnk=raot&pli=1
You might check the little program George Greer posted on Apr 19 2009 and see if that fails on your perl but works on more recent ones. And try isolating your real-data test too...?
HTH,
Jay Hannah
Software Architect
jhannah@mutationgrid.com | http://mutationgrid.com | 1-402-598-7782
> On Dec 28, 2010, at 4:54 PM, Sean Baker wrote:
>> This seems to work in Perl 5.8 and dies in Perl 5.10:
>>
>> DB<2> x encode_entities($str, "\200-\377")
>> Assertion rx->sublen >= (s - rx->subbeg) + i failed: file "regcomp.c", line 5109 at (eval 4652)[/usr/lib/perl5/site_perl/5.10.0/x86_64-linux-thread-multi/HTML/Entities.pm:456] line 1.
>> DB<3> x $str
>> 0 '<b>Ce forfait inclut:</b><br>Chambre Luxueuse, petit déjeuner buffet par jour, par personne, sac \x{201C}Omni\x{201D} pour les enfants, lait & biscuits (soir de l\x{2019}arrivée).'
>
> Is HTML::Entities current (3.68)?
>
> Can this go on Omaha.pm?
From: "Sean Baker"
Date: December 29, 2010 8:00:15 AM CST
To: "Jay Hannah" <jhannah@mutationgrid.com>
> Sure. Yes, it's on a current version. I'm going to go through production packages and compare against QA & dev and roll anything up or back that I need too.
>
> Module id = HTML::Entities
> DESCRIPTION Encode/decode HTML entities
> CPAN_USERID LWWWP (The libwww-perl mailing list <libwww@perl.org>)
> CPAN_VERSION 3.68
> CPAN_FILE G/GA/GAAS/HTML-Parser-3.68.tar.gz
> UPLOAD_DATE 2010-09-01
> DSLIP_STATUS Rmpf? (released,mailing-list,perl,functions,)
> MANPAGE HTML::Entities - Encode or decode strings with HTML entities
> INST_FILE /usr/lib/perl5/site_perl/5.10.0/x86_64-linux-thread-multi/HTML/Entities.pm
> INST_VERSION 3.68