Hello, Have you check that your database connection uses uft8 ?
Thomas On Mon, Jun 30, 2008 at 10:09 AM, Olivier Revollat <[EMAIL PROTECTED]> wrote: > My problem: I can't find accentuated static content with with sfLucene !! > > My config : I use symfony 1.0.6 with sfLucenePlugin 0.1.6. > > > > frontend/config/settings.yml : > > "charset: utf-8" > > > > Search.yml (project level) : > > MyIndex: > > models: > > index: > > cultures: [en, fr] > > encoding: utf-8 > > analyzer: utf8 > > case_sensitive: off > > mb_string: on > > Search.yml (app level) : > > MyIndex: > > models: > > Actualites: > > route: common/actualite?actu=%Actualiteid% > > fields: > > Actualiteid: unindexed > > Textefr: > > boost: 1.5 > > type: text > > > > I use i18N with XLIFF files and for example I have this content: > > > > <trans-unit id="258"> > > <source>Detail_PresentationDeveloppement</source> > > <target><![CDATA[<p> blablabla est une société au service > blablabl</p>]]></target> > > </trans-unit> > > > > Notice the accents in 'société' … > > I build the index using this command: > > php symfony lucene-rebuild frontend > > > > And if I look into the index to find the string I see : > > > > <p> blablabla est une sociÃ(c)tÃ(c) au service blablabla </p> > > > > Now if I search for "société" in lucene module I got no results … and if I > search for "sociÃ(c)tÃ(c)" I got some results!!! > > And if I try to encode the index in utf-8, so I can see "société" instead of > "sociÃ(c)tÃ(c)" in notepad++ then, there is an error : > > Largest supported segment size (for 32-bit mode) is 2Gb > > Can anyone help me ? > > Thanks :) > > > > > > -- Thomas Rabaix Internet Consultant --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "symfony users" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/symfony-users?hl=en -~----------~----~----~----~------~----~------~--~---
