Hi Mika,

Are both systems using the same OS version and the same version of Java?

Best regards,

Terrance

--
Web Applications Programmer
Institute for Clean and Secure Energy
University of Utah
http://www.ices.utah.edu


On Jun 15, 2009, at 2:01 AM, mikan.d.dspace listmail wrote:

> Hi Terrance,
>
> I double-checked the indexes in configuration and they do match. What
> I noticed though, is that the text extracted from pdf files differ,
> which might be the cause of this problem. It seems that when
> filter-media extracts the text on the other server, it messes up some
> special characters, thus making them unsearchable. What might be
> causing  this? Both databases are set to UNICODE when created. Is
> there some other system setting that might be causing this?
>
> Example of extracted text is below:
>
> Server 1: (correct encoding)
> 3. PUNAISEN KIRJAN SISÄLTÖ
> Jaettiin punaisen kirjan sisällön päivitystä varten vastuuhenkilöt  
> seuraavaksi:
> 3.1 Yleisasu ja kirjan sisällön järjestys miettii ja tarkastelee  
> Tiina Sairanen
>
> Server 2: (Messed up characters)
>
> 3. PUNAISEN KIRJAN SIS?LT?
> Jaettiin punaisen kirjan sis?ll?n p?ivityst? varten vastuuhenkil?t  
> seuraavaksi:
> 3.1 Yleisasu ja kirjan sis?ll?n j?rjestys miettii ja tarkastelee  
> Tiina Sairanen
>
>
> Thanks for any help,
> Mika
>
>
> 2009/6/12 Terrance Davis <[email protected]>:
>> Hi Mika,
>> My first guess is that your config files don't match. You might  
>> want to
>> check the server that is returning 40 results. If the configured  
>> search
>> indexes have any white space (such as a tab) after the properties,  
>> they
>> might not be matching up with the dublin core and not indexing  
>> properly.
>> No trim() is happening on the configured search index properties  
>> from the
>> 1.5.2 dspace.cfg, so they may look the same, but be thrown off by  
>> extra
>> unwanted white space.
>> Best regards,
>> Terrance Davis
>> --
>> Web Applications Programmer
>> Institute for Clean and Secure Energy
>> University of Utah
>> http://www.ices.utah.edu/
>>
>>
>>
>> On Jun 12, 2009, at 5:24 AM, mikan.d.dspace listmail wrote:
>>
>> Im confused by the way DSpace search works. I cloned our Dspace 1.5.2
>> instance to another server. They both have the same config, same  
>> items
>> etc. However when I run search I get different results?! With the  
>> same
>> search term the other search shows 40 results and the other 72. I've
>> forced reindexing and media-filters but nothing changes. What could  
>> be
>> the  cause of this?
>>
>> Thanks,
>> Mika
>>
>> ------------------------------------------------------------------------------
>> Crystal Reports - New Free Runtime and 30 Day Trial
>> Check out the new simplified licensing option that enables unlimited
>> royalty-free distribution of the report engine for externally facing
>> server and web deployment.
>> http://p.sf.net/sfu/businessobjects
>> _______________________________________________
>> DSpace-tech mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>>
>>


------------------------------------------------------------------------------
Crystal Reports - New Free Runtime and 30 Day Trial
Check out the new simplified licensing option that enables unlimited
royalty-free distribution of the report engine for externally facing 
server and web deployment.
http://p.sf.net/sfu/businessobjects
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to