Hi Mika, Are both systems using the same OS version and the same version of Java?
Best regards, Terrance -- Web Applications Programmer Institute for Clean and Secure Energy University of Utah http://www.ices.utah.edu On Jun 15, 2009, at 2:01 AM, mikan.d.dspace listmail wrote: > Hi Terrance, > > I double-checked the indexes in configuration and they do match. What > I noticed though, is that the text extracted from pdf files differ, > which might be the cause of this problem. It seems that when > filter-media extracts the text on the other server, it messes up some > special characters, thus making them unsearchable. What might be > causing this? Both databases are set to UNICODE when created. Is > there some other system setting that might be causing this? > > Example of extracted text is below: > > Server 1: (correct encoding) > 3. PUNAISEN KIRJAN SISÄLTÖ > Jaettiin punaisen kirjan sisällön päivitystä varten vastuuhenkilöt > seuraavaksi: > 3.1 Yleisasu ja kirjan sisällön järjestys miettii ja tarkastelee > Tiina Sairanen > > Server 2: (Messed up characters) > > 3. PUNAISEN KIRJAN SIS?LT? > Jaettiin punaisen kirjan sis?ll?n p?ivityst? varten vastuuhenkil?t > seuraavaksi: > 3.1 Yleisasu ja kirjan sis?ll?n j?rjestys miettii ja tarkastelee > Tiina Sairanen > > > Thanks for any help, > Mika > > > 2009/6/12 Terrance Davis <[email protected]>: >> Hi Mika, >> My first guess is that your config files don't match. You might >> want to >> check the server that is returning 40 results. If the configured >> search >> indexes have any white space (such as a tab) after the properties, >> they >> might not be matching up with the dublin core and not indexing >> properly. >> No trim() is happening on the configured search index properties >> from the >> 1.5.2 dspace.cfg, so they may look the same, but be thrown off by >> extra >> unwanted white space. >> Best regards, >> Terrance Davis >> -- >> Web Applications Programmer >> Institute for Clean and Secure Energy >> University of Utah >> http://www.ices.utah.edu/ >> >> >> >> On Jun 12, 2009, at 5:24 AM, mikan.d.dspace listmail wrote: >> >> Im confused by the way DSpace search works. I cloned our Dspace 1.5.2 >> instance to another server. They both have the same config, same >> items >> etc. However when I run search I get different results?! With the >> same >> search term the other search shows 40 results and the other 72. I've >> forced reindexing and media-filters but nothing changes. What could >> be >> the cause of this? >> >> Thanks, >> Mika >> >> ------------------------------------------------------------------------------ >> Crystal Reports - New Free Runtime and 30 Day Trial >> Check out the new simplified licensing option that enables unlimited >> royalty-free distribution of the report engine for externally facing >> server and web deployment. >> http://p.sf.net/sfu/businessobjects >> _______________________________________________ >> DSpace-tech mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dspace-tech >> >> ------------------------------------------------------------------------------ Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

