Re: Inverse English an digits in Arabic Text

2020-09-08 Thread adeq8

Thank you for support,

I upload PDF file page by page. And in this case left to right (LTR) or right 
to left (RTL) reading apples for the whole document not for the specific text 
block ( separate for Arabic, separate for Enlish)

I can see the same behavior with output for via  /select as well as /browse 
call 

Almost sure the problem is with during upload  
 

But adding this to the 
   and latter to another analyzer does not change the 
result.




Inverse English an digits in Arabic Text

2020-09-07 Thread adeq8
Hi,

Could please help to resolve an issue. I upload/index several documents in 
English and in Arabic languages to SOLR, in addition I use handler for Arabic 
language:
  
   
    
    
     
     
    
    

  
  
    
    
    
     
      
    
    

  

There are two environments:
Local machine:     - SOLR version: 4,2
    - Windows version: 10

DEV env:     - SOLR version 4.1 as part of the cloudera suit
    - Linux core version: 3.10.0-862

Issue appears when uploading documents:
Local machine:     - Doc in English with English words only - ok 
(for example, "www.apache.org")
    - Doc in Arabic with some English words - ok (for example, 
"www.apache.org")

DEV env:     - Doc in English with English words only - ok (for 
example, "www.apache.org")
    - Doc in Arabic with some English - English text is inverted 
(for example, "gro.echapa.www"), what makes search by key words impossible.

Please advise whether this fixable and how?

Thank you in advance!