Fwd: Fwd: Data Impor Handlert

2020-12-22 Thread DINSD | SPAutores

Hi,

Trying do apply the procedures described in 
https://github.com/rohitbemax/dataimporthandler


However when apply "bin/solr start -c -Denable.packages=true",

i get the followning message at web interface :


Also my predefined /cores , /don't appear.

Has anyone been through this situation?

Best regards
Assinatura SPA
**
*Rui Pimentel*


**
*DINSD - Departamento de Informática / SPA Digital*
Av. Duque de Loulé, 31 - 1069-153 Lisboa  PORTUGAL
*T * (+ 351) 21 359 44 36 */* (+ 351) 21 359 44 00 *F* (+ 351) 21 353 02 57
 informat...@spautores.pt
www.SPAutores.pt
 
 


Please consider the environment before printing this email

Esta mensagem electrónica, incluindo qualquer dos seus anexos, contém 
informação PRIVADA, CONFIDENCIAL e de DIVULGAÇÃO PROIBIDA,e destina-se 
unicamente à pessoa e endereço electrónico acima indicados. Se não for o 
destinatário desta mensagem, agradecemos que a elimine e nos comunique 
de imediato através do telefone +351 21 359 44 00 ou por email para: 
ge...@spautores.pt 


This electronic mail transmission including any attachment hereof, 
contains information that is PRIVATE, CONFIDENTIAL and PROTECTED FROM 
DISCLOSURE, and it is only for the use of the person and the e-mail 
address above indicated. If you have received this electronic mail 
transmission in error, please destroy it and notify us immediately 
through the telephone number +351 21 359 44 00 or at the e-mail address: 
ge...@spautores.pt

Assinatura SPA


Re: Data Impor Handlert

2020-10-15 Thread DINSD | SPAutores

Hi Alexander,

Very useful the provided information.

Assinatura SPA Many thanks and best regards
**
*Rui Pimentel*


**
*DINSD - Departamento de Informática / SPA Digital*
Av. Duque de Loulé, 31 - 1069-153 Lisboa PORTUGAL
*T * (+ 351) 21 359 44 36 */* (+ 351) 21 359 44 00 *F* (+ 351) 21 353 02 57
 informat...@spautores.pt
www.SPAutores.pt
 
 


Please consider the environment before printing this email

Esta mensagem electrónica, incluindo qualquer dos seus anexos, contém 
informação PRIVADA, CONFIDENCIAL e de DIVULGAÇÃO PROIBIDA,e destina-se 
unicamente à pessoa e endereço electrónico acima indicados. Se não for o 
destinatário desta mensagem, agradecemos que a elimine e nos comunique 
de imediato através do telefone +351 21 359 44 00 ou por email para: 
ge...@spautores.pt 


This electronic mail transmission including any attachment hereof, 
contains information that is PRIVATE, CONFIDENTIAL and PROTECTED FROM 
DISCLOSURE, and it is only for the use of the person and the e-mail 
address above indicated. If you have received this electronic mail 
transmission in error, please destroy it and notify us immediately 
through the telephone number +351 21 359 44 00 or at the e-mail address: 
ge...@spautores.pt

Assinatura SPA
On 2020-10-15 15:33, Alexandre Rafalovitch wrote:
Solr now has package managers and DIH is one of the packages to 
reflect the fact that its development cycle is not locked to Solr's 
and to reduce core download. Tika may be heading the same way, as 
running Tika inside the Solr process could cause memory issues with 
complex PDFs.


In terms of other ways of pre-process and load data into Solr, there 
are things like:

1) Apache Camel https://camel.apache.org/
2) Apache NiFi https://nifi.apache.org/

Other commercial solutions also exist, such as StreamSets:
3) 
https://streamsets.com/documentation/datacollector/latest/help//datacollector/UserGuide/Destinations/Solr.html


And, of course, you can always roll your own with SolrJ.

Regards,
  Alex.



On Thu, 15 Oct 2020 at 10:08, DINSD | SPAutores 
 wrote:


Hi

Based on this document there are two ways to index document on the
Solr platform, https://lucidworks.com/post/indexing-with-solrj/

Quote:
"Two popular methods of indexing existing data are the Data Import
Handler (DIH) and Tika (Solr Cell)/ExtractingRequestHandler"

Now that DHI has been discontinued, only supported by a community
package, are there any other options?

Best regards
*Rui Pimentel*

**
*Rui Pimentel*


**
*DINSD - Departamento de Informática / SPA Digital*
Av. Duque de Loulé, 31 - 1069-153 Lisboa  PORTUGAL
*T * (+ 351) 21 359 44 36 */* (+ 351) 21 359 44 00 *F* (+ 351) 21
353 02 57
 informat...@spautores.pt

www.SPAutores.pt 




Please consider the environment before printing this email

Esta mensagem electrónica, incluindo qualquer dos seus anexos,
contém informação PRIVADA, CONFIDENCIAL e de DIVULGAÇÃO PROIBIDA,e
destina-se unicamente à pessoa e endereço electrónico acima
indicados. Se não for o destinatário desta mensagem, agradecemos
que a elimine e nos comunique de imediato através do telefone +351
21 359 44 00 ou por email para: ge...@spautores.pt


This electronic mail transmission including any attachment hereof,
contains information that is PRIVATE, CONFIDENTIAL and PROTECTED
FROM DISCLOSURE, and it is only for the use of the person and the
e-mail address above indicated. If you have received this
electronic mail transmission in error, please destroy it and
notify us immediately through the telephone number +351 21 359 44
00 or at the e-mail address: ge...@spautores.pt




Re: Data Impor Handlert

2020-10-15 Thread Alexandre Rafalovitch
Solr now has package managers and DIH is one of the packages to reflect the
fact that its development cycle is not locked to Solr's and to reduce core
download. Tika may be heading the same way, as running Tika inside the Solr
process could cause memory issues with complex PDFs.

In terms of other ways of pre-process and load data into Solr, there are
things like:
1) Apache Camel https://camel.apache.org/
2) Apache NiFi https://nifi.apache.org/

Other commercial solutions also exist, such as StreamSets:
3)
https://streamsets.com/documentation/datacollector/latest/help//datacollector/UserGuide/Destinations/Solr.html

And, of course, you can always roll your own with SolrJ.

Regards,
  Alex.



On Thu, 15 Oct 2020 at 10:08, DINSD | SPAutores
 wrote:

> Hi
>
> Based on this document there are two ways to index document on the Solr
> platform, https://lucidworks.com/post/indexing-with-solrj/
>
> Quote:
> "Two popular methods of indexing existing data are the Data Import Handler
> (DIH) and Tika (Solr Cell)/ExtractingRequestHandler"
>
> Now that DHI has been discontinued, only supported by a community package,
> are there any other options?
>
> Best regards
> *Rui Pimentel*
>
> *Rui Pimentel*
>
>
> * DINSD - Departamento de Informática / SPA Digital*
> Av. Duque de Loulé, 31 - 1069-153 Lisboa  PORTUGAL
> *T * (+ 351) 21 359 44 36 */* (+ 351) 21 359 44 00  *F* (+ 351) 21 353 02
> 57
> <%7bmailsector...@spautores.pt> informat...@spautores.pt
>  www.SPAutores.pt
> 
> 
> 
> 
> Please consider the environment before printing this email
>
> Esta mensagem electrónica, incluindo qualquer dos seus anexos, contém
> informação PRIVADA, CONFIDENCIAL e de DIVULGAÇÃO PROIBIDA,e destina-se
> unicamente à pessoa e endereço electrónico acima indicados. Se não for o
> destinatário desta mensagem, agradecemos que a elimine e nos comunique de
> imediato através do telefone  +351 21 359 44 00 ou por email para:
> ge...@spautores.pt
>
> This electronic mail transmission including any attachment hereof,
> contains information that is PRIVATE, CONFIDENTIAL and PROTECTED FROM
> DISCLOSURE, and it is only for the use of the person and the e-mail address
> above indicated. If you have received this electronic mail transmission in
> error, please destroy it and notify us immediately through the telephone
> number  +351 21 359 44 00 or at the e-mail address:  ge...@spautores.pt
>
>


Data Impor Handlert

2020-10-15 Thread DINSD | SPAutores

Hi

Based on this document there are two ways to index document on the Solr 
platform, https://lucidworks.com/post/indexing-with-solrj/


Quote:
"Two popular methods of indexing existing data are the Data Import 
Handler (DIH) and Tika (Solr Cell)/ExtractingRequestHandler"


Now that DHI has been discontinued, only supported by a community 
package, are there any other options?


Best regards
*Rui Pimentel*
Assinatura SPA
**
*Rui Pimentel*


**
*DINSD - Departamento de Informática / SPA Digital*
Av. Duque de Loulé, 31 - 1069-153 Lisboa PORTUGAL
*T * (+ 351) 21 359 44 36 */* (+ 351) 21 359 44 00 *F* (+ 351) 21 353 02 57
 informat...@spautores.pt
www.SPAutores.pt
 
 


Please consider the environment before printing this email

Esta mensagem electrónica, incluindo qualquer dos seus anexos, contém 
informação PRIVADA, CONFIDENCIAL e de DIVULGAÇÃO PROIBIDA,e destina-se 
unicamente à pessoa e endereço electrónico acima indicados. Se não for o 
destinatário desta mensagem, agradecemos que a elimine e nos comunique 
de imediato através do telefone +351 21 359 44 00 ou por email para: 
ge...@spautores.pt 


This electronic mail transmission including any attachment hereof, 
contains information that is PRIVATE, CONFIDENTIAL and PROTECTED FROM 
DISCLOSURE, and it is only for the use of the person and the e-mail 
address above indicated. If you have received this electronic mail 
transmission in error, please destroy it and notify us immediately 
through the telephone number +351 21 359 44 00 or at the e-mail address: 
ge...@spautores.pt

Assinatura SPA