indexing rich data with solr 5.3

2016-01-11 Thread kostali hassan
such files msword and pdf donsnt indexing using *dataimoprt i have this error:* Full Import failed:java.lang.RuntimeException: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to read content Processing Document # 2 at org.apache.solr.handl

Re: indexing rich data with solr 5.3

2016-01-12 Thread kostali hassan
va:73) 2016-01-12 1:23 GMT+00:00 Erick Erickson : > Looks like a bad file. Do you have any success using DIH on any files? > > What happens if you just send that particular file throug the > ExtractingRequestHandler? > > Best, > Erick > > On Mon, Jan 11, 2016 at 3:51 PM, k

Fwd: indexing rich data with solr 5.3

2016-01-14 Thread kostali hassan
thank you Eric I have prb with this files; last question how to define or get the list of files cant be indexing or bad files. > > > >

Re: indexing rich data with solr 5.3

2016-01-15 Thread kostali hassan
t; — > Erik Hatcher, Senior Solutions Architect > http://www.lucidworks.com > > > > > On Jan 14, 2016, at 10:55 AM, Erick Erickson > wrote: > > > > No good way except to try them. For getting details on Tika parsing > > failures, I much prefer the SolrJ proces

indexing rich data with solr 5.3.1 integreting in Ubuntu server

2016-01-23 Thread kostali hassan
0down votefavorite I have a problem with integrating solr in Ubuntu server.Before using solr on ubuntu server i tested it on my mac it was working perfectly for DIH request handler and update/extract

indexing rich data with solr 5.3.1 integreting in Ubuntu server

2016-01-25 Thread kostali hassan
0down votefavorite I have a problem with integrating solr in Ubuntu server.Before using solr on ubuntu server i tested it on my mac it was working perfectly for DIH request handler and update/extract

Re: indexing rich data with solr 5.3.1 integreting in Ubuntu server

2016-01-26 Thread kostali hassan
te > path to libs and see in logs if they are loaded. > > regex=".*\.jar" /> > > > Thanks, > Emir > > On 25.01.2016 15:16, kostali hassan wrote: > >> 0down votefavorite >> < >> http://stackoverflow.com/questions/34962280/solr-index

Re: sorry, no dataimport-handler defined!

2016-02-03 Thread kostali hassan
in request data import handler for solrconfig.xml do : tika-data-config.xml and define your file tika-data-config.xml and put this file in the directory config from your core. 2016-02-02 17:35 GMT+00:00 Jean-Jacques Monot : > Exact. Newbie user ! > > OK i have seen what is m

DIH cant index adresses web

2016-03-22 Thread kostali hassan
I try to index rich data (msword and pdf) but when a content of document have multiple liens (web adress) i get an ERROR in log . what i have to add in my tika-config.xml to index web path .

indexing rich data using DIH from solr 5.4.1

2016-03-25 Thread kostali hassan
some document have content can not be extracted and stack in JVM of solr ; i get this ERROR: 24/03/2016 à 19:26:59 ERROR null DocBuilder Exception while processing: files document : null:org.apache.solr.handler. dataimport.DataImportHandlerException: Unable to read content Processing Document # 1

Re: Indexing docuements in Solr 5 Using Tika extraction error

2016-03-25 Thread kostali hassan
tank you shawn ; but if I use solarium client PHP for the production what I have to do in this case. 2016-03-25 13:44 GMT+00:00 Shawn Heisey : > On 3/25/2016 5:44 AM, Moncif Aidi wrote: > > Im Using solr 5.4.1 for indexing thousands of documents, and it works > > perfectly.The issue comes when so

Fwd: Indexing rich data (msword and pdf) in apache solr-5.3.1

2015-12-01 Thread kostali hassan
I start working in solr 5x by extract solr in D://solr and run solr server with : D:\solr\solr-5.3.1\bin>solr start ; Then I create a core in standalone mode : D:\solr\solr-5.3.1\bin>solr create -c mycore I need indexing from system files (word and pdf) and the schema API don’t have a field “na

Fwd: index rich data with solarium php solr Client

2015-12-01 Thread kostali hassan
I get this Error Invalid character encoding detected after position 79 of query string / form data (while parsing as UTF-8) this is my function to index rich data from direct

indexing rich data from directory using solarium

2015-12-02 Thread kostali hassan
HOW I can indexing from solarium rich data(msword and pdf files) from a dirctory who contient many files, MY config is $config = array( "endpoint" => array("localhost" => array("host"=>"127.0.0.1", "port"=>"8983", "path"=>"/solr", "core"=>"demo",) ) ); I try this code:

Re: indexing rich data from directory using solarium

2015-12-02 Thread kostali hassan
implePostTool D:\Lucene\document ; 2015-12-02 11:09 GMT+00:00 Gora Mohanty : > On 2 December 2015 at 16:32, kostali hassan > wrote: > [...] > > > > When i execute it i get this ERROR: > > > > org.apache.solr.common.SolrException: URLDecoder: Invalid character > &g

Re: indexing rich data from directory using solarium

2015-12-02 Thread kostali hassan
-79> this is my post in stack overflow : http://stackoverflow.com/questions/34017889/solrexception-urldecoder-invalid-character-encoding-detected-after-position-79 2015-12-02 16:18 GMT+00:00 Gora Mohanty : > On 2 December 2015 at 17:16, kostali hassan > wrote: > > yes its logic Than

Re: indexing rich data from directory using solarium

2015-12-02 Thread kostali hassan
led demo. When I indexed all document the field name exist but still empty for all document indexed. My question is how I can get just the name of each document(msword and pdf) not the path like the field “id” or field “ressource_name” ; I have to create new Typefield or exist another way. 2015-12-

Re: indexing rich data from directory using solarium

2015-12-02 Thread kostali hassan
('Content-Type:multipart/form-data');* * I guess it will *working if the time of indexing is not depassing 30 second from time out of wamp server. 2015-12-02 16:32 GMT+00:00 Gora Mohanty : > On 2 December 2015 at 21:55, kostali hassan > wrote: > > yes they are a

Re: indexing rich data from directory using solarium

2015-12-02 Thread kostali hassan
etOmitHeader(false);* *$doc = $query->createDocument();* *$doc->id =$file->pwd();* *$doc->name = $file->name;* *$doc->title = $file->name();* *$query->setDocument($doc);* *$request = $client->createRequest($query);* *$request->addHeaders($headers);* *$result = $client-

Solr extract performance

2015-12-02 Thread kostali hassan
I look for optimal way to extract and commit rich data from directory contient many file system masword and pdf because I have a prb with 30second of time out in wamp server. this is my function index document in cakephp using solarium: *App::import('Vendor','autoload',array('file'=>'solarium/ven

curl adapter in solarium 3x

2015-12-03 Thread kostali hassan
How to force the connection to explicitly close when it has finished processing, and not be pooled for reuse. they are a way to tell to server may send a keep-alive timeout (with default Apache install, it is 15 seconds or 100 requests, whichever comes first) - but cURL will just open another conn

Re: curl adapter in solarium 3x

2015-12-03 Thread kostali hassan
Thank you Gora , in fact Curl is default adapter for solarium-3x and I am not using zend framwork. 2015-12-03 11:05 GMT+00:00 Gora Mohanty : > On 3 December 2015 at 16:20, kostali hassan > wrote: > > How to force the connection to explicitly close when it has finished > > proc

schema fileds and Typefield in solr-5.3.1

2015-12-03 Thread kostali hassan
I start working in solr 5x by extract solr in D://solr and run solr server with : D:\solr\solr-5.3.1\bin>solr start ; Then I create a core in standalone mode : D:\solr\solr-5.3.1\bin>solr create -c mycore I need indexing from system files (word and pdf) and the schema API don’t have a field “na

Re: schema fileds and Typefield in solr-5.3.1

2015-12-04 Thread kostali hassan
thank you Erick, i follow you advice and take a look to config apache tika, I have modifie my request handler /update/extract: last_modified ignored_ true links ignored_ D:\solr\solr-5.3.1\server\solr\tika-data-config.xml and config tika : data

Re: schema fileds and Typefield in solr-5.3.1

2015-12-04 Thread kostali hassan
ust your mappings and configurations accordingly. > > * And apologies that bin/post isn’t Windows savvy at this point, but > you’ve got the hang of the Windows-compatible command-line it looks like. > > — > Erik Hatcher, Senior Solutions Architect > http://www.lucidworks.com >

Re: schema fileds and Typefield in solr-5.3.1

2015-12-05 Thread kostali hassan
simpleposttool just go to to the web admin for dataimport and try and execute a full import. 2015-12-04 17:05 GMT+00:00 kostali hassan : > thank you , that's why I choose to add the exact value using solarium PHP > Client, but the time out stop indexing after 30seconde: > >

secure solr 5.3.1

2015-12-07 Thread kostali hassan
How I shoud secure my server of solr 5 .3.1 in single-node Mode. I Am searching for the best way to secure my server solr but I found only for cloud mode.

Re: secure solr 5.3.1

2015-12-08 Thread kostali hassan
asic Auth, Kerberos and > authorization support. > > > Bosco > > > > > > On 12/7/15, 7:03 AM, "kostali hassan" wrote: > > >How I shoud secure my server of solr 5 .3.1 in single-node Mode. I Am > >searching for the best way to secure my server solr but I found only for > >cloud mode. > >

Re: secure solr 5.3.1

2015-12-08 Thread kostali hassan
2015-12-08 1:58 GMT+00:00 Don Bosco Durai : > > > Have you considered running your Solr as SolrCloud with embedded > zookeeper? > > > > If you do, you have multiple options. Basic Auth, Kerberos and > > authorization support. > > > > > > Bo

Re: secure solr 5.3.1

2015-12-09 Thread kostali hassan
run it in standalone mode, but it is not something > that has been well test yet. > > On Tue, Dec 8, 2015 at 11:02 PM, Don Bosco Durai wrote: > > > It was tested and meant to work only in SolrCloud mode. > > > > > > > > > > > > > > On Tue,

kerberos and solr5 Service Principals and Keytab Files

2015-12-09 Thread kostali hassan
I am trying to secure solr using kerberos plugin , I want test kerberos in localhost but i dont know how create kerberos principal At the KDC server.and where generate keytab file from the KDC server’s /tmp/107.keytab.

Re: secure solr 5.3.1

2015-12-09 Thread kostali hassan
g/confluence/display/RANGER/How+to+configure+Solr+Cloud+with+Kerberos+for+Ranger+0.5 > > > > > > Bosco > > > > > > > > > > > > On 12/9/15, 3:14 AM, "kostali hassan" wrote: > > > > >how I setting up Solr to use Kerbero

Re: secure solr 5.3.1

2015-12-09 Thread kostali hassan
/tmp/107.keytab HTTP/192.168.0.107 > > Then copy the keytab file to your solr node to the appropriate places. > > > On Thu, Dec 10, 2015 at 12:08 AM, kostali hassan < > med.has.kost...@gmail.com> > wrote: > > > I folow this two resources and Iam stuck in > > > >

Re: Re: secure solr 5.3.1

2015-12-10 Thread kostali hassan
Iam looking to secure my solr runing in standalone Mode within windows ;the kerberose plugin is only able to secure solr in standalone mode. how create principale and here password. 2015-12-10 9:35 GMT+00:00 kostali hassan : > Iam looking to secure my solr runing in standalone Mode the kerber

Re: Re: secure solr 5.3.1

2015-12-10 Thread kostali hassan
t; > All the best, > Regards, > Ishan > > > On Thu, Dec 10, 2015 at 5:19 PM, kostali hassan > > wrote: > > > Iam looking to secure my solr runing in standalone Mode within windows > ;the > > kerberose plugin is only able to secure solr in standalone mode. h

Re: trying DIH but get 'Sorry, no dataimport-handler defined!'

2016-05-24 Thread kostali hassan
if you have in this path server/solr/configsets/testdih/conf you shoud right this in your line commande: 'bin\solr>solr create -c your_core -d testdih -p 8983 to create a core with an exemple config testdih. 2016-05-24 9:35 GMT+01:00 scott.chu : > > I do following things: > > * I create folder :

"data import handler : import data from sql database :how to search in all fields"

2016-05-26 Thread kostali hassan
I import data from sql databases with DIH . I am looking for serch term in all fields not by field.

Re: "data import handler : import data from sql database :how to search in all fields"

2016-05-26 Thread kostali hassan
thank you it make sence . have a good day 2016-05-26 15:31 GMT+01:00 Siddhartha Singh Sandhu : > The schema.xml/managed_schema defines the default search field as `text`. > > You can make all fields that you want searchable type `text`. > > On Thu, May 26, 2016 at 10:23 AM,

Re: "data import handler : import data from sql database :how to search in all fields"

2016-05-26 Thread kostali hassan
can copy all of the fields into an "uber field" using the > copyField directive and just search the "uber field". > > Best, > Erick > > On Thu, May 26, 2016 at 7:35 AM, kostali hassan > wrote: > > thank you it make sence . > > have a good day &g

data import handler for solr 5.4.1 to index rich Data

2016-06-02 Thread kostali hassan
I am looking for to define multi field for example the field links to extract all links from the field text of each file. I define in tika.config.xml a regex for the expression of links but when the prossesor of indexation is finish I get just one value even if in schema.xml I define the field link

solr5.4.1 : data import handler for index rich data

2016-06-06 Thread kostali hassan
I am looking to add new field to extract they value from the field text: for example the field links to extract all links from the field text of each file. I define in tika.config.xml a regex for the expression of links but when the prossesor of indexation is finish I get just one value even if

how collect a list of damaged file they can not be indexed

2016-06-22 Thread kostali hassan
I start solr 5.4.1 to indexe rich data pdf and msword using data import handler. the file tika-config.xml I wrote: onError="skip" I want recover corrupted file

Fwd: how collect a list of damaged file they can not be indexed

2016-06-22 Thread kostali hassan
-- Message transféré -- De : "kostali hassan" Date : 22 juin 2016 14:00 Objet : how collect a list of damaged file they can not be indexed À : Cc : I start solr 5.4.1 to indexe rich data pdf and msword using data import handler. the file tika-config.xml I wrote: onE

Re: Searching Home's, Homes and Home

2016-07-12 Thread kostali hassan
Or you can build a file called synonym.txt in your directory config of your core. Le 11 juil. 2016 17:06, "Surender" a écrit : > Thanks... > > I am applying these filters and will share update on this issue. It will > take couple of days. > > Thanks, > Surender Singh > > > > -- > View this messag

Update index

2016-07-13 Thread kostali hassan
I am using solr 5.4 1 to index sql database with data import handler. I am looking for update index automatically when the database is modified or insert in it new value.

Re: Update index

2016-07-13 Thread kostali hassan
ied just date_creation 2016-07-13 14:11 GMT+01:00 Jamal, Sarfaraz < sarfaraz.ja...@verizonwireless.com.invalid>: > Hi Kostali, > > I would look at the Delta Queries - > > Sas > > -Original Message- > From: kostali hassan [mailto:med.has.kost...@gmail.com] &g

Re: Fwd: how collect a list of damaged file they can not be indexed

2016-07-13 Thread kostali hassan
06-24 12:39 GMT+00:00 Rick Leir : > Do you mean that some of your pdf's are corrupt and Tika cannot index > them? There should be some mention in the log file, so you can know which > pdf is a problem. Fix it somehow and re-index. > > > On June 22, 2016 9

DIH:damaged files

2016-07-14 Thread kostali hassan
I try to index many files msword and pdf using solr-5.4.1 ; In solr logg I get only the description of ERROR not the file who cause the Error; how to get a list of files are corrupt and Tika cannot index them; AND even if solr try index corrupt file and fail how force solr to continue indexing the

index sql databases

2016-07-15 Thread kostali hassan
I use solr5.4.1 when a attribute the type date is null (:00:00) the processus of indexation stop and the log had an Error , how i have to change in driver="com.mysql.jdbc.Driver" to ignore null date; Last question how to set to to field date and time< hh:mm:ss>

Re: index sql databases

2016-07-15 Thread kostali hassan
Thank you Shawn the prb is when I display the field type date I get the value in this forme -MM-dd'T'hh:mm:ss'Z'

Re: index sql databases

2016-07-18 Thread kostali hassan
can we use transformer="RegexTransformer" and set in db_data_config.xml 2016-07-16 18:18 GMT+01:00 Shawn Heisey : > On 7/15/2016 3:10 PM, kostali hassan wrote: > > Thank you Shawn the prb is when I display the field type date I get > > the value in this forme -MM-

More like this in solr5.4.1

2016-07-19 Thread kostali hassan
I want introdius Morelikethis to get simmilaire document for each query. I had index rich data pds and msword I guess The fields to use for similarity is CONTENT used also for highlighting document content. In my case what is the best way to build mlt :MoreLikeThisHandler

Re: index sql databases

2016-07-19 Thread kostali hassan
type date I get the > value in this forme -MM-dd'T'hh:mm:ss'Z' > > A regex in the _input_ side will have > no effect on what Solr returns. You'd have > to use a DocTransformer to change the output > on the query side. DIH is in the indexing side

solr AND riche Data

2016-09-05 Thread kostali hassan
I index rich data in solr 5.4.1 and I use solarium to search terms in index at the field text ;how to display for each term hes category,synonym,similair result,suggester,autocomplet... for exemple the term to search is q=java similair term is: javascript,javaEE.. framworks:Hibernate,Jboss,strut,s

Fwd: solr AND riche Data

2016-09-19 Thread kostali hassan
I index rich data in solr 5.4.1 and I use solarium to search terms in index at the field text ;how to display for each term hes category,synonym,similair result,suggester,autocomplet... for exemple the term to search is q=java similair term is: javascript,javaEE.. framworks:Hibernate,Jboss,strut,s