Re: Can't search formatted text in solr

2020-07-22 Thread Khare, Kushal (MIND)
kens indexed. My bet is that your doc format isn’t being analyzed properly, perhaps to do markup and the second case doesn’t get indexed the way you think it should. You can use the terms handler to examine exactly what’s in the index Best, Erick > On Jul 22, 2020, at 12:42 PM, Khare, Kushal

Can't search formatted text in solr

2020-07-22 Thread Khare, Kushal (MIND)
Hello guys, I have been using solr for my java application to carry out content search from the saved docs. I am facing a problem in searching for a word - 'load' There are 2 cases, in 1st search is working good but in second case with the same doc and same query - 'load' am not getting the resul

RE: Delete documents from the Solr index using SolrJ

2019-11-05 Thread Khare, Kushal (MIND)
index data, delete data for old version number. -Original Message- From: Khare, Kushal (MIND) [mailto:kushal.kh...@mind-infotech.com] Sent: 04 November 2019 15:03 To: solr-user@lucene.apache.org Subject: [EXTERNAL] RE: Delete documents from the Solr index using SolrJ Thanks! Actual

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
do the trick unless im missing something significant in what youre trying to do here. you can just pass in an xml command: '".$kill_query."' On Mon, Nov 4, 2019 at 9:37 AM Khare, Kushal (MIND) < kushal.kh...@mind-infotech.com> wrote: > In my case, id won't be

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
ubject: Re: Delete documents from the Solr index using SolrJ when you add a new document using the same "id" value as another it just over writes it On Mon, Nov 4, 2019 at 9:30 AM Khare, Kushal (MIND) < kushal.kh...@mind-infotech.com> wrote: > Could you please le

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
more clear now. -Original Message- From: Khare, Kushal (MIND) [mailto:kushal.kh...@mind-infotech.com] Sent: 04 November 2019 20:00 To: solr-user@lucene.apache.org Subject: RE: Delete documents from the Solr index using SolrJ Could you please let me know how to achieve that

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
. However why don’t you simply overwrite the existing document instead of add+delete > Am 04.11.2019 um 15:12 schrieb Khare, Kushal (MIND) > : > > Hello mates! > I want to know how we can delete the documents from the Solr index . Suppose > for my system, I have a document tha

Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
Hello mates! I want to know how we can delete the documents from the Solr index . Suppose for my system, I have a document that has been indexed, now its newer version is into use, so I want to use the latest one, for that I want the previous one to be deleted from the index. Kindly help me a wa

Integrating Solr with my existing java web application

2019-10-31 Thread Khare, Kushal (MIND)
Hello mates ! Hope you people are doing good ! Well, I am trying to integrate the SolrJ code for indexing and querying the documents through Solr with my java web app. I am facing a very wired issue, that when I run my method for Solr as java app (independently using main() function) it works f

RE: Skip Headers & Footers while text extraction using Apache Tika parsing for PPT & PDF formats

2019-09-03 Thread Khare, Kushal (MIND)
at you are currently doing. Here is the documentation on how to get a different output format: https://tika.apache.org/1.8/examples.html#Parsing_using_the_Auto-Detect_Parser > Am 04.09.2019 um 08:29 schrieb Khare, Kushal (MIND) > : > > I already spent a lot of time reading on the in

RE: Skip Headers & Footers while text extraction using Apache Tika parsing for PPT & PDF formats

2019-09-03 Thread Khare, Kushal (MIND)
che POI. Feel free to contribute a solution to those problems to the Apache Tika project. > Am 04.09.2019 um 05:42 schrieb Khare, Kushal (MIND) > : > > Guys, could I get any help ? Or it's useless posting queries over here ? > > On Sep 3, 2019 4:00 PM, "Khare, Kushal (MI

Re: Skip Headers & Footers while text extraction using Apache Tika parsing for PPT & PDF formats

2019-09-03 Thread Khare, Kushal (MIND)
Guys, could I get any help ? Or it's useless posting queries over here ? On Sep 3, 2019 4:00 PM, "Khare, Kushal (MIND)" wrote: Hello, mates ! I am extracting content from my documents using Apache Tika. I need to exclude the headers & footers of the documents. I have already

Skip Headers & Footers while text extraction using Apache Tika parsing for PPT & PDF formats

2019-09-03 Thread Khare, Kushal (MIND)
Hello, mates ! I am extracting content from my documents using Apache Tika. I need to exclude the headers & footers of the documents. I have already done this for Word & Excel format using OfficeParseConfig, but need to implement the same for PPT & PDF. How to achieve that ? ___

RE: Require searching only for file content and not metadata

2019-08-29 Thread Khare, Kushal (MIND)
ct: Re: Require searching only for file content and not metadata I already provided feedback, you haven’t evidenced any attempt to follow up on it. Best, Erick > On Aug 29, 2019, at 4:54 AM, Khare, Kushal (MIND) > wrote: > > Erick, > I am using the code that I posted yesterday. But,

RE: Require searching only for file content and not metadata

2019-08-29 Thread Khare, Kushal (MIND)
ta for file: " + fileName); >for (String name : metadata.names()) { > System.out.println(name + ":" + metadata.get(name)); >} >System.out.println("x.."); > } > } > > > Also, I am attaching th

RE: Require searching only for file content and not metadata

2019-08-28 Thread Khare, Kushal (MIND)
adata metadata) { > System.out.println("Dumping metadata for file: " + fileName); >for (String name : metadata.names()) { > System.out.println(name + ":" + metadata.get(name)); >} >System.out.println("x..

RE: Require searching only for file content and not metadata

2019-08-28 Thread Khare, Kushal (MIND)
ocList.clear(); > } >} > } > > // Just to show all the metadata that's available. > private void dumpMetadata(String fileName, Metadata metadata) { > System.out.println("Dumping metadata for file: " + fileName); >for (String name : metadata.names())

RE: Require searching only for file content and not metadata

2019-08-28 Thread Khare, Kushal (MIND)
meta:word-count 20 \n extended-properties:PresentationFormat Widescreen \n dc:creator Khare, Kushal (MIND) \n extended-properties:Company MIND \n Word-Count 20 \n dcterms:created 2019-06-18T07:25:29Z \n dcterms:modified 2019-06-24T09:52:33Z \n Last-Modified 2019-06-24T09:52:33Z \n Last

RE: Require searching only for file content and not metadata

2019-08-28 Thread Khare, Kushal (MIND)
Attaching managed-schema.xml -Original Message- From: Khare, Kushal (MIND) [mailto:kushal.kh...@mind-infotech.com] Sent: 28 August 2019 16:30 To: solr-user@lucene.apache.org Subject: RE: Require searching only for file content and not metadata I already tried this example, I am currently

RE: Require searching only for file content and not metadata

2019-08-28 Thread Khare, Kushal (MIND)
ching the scrollconfig.xml & Managed-schema.xml for my collection. Please see to it & suggest where I am getting wrong. I can't even get to see the _text_ field in the query result, instead of stored parameter being true. Any help would really be appreciated. Thanks ! -Original Message

RE: Require searching only for file content and not metadata

2019-08-28 Thread Khare, Kushal (MIND)
adata from? Have you read the Solr reference guide? Have you read a book about Solr? > Am 28.08.2019 um 08:10 schrieb Khare, Kushal (MIND) > : > > Could anyone please help me with how to use this approach ? I humbly request > all the users to please help me get th

RE: Require searching only for file content and not metadata

2019-08-27 Thread Khare, Kushal (MIND)
t Handler Update Extract configurations, but > none of them worked for me. > Please help me resolve this as I am badly stuck in this. > > -Original Message- > From: Khare, Kushal (MIND) [mailto:kushal.kh...@mind-infotech.com] > Sent: 27 August 2019 12:59 > To: solr-user@lu

RE: Require searching only for file content and not metadata

2019-08-27 Thread Khare, Kushal (MIND)
this as I am badly stuck in this. -Original Message- From: Khare, Kushal (MIND) [mailto:kushal.kh...@mind-infotech.com] Sent: 27 August 2019 12:59 To: solr-user@lucene.apache.org; ch...@christopherschultz.net Subject: RE: Require searching only for file content and not metadata Chris, What

RE: Require searching only for file content and not metadata

2019-08-27 Thread Khare, Kushal (MIND)
To: solr-user@lucene.apache.org Subject: Re: Require searching only for file content and not metadata -BEGIN PGP SIGNED MESSAGE----- Hash: SHA256 Kushal, On 8/26/19 07:52, Khare, Kushal (MIND) wrote: > This is Kushal Khare, a new addition to the user-list. I started > working with S

Require searching only for file content and not metadata

2019-08-26 Thread Khare, Kushal (MIND)
Hello Guys! This is Kushal Khare, a new addition to the user-list. I started working with Solr few days ago for implementing it in my project. Now, I have the basics done, and reached the query stage. My problem is – I need to restrict the solr to search only for the file content and not the meta

Query regarding Solr Search

2019-08-07 Thread Khare, Kushal (MIND)
Hello People ! Hope you all are doing good ! Well, I am new to the solr server and want to use it for content search in one of my applications. I have already been working upon it since quite a few days, and have the basics done. The issue that I have is, I want my search to be restricted only