Re: Solr documents update on index

2013-09-06 Thread Luís Portela Afonso
Hi,

But i'm indexing rss feeds. I want that solr indexes that without change the 
existing information of a document with the same uniqueKey.
The best approach is that solr updates the doc if changes are detected, but i 
can leave without that.

I really would like that solr does not update the document if it already exists.

I'm using the DataImportScheduler to solr index launch the scheduled index.

Appreciate any possible help.

On Sep 6, 2013, at 9:16 AM, Shalin Shekhar Mangar  
wrote:

> Yes, if a document with the same key exists, then the old document
> will be deleted and replaced with the new document. You can also
> partially update documents (we call it atomic updates) which reads the
> old document from local index, updates it according to the request and
> then replaces the old document with the new one.
> 
> See 
> https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Index+Handlers#UploadingDatawithIndexHandlers-UpdatingOnlyPartofaDocument
> 
> On Fri, Sep 6, 2013 at 1:03 AM, Luis Portela Afonso
>  wrote:
>> Hi,
>> 
>> I'm having a problem when solr indexes.
>> It is updating documents already indexed. Is this a normal behavior?
>> If a document with the same key already exists is it supposed to be updated?
>> I has thinking that is supposed to just update if the information on the
>> rss has changed.
>> 
>> Appreciate your help
>> 
>> --
>> Sent from Gmail Mobile
> 
> 
> 
> -- 
> Regards,
> Shalin Shekhar Mangar.



smime.p7s
Description: S/MIME cryptographic signature


Re: Solr documents update on index

2013-09-06 Thread Shalin Shekhar Mangar
Yes, if a document with the same key exists, then the old document
will be deleted and replaced with the new document. You can also
partially update documents (we call it atomic updates) which reads the
old document from local index, updates it according to the request and
then replaces the old document with the new one.

See 
https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Index+Handlers#UploadingDatawithIndexHandlers-UpdatingOnlyPartofaDocument

On Fri, Sep 6, 2013 at 1:03 AM, Luis Portela Afonso
 wrote:
> Hi,
>
> I'm having a problem when solr indexes.
> It is updating documents already indexed. Is this a normal behavior?
> If a document with the same key already exists is it supposed to be updated?
> I has thinking that is supposed to just update if the information on the
> rss has changed.
>
> Appreciate your help
>
> --
> Sent from Gmail Mobile



-- 
Regards,
Shalin Shekhar Mangar.


Solr documents update on index

2013-09-05 Thread Luis Portela Afonso
Hi,

I'm having a problem when solr indexes.
It is updating documents already indexed. Is this a normal behavior?
If a document with the same key already exists is it supposed to be updated?
I has thinking that is supposed to just update if the information on the
rss has changed.

Appreciate your help

-- 
Sent from Gmail Mobile


Re: Trigger documents update in a collection

2013-04-15 Thread Otis Gospodnetic
Hi,

Doable with a custom Update Request Processor, yes.

Otis
Solr & ElasticSearch Support
http://sematext.com/
On Apr 15, 2013 3:14 PM, "Francois Perron" <
francois.per...@wantedanalytics.com> wrote:

> Hi all,
>
> I want to use Solr4 as a NoSQL.
>
> My 'ideal' workflow is to add/update documents in a collection (NoSQL) and
> automatically update changes in another collection with more specific
> search capabilities. The nosql collection will contains all my documents
> (750M docs).  The 'searchable' collection will only contain a subset of
> this collection (active documents based on a field).
>
> Is it possible ?
>
> Thank you


Trigger documents update in a collection

2013-04-15 Thread Francois Perron
Hi all,

I want to use Solr4 as a NoSQL.  

My 'ideal' workflow is to add/update documents in a collection (NoSQL) and 
automatically update changes in another collection with more specific search 
capabilities. The nosql collection will contains all my documents (750M docs).  
The 'searchable' collection will only contain a subset of this collection 
(active documents based on a field).

Is it possible ?

Thank you

Re: Documents update

2011-06-07 Thread Denis Kuzmenok
Created  file,  reloaded  solr  -  externalfilefield  works fine, if i
change  change  external  files  and  do "curl
http://127.0.0.1:4900/solr/site/update -H "Content-Type: text/xml" 
--data-binary ''"
then  no  thanges are made. If i start solr without external files and
then create them - they are not working..
What is wrong?

PS: Solr 3.2

> http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.html

> On Tuesday 31 May 2011 15:41:32 Denis Kuzmenok wrote:
>> Flags   are   stored  to filter results and it's pretty highloaded, it's
>> working  fine,  but i can't update index very often just to make flags
>> up to time =\
>> Where can i read about using external fields / files?
>> 
>> > And it wouldn't work unless all the data is stored anyway. Currently
>> > there's no way to update a single field in a document, although there's
>> > work being done in that direction (see the "column stride" JIRA).
>> > 
>> > What do you want to do with these fields? If it's to influence scoring,
>> > you could look at external fields.
>> > 
>> > If the flags are a selection criteria, it's...harder. What are the flags
>> > used for? Could you consider essentially storing a map of the
>> > uniqueKey's and flags in a "special" document and having your app
>> > read that document and merge the results with the output? If this seems
>> > irrelevant, a more complete statement of the use-case would be helpful.
>> > 
>> > Best
>> > Erick





Re: Documents update

2011-06-01 Thread Alexey Serba
> Will it be slow if there are 3-5 million key/value rows?
AFAIK it shouldn't affect search time significantly as Solr caches it
in memory after you reloading Solr core / issuing commit.

But obviously you need more memory and commit/reload will take more time.


Re: Documents update

2011-05-31 Thread Denis Kuzmenok
Will it be slow if there are 3-5 million key/value rows?

> http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.html

> On Tuesday 31 May 2011 15:41:32 Denis Kuzmenok wrote:
>> Flags   are   stored  to filter results and it's pretty highloaded, it's
>> working  fine,  but i can't update index very often just to make flags
>> up to time =\
>> Where can i read about using external fields / files?






Re: Documents update

2011-05-31 Thread Markus Jelsma
http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.html

On Tuesday 31 May 2011 15:41:32 Denis Kuzmenok wrote:
> Flags   are   stored  to filter results and it's pretty highloaded, it's
> working  fine,  but i can't update index very often just to make flags
> up to time =\
> Where can i read about using external fields / files?
> 
> > And it wouldn't work unless all the data is stored anyway. Currently
> > there's no way to update a single field in a document, although there's
> > work being done in that direction (see the "column stride" JIRA).
> > 
> > What do you want to do with these fields? If it's to influence scoring,
> > you could look at external fields.
> > 
> > If the flags are a selection criteria, it's...harder. What are the flags
> > used for? Could you consider essentially storing a map of the
> > uniqueKey's and flags in a "special" document and having your app
> > read that document and merge the results with the output? If this seems
> > irrelevant, a more complete statement of the use-case would be helpful.
> > 
> > Best
> > Erick

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350


Re: Documents update

2011-05-31 Thread Denis Kuzmenok
Flags   are   stored  to filter results and it's pretty highloaded, it's
working  fine,  but i can't update index very often just to make flags
up to time =\
Where can i read about using external fields / files?


> And it wouldn't work unless all the data is stored anyway. Currently there's
> no way to update a single field in a document, although there's work being
> done in that direction (see the "column stride" JIRA).

> What do you want to do with these fields? If it's to influence scoring, you
> could look at external fields.

> If the flags are a selection criteria, it's...harder. What are the flags
> used for? Could you consider essentially storing a map of the
> uniqueKey's and flags in a "special" document and having your app
> read that document and merge the results with the output? If this seems
> irrelevant, a more complete statement of the use-case would be helpful.

> Best
> Erick







Re: Documents update

2011-05-31 Thread Erick Erickson
And it wouldn't work unless all the data is stored anyway. Currently there's
no way to update a single field in a document, although there's work being
done in that direction (see the "column stride" JIRA).

What do you want to do with these fields? If it's to influence scoring, you
could look at external fields.

If the flags are a selection criteria, it's...harder. What are the flags
used for? Could you consider essentially storing a map of the
uniqueKey's and flags in a "special" document and having your app
read that document and merge the results with the output? If this seems
irrelevant, a more complete statement of the use-case would be helpful.

Best
Erick

On Fri, May 27, 2011 at 4:33 AM, Denis Kuzmenok  wrote:
> I'm  using  3.1  now.  Indexing  lasts for a few hours, and have big
> plain size. Getting all documents would be rather slow :(
>
>
>> Not with 1.4, but apparently there is a patch for trunk. Not
>> sure if it is in 3.1.
>
>> If you are on 1.4, you could first query Solr to get the data
>> for the document to be changed, change the modified values,
>> and make a complete XML, including all fields, for post.jar.
>
>> Regards,
>> Gora
>
>
>
>
>


Re: Documents update

2011-05-27 Thread Denis Kuzmenok
I'm  using  3.1  now.  Indexing  lasts for a few hours, and have big
plain size. Getting all documents would be rather slow :(


> Not with 1.4, but apparently there is a patch for trunk. Not
> sure if it is in 3.1.

> If you are on 1.4, you could first query Solr to get the data
> for the document to be changed, change the modified values,
> and make a complete XML, including all fields, for post.jar.

> Regards,
> Gora






Re: Documents update

2011-05-27 Thread Gora Mohanty
2011/5/27 Denis Kuzmenok :
> Hi.
>
> I  have  and  indexed  database  which  is indexed few times a day and
> contain  tinyint  flag  (like is_enabled, is_active, etc), and content
> isn't changed too often, but flags are.
> So  if i index via post.jar only flags then entire document is deleted
> and there's only unique key and flags.
> Is  there  any  way  to  index  certain columns, and not to change all
> document?
[...]

Not with 1.4, but apparently there is a patch for trunk. Not
sure if it is in 3.1.

If you are on 1.4, you could first query Solr to get the data
for the document to be changed, change the modified values,
and make a complete XML, including all fields, for post.jar.

Regards,
Gora


Documents update

2011-05-27 Thread Denis Kuzmenok
Hi.

I  have  and  indexed  database  which  is indexed few times a day and
contain  tinyint  flag  (like is_enabled, is_active, etc), and content
isn't changed too often, but flags are.
So  if i index via post.jar only flags then entire document is deleted
and there's only unique key and flags.
Is  there  any  way  to  index  certain columns, and not to change all
document?

Thanks