Re: If you could have one feature in Solr...

2010-02-24 Thread fergus mcmenemie
-- == Fergus McMenemieEmail:fer...@twig.me.uk Techmore Limited, Phone:(UK) 07721 376021 Old Stables, Far End, Home: (UK) 01522 810839 Boothby Graffoe, Lincoln, LN5 0LG, England Unix/Mac/Intranets/WWW

Re: Question about DIH execution order

2009-11-02 Thread Fergus McMenemie
/ -- - Noble Paul | Principal Engineer| AOL | http://aol.com -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets

Re: Error when indexing XML files

2009-10-16 Thread Fergus McMenemie
Hi, Please find the schema file attached. Please let me know what I am doing wrong. Regards Chaitali --- On Wed, 10/14/09, Fergus McMenemie fer...@twig.me.uk wrote: From: Fergus McMenemie fer...@twig.me.uk Subject: Re: Error when indexing XML files To: solr-user@lucene.apache.org Date

Re: Error when indexing XML files

2009-10-16 Thread Fergus McMenemie
Hi, Please find the schema file attached. Please let me know what I am doing wrong. Regards Chaitali --- On Wed, 10/14/09, Fergus McMenemie fer...@twig.me.uk wrote: From: Fergus McMenemie fer...@twig.me.uk Subject: Re: Error when indexing XML files To: solr-user@lucene.apache.org Date

Re: Using DIH's special commands....Help needed

2009-10-15 Thread Fergus McMenemie
to delete these rows using DIH?In other words, where/how do I specify this? The $deleteDocByQuery is for deleting Solr documents by a Solr query and not DB rows. -- Regards, Shalin Shekhar Mangar. -- === Fergus McMenemie

Re: Error when indexing XML files

2009-10-14 Thread Fergus McMenemie
Hi, I am trying to index XML files using SolrJ. The original XML file contains nested elements. For example, the following is the snippet of the XML file. entry   nameSOMETHING /name   facilitySOME_OTHER_THING/facility  /entry I have added the elements name and facility in Schema.xml

Re: Query filters/analyzers

2009-10-02 Thread Fergus McMenemie
On Thu, Oct 1, 2009 at 7:59 PM, Claudio Martella claudio.marte...@tis.bz.it wrote: About the copyField issue in general: as it copies the content to the other field, what is the sense to define analyzers for the destination field? The source is already analyzed so i guess that the RESULT of

Number of terms in a SOLR field

2009-09-30 Thread Fergus McMenemie
Hi all, I am attempting to test some changes I made to my DIH based indexing process. The changes only affect the way I describe my fields in data-config.xml, there should be no changes to the way the data is indexed or stored. As a QA check I was wanting to compare the results from indexing

Re: Number of terms in a SOLR field

2009-09-30 Thread Fergus McMenemie
Fergus McMenemie wrote: Hi all, I am attempting to test some changes I made to my DIH based indexing process. The changes only affect the way I describe my fields in data-config.xml, there should be no changes to the way the data is indexed or stored. As a QA check I was wanting

Re: Number of terms in a SOLR field

2009-09-30 Thread Fergus McMenemie
Fergus McMenemie wrote: Fergus McMenemie wrote: Hi all, I am attempting to test some changes I made to my DIH based indexing process. The changes only affect the way I describe my fields in data-config.xml, there should be no changes to the way the data is indexed or stored. As a QA

Re: Extract info from parent node during data import (redirect:)

2009-09-17 Thread Fergus McMenemie
a JIRA for improving XPathRecordReader. Please go ahead. You can paste the contents of this mail in the list . There may be others with similar ideas Noble. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd

Re: FileListEntityProcessor and LineEntityProcessor

2009-09-16 Thread Fergus McMenemie
: http://www.nabble.com/FileListEntityProcessor-and-LineEntityProcessor-tp25476443p25476443.html Sent from the Solr - User mailing list archive at Nabble.com. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd

Re: [DIH] Multiple repeat XPath stmts

2009-09-13 Thread Fergus McMenemie
to the transformers and I think we will have a turing complete language:-) fergus. Thanks, Grant -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets

Re: Extract info from parent node during data import

2009-09-12 Thread Fergus McMenemie
=true / field name=author type=string indexed=true stored=true/ field name=category type=string indexed=true stored=true/ /fields uniqueKeyid/uniqueKey defaultSearchFieldid/defaultSearchField -- === Fergus McMenemie

RE: Extract info from parent node during data import

2009-09-10 Thread Fergus McMenemie
://windowslive.ninemsn.com.au/article.aspx?id=845706 -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: Specifying multiple documents in DataImportHandler dataConfig

2009-09-09 Thread Fergus McMenemie
of how DIH works. My 2 content types are indeed separate so they logically represent two document types, not one. Is this correct? What am I missing here? Thanks -Rupert -- === Fergus McMenemie Email:fer...@twig.me.uk

Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Fergus McMenemie
files. PS: I am a total netbeans newbie. -- === Fergus McMenemie               Email:fer...@twig.me.uk Techmore Ltd                   Phone:(UK) 07721 376021 Unix/Mac/Intranets             Analyst Programmer

Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Fergus McMenemie
On Mon, Sep 7, 2009 at 5:58 PM, Fergus McMenemie fer...@twig.me.uk wrote: This testcase is quite independent of anything in Solr. It is a standalone utility and the only dependency is stax. discalimer (I run these testcases from Intellij and command line) BTW are you using XpathRecordReader

Re: Aliases for fields

2009-08-18 Thread Fergus McMenemie
multiValued=false termVectors=false alias=source.date/ is there any jira issue related? Thx -- Lici -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets

Re: Using Multiple fields in UniqueKey

2009-07-15 Thread Fergus McMenemie
. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: Updating Solr index from XML files

2009-07-08 Thread Fergus McMenemie
BTW: We are using weblogic to deploy the solr.war and by default solr in weblogic using port 7001, but not 8983. Thanks Francis -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone

Re: DIH: Limited xpath syntax unable to parse all xml elements

2009-07-02 Thread Fergus McMenemie
without the markup? Thanks, -Jay -- - Noble Paul | Principal Engineer| AOL | http://aol.com -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd

Re: DIH: Limited xpath syntax unable to parse all xml elements

2009-07-02 Thread Fergus McMenemie
Shalin Shekhar Mangar wrote: On Thu, Jul 2, 2009 at 11:08 PM, Mark Miller markrmil...@gmail.com wrote: It looks like DIH implements its own subset of the Xpath spec. Right, DIH has a streaming implementation supporting a subset of XPath only. The supported things are in the wiki

RE: plans for switching to maven2 (after 1.4 release)?

2009-06-30 Thread Fergus McMenemie
FWIW I strongly agree with your sentiments, Manual. One of the neat maven features that isn't well known is just being able to do mvn jetty:run and have Jetty load up right away (no creating of a web-app directory or packaging of a war or anything like that). What I hate about ant based projects

Re: fq vs. q

2009-06-16 Thread Fergus McMenemie
Fergus McMenemie schrieb: The article could explain the difference between fq= and facet.query= and when you should use one in preference to the other. My understanding is that while these query modifiers rely on the same implementation (cached filters) to boost performance

Re: fq vs. q

2009-06-12 Thread Fergus McMenemie
. Regards Fergus. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

RE: ExtractingRequestHandler and local files

2009-06-10 Thread Fergus McMenemie
I had also been wondering about this, but was to lazy/busy to post a question. Now that it is resolved it would help lots if you could post ad example of how you invoked enableRemoteStreaming for your document(s)? Rgds Fergus. Thanks for the quick response, Grant. We tried it and it seems

Re: fq vs. q

2009-06-10 Thread Fergus McMenemie
On Tue, Jun 9, 2009 at 7:25 PM, Michael Ludwig m...@as-guides.com wrote: A filter query is cached, which means that it is the more useful the more often it is repeated. We know how often certain queries arise, or at least have the means to collect that data - so we know what might be

Re: Customizing results

2009-06-05 Thread Fergus McMenemie
Generally a good idea, but be prepared to entertain requests that should also ask you to be able to perform the query using those aliases. I mean when you talk about something similar to aliases in SQL, those aliases can be used in SQL scripts in the where clause too. Cheers Avlesh I am using

Re: [Solr Wiki] Update of FrontPage by OscarBernal

2009-05-27 Thread Fergus McMenemie
] + === Solr Clients === * IntegratingSolr -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: DataImportHandler Template Transformer

2009-05-19 Thread Fergus McMenemie
template=${jc.fileAbsolutePath}${x.vurl} / This can be used instead:- field column=id regex=^(.*)$ relpaceWith=$1${x.vurl} sourceColName=fileAbsolutePath / So I guess we have the best of both worlds! Regards Fergus. -- === Fergus

Re: Documents in facet results

2009-05-17 Thread Fergus McMenemie
-- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: Solr vs Sphinx

2009-05-17 Thread Fergus McMenemie
Something that would be interesting is to share solr configs for various types of indexing tasks. From a solr configuration aimed at indexing web pages to one doing large amounts of text to one that indexes specific structured data. I could see those being posted on the wiki and helping

Re: query regarding Indexing xml files -db-data-config.xml

2009-05-16 Thread Fergus McMenemie
-- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: UK Solr users meeting?

2009-05-14 Thread Fergus McMenemie
I was wondering if there is an interest in a UK (South East) solr user group meeting Please let me know if you are interested. I am happy to organize. Regards, Colin Yes Very interested. I am in lincolnshire. -- === Fergus

Re: Delete documents from index with dataimport

2009-05-14 Thread Fergus McMenemie
/ Notes. 1) the entity is assumed to have name=jc. 2) the uniqueKey field is assumed to called id. 3) the entity needs to have transformer=RegexTransformer 2009/5/13 Fergus McMenemie fer...@twig.me.uk: Hi Is it possible, through dataimport handler to remove an existing document

Re: indexing txt file

2009-04-15 Thread Fergus McMenemie
a txt file? Where should I put my txt file I want to index? thank you, Alex V. -- --Noble Paul -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021

Re: [solr-user] Upgrade from 1.2 to 1.3 gives 3x slowdown

2009-04-15 Thread Fergus McMenemie
On Apr 2, 2009, at 9:23 AM, Fergus McMenemie wrote: Grant, I should note, however, that the speed difference you are seeing may not be as pronounced as it appears. If I recall during ApacheCon, I commented on how long it takes to shutdown your Solr instance when exiting it. That time

looking at the results of a distributed search using shards.

2009-04-15 Thread Fergus McMenemie
such the source document can be linked to, and to do so I think I need to know which shard a particular result came from. Is this a FAQ? Regards -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK

Re: looking at the results of a distributed search using shards.

2009-04-15 Thread Fergus McMenemie
On Apr 15, 2009, at 11:18 AM, Fergus McMenemie wrote: Hi, Having all kinds of fun with distributed search using shards:-) I have 30K documents indexed using DIH into an index. Another index contain documents indexed using solr-cell. I am using shards to search across both indexes. I am

Re: Using ExtractingRequestHandler to index a large PDF ~solved

2009-04-14 Thread Fergus McMenemie
On Apr 6, 2009, at 10:16 AM, Fergus McMenemie wrote: Hmmm, Not sure how this all hangs together. But editing my solrconfig.xml as follows sorted the problem:- requestParsers enableRemoteStreaming=false multipartUploadLimitInKB=2048 / to requestParsers enableRemoteStreaming

Re: Searching on mulit-core Solr

2009-04-09 Thread Fergus McMenemie
solr instance? Thanks, -vivek On Mon, Apr 6, 2009 at 2:40 PM, Fergus McMenemie fer...@twig.me.uk wrote: vivek, 404 from the URL you provided in the message! Similar URLs work OK for me. hmm try http://localhost:8080/solr/admin/cores?action=status and see if that gives a 404. Also

Re: DIH; Hardcode field value/replacement based on source column

2009-04-08 Thread Fergus McMenemie
=~ m/^(.*)/g ); $c=0; print d-match, ++$c, ='$1'\n while( $s =~ m/(.*)$/g ); -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst

Re: How could I avoid reindexing same files?

2009-04-08 Thread Fergus McMenemie
Hi Fergus, On Tue, Apr 07, 2009 at 05:06:23PM +0100, Fergus McMenemie wrote: Thank you much Fergus, I was considering implementing a database which would hold a path name and an MD5 sum of each file. Snap. That is close to what we did. However due to our pervious duff full text search

Re: How could I avoid reindexing same files?

2009-04-07 Thread Fergus McMenemie
and skipping a file if it has already been indexed and not changed since? Thank you. Regards, Veselin K -- --Noble Paul -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd

Re: How could I avoid reindexing same files?

2009-04-07 Thread Fergus McMenemie
from solr if/when you wanted to validate your folder structure and or indexes. Regards, Veselin K On Tue, Apr 07, 2009 at 09:01:31AM +0100, Fergus McMenemie wrote: Veselin, Well, as far as solr is concerned, there is two issues here:- 1) To stop the same document ending up in the indexes

Re: DIH API for specifying a either specific or all configurations imported

2009-04-06 Thread Fergus McMenemie
-importentity=jc See the docs at:- http://wiki.apache.org/solr/DataImportHandler#head-1582242c1bfc1f3e89f4025bf2055791848acefb Fergus. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone

Re: Using ExtractingRequestHandler to index a large PDF ~solved

2009-04-06 Thread Fergus McMenemie
) at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105) Although the PDF is big, it contains very little text; it is a map. java -jar solr/lib/tika-0.3.jar -g appears to have no bother with it. Fergus... -- === Fergus McMenemie

Re: Additive filter queries

2009-04-03 Thread Fergus McMenemie
! As best I understand, you somehow need to arrange for each different combination of colour, size and width to be indexed as a separate sol document. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd

Re: [solr-user] Upgrade from 1.2 to 1.3 gives 3x slowdown

2009-04-02 Thread Fergus McMenemie
On Apr 1, 2009, at 9:39 AM, Fergus McMenemie wrote: Grant, Redoing the work with your patch applied does not seem to make a difference! Is this the expected result? No, I didn't expect Solr 1095 to fix the problem. Overwrite = false + 1095, does, however, AFAICT by your last line, right

Problem using ExtractingRequestHandler with tomcat

2009-04-02 Thread Fergus McMenemie
) at org.apache.solr.core.RequestHandlers$1.create(RequestHandlers.java:154) at org.apache.solr.core.RequestHandlers$1.create(RequestHandlers.java:163) Any ideas? -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK

Re: Problem using ExtractingRequestHandler with tomcat

2009-04-02 Thread Fergus McMenemie
On Apr 2, 2009, at 4:26 AM, Fergus McMenemie wrote: I cant get ExtractingRequestHandler to work with tomcat. Using the latest version from svn and then a make clean dist and copying the war file to a clean tomcat does not work. make?! :) Oops! try ant example to see if that gets it working

Using ExtractingRequestHandler to index a large PDF

2009-04-02 Thread Fergus McMenemie
... -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: [solr-user] Upgrade from 1.2 to 1.3 gives 3x slowdown

2009-04-01 Thread Fergus McMenemie
-- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: DIH; Hardcode field value/replacement based on source column

2009-03-31 Thread Fergus McMenemie
for all documents. Any idea why this DIH instruction would see constant value appear twice?? -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets

Re: [solr-user] Upgrade from 1.2 to 1.3 gives 3x slowdown

2009-03-31 Thread Fergus McMenemie
. UNLESS there are duplicate gaz entries. In the meantime, I'm trying to see if I can pinpoint down a specific change and see if there is anything that might help it perform better. -Grant -- === Fergus McMenemie Email:fer

Re: [solr-user] Upgrade from 1.2 to 1.3 gives 3x slowdown

2009-03-30 Thread Fergus McMenemie
Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene: http://www.lucidimagination.com/search -- === Fergus McMenemie Email:fer...@twig.me.uk

Re: [solr-user] Upgrade from 1.2 to 1.3 gives 3x slowdown

2009-03-30 Thread Fergus McMenemie
ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene: http://www.lucidimagination.com/search -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac

Clarifying use of lst name=appends within a requestHandler

2009-03-27 Thread fergus mcmenemie
Hello, Due to limitations with the way my content is organised and DIH I have to add “-imgCaption:[* TO *]” to some of my queries. I discovered the name=”appends” functionality tucked away inside solconfig.xml. This looks a very useful feature, and I created a new requestHandler to deal with my

Re: Scheduling DIH

2009-03-26 Thread fergus mcmenemie
H, my tuppence worth! IMHO I do not think this should be built into solr. Doing it properly leads to all kinds of nasty platform dependent issues... will we then want to add notification features on success/failure? via email? Ideally, all the scheduled activities on a system should be

Re: DIH - read datasource param values from property file or configure JNDI datasource

2009-03-19 Thread Fergus McMenemie
I am looking for a implementation of DIH feature: It also takes in a properties file for the data source configuration (http://issues.apache.org/jira/browse/SOLR-469) I want to externalize the data source parameters like driver, url, user and password to property file outside the solr. My aim

Problem encoding ':' char in a solr query

2009-03-18 Thread Fergus McMenemie
not work! Help! -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: Problem using DIH templatetransformer to create uniqueKey: solved

2009-03-12 Thread Fergus McMenemie
that currently right? The replace stuff in the config files does though. Erik On Feb 13, 2009, at 8:17 AM, Fergus McMenemie wrote: Paul, Following up your usenet sussgetion: field column=id template=${jc.fileAbsolutePath}${x.vurl} ignoreMissingVariables=true/ and to add more

DIH use of the ?command=full-import entity= command option

2009-03-12 Thread Fergus McMenemie
-- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: DIH use of the ?command=full-import entity= command option

2009-03-12 Thread Fergus McMenemie
, Fergus McMenemie fer...@twig.me.uk wrote: Hello, Can anybody describe the intended purpose, or provide a few examples, of how the DIH entity= command option works. Am I supposed to build a data-conf.xml file which contains many different alternate entities.. or With the entity

Re: a new DIH manifestEnityProcessor SOLR-1060 on jira

2009-03-10 Thread Fergus McMenemie
on a regex 3) extract parts (named parts) from the line using another regex Noble On Tue, Mar 10, 2009 at 1:50 AM, Fergus McMenemie fer...@twig.me.uk wrote: Hi Fergus, The idea is that we have something generic which can be applicable to a large set of users. If the manifest is a text

Re: passing parameters into the XSLTResponseWriter: particularly hostname

2009-03-09 Thread Fergus McMenemie
where your stylesheet has access to it. -Hoss Doh! of course. Thanks. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst

a new DIH manifestEnityProcessor

2009-03-09 Thread Fergus McMenemie
? Is DIH the right place to add this? Suggestions for a different name? Suggestions on how to do the delete bitty from within an entity? Regards Fergus. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd

Re: a new DIH manifestEnityProcessor

2009-03-09 Thread Fergus McMenemie
On Mon, Mar 9, 2009 at 8:30 PM, Fergus McMenemie fer...@twig.me.uk wrote: Hello, I have almost finished a new DIH EntityProcessor which I am calling the manifestEnityProcessor. It is designed around the idea that whatever demon is used to maintain your set of a few 100,000 xml documents

Re: DIH with a list of changed documents?

2009-03-09 Thread Fergus McMenemie
find the thread titled:- a new DIH manifestEnityProcessor is your list of changed documents a list of additions and updates only, or does it contain deletes as well? Fergus. -- === Fergus McMenemie Email:fer

Re: DIH with a list of changed documents?

2009-03-09 Thread Fergus McMenemie
Le 09-mars-09 à 22:29, Fergus McMenemie a écrit : how would I implement entity-processor if I were able to get the list of recently changed documents of our sites? H, this sounds like a job for my manifestEnityProcessor see if you can find the thread titled:- a new DIH

passing parameters into the XSLTResponseWriter: particularly hostname

2009-02-27 Thread Fergus McMenemie
-- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: DIH transformers - sect 2 - SOLR-1033

2009-02-21 Thread Fergus McMenemie
I have created SOLR-1033 in JIRA to address this issue. At 13:32 + 21/2/09, Fergus McMenemie wrote: On Mon, Feb 16, 2009 at 3:22 PM, Fergus McMenemie fer...@twig.me.uk wrote: 2) Having used TemplateTransformer to assign a value to an entity column that column cannot be used in other

Re: DIH transformers - sect 2

2009-02-17 Thread Fergus McMenemie
On Mon, Feb 16, 2009 at 3:22 PM, Fergus McMenemie fer...@twig.me.uk wrote: 2) Having used TemplateTransformer to assign a value to an entity column that column cannot be used in other TemplateTransformer operations. In my project I am attempting to reuse x.fileWebPath. To fix

DIH transformers

2009-02-16 Thread Fergus McMenemie
template=${jc.fileAbsolutePath}#${x.vurl} / /entity /entity /document /dataConfig Regards Fergus. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021

Re: Is this DIH entity forEach expression OK? ... yes

2009-02-13 Thread Fergus McMenemie
. But -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Fergus McMenemie
if there was a good reason for this behavior. Regards. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Fergus McMenemie
. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Fergus McMenemie
to create id, the uniqueKey fails for the parent document /record I am hacking around with TemplateTransformer.java to sort this but was wondering if there was a good reason for this behavior. -- === Fergus McMenemie

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Fergus McMenemie
On Feb 13, 2009, at 8:17 AM, Fergus McMenemie wrote: Paul, Following up your usenet sussgetion: field column=id template=${jc.fileAbsolutePath}${x.vurl} ignoreMissingVariables=true/ and to add more to what I was thinking... if the field is undefined in the input document

Is this DIH entity forEach expression OK?

2009-02-12 Thread Fergus McMenemie
xpath=/record/mediaBlock/caption / Is is OK to have an xpath expression within forEach which is a child of another of the forEach xpath expressions? Or.. is there a better way of doing this? Regards -- === Fergus McMenemie

DIH fails to import after svn update

2009-02-11 Thread Fergus McMenemie
Regards to all. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

ant dist of a nightly download fails

2009-02-11 Thread Fergus McMenemie
. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: DIH fails to import after svn update

2009-02-11 Thread Fergus McMenemie
Thanks, That fixed it. On Wed, Feb 11, 2009 at 4:19 PM, Fergus McMenemie fer...@twig.me.uk wrote: java.lang.NoSuchFieldError: docCount at org.apache.solr.handler.dataimport.SolrWriter.getDocCount(SolrWriter.java:231

Re: DIH using values from solrconfig.xml inside data-config.xml

2009-02-04 Thread Fergus McMenemie
of //para would cover many of the use cases, and what was left could be covered by a preceding XSLT transform. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021

Re: DIH, assigning multiple xpaths to the same solr field: solved

2009-02-04 Thread Fergus McMenemie
/f/g/para / Regards Fergus On Wed, Feb 4, 2009 at 1:35 AM, Fergus McMenemie fer...@twig.me.uk wrote: entity name=x dataSource=myfilereader processor=XPathEntityProcessor url=${jc.fileAbsolutePath} stream=false forEach=/record field column=para xpath

DIH using values from solrconfig.xml inside data-config.xml

2009-02-02 Thread Fergus McMenemie
) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:365) Is there some simple escape or other syntax to be used or is this an enhancement? Regards Fergus. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd

Re: DIH FileListEntityProcessor recursion and fileName clash

2009-02-02 Thread Fergus McMenemie
, Feb 2, 2009 at 2:36 AM, Fergus McMenemie fer...@twig.me.uk wrote: Hello I have been trying to find out why DIH in FileListEntityProcessor mode did not appear to be recursing into subdirectories. Going through FileListEntityProcessor.java I eventually tumbled to the fact that my filename

Re: DIH using values from solrconfig.xml inside data-config.xml

2009-02-02 Thread Fergus McMenemie
...@gmail.com 650-922-8831 (US) -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

DIH FileListEntityProcessor recursion and fileName clash

2009-02-01 Thread Fergus McMenemie
(aFile.lastModified()); if (biggerThan != -1 sz = biggerThan) -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

Re: How to make Relationships work for Multi-valued Index Fields?

2009-01-24 Thread Fergus McMenemie
have completely goofed on a way to set it up - much appreciate any direction on it. I am using SOLR 1.3 Regards, Guna -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721

Re: DIH XPathEntityProcessor fails with docs containing !DOCTYPE

2009-01-23 Thread Fergus McMenemie
Seems to work fin on this mornings 23-jan-2009 nightly. Thanks very much. On Wed, Jan 21, 2009 at 6:05 PM, Fergus McMenemie fer...@twig.me.uk wrote: After looking looking at http://issues.apache.org/jira/browse/SOLR-964, where it seems this issue has been addressed, I had another go

Re: Cant get HTMLStripTransformer's stripHTML to work in DIH.

2009-01-21 Thread Fergus McMenemie
and build from the trunk if need this immediately. On Mon, Jan 19, 2009 at 7:02 PM, Fergus McMenemie fer...@twig.me.uk wrote: Hmmm, Just to clarify I retested the thing using the nightly as of today 18-jan-2009. The problem is still there and this traceback is from that nightly. This looks

Re: DIH XPathEntityProcessor fails with docs containing !DOCTYPE

2009-01-21 Thread Fergus McMenemie
everything. I know that use of DOCTYPE is out of fashion, and it does not exist in our newer documents, however there are lots of older XML docs about! -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd

Re: Cant get HTMLStripTransformer's stripHTML to work in DIH.

2009-01-21 Thread Fergus McMenemie
; at least not on that line:-) On Wed, Jan 21, 2009 at 5:40 PM, Fergus McMenemie fer...@twig.me.uk wrote: Shalin Downloaded nightly for 21jan and tried DIH again. Its better but still broken. Dozens of embeded tags are stripped from documents but it now fails every few documents

Re: getting DIH to read my XML files: solved

2009-01-19 Thread Fergus McMenemie
/subje...@qualifier='pubAbbrev']/ field column=pubdate xpath=/record/metadata/da...@qualifier='pubDate']/ /entity /entity /document /dataConfig Regards Fergus. -- === Fergus McMenemie

Cant get HTMLStripTransformer's stripHTML to work in DIH.

2009-01-19 Thread Fergus McMenemie
/metadata/da...@qualifier='pubDate'] dateTimeFormat=MMdd / /entity /entity /document /dataConfig -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721

Re: Cant get HTMLStripTransformer's stripHTML to work in DIH.

2009-01-19 Thread Fergus McMenemie
: start rollback Jan 19, 2009 11:14:06 AM org.apache.solr.update.DirectUpdateHandler2 rollback INFO: end_rollback On Mon, Jan 19, 2009 at 4:14 PM, Fergus McMenemie fer...@twig.me.uk wrote: Hello all, I have the following DIH data-config.xml file. Adding HTMLStripTransformer and the associated

Re: Cant get HTMLStripTransformer's stripHTML to work in DIH.

2009-01-19 Thread Fergus McMenemie
, Fergus McMenemie fer...@twig.me.uk wrote: Hello all, I have the following DIH data-config.xml file. Adding HTMLStripTransformer and the associated stripHTML on the para tag seems to have broke things. I am using a nightly build from 12-jan-2009 The /record/sect1/para contains HTML sub tags

DIH XPathEntityProcessor fails with docs containing !DOCTYPE

2009-01-16 Thread Fergus McMenemie
there are lots of older XML docs about! Regards Fergus. -- === Fergus McMenemie Email:fer...@twig.me.uk Techmore Ltd Phone:(UK) 07721 376021 Unix/Mac/Intranets Analyst Programmer ===

  1   2   >