Re: Regex replacement not working!

2011-06-29 Thread Adam Estrada
I have had the same problems with regex and I went with the regular pattern replace filter rather than the charfilter. When I added it to the very end of the chain, only then would it work...I am on Solr 3.2. I have also noticed that the HTML filter factory is not working either. When I dump the fi

REGEX Proper Usage?

2011-06-17 Thread Adam Estrada
All, I am having trouble getting my regex pattern to work properly. I have tried PatternReplaceFilterFactory after the standard tokenizer and PatternReplaceCharFilterFactory before it. It looks like this should work to remove everything except letters and numbers.

Re: Mahout & Solr

2011-06-16 Thread Adam Estrada
You're right...It would be nice to be able to see the cluster results coming from Solr though... Adam On Thu, Jun 16, 2011 at 3:21 AM, Andrew Clegg wrote: > Well, it does have the ability to pull TermVectors from an index: > > > https://cwiki.apache.org/MAHOUT/creating-vectors-from-text.html#Cr

[Handling] empty fields

2011-06-15 Thread Adam Estrada
All, I have a field "foo" with several thousand blank or non-existing records in it. This is also my faceting field. My question is, how can I deal with this field so that I don't get a blank facet at query time? 5000 vs. 1000 Adam

Re: Mahout & Solr

2011-06-15 Thread Adam Estrada
The only integration at this point (as far as I can tell) is that Mahout can read the lucene index created by Solr. I agree that it would be nice to swap out the Carrot2 clustering engine with Mahout's set of algorithms but that has not been done yet. Grant has pointed out that you can use Solr's c

Re: Finding Keywords/Phrases

2011-06-12 Thread Adam Estrada
Hi Frank, I have been working on something very similar and I am at the point where I don't believe (and I could be totally wrong) that a pure Solr solution is going to do this. I would look at Mahout and play with some of the machine learning algorithms that it can run against a Lucene index. I h

Re: [Free Text] Field Tokenizing

2011-06-09 Thread Adam Estrada
query time and make them into a > literal phrase > (i.e. with quotationmarks)? > > Best > Erick > > On Thu, Jun 9, 2011 at 10:56 AM, Adam Estrada > wrote: > > All, > > > > I am at a bit of a loss here so any help would be greatly appreciated. I > am > &g

Re: [Mahout] Integration with Solr

2011-06-09 Thread Adam Estrada
change engines flawlessly (just need an Adapter). > Regards, > Tommaso > > 2011/6/9 Adam Estrada > > > Has anyone integrated Mahout with Solr? I know that Carrot2 is part of > the > > core build but the docs say that it's not very good for very large > indexes. > > Anyone have thoughts on this? > > > > Thanks, > > Adam > > >

[Free Text] Field Tokenizing

2011-06-09 Thread Adam Estrada
All, I am at a bit of a loss here so any help would be greatly appreciated. I am using the DIH to grab data from a DB. The field that I am most interested in has anywhere from 1 word to several paragraphs worth of free text. What I would really like to do is pull out phrases like "Joe's coffee sho

[Mahout] Integration with Solr

2011-06-09 Thread Adam Estrada
Has anyone integrated Mahout with Solr? I know that Carrot2 is part of the core build but the docs say that it's not very good for very large indexes. Anyone have thoughts on this? Thanks, Adam

Re: [Visualizations] from Query Results

2011-06-03 Thread Adam Estrada
search :: http://search-lucene.com/ > > > > - Original Message > > From: Adam Estrada > > To: solr-user@lucene.apache.org > > Sent: Fri, June 3, 2011 7:13:39 AM > > Subject: [Visualizations] from Query Results > > > > Dear Solr experts, > &

[Visualizations] from Query Results

2011-06-03 Thread Adam Estrada
Dear Solr experts, I am curious to learn what visualization tools are out there to help me "visualize" my query results. I am not talking about a language specific client per se but something more like Carrot2 which breaks clusters in to their knowledge tree and expandable pie chart. Sorry if thos

Re: GeoJSON Response Writer

2011-05-29 Thread Adam Estrada
t; Cheers, > Chris > > On May 29, 2011, at 11:14 AM, Adam Estrada wrote: > > > All, > > > > Has anyone modified the current json response writer to include the > GeoJSON > > geospatial encoding s

GeoJSON Response Writer

2011-05-29 Thread Adam Estrada
All, Has anyone modified the current json response writer to include the GeoJSON geospatial encoding standard. See here: http://geojson.org/ Just curious... Adam

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Adam Estrada
Well...by default there is a pretty decent schema that you can use as a template in the example project that builds with Solr. Tika is the library that does the actual content extraction so it would be a good idea to try the example project out first. Adam 2011/4/6 Ezequiel Calderara > Another

Re: dataimport

2011-03-09 Thread Adam Estrada
Brian, I had the same problem a while back and set the JAVA_OPTS env variable to something my machine could handle. That may also be an option for you going forward. Adam On Wed, Mar 9, 2011 at 9:33 AM, Brian Lamb wrote: > This has since been fixed. The problem was that there was not enough mem

Re: Tomcat EXE Source Code

2011-02-25 Thread Adam Estrada
Some of these links may help... http://www.google.com/search?client=safari&rls=en&q=apache+tomcat+download&ie=UTF-8&oe=UTF-8 Adam On Feb 25, 2011, at 3:16 AM, rajini maski wrote: > Can anybody help me to get the source code of the Tomcat exe > file i.e, source code of the installa

Re: Datetime problems with dataimport

2011-02-22 Thread Adam Estrada
I logged an issue in Jira that relates to this and it looks like Yonik picked it up. https://issues.apache.org/jira/browse/SOLR-2286 Adam On Feb 22, 2011, at 9:07 AM, MOuli wrote: > > Ok i got it. > > It should look like -mm-ddThh:mm:ssZ > for example: 2011-02-22T15:07:00Z > -- > View

Re: Index Autocad

2011-02-19 Thread Adam Estrada
I think you may have already posted this same question but please check VoyagerGIS out. They have some shit-hot software that is geared specifically towards the archive and retrieval of geospatial data. I suggest that you check it out!!! w/r, Adam On Sat, Feb 19, 2011 at 2:33 AM, lucene lucene

Re: Indexing AutoCAD files

2011-02-19 Thread Adam Estrada
Hi Vignesh, I believe that you would have to incorporate GDAL in to Tika in order to read the file and extract the proper metadata. This is entirely doable but I don't know how to do it. There are companies out there that specialize in this sort of thing so hopefully, one of them has already conta

Re: Indexing AutoCAD files

2011-02-19 Thread Adam Estrada
Hi Vignesh, I believe that you would have to incorporate GDAL in to Tika in order to read the file and extract the proper metadata. This is entirely doable but I don't know how to do it. There are companies out there that specialize in this sort of thing so hopefully, one of them has already conta

[Solr] and CouchDB

2011-02-19 Thread Adam Estrada
I am curious to see if anyone has messed around with Solr and the Couch-Lucene incarnation that is out there...I was passed this article this morning and it really opened my eyes about CouchDB http://m.readwriteweb.com/hack/2011/02/hacker-chat-max-ogden.php Thoughts, Adam

Re: Passing parameters to DataImportHandler

2011-02-15 Thread Adam Estrada
Yep...Take a look at this example. Map your SQL query to the appropriate fields in your index. create a directory under called DataImportHandler and reference it in your update command using curl or whatever. example /solr/conf/dataimporthandler

Re: Difference between Solr and Lucidworks distribution

2011-02-13 Thread Adam Estrada
I believe that the Lucid Works distro for Solr is free and as you mentioned they only appear to sell their services for it. I have used that version for several demos because it does seem to have all the bells and whistles already included and it's super easy to set up. The only downside in my c

Re: Architecture decisions with Solr

2011-02-09 Thread Adam Estrada
I tried the multi-core route and it gets too complicated and cumbersome to maintain. That is just from my own personal testing...It was suggested that each user have their own ID in a single index that you can query against accordingly. In the example schema.xml I believe there is a field called

Re: [WKT] Spatial Searching

2011-02-09 Thread Adam Estrada
is it's license, unfortunately. Otherwise, I think > it would be done already! We could, since it's LGPL, make it an optional > dependency, assuming someone can stub it out. > > On Feb 8, 2011, at 11:18 PM, Adam Estrada wrote: > >> I just came across a ~nudge post

Re: [WKT] Spatial Searching

2011-02-09 Thread Adam Estrada
cy, assuming someone can stub it out. > > On Feb 8, 2011, at 11:18 PM, Adam Estrada wrote: > >> I just came across a ~nudge post over in the SIS list on what the status is >> for that project. This got me looking more in to spatial mods with Solr4.0. >> I found this enha

[WKT] Spatial Searching

2011-02-08 Thread Adam Estrada
I just came across a ~nudge post over in the SIS list on what the status is for that project. This got me looking more in to spatial mods with Solr4.0. I found this enhancement in Jira. https://issues.apache.org/jira/browse/SOLR-2155. In this issue, David mentions that he's already integrated

Re: Time fields

2011-02-02 Thread Adam Estrada
If your using a DIH you can configure it however you want. Here is a snippet of my code. Note the DateTimeTransformer. On Wed, Feb 2, 2011 at 7:28 PM, Dennis Gearon wrote: > For time of d

Re: [Failure] to start Solr 4.0

2011-01-28 Thread Adam Estrada
I found the problem...You HAVE to build the Solr directory using "ant example" in order for the web application to start properly. Sorry to post so many times. Adam On Jan 28, 2011, at 5:20 PM, Adam Estrada wrote: > All, > > I've checked out the latest code and built

[Failure] to start Solr 4.0

2011-01-28 Thread Adam Estrada
All, I've checked out the latest code and built the root directory with ant compile and then I build the solr directory again using the ant dist command which gives me the lucene-libs directory and a couple others. Now Solr won't start. What am I missing??? This is as far as it gets. mini:ex

Re: Tika config in ExtractingRequestHandler

2011-01-27 Thread Adam Estrada
I believe that as along as Tika is included in a folder that is referenced by solrconfig.xml you should be good. Solr will automatically throw mime types to Tika for parsing. Can anyone else add to this? Thanks, Adam On Thu, Jan 27, 2011 at 5:06 AM, Erlend Garåsen wrote: > > The wiki page for th

Re: DIH From various File system locations

2011-01-25 Thread Adam Estrada
m On Tue, Jan 25, 2011 at 3:21 PM, Adam Estrada wrote: > There are a few tutorials out there. > > 1. http://wiki.apache.org/nutch/RunningNutchAndSolr (not the most practical) > 2. http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ (similar to 1.) > 3. Build the lates

Re: DIH From various File system locations

2011-01-25 Thread Adam Estrada
There are a few tutorials out there. 1. http://wiki.apache.org/nutch/RunningNutchAndSolr (not the most practical) 2. http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ (similar to 1.) 3. Build the latest from branch http://svn.apache.org/repos/asf/nutch/branches/branch-1.3/ and read this

Re: Indexing spatial columns

2011-01-24 Thread Adam Estrada
Hi MapButcher, There are a couple things that are going on here. 1. The spatial functionality is confusing between versions of Solr. I wish someone would update the solr Spatial Search wiki page. 2. You will want to use the jTDS Driver here instead of the one from Microsoft. http://jtds.sour

Re: [Building] Solr4.0 on Windows

2011-01-23 Thread Adam Estrada
d save a lot of time. Where in the build script would I need to change this? Thanks, Adam On Jan 23, 2011, at 9:31 PM, Adam Estrada wrote: > All, > > I am having problems building Solr trunk on my windows 7 machine. I > get the following errors... > > BUILD FAILED >

Re: Indexing FTP Documents through SOLR??

2011-01-23 Thread Adam Estrada
+1 on Nutch! On Fri, Jan 21, 2011 at 4:11 PM, Markus Jelsma wrote: > Hi, > > Please take a look at Apache Nutch. I can crawl through a file system over > FTP. > After crawling, it can use Tika to extract the content from your PDF files and > other. Finally you can then send the data to your Solr

[Building] Solr4.0 on Windows

2011-01-23 Thread Adam Estrada
All, I am having problems building Solr trunk on my windows 7 machine. I get the following errors... BUILD FAILED C:\Apache\Solr-Nightly\build.xml:23: The following error occurred while executin g this line: C:\Apache\Solr-Nightly\lucene\common-build.xml:529: The following error occurred while ex

Re: Solr Out of Memory Error

2011-01-19 Thread Adam Estrada
Is anyone familiar with the environment variable, JAVA_OPTS? I set mine to a much larger heap size and never had any of these issues again. JAVA_OPTS = -server -Xms4048m -Xmx4048m Adam On Wed, Jan 19, 2011 at 3:29 AM, Isan Fulia wrote: > Hi all, > By adding more servers do u mean sharding of in

Re: boilerpipe solr tika howto please

2011-01-14 Thread Adam Estrada
r URL > > And other type of this application, is 'Readability' which is more for a > end-user (http://lab.arc90.com/experiments/readability/) > > > Le 14.01.2011 16:51, Adam Estrada a écrit : > > Is there a drastic difference between this and TagSoup which is alre

Re: boilerpipe solr tika howto please

2011-01-14 Thread Adam Estrada
Is there a drastic difference between this and TagSoup which is already included in Solr? On Fri, Jan 14, 2011 at 6:57 AM, arnaud gaudinat wrote: > Hello, > > I would like to use BoilerPipe (a very good program which cleans the html > content from surplus "clutter"). > I saw that BoilerPipe is in

[sfield] Missing in Spatial Search

2011-01-13 Thread Adam Estrada
According to the documentation here: http://wiki.apache.org/solr/SpatialSearch the field that identifies the spatial point data is "sfield". See the console output below. Jan 13, 2011 6:49:40 PM org.apache.solr.core.SolrCore execute INFO: [] webapp=/solr path=/select params={spellcheck=true&f.jtyp

Re: Multi-word exact keyword case-insensitive search suggestions

2011-01-13 Thread Adam Estrada
Hi, the following seems to work pretty well.

Re: Solr 4.0 => Spatial Search - How to

2011-01-12 Thread Adam Estrada
In my case, I am getting data from a database and am able to concatenate the lat/long as a coordinate pair to store in my coords field. To test this, I randomized the lat/long values and generated about 6000 documents. Adam On Wed, Jan 12, 2011 at 8:29 PM, caman wrote: > > Adam, > > thanks. Yes

Re: Solr 4.0 => Spatial Search - How to

2011-01-12 Thread Adam Estrada
q=*:*+AND+eventdate:[2006-01-21T00:00:000Z+TO+2007-01-21T00:00:000Z]&fq={!geofilt%20pt=32.15,-93.85%20sfield=coords%20d=5000}>Please let me know what you find. Adam On Wed, Jan 12, 2011 at 8:24 PM, Adam Estrada wrote: > I believe this is what you are looking for. I renamed the field

Re: Solr 4.0 => Spatial Search - How to

2011-01-12 Thread Adam Estrada
I believe this is what you are looking for. I renamed the field called "store" to "coords" in the schema.xml file. The tricky part is building out the query. I am using SolrNet to do this though and have not yet cracked the problem. http://localhost:8983/solr/select?q=*:*+AND+eventdate:[2006-01-21

[Example] Compound Queries

2011-01-11 Thread Adam Estrada
All, I have the following query which works just fine for querying a date range. Now I would like to add any kind of spatial query to the mix. Would someone be so kind as to help me out with an example spatial query that works in conjunction with my date range query? http://localhost:8983/solr/se

Re: DIH - Closing ResultSet in JdbcDataSource

2011-01-07 Thread Adam Estrada
This is my configuration which seems to work just fine. >From there it's just a matter of running the select statement and mapping it against the correct fields in your index. Adam On Fri, Jan 7, 2011 at 2:40 PM, Shane Perry wrote: > Hi, > > I am in the process of migrating our system

[Solr4.0] Release Date

2011-01-05 Thread Adam Estrada
I know this is a bit premature but does anyone have an anticipated release date for Solr4.0? I looked at Jira and there are still quite a few unresolved issues in there. Do we think it will be spring, summer or fall for this release? Thanks, Adam

Re: [sqljdbc4.jar] Errors

2011-01-05 Thread Adam Estrada
I can't tell any difference in performance but it does work like a charm. At least the messaging in the console is a lot more verbose. Thank you very much for the heads up on this one ;-) Adam On Wed, Jan 5, 2011 at 4:29 AM, Gora Mohanty wrote: > On Wed, Jan 5, 2011 at 10:18 AM, Estrada Groups

Re: [sqljdbc4.jar] Errors

2011-01-04 Thread Adam Estrada
Can you try by downgrading your Java version to java 5. > > However i am using Java 6u13 with sqljdbc4.jar , i however do not > > get any error. > > If possible, can you pleease also try with some other version of > > Java 6. > > > >

[sqljdbc4.jar] Errors

2011-01-04 Thread Adam Estrada
Can anyone help me with the following error. I upgraded my database to SQL Server 2008 SP2 and now I get the following error. It was working with SQL Server 2005. Caused by: java.lang.UnsupportedOperationException: Java Runtime Environment (JR E) version 1.6 is not supported by this driver. Use t

Re: [Nutch] and Solr integration

2011-01-03 Thread Adam Estrada
crawl -threads 10 -depth 100 -topN 50* -solr* http://localhost:8983/solr <http://localhost:8983/solr>The correct param is -solr NOT -solrindex. Cheers, Adam On Mon, Jan 3, 2011 at 11:45 AM, Adam Estrada wrote: > All, > > I realize that the documentation says that you crawl first the

Re: SpatialTierQueryParserPlugin Loading Error

2011-01-03 Thread Adam Estrada
No just yet, Grant...I have been sidetracked on a couple other things but I will keep you posted. Thanks for the response, Adam On Mon, Jan 3, 2011 at 10:22 AM, Grant Ingersoll wrote: > Sorry, I just saw this, Adam. Were you able to get it working? > > On Dec 28, 2010, at 8:54 PM, Ada

Re: [Nutch] and Solr integration

2011-01-03 Thread Adam Estrada
very useful blog > http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ > I integrated nutch and solr and it works well. > > Thanks > > On Tue, Dec 21, 2010 at 1:57 AM, Adam Estrada-2 [via Lucene] < > ml-node+2122347-622655030-146...@n3.nabble.com > > > > > wrote: > >

Re: [DIH] and XML Namespaces

2010-12-29 Thread Adam Estrada
s '' the mapping should just contain'subject').Easy, isn't it? And you didn't need to write one line of code! Enjoy [image: :)]" On Wed, Dec 29, 2010 at 12:05 PM, Adam Estrada < estrada.adam.gro...@gmail.com> wrote: > All, > > I am indexing

[DIH] and XML Namespaces

2010-12-29 Thread Adam Estrada
All, I am indexing some RSS feeds that are bound to specific namespaces. See below... http://ws.geonames.org/rssToGeoRSS?geoRSS=simple&feedUrl=http://www.cbp.gov/xp/cgov/admin/rss/?rssUrl=/home.xml " processor="XPathEntityProcessor" forEach="/rss/channel | /rss/ch

Re: SpatialTierQueryParserPlugin Loading Error

2010-12-28 Thread Adam Estrada
ctly. > > > On Dec 13, 2010, at 10:06 PM, Adam Estrada wrote: > > > All, > > > > Can anyone shed some light on this error. I can't seem to get this > > class to load. I am using the distribution of Solr from Lucid > > Imagination and the Spatial Plugin fr

Re: [Import Timeout] using /dataimport

2010-12-24 Thread Adam Estrada
ll timeout issues and adding the params suggested in that article don't seem to fix it. Adam On Tue, Dec 21, 2010 at 8:04 PM, Koji Sekiguchi wrote: > (10/12/22 9:35), Adam Estrada wrote: > >> All, >> >> I've noticed that there are some RSS feeds that are slow to re

[Import Timeout] using /dataimport

2010-12-21 Thread Adam Estrada
All, I've noticed that there are some RSS feeds that are slow to respond, especially during high usage times throughout the day. Is there a way to set the timeout to something really high or have it just wait until the feed is returned? The entire thing stops working when the feed doesn't respond.

Re: [Reload-Config] not working

2010-12-21 Thread Adam Estrada
s, Adm On Mon, Dec 20, 2010 at 10:58 AM, Adam Estrada wrote: > This is the response I get...Does it matter that the configuration file is > called something other than data-config.xml? After I get this I still have > to restart the service. I wonder...do I need to commit the change

Re: [Nutch] and Solr integration

2010-12-20 Thread Adam Estrada
gt; > Also for nutch-solr integration this is very useful blog > http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ > I integrated nutch and solr and it works well. > > Thanks > > On Tue, Dec 21, 2010 at 1:57 AM, Adam Estrada-2 [via Lucene] < > ml-node+2122347-622655

[Nutch] and Solr integration

2010-12-20 Thread Adam Estrada
All, I have a couple websites that I need to crawl and the following command line used to work I think. Solr is up and running and everything is fine there and I can go through and index the site but I really need the results added to Solr after the crawl. Does anyone have any idea on how to make

Re: [Reload-Config] not working

2010-12-20 Thread Adam Estrada
This is the response I get...Does it matter that the configuration file is called something other than data-config.xml? After I get this I still have to restart the service. I wonder...do I need to commit the change? -

[Reload-Config] not working

2010-12-19 Thread Adam Estrada
http://localhost:8983/solr/select?clean=false&commit=true&qt=%2Fdataimport&command=full-import";>Full Import http://localhost:8983/solr/select?clean=false&commit=true&qt=%2Fdataimport&command=reload-config";>Reload Configuration All, The links above are meant for me to reload the configuration fi

Re: bulk commits

2010-12-16 Thread Adam Estrada
ng > > > > It is always a good idea to learn from your own mistakes. It is usually a > better > > idea to learn from others’ mistakes, so you do not have to make them > yourself. > > from 'http://blogs.techrepublic.com.com/security/?p=4501&tag=

Re: bulk commits

2010-12-16 Thread Adam Estrada
vation,gtopo30,timezone,modificationdate&stream.file=C :\tmp\xao.csv&overwrite=true&stream.contentType=text/plain;charset=utf-8" curl " http://localhost:8983/solr/update/csv?commit=true&separator=%2C&fieldnames=id,name,asciiname,latitude,longitude,featureclass,fea

Re: bulk commits

2010-12-16 Thread Adam Estrada
what is it that you are trying to commit? a On Thu, Dec 16, 2010 at 1:03 PM, Dennis Gearon wrote: > What have people found as the best way to do bulk commits either from the > web or > from a file on the system? > > Dennis Gearon > > > Signature Warning > > It is always a good

Re: indexing a lot of XML dokuments

2010-12-16 Thread Adam Estrada
I have been very successful in following this example http://wiki.apache.org/solr/DataImportHandler#HttpDataSource_Example Adam On Thu, Dec 16, 2010 at 5:44 AM, Jörg Agatz wrote: > hi, users, i serch e way to indexing a lot of

Thank you!

2010-12-15 Thread Adam Estrada
I just want to say that this list serve has been invaluable to a newbie like me ;-) I posted a question earlier today and literally 10 minutes later I got an answer that helped me solve my problem. This is proof that there is a experienced and energetic community behind this FOSS group of projects

Re: [Adding] Entities when indexing a DB

2010-12-15 Thread Adam Estrada
entity i > usually prefix it with the entity name in the value mapped to the schema id > field > > On 15 Dec 2010, at 20:49, Adam Estrada wrote: > > > All, > > > > I have successfully indexed a single entity but when I try multiple > entities > > is the secon

[Adding] Entities when indexing a DB

2010-12-15 Thread Adam Estrada
All, I have successfully indexed a single entity but when I try multiple entities is the second is skipped all together. Is there something wrong with my config file?

Re: [DIH] Example for SQL Server

2010-12-15 Thread Adam Estrada
I got it to work! This is an excellent article for importing SQL Server data in to your index. http://www.chrisumbel.com/article/lucene_solr_sql_server <http://www.chrisumbel.com/article/lucene_solr_sql_server>Adam On Wed, Dec 15, 2010 at 8:43 AM, Adam Estrada wrote: > Thanks All, &g

Re: Dataimport performance

2010-12-15 Thread Adam Estrada
What version of Solr are you using? Adam 2010/12/15 Robert Gründler > Hi, > > we're looking for some comparison-benchmarks for importing large tables > from a mysql database (full import). > > Currently, a full-import of ~ 8 Million rows from a MySQL database takes > around 3 hours, on a QuadCo

Re: [DIH] Example for SQL Server

2010-12-15 Thread Adam Estrada
u seen the example in the > > distribution somewhere like > > /example/example-DIH/solr/db/conf/db-data-config.xml? > > > > Also, there's a magic URL for debugging DIH at: > > .../solr/admin/dataimport.jsp > > > > If none of that is useful, could you post your a

[DIH] Example for SQL Server

2010-12-14 Thread Adam Estrada
Does anyone have an example config.xml file I can take a look at for SQL Server? I need to index a lot of data from a DB and can't seem to figure out the right syntax so any help would be greatly appreciated. What is the correct /jar file to use and where do I put it in order for it to work? Thank

SpatialTierQueryParserPlugin Loading Error

2010-12-13 Thread Adam Estrada
All, Can anyone shed some light on this error. I can't seem to get this class to load. I am using the distribution of Solr from Lucid Imagination and the Spatial Plugin from here https://issues.apache.org/jira/browse/SOLR-773. I don't know how to apply a patch but the jar file is in there. What el

Re: [pubDate] is not converting correctly

2010-12-13 Thread Adam Estrada
; New Issue' for the Solr project. > > On Mon, Dec 13, 2010 at 2:13 PM, Lance Norskog wrote: > > Please file a JIRA requesting this. > > > > On Mon, Dec 13, 2010 at 6:29 AM, Adam Estrada > wrote: > >> +1 If I knew enough about how to do this in Java I wou

Re: Indexing pdf files - question.

2010-12-13 Thread Adam Estrada
Hi, I use the following command to post PDF files. $ curl "http://localhost:8983/solr/update/extract?stream.file=C :\temp\document.docx&stream.contentType=application/msword&literal.id =esc.doc&commit=true" $ curl "http://localhost:8983/solr/update/extract?stream.file=C :\temp\features.pdf&stream

Re: [pubDate] is not converting correctly

2010-12-13 Thread Adam Estrada
rs: Is it kosher to add this to the Solr > distribution? It's not in the JDK and is also com.sun.* > > On Sun, Dec 12, 2010 at 5:33 PM, Adam Estrada > wrote: > > Thanks for the feedback! There are quite a few formats that can be used. > I > > am experiencing at least

Re: [pubDate] is not converting correctly

2010-12-12 Thread Adam Estrada
31 PM, Koji Sekiguchi wrote: > (10/12/13 8:49), Adam Estrada wrote: > >> All, >> >> I am having some difficu"lties parsing the pubDate field that is part of >> the? >> RSS spec (I believe). I get the warnin

[pubDate] is not converting correctly

2010-12-12 Thread Adam Estrada
All, I am having some difficu"lties parsing the pubDate field that is part of the RSS spec (I believe). I get the warning that "states, "Dec 12, 2010 6:45:26 PM org.apache.solr.handler.dataimport.DateFormatTransformer transformRow WARNING: Could not parse a Date field java.text.ParseException: Un

Re: [Multiple] RSS Feeds at a time...

2010-12-12 Thread Adam Estrada
sformer"> As you can see, I can add sub-entities from what appears to be as many times as I want. The idea was to reload the xml file after each entity is added. What else

Re: SOLR geospatial

2010-12-12 Thread Adam Estrada
there either. Here's a place to start if you want to see the JIRA > discussions: https://issues.apache.org/jira/browse/SOLR-1568 > > Best > Erick > > > On Sun, Dec 12, 2010 at 11:23 AM, Adam Estrada >wrote: > > > I am particularly interested in storing and quer

Re: SOLR geospatial

2010-12-12 Thread Adam Estrada
I am particularly interested in storing and querying polygons. That sort of thing looks like its on their roadmap so does anyone know what the status is on that? Also, integration with JTS would make this a core component of any GIS. Again, anyone know what the status is on that? *What’s on the ro

[Parsing] Date Fields

2010-12-11 Thread Adam Estrada
All, I am ingesting a lot of RSS feeds as part of my application and I keep getting the same error. WARNING: Could not parse a Date field java.text.ParseException: Unparseable date: "Mon, 06 Dec 2010 23:31:38 +" at java.text.DateFormat.parse(Unknown Source) at org.apache.solr.

Re: [Multiple] RSS Feeds at a time...

2010-12-11 Thread Adam Estrada
You are da man! w00t! adam On Sat, Dec 11, 2010 at 4:48 PM, Ahmet Arslan wrote: > > I found that you can have a single config file that can > > have several > > entities in it. My question now is how can I add entities > > without restarting > > the Solr service? > > You mean changing and re-lo

Re: [Multiple] RSS Feeds at a time...

2010-12-11 Thread Adam Estrada
, Dec 10, 2010 at 10:38 PM, Lance Norskog wrote: > There is I believe no way to do this without separate copies of your > script. Each 'handler=/dataimport' has to refer to a separate config > file. > > You can make several copies and name them config1.xml, config2.xml > et

Re: Indexing documents with SOLR

2010-12-11 Thread Adam Estrada
: > Hi Adam, >Thanks a lot for pointing me out to NUTCH. >Can you please tell me, is through NUTCH Can I read teh directory on > local system or on a shared file system. > > Will wait for your response. > > / Pankaj Bhatt > > > On Fri, Dec 10, 2010 at 9:35 P

[Multiple] RSS Feeds at a time...

2010-12-10 Thread Adam Estrada
All, Right now I am using the default DIH config that comes with the Solr examples. I update my index using the dataimport handler here http://localhost:8983/solr/admin/dataimport.jsp?handler=/dataimport This works fine but I want to be able to index more than just one feed at a time and more im

Re: Indexing documents with SOLR

2010-12-10 Thread Adam Estrada
Nutch is also a great option if you want a crawler. I have found that you will need to use the latest version of PDFBox and a it's dependencies for better results. Also, make sure to set JAVA_OPT to something really large so that you won't exceed your heap size. Adam On Fri, Dec 10, 2010 at 6:27

Re: [Multiple] RSS Feeds and Source Field

2010-12-09 Thread Adam Estrada
t; that every outgoing document has a value > > If that's irrelevant, can we see your schema file? > > You can send as many documents in a packet as you want. > > Best > Erick > > On Thu, Dec 9, 2010 at 3:23 PM, Adam Estrada > wrote: > > > All, > > >

Re: Open source Solr UI with multiple select faceting?

2010-12-09 Thread Adam Estrada
SolrNet has a great example application that you can use...There is a great Javascript project called SolrAjax but I don't know what the state of it is. Adam On Thu, Dec 9, 2010 at 4:53 PM, Andy wrote: > Hi, > > Any open source Solr UI's that support selecting multiple facet values > ("OR" face

Re: [Multiple] RSS Feeds and Source Field

2010-12-09 Thread Adam Estrada
ny more of your suggestions on the index would be greatly appreciated Adam On Thu, Dec 9, 2010 at 3:46 PM, Jonathan Rochkind wrote: > You look at what index in Luke? I bet you $10 there is no index called > "source*" in your index. With an asterisk in it. > > > On 12

[Multiple] RSS Feeds and Source Field

2010-12-09 Thread Adam Estrada
All, I am indexing RSS feeds from several sources so I have a couple questions. 1. There is only 1 source for each RSS feed which is typically the name of the feed, I get an error in my app stating " *Value cannot be null. Parameter name: source* I look at the index in Luke and there are data val

Re: [Casting] values on update/csv

2010-12-08 Thread Adam Estrada
Hi, I am using curl to run the following and as soon as I convert the field type from string to tdouble, I get the errors you see below. 0:0:0:0:0:0:0:1 - - [08/12/2010:23:28:27 +] "GET /solr/update/csv?commit=true&separator=%2C&fieldnames=id,name,asciiname,lat,lng,countrycode,population,el

[Casting] values on update/csv

2010-12-08 Thread Adam Estrada
All, I have a csv file and I want to store one of the fields as a tdouble type. It does not like that at all...Is there a way to cast the string value to a tdouble? Thanks, Adam

Re: Batch Update Fields

2010-12-05 Thread Adam Estrada
; On Sat, Dec 4, 2010 at 12:16 PM, Adam Estrada < > estrada.adam.gro...@gmail.com > > wrote: > > > Synonyms eh? I have a synonym list like the following so how do I > identify > > the synonyms on a specific field. The only place the field is used is as > a > > fa

Re: Batch Update Fields

2010-12-04 Thread Adam Estrada
ata... > > That said, if you pre-processing is working for you it may not be worth > your while > to worry about doing it differently > > Best > Erick > > On Fri, Dec 3, 2010 at 12:51 PM, Adam Estrada < > estrada.adam.gro...@gmail.com > > wrote: > >

Re: Batch Update Fields

2010-12-03 Thread Adam Estrada
er solution... > > Best > Erick > > On Fri, Dec 3, 2010 at 12:20 PM, Adam Estrada < > estrada.adam.gro...@gmail.com > > wrote: > > > I wonder...I know that sed would work to find and replace the terms in > all > > of the csv files that I am indexing b

  1   2   >