First off...I know enough about Solr to be VERY dangerous so please bare with me ;-) I am indexing the geonames database which only provides country codes. I can facet the codes but to the end user who may not know all 249 codes, it isn't really all that helpful. Therefore, I want to map the full country names to the country codes provided in the geonames db. http://download.geonames.org/export/dump/
<http://download.geonames.org/export/dump/>I used a simple split function to chop the 850 meg txt file in to manageable csv's that I can import in to Solr. Now that all 7 million + documents are in there, I want to change the country codes to the actual country names. I would of liked to have done it in the index but finding and replacing the strings in the csv seems to be working fine. After that I can just reindex the entire thing. Adam On Fri, Dec 3, 2010 at 12:42 PM, Erick Erickson <erickerick...@gmail.com>wrote: > Have you consider defining synonyms for your code <->country > conversion at index time (or query time for that matter)? > > We may have an XY problem here. Could you state the high-level > problem you're trying to solve? Maybe there's a better solution... > > Best > Erick > > On Fri, Dec 3, 2010 at 12:20 PM, Adam Estrada < > estrada.adam.gro...@gmail.com > > wrote: > > > I wonder...I know that sed would work to find and replace the terms in > all > > of the csv files that I am indexing but would it work to find and replace > > key terms in the index? > > > > find C:\\tmp\\index\\data -type f -exec sed -i 's/AF/AFGHANISTAN/g' {} \; > > > > That command would iterate through all the files in the data directory > and > > replace the country code with the full country name. I many just back up > > the > > directory and try it. I have it running on csv files right now and it's > > working wonderfully. For those of you interested, I am indexing the > entire > > Geonames dataset > http://download.geonames.org/export/dump/(allCountries.zip) > > which gives me a pretty comprehensive world gazetteer. My next step is > > gonna > > be to display the results as KML to view over a google globe. > > > > Thoughts? > > > > Adam > > > > On Fri, Dec 3, 2010 at 7:57 AM, Erick Erickson <erickerick...@gmail.com > > >wrote: > > > > > No, there's no equivalent to SQL update for all values in a column. > > You'll > > > have to reindex all the documents. > > > > > > On Thu, Dec 2, 2010 at 10:52 PM, Adam Estrada < > > > estrada.adam.gro...@gmail.com > > > > wrote: > > > > > > > OK part 2 of my previous question... > > > > > > > > Is there a way to batch update field values based on a certain > > criteria? > > > > For example, if thousands of documents have a field value of 'US' can > I > > > > update all of them to 'United States' programmatically? > > > > > > > > Adam > > > > > >