[jira] [Comment Edited] (SOLR-6900) bin/post improvements needed
[ https://issues.apache.org/jira/browse/SOLR-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277938#comment-14277938 ] Erik Hatcher edited comment on SOLR-6900 at 1/15/15 2:39 PM: - Latest improvements: * Error handling: script now checks several things like collection specified, files/directories not mixed with URLs, and that one or more are specified * Spaces in file names now handled properly * Script works when run from any working directory Open issues: * Windows version not implemented yet (volunteers to get this in for 5.0? Otherwise will be deferred to a later version) * args (direct string to post to Solr) and stdin not yet supported was (Author: ehatcher): Latest improvements: * Error handling: script now checks several things like collection specified, files/directories not mixed with URLs, and that one more are specified * Spaces in file names now handled properly * Script works when run from any working directory Open issues: * Windows version not implemented yet (volunteers to get this in for 5.0? Otherwise will be deferred to a later version) * args (direct string to post to Solr) and stdin not yet supported bin/post improvements needed Key: SOLR-6900 URL: https://issues.apache.org/jira/browse/SOLR-6900 Project: Solr Issue Type: Bug Affects Versions: 5.0, Trunk Reporter: Erik Hatcher Assignee: Erik Hatcher Priority: Blocker Fix For: 5.0, Trunk * Fix glob patterns. They don't work as expected: bin/post collection1 \*.xml expands \*.xml such that the script gets all the file names as parameters not just literally \*.xml * Add error handling to check that the collection exists * Create Windows version -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Comment Edited] (SOLR-6900) bin/post improvements needed
[ https://issues.apache.org/jira/browse/SOLR-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277921#comment-14277921 ] Erik Hatcher edited comment on SOLR-6900 at 1/15/15 2:50 AM: - Several improvements committed. Here's the latest: {code} $ bin/post -h Usage: post -c collection/core files|directories|urls [OPTIONS] or post -help collection name defaults to DEFAULT_SOLR_COLLECTION if not specified OPTIONS === Solr options: -url base Solr update URL (overrides collection, host, and port) -host host (default: localhost) -port port (default: 8983) -commit yes|no (default: yes) Web crawl options: -recursive depth (default: 1) -delay seconds (default=10) Directory crawl options: -delay seconds (default=0) Other options: -filetypes type[,type,...] (default: xml,json,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log) -params key=value[key=value...] (values must be URL-encoded; these pass through to Solr update request) -out yes|no (default=no; yes outputs Solr response to console) Examples: JSON file: bin/post -c wizbang events.json XML files: bin/post -c records article*.xml CSV file: bin/post -c signals LATEST-signals.csv Directory of files: bin/post -c myfiles ~/Documents Web crawl: bin/post -c gettingstarted http://lucidworks.com -recursive 2 -delay 1 {code} was (Author: ehatcher): Several improvements committed. Here's the latest: {code} $ bin/post -h Usage: post -c collection/core files|directories|urls [OPTIONS] or post -help collection name defaults to DEFAULT_SOLR_COLLECTION if not specified OPTIONS === Solr options: url=base Solr update URL (overrides collection, host, and port) host=host (default: localhost) port=port (default: 8983) commit=yes|no (default: yes) Web crawl options: recursive=depth (default: 1) delay=seconds (default=10) Directory crawl options: delay=seconds (default=0) Other options: filetypes=type[,type,...] (default: xml,json,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log) params=key=value[key=value...] (values must be URL-encoded) out=yes|no (default=no; yes outputs Solr response to console) Examples: JSON file: bin/post -c wizbang events.json XML files: bin/post -c records article*.xml CSV file: bin/post -c signals LATEST-signals.csv Directory of files: bin/post -c myfiles ~/Documents Web crawl: bin/post -c gettingstarted http://lucidworks.com recursive=2 delay=1 {code} bin/post improvements needed Key: SOLR-6900 URL: https://issues.apache.org/jira/browse/SOLR-6900 Project: Solr Issue Type: Bug Affects Versions: 5.0, Trunk Reporter: Erik Hatcher Assignee: Erik Hatcher Priority: Blocker Fix For: 5.0, Trunk * Fix glob patterns. They don't work as expected: bin/post collection1 \*.xml expands \*.xml such that the script gets all the file names as parameters not just literally \*.xml * Add error handling to check that the collection exists * Create Windows version -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Comment Edited] (SOLR-6900) bin/post improvements needed
[ https://issues.apache.org/jira/browse/SOLR-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270017#comment-14270017 ] Erik Hatcher edited comment on SOLR-6900 at 1/8/15 8:44 PM: bin/post collection file with spaces.csv does not work, breaks arguments at whitespace apparently. was (Author: ehatcher): Files with whitespace in their names does not work - needs fixing here too. bin/post improvements needed Key: SOLR-6900 URL: https://issues.apache.org/jira/browse/SOLR-6900 Project: Solr Issue Type: Bug Affects Versions: 5.0, Trunk Reporter: Erik Hatcher Assignee: Erik Hatcher Fix For: 5.0, Trunk * Fix glob patterns. They don't work as expected: bin/post collection1 \*.xml expands \*.xml such that the script gets all the file names as parameters not just literally \*.xml * Add error handling to check that the collection exists * Create Windows version -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Comment Edited] (SOLR-6900) bin/post improvements needed
[ https://issues.apache.org/jira/browse/SOLR-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270021#comment-14270021 ] Grant Ingersoll edited comment on SOLR-6900 at 1/8/15 8:45 PM: --- I tried: {code} bin/post citi /foo/projects/content/citi-bike/2013-07 - Citi Bike trip data.csv bin/post citi /foo/projects/content/citi-bike/2013-07\ -\ Citi\ Bike\ trip\ bin/post citi /foo/content/citi-bike/2013-07\ -\ Citi\ Bike\ trip\ data.csv {code} All failed w/ errors in parsing spaces. was (Author: gsingers): I tried: {code} bin/post citi /foo/projects/content/citi-bike/2013-07 - Citi Bike trip data.csv bin/post citi /foo/projects/content/citi-bike/2013-07\ -\ Citi\ Bike\ trip\ bin/post citi /foo/content/citi-bike/2013-07\ -\ Citi\ Bike\ trip\ data.csv {code} bin/post improvements needed Key: SOLR-6900 URL: https://issues.apache.org/jira/browse/SOLR-6900 Project: Solr Issue Type: Bug Affects Versions: 5.0, Trunk Reporter: Erik Hatcher Assignee: Erik Hatcher Fix For: 5.0, Trunk * Fix glob patterns. They don't work as expected: bin/post collection1 \*.xml expands \*.xml such that the script gets all the file names as parameters not just literally \*.xml * Add error handling to check that the collection exists * Create Windows version -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Comment Edited] (SOLR-6900) bin/post improvements needed
[ https://issues.apache.org/jira/browse/SOLR-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14262644#comment-14262644 ] Alexandre Rafalovitch edited comment on SOLR-6900 at 1/1/15 7:27 PM: - Also, the help message is problematic. The underlying simplepost tool talks about using *-h* but you can't actually use that as the first parameter is the collection name. A bit confusing. But even if I give dummy collection name and -h, I am not sure the examples given will actually work. So, maybe it's own help is needed for the tool. was (Author: arafalov): Also, the help message is problematic. The underlying simplepost tool talks about using *-h* but you can't actually use that as the first parameter is the collection name. A bit confusing. bin/post improvements needed Key: SOLR-6900 URL: https://issues.apache.org/jira/browse/SOLR-6900 Project: Solr Issue Type: Bug Affects Versions: 5.0, Trunk Reporter: Erik Hatcher Assignee: Erik Hatcher Fix For: 5.0, Trunk * Fix glob patterns. They don't work as expected: bin/post collection1 \*.xml expands \*.xml such that the script gets all the file names as parameters not just literally \*.xml * Add error handling to check that the collection exists * Create Windows version -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org