Author: chetanm
Date: Wed Jun 21 09:05:54 2017
New Revision: 1799407
URL: http://svn.apache.org/viewvc?rev=1799407&view=rev
Log:
OAK-6370 - Improve documentation for text pre-extraction
Remove reference to --nodestore option which is not supported for trunk
Modified:
jackrabbit/oak/trunk/oak-doc/src/site/markdown/query/pre-extract-text.md
Modified:
jackrabbit/oak/trunk/oak-doc/src/site/markdown/query/pre-extract-text.md
URL:
http://svn.apache.org/viewvc/jackrabbit/oak/trunk/oak-doc/src/site/markdown/query/pre-extract-text.md?rev=1799407&r1=1799406&r2=1799407&view=diff
==============================================================================
--- jackrabbit/oak/trunk/oak-doc/src/site/markdown/query/pre-extract-text.md
(original)
+++ jackrabbit/oak/trunk/oak-doc/src/site/markdown/query/pre-extract-text.md
Wed Jun 21 09:05:54 2017
@@ -45,14 +45,14 @@ To generate the csv file use the `--gene
java -jar oak-run.jar tika \
--fds-path /path/to/datastore \
- --nodestore /path/to/segmentstore --data-file oak-binary-stats.csv
--generate
+ /path/to/segmentstore --data-file oak-binary-stats.csv --generate
If connecting to S3 this command can take long time because checking binary id
currently triggers download of the
actual binary content which we do not require. To speed up here we can use the
Fake DataStore support of oak-run
java -jar oak-run.jar tika \
--fake-ds-path=temp \
- --nodestore /path/to/segmentstore --data-file oak-binary-stats.csv
--generate
+ /path/to/segmentstore --data-file oak-binary-stats.csv --generate
This would generate a csv file with content like below