lewismc opened a new pull request #165: URL: https://github.com/apache/any23/pull/165
This PR addresses https://issues.apache.org/jira/browse/ANY23-458 PR address the following improved info to STDOUT ``` rover Apache Any23 Command Line Tool. Usage: rover [options] input IRIs {<url>|<file>}+ Options: -d, --defaultns Override the default namespace used to produce statements. -e, --extractors a comma-separated list of extractors, e.g. rdf-xml,rdf-turtle, etc. A complete extractor list can be obtained by calling ./any23 extractor --list Default: [csv, html-embedded-jsonld, html-head-icbm, html-head-links, html-head-meta, html-head-title, html-mf-adr, html-mf-geo, html-mf-hcalendar, html-mf-hcard, html-mf-hlisting, html-mf-hrecipe, html-mf-hresume, html-mf-hreview, html-mf-hreview-aggregate, html-mf-license, html-mf-species, html-mf-xfn, html-microdata, html-rdfa11, html-xpath, ical, jcal, owl-functional, owl-manchester, rdf-jsonld, rdf-nq, rdf-nt, rdf-trix, rdf-turtle, rdf-xml, xcal, yaml] -f, --format a comma-separated list of writer factories, e.g. json,jsonld,nquads,notrivial,ntriples,trix,turtle,uri Default: [ntriples] -l, --log Produce log within a file. -n, --nesting Disable production of nesting triples. Default: false -t, --notrivial Filter trivial statements (e.g. CSS related ones). [DEPRECATED: As of version 2.3, use --format instead.] Default: false -o, --output Specify Output file (defaults to standard output) Default: java.io.PrintStream@2b2948e2 -p, --pedantic Validate and fixes HTML content detecting commons issues. Default: false -s, --stats Print out extraction statistics. Default: false ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
