[ 
https://issues.apache.org/jira/browse/CONNECTORS-277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13130060#comment-13130060
 ] 

Karl Wright commented on CONNECTORS-277:
----------------------------------------

nvm, I found a URL they claim will return the namespace list:


http://www.mediawiki.org/w/api.php?action=query&meta=siteinfo&siprop=namespaces 

This seems to return XML of the following form:

<?xml version="1.0"?>
<api>
  <query>
    <namespaces>
      <ns id="-2" case="first-letter" canonical="Media" 
xml:space="preserve">Media</ns>
      <ns id="-1" case="first-letter" canonical="Special" 
xml:space="preserve">Special</ns>
      <ns id="0" case="first-letter" subpages="" content="" 
xml:space="preserve" />
      <ns id="1" case="first-letter" subpages="" canonical="Talk" 
xml:space="preserve">Talk</ns>
      <ns id="2" case="first-letter" subpages="" canonical="User" 
xml:space="preserve">User</ns>
      <ns id="3" case="first-letter" subpages="" canonical="User talk" 
xml:space="preserve">User talk</ns>
      <ns id="4" case="first-letter" subpages="" canonical="Project" 
xml:space="preserve">Project</ns>
      <ns id="5" case="first-letter" subpages="" canonical="Project talk" 
xml:space="preserve">Project talk</ns>
      <ns id="6" case="first-letter" canonical="File" 
xml:space="preserve">File</ns>
      <ns id="7" case="first-letter" subpages="" canonical="File talk" 
xml:space="preserve">File talk</ns>
      <ns id="8" case="first-letter" canonical="MediaWiki" 
xml:space="preserve">MediaWiki</ns>
      <ns id="9" case="first-letter" subpages="" canonical="MediaWiki talk" 
xml:space="preserve">MediaWiki talk</ns>
      <ns id="10" case="first-letter" subpages="" canonical="Template" 
xml:space="preserve">Template</ns>
      <ns id="11" case="first-letter" subpages="" canonical="Template talk" 
xml:space="preserve">Template talk</ns>
      <ns id="12" case="first-letter" subpages="" canonical="Help" 
xml:space="preserve">Help</ns>
      <ns id="13" case="first-letter" subpages="" canonical="Help talk" 
xml:space="preserve">Help talk</ns>
      <ns id="14" case="first-letter" subpages="" canonical="Category" 
xml:space="preserve">Category</ns>
      <ns id="15" case="first-letter" canonical="Category talk" 
xml:space="preserve">Category talk</ns>
      <ns id="90" case="first-letter" canonical="Thread" 
xml:space="preserve">Thread</ns>
      <ns id="91" case="first-letter" canonical="Thread talk" 
xml:space="preserve">Thread talk</ns>
      <ns id="92" case="first-letter" canonical="Summary" 
xml:space="preserve">Summary</ns>
      <ns id="93" case="first-letter" canonical="Summary talk" 
xml:space="preserve">Summary talk</ns>
      <ns id="100" case="first-letter" subpages="" canonical="Manual" 
content="" xml:space="preserve">Manual</ns>
      <ns id="101" case="first-letter" subpages="" canonical="Manual talk" 
xml:space="preserve">Manual talk</ns>
      <ns id="102" case="first-letter" subpages="" canonical="Extension" 
content="" xml:space="preserve">Extension</ns>
      <ns id="103" case="first-letter" subpages="" canonical="Extension talk" 
xml:space="preserve">Extension talk</ns>
      <ns id="104" case="first-letter" subpages="" canonical="API" 
xml:space="preserve">API</ns>
      <ns id="105" case="first-letter" subpages="" canonical="API talk" 
xml:space="preserve">API talk</ns>
    </namespaces>
  </query>
</api>


                
> WikiConnector - option to limit crawl by namespace
> --------------------------------------------------
>
>                 Key: CONNECTORS-277
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-277
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: Wiki connector
>    Affects Versions: ManifoldCF 0.4
>            Reporter: Tobias Wunderlich
>            Assignee: Karl Wright
>            Priority: Minor
>             Fix For: ManifoldCF 0.4
>
>
> At the moment, the WikiConnector crawls the whole Wiki. This can take up a 
> lot of time. For testing purposes an option to limit the pages to crawl by 
> namespaces(title) would be great.
> Tobias

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to