Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change 
notification.

The "VirtualMachine" page has been changed by ChrisMattmann:
https://wiki.apache.org/tika/VirtualMachine?action=diff&rev1=13&rev2=14

Comment:
- add dump commands

  
  7. `groovy rmBugged.groovy`
  
+ ==== prep nsfpolardata ====
+ 
+ 1. `scp -r <user>@nsfpolardata.dyndns.org:/usr/local/ndeploy/data/AcadisCrawl 
.`
+ 
+ 2. (go get some coffee)
+ 
+ 3. `scp -r 
<user>@nsfpolardata.dyndns.org:/usr/local/ndeploy/data/AcadisCrawl2 .`
+ 
+ 4. (go get some coffee)
+ 
+ 5. `scp -r 
<user>@nsfpolardata.dyndns.org:/home/mattmann/polar-data/nutch_trunk/runtime/local/bin/crawlId
 .`
+ 
+ 6. (go get some coffee)
+ 
+ 7. `cd /data1/public/archives/nsf-polar-data/`
+ 
+ 8. `export NUTCH_OPTS="-Xmx8192m -XX:MaxPermSize=8192m"`
+ 
+ 9. `./bin/nutch dump -outputDir out -segment 
/data1/public/archives/nsf-polar-data/acadis/AcadisCrawl/segments/`
+ 
+ 10. `./bin/nutch dump -outputDir out2 -segment 
/data1/public/archives/nsf-polar-data/acadis/AcadisCrawl2/segments/`
+ 
+ 11. `./bin/nutch dump -outputDir out3 -segment 
/data1/public/archives/nsf-polar-data/nasa-amd/crawlId/segments/`
+ 
+ 
+ 
  == add more disc ==
  From Rackspace website, add block storage volume and attach it to server.
  

Reply via email to