[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-11-12 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15660270#comment-15660270 ] Tristan Nixon commented on OPENNLP-776: --- I'm not seeing any changes on

[jira] [Commented] (OPENNLP-857) ParserTool should take use Tokenizer instance. It should not use java.util.StringTokenizer

2016-11-12 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15660268#comment-15660268 ] Tristan Nixon commented on OPENNLP-857: --- I'm not seeing this in the t

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-11-07 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15644259#comment-15644259 ] Tristan Nixon commented on OPENNLP-776: --- I've been swamped with other wo

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-10-28 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15616763#comment-15616763 ] Tristan Nixon commented on OPENNLP-776: --- Great, I'll give the patch at

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-10-28 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15616742#comment-15616742 ] Tristan Nixon commented on OPENNLP-776: --- Thanks, I think this patch looks goo

[jira] [Comment Edited] (OPENNLP-776) Model Objects should be Serializable

2016-10-04 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15545918#comment-15545918 ] Tristan Nixon edited comment on OPENNLP-776 at 10/4/16 4:4

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-10-04 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15545918#comment-15545918 ] Tristan Nixon commented on OPENNLP-776: --- Sorry, I probably should have rem

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-10-04 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15545832#comment-15545832 ] Tristan Nixon commented on OPENNLP-776: --- But that's not what my imple

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-10-04 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15545791#comment-15545791 ] Tristan Nixon commented on OPENNLP-776: --- Well, it's a bit of a m

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-10-04 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15545610#comment-15545610 ] Tristan Nixon commented on OPENNLP-776: --- It's not about the JVM vers

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-10-04 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15545536#comment-15545536 ] Tristan Nixon commented on OPENNLP-776: --- Thanks, that's great! While i

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2016-08-19 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: serializable-basemodel.patch Patch to make BaseModel serializable > Model Obje

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-08-19 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428454#comment-15428454 ] Tristan Nixon commented on OPENNLP-776: --- Good point. I thought the only wa

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2016-08-18 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: serialization_proxy.patch Patch containing modifications to model classes to

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-08-18 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15427213#comment-15427213 ] Tristan Nixon commented on OPENNLP-776: --- True, you don't need se

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-08-08 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15412332#comment-15412332 ] Tristan Nixon commented on OPENNLP-776: --- This pattern is quite commo

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-08-08 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15412320#comment-15412320 ] Tristan Nixon commented on OPENNLP-776: --- There are some slight differe

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2016-07-18 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: (was: externalizable.patch) > Model Objects should be Serializa

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2016-07-18 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: externalizable.patch Also model classes can't be final if we're going

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2016-07-18 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: externalizable.patch Actually, there is one more thing that must happen for this

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2016-07-18 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: (was: model-constructors.patch) > Model Objects should be Serializa

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2016-07-18 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: (was: BaseModel-serialization.patch) > Model Objects should be Serializa

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2016-07-18 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15383308#comment-15383308 ] Tristan Nixon commented on OPENNLP-776: --- Finally returning to this after more

[jira] [Updated] (OPENNLP-857) ParserTool should take use Tokenizer instance. It should not use java.util.StringTokenizer

2016-07-09 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-857: -- Attachment: ParserToolTokenize.patch My patch > ParserTool should take use Tokenizer insta

[jira] [Created] (OPENNLP-857) ParserTool should take use Tokenizer instance. It should not use java.util.StringTokenizer

2016-07-09 Thread Tristan Nixon (JIRA)
Tristan Nixon created OPENNLP-857: - Summary: ParserTool should take use Tokenizer instance. It should not use java.util.StringTokenizer Key: OPENNLP-857 URL: https://issues.apache.org/jira/browse/OPENNLP-857

Re: How to distribute dependent files (.so , jar ) across spark worker nodes

2016-03-14 Thread Tristan Nixon
> > > On Mon, Mar 14, 2016 at 2:31 PM, Jakob Odersky <mailto:ja...@odersky.com>> wrote: > Have you tried setting the configuration > `spark.executor.extraLibraryPath` to point to a location where your > .so's are available? (Not sure if non-local files, such as HDF

Re: How to distribute dependent files (.so , jar ) across spark worker nodes

2016-03-14 Thread Tristan Nixon
What build system are you using to compile your code? If you use a dependency management system like maven or sbt, then you should be able to instruct it to build a single jar that contains all the other dependencies, including third-party jars and .so’s. I am a maven user myself, and I use the

Re: Newbie question - Help with runtime error on augmentString

2016-03-11 Thread Tristan Nixon
in, but with its troublesome scalap dependency removed. > On Mar 11, 2016, at 6:34 PM, Vasu Parameswaran wrote: > > Added these to the pom and still the same error :-(. I will look into sbt as > well. > > > > On Fri, Mar 11, 2016 at 2:31 PM, Tristan Nixon <m

Re: udf StructField to JSON String

2016-03-11 Thread Tristan Nixon
So I think in your case you’d do something more like: val jsontrans = new JsonSerializationTransformer[StructType].setInputCol(“event").setOutputCol(“eventJSON") > On Mar 11, 2016, at 3:51 PM, Tristan Nixon wrote: > > val jsontrans = new > JsonSerializationTransformer

Re: How to distribute dependent files (.so , jar ) across spark worker nodes

2016-03-11 Thread Tristan Nixon
I recommend you package all your dependencies (jars, .so’s, etc.) into a single uber-jar and then submit that. It’s much more convenient than trying to manage including everything in the --jars arg of spark-submit. If you build with maven than the shade plugin will do this for you nicely: https:

Re: Newbie question - Help with runtime error on augmentString

2016-03-11 Thread Tristan Nixon
You must be relying on IntelliJ to compile your scala, because you haven’t set up any scala plugin to compile it from maven. You should have something like this in your plugins: net.alchim31.maven scala-maven-plugin scala-compile-first process-resources compile

Re: udf StructField to JSON String

2016-03-11 Thread Tristan Nixon
into a JSON-formatted string. * Created by Tristan Nixon on 3/11/16. */ class JsonSerializationTransformer[T](override val uid: String) extends UnaryTransformer[T,String,JsonSerializationTransformer[T]] { def this() = this(Identifiable.randomUID("JsonSerializationTransformer"))

Re: udf StructField to JSON String

2016-03-11 Thread Tristan Nixon
(nullable = true) > > > > I want to transform the Column event into String (formatted as JSON). > > I was trying to use udf but without success. > > > On Fri, Mar 11, 2016 at 1:53 PM Tristan Nixon <mailto:st...@memeticlabs.org>> wrot

Re: udf StructField to JSON String

2016-03-11 Thread Tristan Nixon
Have you looked at DataFrame.write.json( path )? https://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameWriter > On Mar 11, 2016, at 7:15 AM, Caires Vinicius wrote: > > I have one DataFrame with nested StructField and I want to convert to JSON > String. There is

Re: Can we use spark inside a web service?

2016-03-10 Thread Tristan Nixon
Hear, hear. That’s why I’m here :) > On Mar 10, 2016, at 7:32 PM, Chris Fregly wrote: > > Anyway, thanks for the good discussion, everyone! This is why we have these > lists, right! :)

Re: Can we use spark inside a web service?

2016-03-10 Thread Tristan Nixon
Very interested, Evan, thanks for the link. It has given me some food for thought. I’m also in the process of building a web application which leverage Spark on the back-end for some heavy lifting. I would be curious about your thoughts on my proposed architecture: I was planning on running a s

Re: Installing Spark on Mac

2016-03-10 Thread Tristan Nixon
r I'm running it as super user. > > I have java version 1.8.0_73 and SCALA version 2.11.7 > > Sent from my iPhone > >> On 9 Mar 2016, at 21:58, Tristan Nixon wrote: >> >> That’s very strange. I just un-set my SPARK_HOME env param, downloaded a >> fresh

Re: log4j pains

2016-03-10 Thread Tristan Nixon
Hmmm… that should be right. > On Mar 10, 2016, at 11:26 AM, Ashic Mahtab wrote: > > src/main/resources/log4j.properties > > Subject: Re: log4j pains > From: st...@memeticlabs.org > Date: Thu, 10 Mar 2016 11:08:46 -0600 > CC: user@spark.apache.org > To: as...@live.com > > Where in the jar is th

Re: log4j pains

2016-03-10 Thread Tristan Nixon
Where in the jar is the log4j.properties file? > On Mar 10, 2016, at 9:40 AM, Ashic Mahtab wrote: > > 1. Fat jar with logging dependencies included. log4j.properties in fat jar. > Spark doesn't pick up the properties file, so uses its defaults.

Re: Installing Spark on Mac

2016-03-09 Thread Tristan Nixon
It really shouldn’t, if anything, running as superuser should ALLOW you to bind to ports 0, 1 etc. It seems very strange that it should even be trying to bind to these ports - maybe a JVM issue? I wonder if the old Apple JVM implementations could have used some different native libraries for cor

Re: Installing Spark on Mac

2016-03-09 Thread Tristan Nixon
That’s very strange. I just un-set my SPARK_HOME env param, downloaded a fresh 1.6.0 tarball, unzipped it to local dir (~/Downloads), and it ran just fine - the driver port is some randomly generated large number. So SPARK_HOME is definitely not needed to run this. Aida, you are not running thi

Re: Installing Spark on Mac

2016-03-09 Thread Tristan Nixon
ts to a > single machine(local host) > > Sent from my iPhone > >> On 9 Mar 2016, at 19:59, Tristan Nixon wrote: >> >> Also, do you have the SPARK_HOME environment variable set in your shell, and >> if so what is it set to? >> >>> On Mar 9,

Re: Installing Spark on Mac

2016-03-09 Thread Tristan Nixon
Also, do you have the SPARK_HOME environment variable set in your shell, and if so what is it set to? > On Mar 9, 2016, at 1:53 PM, Tristan Nixon wrote: > > There should be a /conf sub-directory wherever you installed spark, which > contains several configuration files. > I b

Re: Installing Spark on Mac

2016-03-09 Thread Tristan Nixon
r message > > When I look at the spark-defaults.conf.template it shows a spark > example(spark://master:7077) where the port is 7077 > > When you say look to the conf scripts, how do you mean? > > Sent from my iPhone > >> On 9 Mar 2016, at 19:32, Tristan Nixon wrote: &

Re: Installing Spark on Mac

2016-03-09 Thread Tristan Nixon
Yeah, according to the standalone documentation http://spark.apache.org/docs/latest/spark-standalone.html the default port should be 7077, which means that something must be overriding this on your installation - look to the conf scripts! > On Mar 9, 2016, at 1:26 PM, Tristan Nixon wr

Re: Installing Spark on Mac

2016-03-09 Thread Tristan Nixon
Looks like it’s trying to bind on port 0, then 1. Often the low-numbered ports are restricted to system processes and “established” servers (web, ssh, etc.) and so user programs are prevented from binding on them. The default should be to run on a high-numbered port like 8080 or such. What do yo

Re: Specify log4j properties file

2016-03-09 Thread Tristan Nixon
You can also package an alternative log4j config in your jar files > On Mar 9, 2016, at 12:20 PM, Ashic Mahtab wrote: > > Found it. > > You can pass in the jvm parameter log4j.configuration. The following works: > > -Dlog4j.configuration=file:path/to/log4j.properties > > It doesn't work with

Re: SparkFiles.get() returns with driver path Instead of Worker Path

2016-03-08 Thread Tristan Nixon
ew to spark and I am just messing around with it. > > On Mar 8, 2016 10:23 PM, "Tristan Nixon" <mailto:st...@memeticlabs.org>> wrote: > My understanding of the model is that you’re supposed to execute > SparkFiles.get(…) on each worker node, not on the driver. >

Re: Analyzing json Data streams using sparkSQL in spark streaming returns java.lang.ClassNotFoundException

2016-03-08 Thread Tristan Nixon
this is a bit strange, because you’re trying to create an RDD inside of a foreach function (the jsonElements). This executes on the workers, and so will actually produce a different instance in each JVM on each worker, not one single RDD referenced by the driver, which is what I think you’re try

Re: SparkFiles.get() returns with driver path Instead of Worker Path

2016-03-08 Thread Tristan Nixon
My understanding of the model is that you’re supposed to execute SparkFiles.get(…) on each worker node, not on the driver. Since you already know where the files are on the driver, if you want to load these into an RDD with SparkContext.textFile, then this will distribute it out to the workers,

Re: OOM exception during Broadcast

2016-03-07 Thread Tristan Nixon
happening. > > On Mon, Mar 7, 2016 at 5:55 PM, Tristan Nixon <mailto:st...@memeticlabs.org>> wrote: > I’m not sure I understand - if it was already distributed over the cluster in > an RDD, why would you want to collect and then re-send it as a broadcast > variable? Why

Re: OOM exception during Broadcast

2016-03-07 Thread Tristan Nixon
> Hi Tristan, > > This is not static, I actually collect it from an RDD to the driver. > > On Mon, Mar 7, 2016 at 5:42 PM, Tristan Nixon <mailto:st...@memeticlabs.org>> wrote: > Hi Arash, > > is this static data? Have you considered including it in your jars an

Re: OOM exception during Broadcast

2016-03-07 Thread Tristan Nixon
Hi Arash, is this static data? Have you considered including it in your jars and de-serializing it from jar on each worker node? It’s not pretty, but it’s a workaround for serialization troubles. > On Mar 7, 2016, at 5:29 PM, Arash wrote: > > Hello all, > > I'm trying to broadcast a variable

[jira] [Commented] (OPENNLP-808) Parser is not thread safe

2016-02-26 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15169432#comment-15169432 ] Tristan Nixon commented on OPENNLP-808: --- A simple way to deal with this is to

[jira] [Commented] (TIKA-1362) Add GoogleTranslate implementation of Translation API

2015-07-10 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14622999#comment-14622999 ] Tristan Nixon commented on TIKA-1362: - Great to hear, and thanks for the invite.

[jira] [Commented] (TIKA-1362) Add GoogleTranslate implementation of Translation API

2015-07-10 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14622982#comment-14622982 ] Tristan Nixon commented on TIKA-1362: - Storing the API key in the properties fil

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2015-05-19 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14550604#comment-14550604 ] Tristan Nixon commented on OPENNLP-776: --- It does not make the (de-)serializa

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2015-05-19 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: model-constructors.patch I realized that for automatic de-serialization, all

[jira] [Commented] (OPENNLP-776) Model Objects should be Serializable

2015-05-19 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14550561#comment-14550561 ] Tristan Nixon commented on OPENNLP-776: --- You're totally welcome! Let me

[jira] [Updated] (OPENNLP-776) Model Objects should be Serializable

2015-05-14 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/OPENNLP-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tristan Nixon updated OPENNLP-776: -- Attachment: BaseModel-serialization.patch My patch > Model Objects should be Serializa

[jira] [Created] (OPENNLP-776) Model Objects should be Serializable

2015-05-14 Thread Tristan Nixon (JIRA)
Tristan Nixon created OPENNLP-776: - Summary: Model Objects should be Serializable Key: OPENNLP-776 URL: https://issues.apache.org/jira/browse/OPENNLP-776 Project: OpenNLP Issue Type

[jira] [Commented] (SPARK-4414) SparkContext.wholeTextFiles Doesn't work with S3 Buckets

2015-04-28 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14517886#comment-14517886 ] Tristan Nixon commented on SPARK-4414: -- Thanks, [~petedmarsh], I was having

[SAtalk] order of preferences in white / black listing

2003-11-11 Thread Tristan Nixon
Hello all, I have a question regarding the way in which SA deals with whitelisting & blacklisting. If I want to whitelist all but a few select entries from a domain, how would I do it. Should the following work? whitelist_from [EMAIL PROTECTED] unwhitelist_from [EMAIL PROTECTED] unwhitelist_from

please disregard

2001-06-24 Thread Tristan Nixon
Hehe, Please disregard my last post - since I had only version 1.6.2 ( redhat still hasn't put out version 1.7 ) I didn't see that all the features I wanted had been included - I got version 1.7 and read the man page only to find that I'd emailed all my suggestions too soon - thanks guys you are r

what's up?

2001-06-24 Thread Tristan Nixon
Hello, I have been trying to use wget for some quick and simple mirroring of individual pages, and I've noticed that there are some features which would be nice to have that it does not seem to have. I would be more than willing to code these myself, but I don't want to duplicate anything that ma