Definetly agree on that, what would be the steps to get the database from the server?
On Mon, Oct 27, 2014 at 10:19 PM, Andrew Douglas Pitonyak < and...@pitonyak.org> wrote: > > In its current state it is mostly useless. > > I took an 8 GB dump by crawling the site in September. I think that it has > as much spam as content in that dump. I think that I grabbed roughly 250K > threads. If I had more time, I would write some scripts to assist cleaning > up the spam, but that is especially difficult from an HTML dump that links > some of the files together. I started looking at the data, but, it really > is not the way to attempt to obtain clean data. Would be better if we could > grab the DB level stuff and migrate to a static site. > > > On 10/27/2014 09:51 PM, Alexandro Colorado wrote: > >> I see, I would hate to see this site die. Lots of information there, >> even if the site is held as a static site. >> >> On 10/27/14, Andrew Douglas Pitonyak <and...@pitonyak.org> wrote: >> >>> On 10/27/2014 07:16 PM, Alexandro Colorado wrote: >>> >>>> The conversation kinda went away and wonder what was the resolution >>>> about the oooforum and if there is any hope to recover the database at >>>> least to get some kind of plain browsable dump of the content. >>>> >>>> Regards. >>>> >>>> There was some discussion on the AOO private list, but I am not on that >>> list. I was copied for some of the stuff. Since I have heard nothing, my >>> guess is that the owner has been non-responsive. >>> >>> I attempted to perform a data dump, but that failed because there is a >>> limit to the size I can pull through the interface and compression is >>> disabled. >>> >> > -- > Andrew Pitonyak > My Macro Document: http://www.pitonyak.org/AndrewMacro.odt > Info: http://www.pitonyak.org/oo.php > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org > For additional commands, e-mail: dev-h...@openoffice.apache.org > > -- Alexandro Colorado Apache OpenOffice Contributor 882C 4389 3C27 E8DF 41B9 5C4C 1DB7 9D1C 7F4C 2614