Hi Maf.

Thanks for the header string from a custom report - that is promising search string.

I was in fact referring to the Chart of Accounts and agree that if I can find the data file I should be fine.

The problem is that the recovery operation (using Testdisk/Photorec) results in files and file fragments that may or may not be correctly identified by file extensions. And in the vast majority of cases the filename is of the form 'f765987234' plus an extension. If the filetype couldn't be determined from a list of 400 or so file signatures that Photorec has then it gets a '.txt' extension.

And in all cases the file folder structure is lost, replaced instead by folders that have 500 or so recovered files and file fragments.

PDFs, JPGs, etc seem to be largely recovered, with the identification and sorting challenge still ahead. Problematic are files that Photorec doesn't recognize ahead of time or which have no real "signature". And things like an encrypted database of passwords will probably only be found by dumb luck.

So if I can identify the data file, you're right, I should be fine. It's finding it that is the problem, assuming it's not corrupted. That has led me to focus on a "plan b" which would be a process for find either backups of transactions, plus something describing the Chart of Accounts that I could ideally import, plus any customized reports or styling. Which is why I'm hunting for strings that could be considered "signatures."

In going through the forum, I've identified
'trans_guid', "acc_guid', 'split_guid', 'PluginPageAccountTree', '<act:id', '<trn:id', '<split:id', 'gnc:account version', 'gnc:transaction version', '<trn:split>' as likely-unique "markers" for gnucash-related files, showing up for example in some .xml.gz files that I have yet to explore.

So I'm hopeful that I can identify a minimum number of files that I can manually import or stitch together to get back up and running.

When I do so, I'm happy to detail the process, assuming it's not considered too far off topic.

Thanks for your help and everyone's patience.
Matt

On 2017-06-29 07:13, Maf. King wrote:
Hi Matt.

What do you mean by category structure? If you're talking about the Chart of Accounts (ie the "tree"), that is stored in the data file. Recover it (or one of the auto-backup copied in the same directory) and I'd expect all to be
well.

Custom reports are in ~/.gnucash/saved-reports-2.4 - this head extract from mine suggests that a sequence of semicolons might be a good search string? or
"gnc:report" etc.

<SNIP>
maf@janus:~/.gnucash> head saved-reports-2.4
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
;; Options for saved report "VAT Box 1 - Last Qtr", based on template
"2fe3b9833af044abb929a88d5a59620f"
(let ()
 (define (options-gen)
  (let ((options (gnc:report-template-new-options/report-guid
"2fe3b9833af044abb929a88d5a59620f" "Transaction Report")))

; Section: General

(let ((option (gnc:lookup-option options

<SNAP>

Maf.


On Thursday, 29 June 2017 11:38:28 BST MattC wrote:
Thank you. That is exactly what I'm looking for. Fwiw, it appears that searching for "trans_guild" identifies log files/fragments. So I'm hoping that piecing those together migjt be a successful "plan b". I think that leaves the category structure to try to figure out, as well as - though
less importantly - customized reports.

I'll report back how it goes, but if anyone has suggestions for other files
to search for, I'd appreciate the input.

Matt


Sent from my Verizon Wireless 4G LTE smartphone

<div>-------- Original message --------</div><div>From: "Maf. King"
<[email protected]> </div><div>Date:06/29/2017  4:31 AM  (GMT-05:00)
</div><div>To: [email protected] </div><div>Cc: [email protected]
</div><div>Subject: Re: File signatures?? </div><div>
</div>On Wednesday, 28 June 2017 14:40:09 BST [email protected] wrote:
> Hi,
>
> I think this is the right venue to ask this question.  If not, I can
> hopefully get a pointer to where else to turn.
>
> I need to figure out what - if any - file signatures could be used to
> identify gnucash data files.  The need arises from a harddisk crash and
> recovery effort, the result of which was a *lot* of files and file
> fragments recovered, but at the expense of the harddisk's directory
> structure and filenames (for the most part).  The harddisk in question
> has terabytes of data on it, so going through the disk manually is not
> practical.
>
> On this disk were the data files for a non-profit which had a somewhat
> customized account tree structure.  What I am trying to figure out is if
> there are any unique headers to a minimum number of files that could be
> used to recreate the transactions and other data in gnucash?  If there
> are keywords or byte strings I can use, then I can use disk search tools
> for look for the files and fragments that are relevant and try to stitch
> things back together.
>
> FWIW, I believe the account data was stored as XML rather than in a
> database.  And the version of gnucash I was using was whatever version
> was stable at the beginning of this calendar year.
>
> Thanks in advance for any help or pointers.
>
> Matt
>
> PS I already understand the wisdom of having some backup elsewhere, so I
> can forgo that pointer.  The problem in this case was that this unit was
> also the backup.

Hi Matt

from my system:

~> head myfile.gnucash
<?xml version="1.0" encoding="utf-8" ?>
<gnc-v2
     xmlns:gnc="http://www.gnucash.org/XML/gnc";
     xmlns:act="http://www.gnucash.org/XML/act";
     xmlns:book="http://www.gnucash.org/XML/book";
     xmlns:cd="http://www.gnucash.org/XML/cd";
     xmlns:cmdty="http://www.gnucash.org/XML/cmdty";
     xmlns:price="http://www.gnucash.org/XML/price";
     xmlns:slot="http://www.gnucash.org/XML/slot";
     xmlns:split="http://www.gnucash.org/XML/split";

Now, this is an uncompressed gc file from v 2.6.16, but dating back years; a
file created this year has exactly the same first few lines though.

If your file was saved with compression turned on, then your task is
probably harder - look for gz compressed files.

HTH,
Maf.
_______________________________________________
gnucash-user mailing list
[email protected]
https://lists.gnucash.org/mailman/listinfo/gnucash-user
-----
Please remember to CC this list on all your replies.
You can do this by using Reply-To-List or Reply-All.

Reply via email to