Hi,

I have been able to fix the NTFS image I have been
allowed to download.

Richard W.M. Jones wrote:
> On Wed, Apr 06, 2016 at 12:17:48PM +0200, Jean-Pierre André wrote:
>> There are several ways to tackle this issue, but before
>> doing anything, it would be useful to define what is
>> needed : just identifying which files are bad (and
>> recover them from some archive), recovering a few
>> files, etc. The issue is in a Windows system directory,
>> so there is probably nothing user related there.
>

There are four bad file names in the system32 directory,
they have a similar name with a bad surrogate pair
followed by ".log". A fifth one has a similar name, but
it has a valid surrogate pair.

They are small files (108 or 132 bytes) created at different
dates. One of them is recent, maybe the customer can
remember something specific being started on Monday
(mornings in the US).

Here are the creation dates :

Thu Feb 18 10:47:30 2016 UTC
Mon Aug 18 14:08:27 2014 UTC
Mon Oct  5 13:25:12 2015 UTC
Mon Jun  4 11:43:29 2012 UTC
Mon Aug 18 14:02:22 2014 UTC

> The problem is not really with this particular image.  It's not that
> we need to recover files from it or anything like that.  The problems
> are:
>
>   - How did this situation arise in the first place?

As I said surrogate pairs are present, which make them
unlikely to have been created by Windows XP. The pairs
are :

da5c dc93 (this is the valid one)
dc5c dc93
dd5c dc93
de24 dc93
de5c dc93

I have been able to replace them with valid characters,
and I can provide the disk patches if the customer wants
to get a clean image.

>   - Could it happen again for other Windows guests.

As (bad) surrogate pairs are present in a Windows XP
image, and these files have no short names, I am inclined
to think ntfs-3g has something to do with it. There has
been no change in utf8 to utf16 translations for six years,
and I am not aware of any other bug report related to the
translations.

More examination needed for telling what has happened.

> Plus, it'd be nice if ntfs-3g could ignore (or at least not give a
> hard error) in these cases.  It's actually the getdents(2) system call
> which fails, so any access at all to the directory returns -EILSEQ.

This will mean (optional) cheating with the translations
so that bad Unicode characters can translate to utf8 and
back to bad Unicode.

> We were trying to read a few files from \Windows\System32, it's most
> likely that the "corrupt" file is not a file that we care about.

I can provide disk patches if you want to delete them.

Regards

Jean-Pierre


------------------------------------------------------------------------------
_______________________________________________
ntfs-3g-devel mailing list
ntfs-3g-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ntfs-3g-devel

Reply via email to