Johannes Sixt <> writes:

> Am 18.05.2013 09:42, schrieb Andreas Leha:
>>> Am 14.05.2013 15:17, schrieb Andreas Leha:
>>>> Hi all,
>>>> how can I make git ignore the time stamp(s) in a PDF.  Two PDFs that
>>>> differ only in these time stamps should be considered identical.
>>>> ...
>>>> What I tried is a filter:
>>>> ,----[ ~/.gitconfig ]
>>>> | [filter "pdfresetdate"]
>>>> |         clean = pdfresetdate
>>>> `----
>>>> This 'works' as far as the committed pdf indeed has the date reset to my
>>>> default value.
>>>> However, when I re-checkout the files, they are marked modified by git.
>>> I'm using cleaned files every now and then, but not on Linux. I have
>>> never observed this behavior recently.
>>> If you 'git add' the file, does it keep its modified state? Does 'git
>> yes.
>>> diff' tell a difference?
>> no.
> I do not believe you. I'm sure that "Binary files differ" was
> reported.

You are correct, of course.  I had forgotten that I also had enabled a
special diff for pdf files, that reports the difference in the pdfinfo

> The reason is that your pdfresetdate script is not idempotent. Look:
> $ pdfresetdate < x.pdf > y.pdf
> $ pdfresetdate < y.pdf > z.pdf
> $ md5sum x.pdf y.pdf z.pdf
> c46a7097574a035e89d1a46d93c83528  x.pdf
> 8e6d942b4cc7d8a4dfe6898867573617  y.pdf
> e6333bc0f8ab9781d3e1d811a392d516  z.pdf

Thanks for that.  I had not noticed due to the non-binary diff I had

> A file that was already cleaned by the clean filter must not be
> modified, i.e., the y.pdf and z.pdf should be identical. But they are not.
> Fix your clean filter.

I will (try to) do.  Anyway, git seems unresponsible for my issue.

Thanks for that clear analysis!


To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to
More majordomo info at

Reply via email to