On Fri, Oct 27, 2006 at 03:55:16PM +0200, bbackde at googlemail.com wrote:
> >Why would the key be different? As long as he uses the same MIME type
> >and the same filename, it should be the same.
> 
> I wrote he changed the filename because he wanted another name. If he
> forgets the old name, or if he does not use the old name explicitely,
> then he will get a different key.
> This is a user error, I know. If he makes no mistake the key is the
> same. But this is also not exactly user friendly ;)
> 
> But nevertheless, no problem for me. And I know what you meant with
> the checksum, you could do it, and it would be nice to use SHA-256, as
> Frost does *g*.

Of course. It's been suggested for ease of integration with other
systems that we provide one or more enforced checksums in the top-level
metadata. If we can spare the CPU cycles we could provide the full range
- SHA-256, SHA-1, MD5. But realistically probably just SHA-256, which
hopefully others will use anyway. This isn't entirely trivial to
implement; how important is it?

> If you want to implement it please tell me about this. The reason is
> that the new Frost will rely on the fact that the same file creates
> the same CHK key. I assume if you add a checksum to metadata the CHK
> key changes... and yes I know its alpha and everything could change
> later, but then I will have to reset the filebase for Frost and start
> with a new filelist. I think also thaw and freemule would have a hit
> when the CHK keys for same content change.

Well ... the same data might yield a different key, but unless the top
level manifest has disappeared, it would still propagate the original
insert. I doubt it would affect Thaw. As a general rule, metadata
changes will always be back-compatible, and are likely to be infrequent,
but they may happen.
> 
> You mentioned one thing that sounds really good to me. If it would be
> possible to have one CHK key for the plain complete binary file (octet
> stream, no mime type, no filename) and multiple different CHKs that
> transport additional informations (filename,mime), this would be
> really great. This offers the most flexiblity, allows to compare files
> if content is the same and more. If the only disadvantage is that you
> have to insert 1 additional metadata CHK, then please do it. For 100+
> MB files this makes no difference...
> What do you think? Could this be done?

And for small files, it makes a huge difference. This requires an
arbitrary threshold in the node. How do we determine one?

I don't see why MIME should vary from inserter to inserter really. Only
broken clients use application/octet-stream. :)

You can force it to work this way if you want by inserting a CHK with no
filename, and then inserting a redirect to it. But you wouldn't at
present be able to ask the node for the redirect target.
> 
> On 10/27/06, toad <toad at amphibian.dyndns.org> wrote:
> >On Fri, Oct 27, 2006 at 02:51:25PM +0200, bbackde at googlemail.com wrote:
> >> I understand your points.
> >>
> >> I was not on the techlist before 2 months, thats why I ask now.
> >
> >Ok.
> >>
> >> After reading your text I think Frost has no hit if this behaviour is
> >> changed. Frost will just provide an empty "TargetFilename=" and it
> >> will never use a filename for requests or inserts, Frost will only
> >> request the pure CHK@ key.
> >> As I understood this continues to work, an application can still use
> >> an empty TargetFilename??? And this will work forever, true?
> >
> >Sure.
> >>
> >> As long as this works as is nice for me...
> >>
> >> Thanks for your clarifications.
> >>
> >> >Why should Freenet URIs behave so radically differently to any other
> >> >kind of URI in the known universe?
> >>
> >> imho this was a big advantage of freenet. In freenet all is free and
> >> you can't trust noone. But if someone gained some trust e.g. on Frost
> >> or over a freesite, and this person provides a CHK key, everyone
> >> always knows that the data behind this key are ok (theoretically).
> >> Now, if there are different keys for the same file, this does not work
> >> any longer. Sample: a trusted person uploads a CHK with a filename.
> >> Someone downloads the file and archives it, but he renamed it. After
> >> months the original trusted uploader is wasted, and the downloader
> >> reinserts the file because someone asked for it. If he use thaw (for
> >> example), then the file gets another CHK key because of the different
> >> filename. Then the reinserter provides the key to the public, but
> >> noone can verify that the reinserted file is really the same file as
> >> uploaded by the trusted person...
> >
> >Why would the key be different? As long as he uses the same MIME type
> >and the same filename, it should be the same.
> >
> >> Maybe some checksum would help, but I don't know how and where to add 
> >it...
> >
> >I can add a checksum to the metadata, that's what I was suggesting.
> >>
> >> On 10/27/06, toad <toad at amphibian.dyndns.org> wrote:
> >> >On Fri, Oct 27, 2006 at 08:37:56AM +0200, bbackde at googlemail.com wrote:
> >> >> I heard that you want to make "CHK with filename" mandatory. What does
> >> >> this mean ...
> >> >
> >> >It means that IF you specify a filename when you insert a CHK, that
> >> >filename becomes a necessary part of the URI. If you don't, it doesn't.
> >> >>
> >> >> I have to insert a file with URI=CHK@ and TargetFilename=abc. Then I
> >> >> must request this file with URI=CHK at bla/abc? the CHK key itself is 
> >> >> not
> >> >> enough?
> >> >>
> >> >> The same file, but with different names, get another CHK key?
> >> >
> >> >Yes. Although beyond the top splitfile level they will share all blocks.
> >> >>
> >> >> What is the sense of this? You said it would be easier to find the
> >> >> file in store, but why is this easier, the CHK is already unique? The
> >> >> filename just adds another abstraction layer and you have to deal with
> >> >> files of same name, but different content. Because of this you would
> >> >> always have to use key+filename for lookups, why is'nt the key itself
> >> >> enough for this?
> >> >
> >> >The CHK is not unique. Here is the problem:
> >> >
> >> >CHK at blah,blah,blah - is invalid, or is a simple key
> >> >CHK at blah,blah,blah/something - could be a simple key, or could be 
> >> >"fetch
> >> >CHK at blah,blah,blah then look up the file called something in the
> >> >manifest"
> >> >CHK at blah,blah,blah/something/else - could be
> >> >a) a simple key (CHK at blah,blah,blah)
> >> >b) a single container lookup (lookup something in CHK at blah,blah,blah)
> >> >c) a double container lookup (lookup something in CHK at blah,blah,blah
> >> >then else in the returned manifest)
> >> >
> >> >You see the problem? A slash delimits a directory, in virtually all URI
> >> >schemes. You should not be able to add arbitrary subdirectories to a URI
> >> >while still returning the original file. Being able to do so means that
> >> >you cannot compare two URIs with any level of confidence, as well as
> >> >being counter-intuitive. Also, there is additional ambiguity if we
> >> >support implicit containers (something.zip/filename-in-zip).
> >> >
> >> >Thus, there is a two-stage solution:
> >> >
> >> >1. If the user specifies a filename, insert the file as a single-file
> >> >manifest so that the filename is required to fetch the file.
> >> >2. Stop accepting superfluous path components.
> >> >
> >> >Then we have no ambiguity. A slash always indicates a manifest lookup -
> >> >or a part of an SSK or USK url, which is much the same thing for our
> >> >purposes (an SSK always has one path component before the manifests, a
> >> >USK always has two).
> >> >>
> >> >> IMHO this concept leads to problems.
> >> >
> >> >The current situation leads to problems.
> >> >
> >> >> I know that same files with
> >> >> different names only have another key because the manifest is
> >> >> different, the (hidden) datablocks itself have the same key. But how
> >> >> should applications for file sharing and insert-on-demand work with
> >> >> this concept?
> >> >
> >> >Having read my explanation above, you really think the current (well,
> >> >older) situation is better?
> >> >
> >> >> We share the file with a unique SHA identifier. If 2
> >> >> users have the same file, but with different names, this works well.
> >> >> Now if there are the same files, but with different CHK keys because
> >> >> the filename is different, the application would have to maintain a
> >> >> list of all known CHK keys for a file with same SHA checksum, and it
> >> >> would have to try to download one key after the other until a download
> >> >> is successful.
> >> >
> >> >Or you could just not use a filename.
> >> >>
> >> >> As you see, this concept adds more complexity to FCP2, with a
> >> >> questionable benefit.
> >> >
> >> >It reduces ambiguity and complexity by making keys behave the same way
> >> >as any other URI for any other protocol.
> >> >
> >> >Really, what is the alternative? As far as I can see these are our
> >> >options:
> >> >
> >> >1. Make CHKs not have filenames at all.
> >> >PRO: Easy, unambiguous
> >> >CON: Not exactly user-friendly
> >> >
> >> >2. CHKs may have any filename, we ignore it.
> >> >
> >> >The first path component is always ignored.
> >> >PRO: Easy, unambiguous
> >> >CON: Breaks existing CHK freesites
> >> >CON: Counterintuitive: part of URI can be tampered with
> >> >
> >> >3. If a filename is specified on insert, it is enforced.
> >> >(Currently working towards this)
> >> >
> >> >PRO: Easy, unambiguous
> >> >CON: See above
> >> >
> >> >4. Make CHKs have optional filenames.
> >> >
> >> >PRO: Makes Frost work
> >> >CON: Ambiguous: any number of bogus pathname components can be appended
> >> >to a URI and it still work
> >> >
> >> >Unless you have any better ideas?
> >> >
> >> >Now, I would be willing to include an enforced checksum in a key, if
> >> >that helps you. I might even be willing to always insert a separate
> >> >block for the data itself and the MIME type, and then have a redirect to
> >> >this to add on the filename. However this would reduce performance by
> >> >introducing an extra block fetch. So I'm not sure it's a good idea.
> >> >
> >> >Could you explain the usage scenario here in a bit more detail?
> >> >
> >> >> Simple applications that count on this concept
> >> >> would have problems later if there are alot of files from many users
> >> >> floating around... they should respect that only the CHK key is the
> >> >> URI for a file. This worked great on 0.5, everyone understand this and
> >> >> its easier for all...
> >> >
> >> >Everyone who has several years of experience with Freenet understood
> >> >this. Nobody else did. URIs should behave like URIs. I'd rather have
> >> >option 1 than option 4!
> >> >>
> >> >> Please don't make this mandatory.
> >> >
> >> >Why should Freenet URIs behave so radically differently to any other
> >> >kind of URI in the known universe?
> >> >>
> >> >> rgds, bback.
> >> >
> >> >
> >> >-----BEGIN PGP SIGNATURE-----
> >> >Version: GnuPG v1.4.5 (GNU/Linux)
> >> >
> >> >iD8DBQFFQe+nA9rUluQ9pFARAtHdAJ0VnNXyk/SVOpqI1NvjNNSaTPWkYwCeNZUi
> >> >baZto3+Gpm92nqZpUQ2rGQE=
> >> >=+aYc
> >> >-----END PGP SIGNATURE-----
> >> >
> >> >
> >> >_______________________________________________
> >> >Tech mailing list
> >> >Tech at freenetproject.org
> >> >http://emu.freenetproject.org/cgi-bin/mailman/listinfo/tech
> >> >
> >> >
> >> _______________________________________________
> >> Tech mailing list
> >> Tech at freenetproject.org
> >> http://emu.freenetproject.org/cgi-bin/mailman/listinfo/tech
> >>
> >
> >
> >-----BEGIN PGP SIGNATURE-----
> >Version: GnuPG v1.4.5 (GNU/Linux)
> >
> >iD8DBQFFQgx9A9rUluQ9pFARAivWAKCq1BU/AJ6DdS/RE7gwsDR0PPd/9gCfVmha
> >WcN6y9iqMi0RcLs/EoBodHE=
> >=T65Q
> >-----END PGP SIGNATURE-----
> >
> >
> >_______________________________________________
> >Tech mailing list
> >Tech at freenetproject.org
> >http://emu.freenetproject.org/cgi-bin/mailman/listinfo/tech
> >
> >
> _______________________________________________
> Tech mailing list
> Tech at freenetproject.org
> http://emu.freenetproject.org/cgi-bin/mailman/listinfo/tech
> 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: 
<https://emu.freenetproject.org/pipermail/tech/attachments/20061027/6c94a841/attachment.pgp>

Reply via email to