Cory Helfrich wrote:

Hi!

>> Exactly. ;) I'd suggest to base such a DB not on mass but on
>> quality.
> Agreed. We should decide how we will treat things like ambiguous  
> player names.

Well, the names should be normalised to scids usual
spellchecking/rating file provided by Franz, I'd suggest.

> Also, my opinion is that the seven-tag roster plus the  
> source tag needs to be as complete as possible.

Perfectly agree. Additionally, if there're other tags
available you could also include them. Usual tags that might
be available are:

    ECO
    whiteElo
    blackElo
    whiteCountry
    blackCountry
    whiteTeamCountry
    blackTeamCountry
    Source

> [...]
>> Actually, right now I just add some
>>
>> [Source "TWIC 654"]
>>
>> lines to the header, and similar ones if they're from other
>> sources.
>>
> The one I use looks like this: [Source "twic706.pgn"]. However, I can  
> easily change my tag to suit.

I'd suggest to use "institution tags" and issues. I regard
TWIC like a normal journal. The Journal is called TWIC, and
the issue in your example ist 706. Therfore I used

[Source "TWIC 706"]

as I'd also use for a book e.g.

[Source "Kasparov: My Great Predecessors, III p. 287"]

If you work in e.g. files from the Pitsburg archive one
could think about

[Source "Pitsburg Chess Archive"]

>> I for my person am currently just to busy to add another
>> project to my desk, sorry for that. But if you want to
>> please just feel free to start right away. From my
>> experience stuff gets going easier if someone just starts
>> doing something. I could provide some scripts of course
>> to automatically fetch TWIC or also my pgnaddtag that
>> adds the above mentioned header line to all games in a
>> pgn file.
> OK. I can start with the games in the Pitt Archive. I have
> downloaded  all of the games in the "Events" folder there.

I'd be carefull to be to ambitious. Maybe you should start
with some thoughts about how to build up the DB at all. E.g.
I do not know whether their tags are normalised well.
Besides player names also the "event" and "site" tags are
often a mess and one of the main inconveniences of my own
reference base. Mostly though pgns from a single site are in
one common style.  Therefore if you e.g. decide to use TWIC
for the continous upgrade of the DB (probably a wise
decision) one could base the tagging conventions on TWIC,
then start adding tournaments and tag them accordingly.

> Please let me know the  format of the source tag I should
> use. Since the files in this  archive appear to be
> separated by Event, does it make sense to have a different
> source tag for each file?

For the Pitsburg Archive I'd probably flag all games as

[Source "Pitsburg Chess Archive"]

As they do not have an issue there, as far as I know. The
Event should be encoded in the Event tag of course as well
as the Site belongs to the proper tag.

-- 

Kind regards,                /                 War is Peace.
                             |            Freedom is Slavery.
Alexander Wagner            |         Ignorance is Strength.
                             |
                             | Theory     : G. Orwell, "1984"
                            /  In practice:   USA, since 2001

-------------------------------------------------------------------------
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php
_______________________________________________
Scid-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scid-users

Reply via email to