andrei raevsky wrote:

Hi!

> *I) find how many times this exact position *
> 
> r1bqkbnr/pp3ppp/2n1p3/2ppP3/3P4/2P1B3/PP3PPP/RN1QKBNR b KQkq - 2 5
> 
> can be found in this database. No problem here.  I used the 
> "search->current board" selected 'exact position' and search in 
> reference database was set to '1.74 million'. The current filter was set 
> to 'ignore'.  The search resulted in 292 games.  So far so good.  Now, 
> what I would like to do is
> 
> a) select only the games in which both players had an Elo of 2400+
> 
> and
> 
> b) select only the games in which Kupreichik played White.
> 
> How do I do this?

Given I) just invoke "Header search" set the ELO values for
both players to 2400-4000, insert Kupreichik as name for the
white player, check that "No" is selected in "ignore
colours" line, use "AND (restrict filter)" in the header
search, hit "Search" and you're done.

> *II) create a single "reference" database for me:*
> 
> There are five large free databases out there: the 1.74 million games 
> database 
> <http://www.top-5000.nl/dl.htm?file=dl/Million%20Base%201.74.rar>, the 
> ICOfY IB109PGN database <http://sourceforge.net/projects/icofybase/>, 
> the UAB CIS "enormous" database 
> <ftp://ftp.cis.uab.edu/pub/hyatt/pgn/enormous.zip.>, the Chess Analysis 
> Project database <http://cap.connx.com/>, and the Encyclopedia of Chess 
> Openings database <http://observer.homestead.com/openings.html>.  I 
> suppose that many of these databases list the sames games, but with 
> different variations.  I would like to create a single large "main" 
> reference database which I could then search by various criteria.  Does 
> it make sense for me to do the following.
> 
> a) import all these databases into one huge SCID4 database and then
> b) have run a complete 'maintenance window->cleaner' with all the fields 
> set to 'yes'

It is maybe not senseless but you'll hardly get rid of all
dupes that way and probably get a number of player names
actually corrected to the wrong players. Not cause of bad
info in the spell checking file, but simply cause names are
not unique and especially as foreighn names are usually
transcribed to some target language.

IMHO the problem is not to create a really huge DB but to
get a suitalby large good DB. Quality is surely an issue
here. It could well be that it's better to stick to only one
of the sources you list above to get a better base at the
end of the day.

> Would that give me a good single-source 'reference'
> database to do opening reports, player reports, update
> weekly with TWIC, etc?

Difficult to answer.

> Has somebody else already done that?  Would it not be a
> good idea for the SCID project to have "our own" database
> (based on other free databases) available for download?

Well you could fetch RefCorr and TWIC from
http://www.stellarcom.org/scid/RefCorr.zip
http://www.stellarcom.org/scid/TWIC.zip

They're not that large as your project however. (RefCorr is
essentially a merger of CC games, TWIC is what it says it is
;)

-- 

Kind regards,                /                 War is Peace.
                             |            Freedom is Slavery.
Alexander Wagner            |         Ignorance is Strength.
                             |
                             | Theory     : G. Orwell, "1984"
                            /  In practice:   USA, since 2001

------------------------------------------------------------------------------
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference
_______________________________________________
Scid-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scid-users

Reply via email to