Its not 1000 of each email, its 200 (by default). If autolearning is enabled (also default) you should see bayes kick in pretty quickly actually if your spams are scored high enough and your hams are scored low enough. You could train manually but that requires more work...its really up to you. I'd let it autolearn for a bit and see how long it actually takes to reach 200..i dont think it would take that long.
Ugh.. IMO, and in past opinons expresed by the developers, this is bad advice.
I'd agree that you can use autolearning to pick up some of the 200/200 messages, but I'd do at least *some* hand training.
The autolearner isn't perfect, and an autolearn-only bayes database has a noticable chance of ending up poison. It doesn't happen every time, but there's a distinct chance of it.
Giving it a small hand-trained head start helps prevent the autolearner from going awry due to the "never autolearn something that would strongly contradict existing bayes learning" rule.
