Repository: crunch Updated Branches: refs/heads/master f1d074c2a -> 5d237b366
http://git-wip-us.apache.org/repos/asf/crunch/blob/5d237b36/crunch-test/src/main/resources/shakes.txt ---------------------------------------------------------------------- diff --git a/crunch-test/src/main/resources/shakes.txt b/crunch-test/src/main/resources/shakes.txt index 63acf18..0d2b5a6 100644 --- a/crunch-test/src/main/resources/shakes.txt +++ b/crunch-test/src/main/resources/shakes.txt @@ -1,385 +1,3 @@ -***The Project Gutenberg's Etext of Shakespeare's First Folio*** -********************The Tragedie of Macbeth********************* - -This is our 3rd edition of most of these plays. See the index. - - -Copyright laws are changing all over the world, be sure to check -the copyright laws for your country before posting these files!! - -Please take a look at the important information in this header. -We encourage you to keep this file on your own disk, keeping an -electronic path open for the next readers. Do not remove this. - - -**Welcome To The World of Free Plain Vanilla Electronic Texts** - -**Etexts Readable By Both Humans and By Computers, Since 1971** - -*These Etexts Prepared By Hundreds of Volunteers and Donations* - -Information on contacting Project Gutenberg to get Etexts, and -further information is included below. We need your donations. - - -The Tragedie of Macbeth - -by William Shakespeare - -July, 2000 [Etext #2264] - - -***The Project Gutenberg's Etext of Shakespeare's First Folio*** -********************The Tragedie of Macbeth********************* - -*****This file should be named 0ws3410.txt or 0ws3410.zip****** - -Corrected EDITIONS of our etexts get a new NUMBER, 0ws3411.txt -VERSIONS based on separate sources get new LETTER, 0ws3410a.txt - - -Project Gutenberg Etexts are usually created from multiple editions, -all of which are in the Public Domain in the United States, unless a -copyright notice is included. Therefore, we usually do NOT keep any -of these books in compliance with any particular paper edition. - - -We are now trying to release all our books one month in advance -of the official release dates, leaving time for better editing. - -Please note: neither this list nor its contents are final till -midnight of the last day of the month of any such announcement. -The official release date of all Project Gutenberg Etexts is at -Midnight, Central Time, of the last day of the stated month. A -preliminary version may often be posted for suggestion, comment -and editing by those who wish to do so. To be sure you have an -up to date first edition [xxxxx10x.xxx] please check file sizes -in the first week of the next month. Since our ftp program has -a bug in it that scrambles the date [tried to fix and failed] a -look at the file size will have to do, but we will try to see a -new copy has at least one byte more or less. - - -Information about Project Gutenberg (one page) - -We produce about two million dollars for each hour we work. The -time it takes us, a rather conservative estimate, is fifty hours -to get any etext selected, entered, proofread, edited, copyright -searched and analyzed, the copyright letters written, etc. This -projected audience is one hundred million readers. If our value -per text is nominally estimated at one dollar then we produce $2 -million dollars per hour this year as we release thirty-six text -files per month, or 432 more Etexts in 1999 for a total of 2000+ -If these reach just 10% of the computerized population, then the -total should reach over 200 billion Etexts given away this year. - -The Goal of Project Gutenberg is to Give Away One Trillion Etext -Files by December 31, 2001. [10,000 x 100,000,000 = 1 Trillion] -This is ten thousand titles each to one hundred million readers, -which is only ~5% of the present number of computer users. - -At our revised rates of production, we will reach only one-third -of that goal by the end of 2001, or about 3,333 Etexts unless we -manage to get some real funding; currently our funding is mostly -from Michael Hart's salary at Carnegie-Mellon University, and an -assortment of sporadic gifts; this salary is only good for a few -more years, so we are looking for something to replace it, as we -don't want Project Gutenberg to be so dependent on one person. - -We need your donations more than ever! - - -All donations should be made to "Project Gutenberg/CMU": and are -tax deductible to the extent allowable by law. (CMU = Carnegie- -Mellon University). - -For these and other matters, please mail to: - -Project Gutenberg -P. O. Box 2782 -Champaign, IL 61825 - -When all other email fails. . .try our Executive Director: -Michael S. Hart <[email protected]> [email protected] forwards to [email protected] and archive.org -if your mail bounces from archive.org, I will still see it, if -it bounces from prairienet.org, better resend later on. . . . - -We would prefer to send you this information by email. - -****** - -To access Project Gutenberg etexts, use any Web browser -to view http://promo.net/pg. This site lists Etexts by -author and by title, and includes information about how -to get involved with Project Gutenberg. You could also -download our past Newsletters, or subscribe here. This -is one of our major sites, please email [email protected], -for a more complete list of our various sites. - -To go directly to the etext collections, use FTP or any -Web browser to visit a Project Gutenberg mirror (mirror -sites are available on 7 continents; mirrors are listed -at http://promo.net/pg). - -Mac users, do NOT point and click, typing works better. - -Example FTP session: - -ftp sunsite.unc.edu -login: anonymous -password: your@login -cd pub/docs/books/gutenberg -cd etext90 through etext99 -dir [to see files] -get or mget [to get files. . .set bin for zip files] -GET GUTINDEX.?? [to get a year's listing of books, e.g., GUTINDEX.99] -GET GUTINDEX.ALL [to get a listing of ALL books] - -*** - -**Information prepared by the Project Gutenberg legal advisor** - -(Three Pages) - - -***START**THE SMALL PRINT!**FOR PUBLIC DOMAIN ETEXTS**START*** -Why is this "Small Print!" statement here? You know: lawyers. -They tell us you might sue us if there is something wrong with -your copy of this etext, even if you got it for free from -someone other than us, and even if what's wrong is not our -fault. So, among other things, this "Small Print!" statement -disclaims most of our liability to you. It also tells you how -you can distribute copies of this etext if you want to. - -*BEFORE!* YOU USE OR READ THIS ETEXT -By using or reading any part of this PROJECT GUTENBERG-tm -etext, you indicate that you understand, agree to and accept -this "Small Print!" statement. If you do not, you can receive -a refund of the money (if any) you paid for this etext by -sending a request within 30 days of receiving it to the person -you got it from. If you received this etext on a physical -medium (such as a disk), you must return it with your request. - -ABOUT PROJECT GUTENBERG-TM ETEXTS -This PROJECT GUTENBERG-tm etext, like most PROJECT GUTENBERG- -tm etexts, is a "public domain" work distributed by Professor -Michael S. Hart through the Project Gutenberg Association at -Carnegie-Mellon University (the "Project"). Among other -things, this means that no one owns a United States copyright -on or for this work, so the Project (and you!) can copy and -distribute it in the United States without permission and -without paying copyright royalties. Special rules, set forth -below, apply if you wish to copy and distribute this etext -under the Project's "PROJECT GUTENBERG" trademark. - -To create these etexts, the Project expends considerable -efforts to identify, transcribe and proofread public domain -works. Despite these efforts, the Project's etexts and any -medium they may be on may contain "Defects". Among other -things, Defects may take the form of incomplete, inaccurate or -corrupt data, transcription errors, a copyright or other -intellectual property infringement, a defective or damaged -disk or other etext medium, a computer virus, or computer -codes that damage or cannot be read by your equipment. - -LIMITED WARRANTY; DISCLAIMER OF DAMAGES -But for the "Right of Replacement or Refund" described below, -[1] the Project (and any other party you may receive this -etext from as a PROJECT GUTENBERG-tm etext) disclaims all -liability to you for damages, costs and expenses, including -legal fees, and [2] YOU HAVE NO REMEDIES FOR NEGLIGENCE OR -UNDER STRICT LIABILITY, OR FOR BREACH OF WARRANTY OR CONTRACT, -INCLUDING BUT NOT LIMITED TO INDIRECT, CONSEQUENTIAL, PUNITIVE -OR INCIDENTAL DAMAGES, EVEN IF YOU GIVE NOTICE OF THE -POSSIBILITY OF SUCH DAMAGES. - -If you discover a Defect in this etext within 90 days of -receiving it, you can receive a refund of the money (if any) -you paid for it by sending an explanatory note within that -time to the person you received it from. If you received it -on a physical medium, you must return it with your note, and -such person may choose to alternatively give you a replacement -copy. If you received it electronically, such person may -choose to alternatively give you a second opportunity to -receive it electronically. - -THIS ETEXT IS OTHERWISE PROVIDED TO YOU "AS-IS". NO OTHER -WARRANTIES OF ANY KIND, EXPRESS OR IMPLIED, ARE MADE TO YOU AS -TO THE ETEXT OR ANY MEDIUM IT MAY BE ON, INCLUDING BUT NOT -LIMITED TO WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A -PARTICULAR PURPOSE. - -Some states do not allow disclaimers of implied warranties or -the exclusion or limitation of consequential damages, so the -above disclaimers and exclusions may not apply to you, and you -may have other legal rights. - -INDEMNITY -You will indemnify and hold the Project, its directors, -officers, members and agents harmless from all liability, cost -and expense, including legal fees, that arise directly or -indirectly from any of the following that you do or cause: -[1] distribution of this etext, [2] alteration, modification, -or addition to the etext, or [3] any Defect. - -DISTRIBUTION UNDER "PROJECT GUTENBERG-tm" -You may distribute copies of this etext electronically, or by -disk, book or any other medium if you either delete this -"Small Print!" and all other references to Project Gutenberg, -or: - -[1] Only give exact copies of it. Among other things, this - requires that you do not remove, alter or modify the - etext or this "small print!" statement. You may however, - if you wish, distribute this etext in machine readable - binary, compressed, mark-up, or proprietary form, - including any form resulting from conversion by word pro- - cessing or hypertext software, but only so long as - *EITHER*: - - [*] The etext, when displayed, is clearly readable, and - does *not* contain characters other than those - intended by the author of the work, although tilde - (~), asterisk (*) and underline (_) characters may - be used to convey punctuation intended by the - author, and additional characters may be used to - indicate hypertext links; OR - - [*] The etext may be readily converted by the reader at - no expense into plain ASCII, EBCDIC or equivalent - form by the program that displays the etext (as is - the case, for instance, with most word processors); - OR - - [*] You provide, or agree to also provide on request at - no additional cost, fee or expense, a copy of the - etext in its original plain ASCII form (or in EBCDIC - or other equivalent proprietary form). - -[2] Honor the etext refund and replacement provisions of this - "Small Print!" statement. - -[3] Pay a trademark license fee to the Project of 20% of the - net profits you derive calculated using the method you - already use to calculate your applicable taxes. If you - don't derive profits, no royalty is due. Royalties are - payable to "Project Gutenberg Association/Carnegie-Mellon - University" within the 60 days following each - date you prepare (or were legally required to prepare) - your annual (or equivalent periodic) tax return. - -WHAT IF YOU *WANT* TO SEND MONEY EVEN IF YOU DON'T HAVE TO? -The Project gratefully accepts contributions in money, time, -scanning machines, OCR software, public domain etexts, royalty -free copyright licenses, and every other sort of contribution -you can think of. Money should be paid to "Project Gutenberg -Association / Carnegie-Mellon University". - -*END*THE SMALL PRINT! FOR PUBLIC DOMAIN ETEXTS*Ver.04.29.93*END* - - - - - -Project Gutenberg's Etext of Shakespeare's The Tragedie of Macbeth - - - - - -Executive Director's Notes: - -In addition to the notes below, and so you will *NOT* think all -the spelling errors introduced by the printers of the time have -been corrected, here are the first few lines of Hamlet, as they -are presented herein: - - Barnardo. Who's there? - Fran. Nay answer me: Stand & vnfold -your selfe - - Bar. Long liue the King - -*** - -As I understand it, the printers often ran out of certain words -or letters they had often packed into a "cliche". . .this is the -original meaning of the term cliche. . .and thus, being unwilling -to unpack the cliches, and thus you will see some substitutions -that look very odd. . .such as the exchanges of u for v, v for u, -above. . .and you may wonder why they did it this way, presuming -Shakespeare did not actually write the play in this manner. . . . - -The answer is that they MAY have packed "liue" into a cliche at a -time when they were out of "v"'s. . .possibly having used "vv" in -place of some "w"'s, etc. This was a common practice of the day, -as print was still quite expensive, and they didn't want to spend -more on a wider selection of characters than they had to. - -You will find a lot of these kinds of "errors" in this text, as I -have mentioned in other times and places, many "scholars" have an -extreme attachment to these errors, and many have accorded them a -very high place in the "canon" of Shakespeare. My father read an -assortment of these made available to him by Cambridge University -in England for several months in a glass room constructed for the -purpose. To the best of my knowledge he read ALL those available -. . .in great detail. . .and determined from the various changes, -that Shakespeare most likely did not write in nearly as many of a -variety of errors we credit him for, even though he was in/famous -for signing his name with several different spellings. - -So, please take this into account when reading the comments below -made by our volunteer who prepared this file: you may see errors -that are "not" errors. . . . - -So. . .with this caveat. . .we have NOT changed the canon errors, -here is the Project Gutenberg Etext of Shakespeare's The Tragedie -of Macbeth. - -Michael S. Hart -Project Gutenberg -Executive Director - - -*** - - -Scanner's Notes: What this is and isn't. This was taken from -a copy of Shakespeare's first folio and it is as close as I can -come in ASCII to the printed text. - -The elongated S's have been changed to small s's and the -conjoined ae have been changed to ae. I have left the spelling, -punctuation, capitalization as close as possible to the -printed text. I have corrected some spelling mistakes (I have put -together a spelling dictionary devised from the spellings of the -Geneva Bible and Shakespeare's First Folio and have unified -spellings according to this template), typo's and expanded -abbreviations as I have come across them. Everything within -brackets [] is what I have added. So if you don't like that -you can delete everything within the brackets if you want a -purer Shakespeare. - -Another thing that you should be aware of is that there are textual -differences between various copies of the first folio. So there may -be differences (other than what I have mentioned above) between -this and other first folio editions. This is due to the printer's -habit of setting the type and running off a number of copies and -then proofing the printed copy and correcting the type and then -continuing the printing run. The proof run wasn't thrown away but -incorporated into the printed copies. This is just the way it is. -The text I have used was a composite of more than 30 different -First Folio editions' best pages. - -If you find any scanning errors, out and out typos, punctuation -errors, or if you disagree with my spelling choices please feel -free to email me those errors. I wish to make this the best -etext possible. My email address for right now are [email protected] -and [email protected]. I hope that you enjoy this. - -David Reed - The Tragedie of Macbeth Actus Primus. Scoena Prima.
