On Apr 4, 5:31 pm, "Daniel M. Eldridge" <[email protected]> wrote: > I have a file that consists of approx 400K lines and 25762K chars > > this work is the result of someone who has copied existing files onto itself > multiple times. > > In an effort to "decrypt" this mess I am: > a) inserting .pp to indicate paragraph beginnings > b) replacing sentence endings with \r so that each line consists of one > "sentence." > c) sorting all of the lines so that I can use uniq to generate a list of > unique sentences. >
Note that Vim's :sort command allows you to eliminate duplicate lines as a built-in command argument. It also lets you skip a regex pattern when sorting lines. -- You received this message from the "vim_use" maillist. Do not top-post! Type your reply below the text you are replying to. For more information, visit http://www.vim.org/maillist.php
