Hello Roger,

Thank you very much for the suggestion. I had thought about doing it this 
way as well, but I have a lot of files to go through and wanted to see if I 
could find a way to work things out that didn't involve running find and 
replace on each file twice. Jean's suggestion worked perfectly. 

Thanks again for the suggestion - I appreciate the help.

Gavin

On Wednesday, November 3, 2021 at 9:40:03 PM UTC+9 Roger Moffat wrote:

> When I need to sort something like this I don’t use GREP since I’m not 
> familiar enough with it to get it right with little effort, but you can use 
> the basic Find and Replace
>
> Find
>
> \r \r (there’s a space between them because you said there was a space 
> between the paragraphs. If you just meant there was an empty line, use \r\r)
>
> Replace with
>
> xxxxxxx
>
> This preserves the gap between paragraphs as xxxxxxx, and now you have a 
> whole bunch of lines ending in a return
>
> Then find
>
> \r
>
> replace with
>
> “ “ (a single space)
>
> Then Find
>
> xxxxxxx
>
> Replace with
>
> \r\r
>
> Will separate out all the paragraphs again with a blank line between them. 
> Then in case you’ve ended up with 2 spaces anywhere
>
> Find “two spaces” (means 2 spaces, not the words in quotes)
>
> Replace with “one space” (same as above - a single space, not the two 
> words)
>
> Roger
>
>
>
> On Nov 2, 2021, at 11:04 PM, Gavin Brooks <[email protected]> wrote:
>
> I have a lot of text that has been copied from a PDF, that I need to make 
> into text files for research purposes. When the text copied over, there was 
> a hard line return at the end of every sentences, and a blank space between 
> paragraphs. I need to the lines within the paragraph joined but the 
> paragraphs to remain separate and am looking for a regex to do that:
>
> *For example, the text looks like this:*
>
> *This is the 1st paragraph. *
> *With a few lines that*
> *need connected.*
>
> *This is another paragraph with*
> *a blank line between*
> *it and the previous paragraph.*
>
> *I want it to look like this:*
>
> *This is the 1st paragraph. With a few lines that need connected.*
>
> *This is another paragraph with a blank line between it and the previous 
> paragraph.*
>
> *The best that I have been able to come up with is this:*
>
> ([^\r\n])\R(?=[^\r\n])
>
> *But that will also remove the last character of each line, so I get this:*
>
> *This is the 1st paragraph With a few lines tha need connected.*
>
> *This is another paragraph wit a blank line betwee it and the previous 
> paragraph.*
>
> Any suggestions about how to rewrite the above so that it does not remove 
> the final character?
>
> Gavin
>
>
>
> -- 
> This is the BBEdit Talk public discussion group. If you have a feature 
> request or need technical support, please email "[email protected]" 
> rather than posting here. Follow @bbedit on Twitter: <
> https://twitter.com/bbedit>
> --- 
> You received this message because you are subscribed to the Google Groups 
> "BBEdit Talk" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to [email protected].
> To view this discussion on the web visit 
> https://groups.google.com/d/msgid/bbedit/7d23bc98-d5b6-4739-94e8-c08f1efabb28n%40googlegroups.com
>  
> <https://groups.google.com/d/msgid/bbedit/7d23bc98-d5b6-4739-94e8-c08f1efabb28n%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>
>
>

-- 
This is the BBEdit Talk public discussion group. If you have a feature request 
or need technical support, please email "[email protected]" rather than 
posting here. Follow @bbedit on Twitter: <https://twitter.com/bbedit>
--- 
You received this message because you are subscribed to the Google Groups 
"BBEdit Talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/bbedit/d44cf44f-0205-4aa1-8f34-f2b2d777a489n%40googlegroups.com.

Reply via email to