Re: [CODE4LIB] Regex Question

2015-07-09 Thread Matt Sherman
Of Harper, Cynthia Sent: woensdag 8 juli 2015 19:51 To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Regex Question I like this regex add-in for Excel: http://www.codedawn.com/index/new-excel-add-in-regex-find-replace Cindy Harper -Original Message- From: Code for Libraries

Re: [CODE4LIB] Regex Question

2015-07-08 Thread Harper, Cynthia
Subject: Re: [CODE4LIB] Regex Question For clarity, Word does regex, not just wildcards. It's not quite as complete as what you'd get with some other environments such as OpenOffice Writer since matching is lazy rather than greedy which can be a big deal depending on what you're doing

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Ivan Goldsmith
University of Pennsylvania Libraries - Original Message - From: Matt Sherman matt.r.sher...@gmail.com To: CODE4LIB@LISTSERV.ND.EDU Sent: Tuesday, July 7, 2015 11:56:15 AM Subject: [CODE4LIB] Regex Question Hi all, I am working my way through teaching myself regex to parse

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Eric Phetteplace
Hi Matt! You can match a string of all caps letters like [A-Z]. Those brackets say match anything inside and the hyphen indicates the full range of capital letters. You cannot, unfortunately, match italics since that's formatting and not text. Regex is really only meant for strings of characters

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Brian Zelip
I think I figured out the all-caps need, see http://regexr.com/3bbfi Cheers bzelip On Tue, Jul 7, 2015 at 12:32 PM, Thomas Krichel kric...@openlib.org wrote: Eric Phetteplace writes You can match a string of all caps letters like [A-Z] This works if you are limited to English. But

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Jason R Peak
In the case of xml, I think xpath is the simpler tool. Brian Zelip wrote Hi Matt. Re: finding words in all caps, yes it's possible. See this SO answer to help: http://stackoverflow.com/a/4255225/2145103 Re: italics, my hunch is that you could do so if you got hold of the xml behind

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Brian Zelip
Hi Matt. Re: finding words in all caps, yes it's possible. See this SO answer to help: http://stackoverflow.com/a/4255225/2145103 Re: italics, my hunch is that you could do so if you got hold of the xml behind the word doc, which I'd assume would have something like an `italic` tags or attribute

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Matt Sherman
Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee kyle.baner...@gmail.com wrote: Y'all are doing this the hard way. Word allows regex replacements as well as format based criteria. For

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Kyle Banerjee
Y'all are doing this the hard way. Word allows regex replacements as well as format based criteria. For this particular use case: 1. Open the find/replace dialog (CTL+H) 2. In the Find what box, put (*) -- make sure the option for Use Wildcards is selected, and for the format, specify

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Gordon, Bonnie
Sent: Tuesday, July 07, 2015 12:45 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Regex Question Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee kyle.baner...@gmail.com wrote

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Katherine N. Deibel
@LISTSERV.ND.EDU] On Behalf Of Matt Sherman Sent: Tuesday, July 7, 2015 9:45 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Regex Question Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Kyle Banerjee
] Regex Question Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee kyle.baner...@gmail.com wrote: Y'all are doing this the hard way. Word allows regex replacements as well as format based