Re: [R] Regex matching that gives byte offset?

2009-11-02 Thread Johannes Graumann
Hmmm ... that should do it, thanks. But how would one use this on a file without reading it into memory completely? Joh On Wednesday 28 October 2009 16:29:00 Prof Brian Ripley wrote: Do you mean like regexpr() (on the same help page)? Depending on your locale, you might actually prefer the

Re: [R] Regex matching that gives byte offset?

2009-11-02 Thread Prof Brian Ripley
On Mon, 2 Nov 2009, Johannes Graumann wrote: Hmmm ... that should do it, thanks. But how would one use this on a file without reading it into memory completely? ?file, ?readLines, ?readBin will tell you about connections. Joh On Wednesday 28 October 2009 16:29:00 Prof Brian Ripley wrote:

Re: [R] Regex matching that gives byte offset?

2009-11-02 Thread Johannes Graumann
On Monday 02 November 2009 13:41:45 Prof Brian Ripley wrote: On Mon, 2 Nov 2009, Johannes Graumann wrote: Hmmm ... that should do it, thanks. But how would one use this on a file without reading it into memory completely? ?file, ?readLines, ?readBin will tell you about connections. ...

[R] Regex matching that gives byte offset?

2009-10-28 Thread Johannes Graumann
Hi, Is there any way of doing 'grep' ore something like it on the content of a text file and extract the byte positioning of the match in the file? I'm facing the need to access rather largish (600MB) XML files and would like to be able to index them ... Thanks for any help or flogging, Joh

Re: [R] Regex matching that gives byte offset?

2009-10-28 Thread Prof Brian Ripley
Do you mean like regexpr() (on the same help page)? Depending on your locale, you might actually prefer the character offset: if you want to match in a MBCS and have byte offsets you will need to work a bit harder if useBytes=TRUE is not sufficient for you. On Wed, 28 Oct 2009, Johannes