Re: [Jprogramming] Split on first found character

'Pascal Jasmin' via Programming Tue, 25 Nov 2014 07:29:09 -0800

The spec (Fasta) can be revised to splitting on all LFs, where the first box is 
the control data, and subsequent boxes are the package.


http://rosalind.info/glossary/fasta-format/

If so, you can grab all of the records from the sample link with this 
expression (copy records on clipboard, uses j8 clipboard access)

cutLF each '>' cut wdclippaste ''

The reason for the LFs in the payload as I understand it is to separate it into 
processing chunks.  Having them boxed works for this, but if you ever need to 
put the LFs back, or join them into a single string

 LF joinstring  ;/ 'abc' 
a 
b 
c

 '' joinstring  ;/ 'abc' 
 ;  ;/ 'abc'  NB. or this
abc 


to extend Joe's solution 

(>@:{. ; ;@}.)@:cutLF each '>' cut




----- Original Message -----
From: Joe Bogner <joebog...@gmail.com>
To: programm...@jsoftware.com
Cc: 
Sent: Tuesday, November 25, 2014 6:00 AM
Subject: Re: [Jprogramming] Split on first found character

I might do it this way:

(>@:{. ; ;@}.)@(LF&cut) x

┌───────┬─────────────┐

│case_id│GCTAGTCGACGTC│

└───────┴─────────────┘




On Tue, Nov 25, 2014 at 5:47 AM, Jan-Pieter Jacobs <
janpieter.jac...@gmail.com> wrote:

> I think I would solve it like this (with intermediate steps):
>
> x=: 'case_id', LF, 'GCTAGTCG', LF, 'ACGTC', LF
>
> NB. Find the locations of LF's in the string
> (LF = ]) x
>
> NB. assign 0 to everything before the LF, assign 1 to everything after it
> ([: +./\  LF = ]) x
>
> NB. The wonderful "key" adverb lets you apply verbs based on it's key
> (another array), eg. box
> (</.~ [: +./\ LF =]) x
>
> NB. Get rid of the LF's by replacing < by a version removing LF:
> ((<@#~ LF~:])/.~ [: +./\ LF = ]) x
>
> This is what I came up with, but smarter people will probably suggest more
> elegant ways.
>
> I hope this is useful.
> Jan-Pieter
>
> 2014-11-25 11:00 GMT+01:00 Ryan <rec...@bwh.harvard.edu>:
>
> > I have a character array with LF's, and want to split it on the
> > first LF, and remove the remaining LF's.  I'm wondering if there's a
> > simpler
> > way than what I'm doing now:
> >
> > x=: 'case_id', LF, 'GCTAGTCG', LF, 'ACGTC', LF
> >
> > filterLF=: #~ ~:&LF
> > headLF=: {.~ i.&LF
> > tailLF=: }.~ i.&LF
> > (headLF ; filterLF@tailLF) x
> > ┌────────┬─────────────┐
> > │>case_id│GCTAGTCGACGTC│
> > └────────┴─────────────┘
> >
> > (I'm reading text files in fasta format: http://rosalind.info/glossary/
> > fasta-format/)
> >
> > Thanks for any suggestions,
> > Ryan
> >
> >
> >
> > The information in this e-mail is intended only for the person to whom it
> > is
> > addressed. If you believe this e-mail was sent to you in error and the
> > e-mail
> > contains patient information, please contact the Partners Compliance
> > HelpLine at
> > http://www.partners.org/complianceline . If the e-mail was sent to you
> in
> > error
> > but does not contain patient information, please contact the sender and
> > properly
> > dispose of the e-mail.
> > ----------------------------------------------------------------------
> > For information about J forums see http://www.jsoftware.com/forums.htm

> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
>
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Re: [Jprogramming] Split on first found character

Reply via email to