On Tue, 2003-10-21 at 16:49, Kevin Pfeiffer wrote: > In article <[EMAIL PROTECTED]>, Kevin Old wrote: > [...] > > I need to parse the following data: > > > > "COUNTRY MUSIC HALL OF FAME UPC#: 0-84296-22922-2"||||||"COUNTRY FEMALE > > PARTY SONGS VOL. 2 UPC#: 0-84296-28682-9"||||||"COUNTRY MALE PARTY > > SONGS VOL. 2 UPC#: 0-84296-28652-2"||| |||||||||||||||||||||||||||||| > [...] > > I'm working on this myself and am wondering if -- er, what the better way is > to split the header line into 3 pieces. > > # get header > if (/^[^|]/ and [EMAIL PROTECTED]) { > @header = split(/"?[|"]+"?/, $_, 5); > pop @header; shift @header; > } > > Data (see below) looks something like: > "Piece One"||||||"Piece Two"||||||"Piece Three"||| > > I'm splitting as you see, but into 5 pieces in order to get rid of quote marks > and all pipes; then I have to throw out the first and last item. > > -K > > __DATA__ > "ALL-TIME FAVORITE COUNTRY LOVE SONGS UPC#: 0-84296-33172-7"||||||"COUNTRY MALE > PARTY SONGS VOL. 1 UPC#: 0-84296-21772-4"||||||"COUNTRY FEMALE PARTY SONGS VOL. 1 > UPC#: 0-84296-21712-0"|||
Here's what I've got so far on this issue: while(<KB>) { chomp; while(/^(.*?)\s+UPC#:\s+0-84296-(\d+)-\d/g) { print "Title: $1 UPC: $2\n"; } } ***** OUTPUT (for entire file) ***** Title: "ALL-TIME FAVORITE COUNTRY LOVE SONGS UPC: 33172 Title: "COUNTRY MUSIC HALL OF FAME UPC: 22922 Title: "GREAT COUNTRY LOVE SONGS VOL. 2 UPC: 30662 Title: "POP/R&B FEMALE PARTY SONGS UPC: 32222 Title: "CRAZY PARTY SONGS VOL. 1 UPC: 30582 Title: "CLASSIC HITS FROM THE MOVIES UPC: 33162 Title: "AMERICAN GRAFFITI 50'S UPC: 22972 Title: "THOSE 70'S HITS UPC: 30982 Title: "ROCK & ROLL PARTY SONGS UPC: 30432 Title: "PURE FUNK UPC: 22962 Title: "DUET HITS UPC: 30752 Title: "STORMY MONDAY BLUES UPC: 30702 Title: "CHICKS RULE THE CHARTS UPC: 30652 Now, just getting the tracks associated with them is the next step. I'll let you know as I come up with more.... Kevin -- Kevin Old <[EMAIL PROTECTED]> -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]