On Tue, 2003-10-21 at 16:49, Kevin Pfeiffer wrote:
> In article <[EMAIL PROTECTED]>, Kevin Old wrote:
> [...]
> > I need to parse the following data:
> > 
> > "COUNTRY MUSIC HALL OF FAME UPC#: 0-84296-22922-2"||||||"COUNTRY FEMALE
> > PARTY SONGS VOL. 2  UPC#:  0-84296-28682-9"||||||"COUNTRY MALE PARTY
> > SONGS VOL. 2   UPC#:  0-84296-28652-2"||| ||||||||||||||||||||||||||||||
> [...]
> 
> I'm working on this myself and am wondering if -- er, what the better way is
> to split the header line into 3 pieces.
> 
>     # get header
>     if (/^[^|]/ and [EMAIL PROTECTED]) {
>         @header = split(/"?[|"]+"?/, $_, 5);
>         pop @header; shift @header;
>     }
> 
> Data (see below) looks something like:
> "Piece One"||||||"Piece Two"||||||"Piece Three"|||
> 
> I'm splitting as you see, but into 5 pieces in order to get rid of quote marks 
> and all pipes; then I have to throw out the first and last item.
> 
> -K
> 
> __DATA__
> "ALL-TIME FAVORITE COUNTRY LOVE SONGS UPC#: 0-84296-33172-7"||||||"COUNTRY MALE 
> PARTY SONGS VOL. 1 UPC#: 0-84296-21772-4"||||||"COUNTRY FEMALE PARTY SONGS VOL. 1 
> UPC#: 0-84296-21712-0"|||

Here's what I've got so far on this issue:

while(<KB>) {
        chomp;
        while(/^(.*?)\s+UPC#:\s+0-84296-(\d+)-\d/g) {
                print "Title: $1 UPC: $2\n";
        }
}

***** OUTPUT (for entire file) *****
Title: "ALL-TIME FAVORITE COUNTRY LOVE SONGS UPC: 33172
Title: "COUNTRY MUSIC HALL OF FAME UPC: 22922
Title: "GREAT COUNTRY LOVE SONGS VOL. 2 UPC: 30662
Title: "POP/R&B FEMALE PARTY SONGS UPC: 32222
Title: "CRAZY PARTY SONGS VOL. 1 UPC: 30582
Title: "CLASSIC HITS FROM THE MOVIES UPC: 33162
Title: "AMERICAN GRAFFITI 50'S UPC: 22972
Title: "THOSE 70'S HITS UPC: 30982
Title: "ROCK & ROLL PARTY SONGS UPC: 30432
Title: "PURE FUNK UPC: 22962
Title: "DUET HITS UPC: 30752
Title: "STORMY MONDAY BLUES UPC: 30702
Title: "CHICKS RULE THE CHARTS UPC: 30652

Now, just getting the tracks associated with them is the next step.

I'll let you know as I come up with more....

Kevin
-- 
Kevin Old <[EMAIL PROTECTED]>


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to