Maybe

    FileMetaInfo::Miner::StarOffice
    FileMetaInfo::Miner::HTML
    etc

Tim.

On Wed, Jul 17, 2002 at 05:41:17PM -0500, Midh Mulpuri wrote:
> You are right. These modules are not general purpose parsers. In fact, I
> am using HTML::Parser to implement a HTML miner to extract data from
> HTML/HEAD/META. There is a miner that wraps around stat() to make the
> data provided by stat() in the Miner/Store framework. The beauty of the
> framework is that it provides a uniform way to get Meta data on many
> sources in the same application and to use several miners to obtain
> information on the same file. It is also fairly easy to write a Miner
> that analyzes a particular file and constructs data such as number of
> words, keywords, version number. Since file formats vary, I believe that
> a uniform way to obtain this information would be useful.
> 
> I am at a loss as far as the Namespace is concerned because there don't
> seem to be related modules in CPAN. There is a Metadata module but it
> implements interfaces that I believe are not useful for what I am trying
> to do. This is what the modules do: Process files to obtain Metadata.
> 
> The best alternative I can come up with is File::Metadata. Is this any
> better?
> 
> 
> -----Original Message-----
> From: Ade Olonoh [mailto:[EMAIL PROTECTED]] 
> Sent: Tuesday, July 16, 2002 8:27 PM
> To: Midh Mulpuri
> Cc: [EMAIL PROTECTED]
> Subject: Re: New Module Advice
> 
> What kind of meta data do you mean?  It sounds like there is
> application-specific functionality that the Miner/Store modules provide,
> rather than being a general Star/Open Office parser or an HTML parser. 
> The type of meta data you're retrieving from the files would probably
> hint towards a better name, since (IMHO) "MetaInfo" is too vague to
> signal what kind of problem could be solved with the module.
> 
> --Ade.
> 
> 
> On Mon, 2002-07-15 at 17:36, Midh Mulpuri wrote:
> >        I am writing an application that collects and stores meta data
> > from a variety of files (e.g. Star/Open Office files, HTML files) etc.
> > The application is written as one set of modules that extract this
> meta
> > data from a file and another set of modules that store this
> information.
> > The information is exchanged between a miner and a store in a hash.
> >        I believe that the miner modules would be useful to every one.
> I
> > would like to release to CPAN. At the same time the store modules are
> a
> > nice way to store this meta information but they do not implement
> > anything that is useful separate from the Miner modules. 
> >        Would it be a good thing to release both the Miner and Store
> > modules. At last count I have four of the former and two of the
> latter.
> > There is one store Module that write the Metadata to a XML file and
> > another to a DBI supported database.
> >        Another problem would be the Namespace. MetaInfo sees
> available.
> > I could release the modules as MetaInfo::Miner::--------- and
> > MetaInfo::Store::------ if I wanted to release both sets. On the other
> > hand I could release just the Miners under the MetaInfo namespace.
> >        Any advice and pointers would be appreciated since this would
> be
> > my first release to CPAN.
> >        - Midh Mulpuri
> 

Reply via email to