Re: [fpc-pascal] Re: html scrapping library

2011-02-15 Thread Felipe Monteiro de Carvalho
On Tue, Feb 15, 2011 at 3:09 PM, Gene Buckle ge...@deltasoft.com wrote:
 The person may mean scraping (to scrape).

Didn't help much at least to me.

-- 
Felipe Monteiro de Carvalho
___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


Re: [fpc-pascal] Re: html scrapping library

2011-02-15 Thread Cees Binkhorst
http://en.wikipedia.org/wiki/ScraperWiki

On 02/15/2011 03:12 PM, Felipe Monteiro de Carvalho wrote:
 On Tue, Feb 15, 2011 at 3:09 PM, Gene Buckle ge...@deltasoft.com wrote:
 The person may mean scraping (to scrape).
 
 Didn't help much at least to me.
 
___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


Re: [fpc-pascal] Re: html scrapping library

2011-02-15 Thread ik
On Tue, Feb 15, 2011 at 16:04, Ben ben.smith.li...@gmail.com wrote:

 What does a html scrapping library do?


I have a spelling error, it's HTML scraping, it means to read HTML (and few
other contents on the web), and extract information using for example CSS
elements (that is #id .class:first) will brings the first data inside of
id of the element with the class content.

Another way is to extract information using XPath (for xml).

At the end it's a data mining for specific type of information.






 --

Ben.

 ___
 fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
 http://lists.freepascal.org/mailman/listinfo/fpc-pascal


Ido
___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal

Re[2]: [fpc-pascal] Re: html scrapping library

2011-02-15 Thread José Mejuto
Hello FPC-Pascal,

Tuesday, February 15, 2011, 6:08:29 PM, you wrote:

i I have a spelling error, it's HTML scraping, it means to read HTML (and few
i other contents on the web), and extract information using for example CSS
i elements (that is #id .class:first) will brings the first data inside of
i id of the element with the class content.
i Another way is to extract information using XPath (for xml).
i At the end it's a data mining for specific type of information.

Almost all scrapers I had used in the past were built using regular
expressions, so the regexp engine should be more than enought.

-- 
Best regards,
 José

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal