On Fri, Jan 21, 2011 at 3:58 AM, Rajarshi Guha <[email protected]> wrote:
>
> On Jan 19, 2011, at 10:19 AM, Carcharoth wrote:
>
>> On Wed, Jan 19, 2011 at 3:10 PM, Andrew Gray <[email protected]
>> > wrote:
>>
>> I'm curious as well. I'm also curious as to why the user wants to
>> extract this information, given that they should (going by their
>> signature) have access to databases that already have this sort of
>> information (the sort of databases that should be supplying the
>> information in the Wikipedia infoboxes). There probably is a reason,
>> but I can't immediately think of one.
>
> Partly because Wikipedia has done an aggregation on the multiple data
> sources

It might be better to extract links to the sources, rather than the
actual data itself, which could be in a vandalised state at the time
of extraction. What I guess I'm saying is that the data is better
obtained from the sources, rather than Wikipedia. Or at the least
cross-checking with the sources needs to be done, depending on what
the data will be used for.

Carcharoth

_______________________________________________
WikiEN-l mailing list
[email protected]
To unsubscribe from this mailing list, visit:
https://lists.wikimedia.org/mailman/listinfo/wikien-l

Reply via email to