Re: [backstage] Proposal for a new API source

2007-06-06 Thread Mr I Forrester

Sam Smith wrote:

On Mon, 4 Jun 2007, Ian Forrester wrote:

So how about we started to archive the RSS feeds into one
large database, index them by time and date, slap a nice
REST interface on them and let you guys have access to it?


You mean a more complex version of:
http://sebastian.foriru.co.uk/~sams/bbc/pouncing/


If anyone thinks time/date is useful, I can stick those in
the feed.


cheers
Sam

Time and date would be useful. So if I'm correct your now index all of 
BBC News feeds from May 13th onwards?


Ian
-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/


[backstage] Proposal for a new API source

2007-06-04 Thread Ian Forrester
Hi All,

I've been thinking about what data you guys find useful and what data could be 
made available without getting rights clearance involved.

Anyway, one thing the BBC does a lot of is RSS feeds. But there not exactly 
that useful for larger applications and prototypes because of lack of structure 
and partial nature (best way I can explain it).

So how about we started to archive the RSS feeds into one large database, index 
them by time and date, slap a nice REST interface on them and let you guys have 
access to it?

I imagine within a few months, you could data mine out keywords and trends. It 
would also make a great archive of the news at the time (suggesting we use BBC 
News, World Service and Sport feeds to start).

Ideally in the future, you would all be able to add metadata around the 
original items, setup unique queries and copy the whole database.

How's this sound because its something I think could be built quite easily. If 
people are interested I can go into the details later on the list.

So what do you guys all think?

Ian Forrester

This e-mail is: [] private; [] ask first; [ x ] bloggable

Senior Producer, BBC Backstage
BC5 C3, Media Village, 201 Wood Lane, London W12 7TP
e: [EMAIL PROTECTED]
p: +44 (0)2080083965

-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/


Re: [backstage] Proposal for a new API source

2007-06-04 Thread Daithi O Crualaoich

That sounds like fun.  It would be a nice research tool to pick out
all the BBC News stories about a particular topic and see how it
developed.

Was it just the RSS feeds you wanted to stored?  Because the See Also
links on an news story page would be great to cross reference in the
database.  Is there any other story metadata floating about that could
be incorporated as well?

If it's easy(famous last words) then definitely build it.  Sounds like
a great idea to me.


Daithi


On 6/4/07, Ian Forrester [EMAIL PROTECTED] wrote:

Hi All,

I've been thinking about what data you guys find useful and what data could be 
made available without getting rights clearance involved.

Anyway, one thing the BBC does a lot of is RSS feeds. But there not exactly 
that useful for larger applications and prototypes because of lack of structure 
and partial nature (best way I can explain it).

So how about we started to archive the RSS feeds into one large database, index 
them by time and date, slap a nice REST interface on them and let you guys have 
access to it?

I imagine within a few months, you could data mine out keywords and trends. It 
would also make a great archive of the news at the time (suggesting we use BBC 
News, World Service and Sport feeds to start).

Ideally in the future, you would all be able to add metadata around the 
original items, setup unique queries and copy the whole database.

How's this sound because its something I think could be built quite easily. If 
people are interested I can go into the details later on the list.

So what do you guys all think?

Ian Forrester

This e-mail is: [] private; [] ask first; [ x ] bloggable

Senior Producer, BBC Backstage
BC5 C3, Media Village, 201 Wood Lane, London W12 7TP
e: [EMAIL PROTECTED]
p: +44 (0)2080083965

-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/


RE: [backstage] Proposal for a new API source

2007-06-04 Thread Brian Butterworth
Is there any way to add the physical locations (ie, latitude and longitude)
so that you can search for 'local' stories too?   For example, Sports
reports would have the location of the stadium,  News reports would list the
relevant locations etc
 
Brian Butterworth
www.ukfree.tv
 

 -Original Message-
 From: [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] On Behalf Of Ian Forrester
 Sent: 04 June 2007 15:04
 To: backstage@lists.bbc.co.uk
 Subject: [backstage] Proposal for a new API source
 
 Hi All,
 
 I've been thinking about what data you guys find useful and 
 what data could be made available without getting rights 
 clearance involved.
 
 Anyway, one thing the BBC does a lot of is RSS feeds. But 
 there not exactly that useful for larger applications and 
 prototypes because of lack of structure and partial nature 
 (best way I can explain it).
 
 So how about we started to archive the RSS feeds into one 
 large database, index them by time and date, slap a nice REST 
 interface on them and let you guys have access to it?
 
 I imagine within a few months, you could data mine out 
 keywords and trends. It would also make a great archive of 
 the news at the time (suggesting we use BBC News, World 
 Service and Sport feeds to start).
 
 Ideally in the future, you would all be able to add metadata 
 around the original items, setup unique queries and copy the 
 whole database.
 
 How's this sound because its something I think could be built 
 quite easily. If people are interested I can go into the 
 details later on the list.
 
 So what do you guys all think?
 
 Ian Forrester
 
 This e-mail is: [] private; [] ask first; [ x ] bloggable
 
 Senior Producer, BBC Backstage
 BC5 C3, Media Village, 201 Wood Lane, London W12 7TP
 e: [EMAIL PROTECTED]
 p: +44 (0)2080083965
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To 
 unsubscribe, please visit 
 http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.
   Unofficial list archive: 
 http://www.mail-archive.com/backstage@lists.bbc.co.uk/
 
 No virus found in this incoming message.
 Checked by AVG Free Edition. 
 Version: 7.5.472 / Virus Database: 269.8.7/830 - Release 
 Date: 03/06/2007 12:47
  
 

No virus found in this outgoing message.
Checked by AVG Free Edition. 
Version: 7.5.472 / Virus Database: 269.8.7/830 - Release Date: 03/06/2007
12:47
 

-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/


Re: [backstage] Proposal for a new API source

2007-06-04 Thread Matthew Somerville
Ian Forrester wrote, reordered slightly:
 So what do you guys all think?

Sounds like a great idea; especially as I've been doing this for the BBC
News front page (similar to the RSS feed, I guess) since 2005? :-)

 I imagine within a few months, you could data mine out keywords and
 trends.

You certainly could. :) Perhaps with a tag cloud:
http://www.dracos.co.uk/work/bbc-news-archive/tardis/tag-cloud/

Or watching how the word Iran is used over time:
http://www.dracos.co.uk/work/bbc-news-archive/tardis/search-headline/?s=iran

Or which pictures the BBC decides to illustrate stories about the Olympics:
http://www.dracos.co.uk/work/bbc-news-archive/tardis/image-search/?s=olympics

 It would also make a great archive of the news at the time

Here's my archive of the front page of BBC News for the past 6 months or so:
http://www.dracos.co.uk/work/bbc-news-archive/archive.php

 Ideally in the future, you would all be able to add metadata around the
 original items, setup unique queries and copy the whole database.

Sadly, not sure I have the bandwidth for that. ;) You can create a unique
timeline of how an event appeared on the front page:
http://www.dracos.co.uk/work/bbc-news-archive/tardis/timeline/?t=Richard+Hammond%27s+crash

 So how about we started to archive the RSS feeds into one large database,
 index them by time and date, slap a nice REST interface on them and let 
 you guys have access to it?

Seems odd for you to have to export the data from somewhere it should
already be a database to RSS to import it back in again - perhaps public
read access to the actual BBC News content management database would be a
bit hard to justify. ;-) Apologies that my dataset doesn't have an API
beyond what's there; I'd be happy to add things if people asked (no-one ever
has :) ) and I had the time. But I can do database queries, e.g. what were
the BBC News front page main headlines on New Years Day 2006?

--8---
msbbc= select url,headline,start,description from headline where
date_trunc('day', start)='2006-01-01' and type='headline' order by start;

 /1/hi/uk/4570980.stm   | Revellers ready to welcome 2006   |
2006-01-01 00:00:00 | Millions in the UK prepare to see in the New Year,
while in London a Tube strike appears not to have caused chaos.
 /1/hi/uk/4570980.stm   | Revellers welcome 2006 across UK  |
2006-01-01 00:05:00 | Millions in the UK celebrate the New Year, while in
London a Tube strike appears not to have caused chaos.
 /1/hi/uk/4570980.stm   | Revellers welcome 2006 across UK  |
2006-01-01 04:35:00 | Millions in the UK celebrate the New Year, with
parties and spectacular firework displays in many cities.
 /1/hi/health/4529762.stm   | Rift over greater hospital choice |
2006-01-01 13:40:00 | People in England are given a choice of hospitals for
non-emergency surgery, but unions voice concerns.
 /1/hi/world/europe/4573572.stm | Ukraine gas crisis 'could hit EU' |
2006-01-01 16:20:00 | Gas exports to the European Union could be hit by the
axeing of Russian supplies to Ukraine, officials say.
 /1/hi/world/europe/4573572.stm | Ukraine gas row hits EU supplies  |
2006-01-01 17:40:00 | EU countries start to suffer knock-on cuts in gas
imports as Russia axes supplies to Ukraine.
 /1/hi/world/europe/4573572.stm | Ukraine gas row hits EU supplies  |
2006-01-01 17:45:00 | EU countries start to suffer knock-on cuts in gas
imports as Russia axes supplies to Ukraine in a row over prices.
--8---

Might be useful to someone, you never know!

-- 
ATB,
Matthew  |  http://www.dracos.co.uk/
-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/