Re: [backstage] Greater access to news data
This might help... http://feeds.feedburner.com/bbcnewsfrontpagefullfeed Regards, Duncan On 17/02/2008, Richard Askew [EMAIL PROTECTED] wrote: Hello everyone, firs time poster here! I wondered if you could help me. I am currently in my final year of University and I am currently drawing up ideas for my dissertation. I am looking to do some work with the BBC news feeds. At the moment I can receive the feeds and get the headline and brief description of the story. Is there a way in which I could go on and retrieve the whole news story for that particular feed so it can be presented in an application? Thank you for your time and I look forward to hearing from you. Richard Askew * To view the terms under which this email is distributed, please go to http://www.hull.ac.uk/legal/email_disclaimer.html * -- --- Mob: 07976 914666 Web: http://www.barnesdmd.co.uk - Sent via the backstage.bbc.co.uk discussion group. To unsubscribe, please visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html. Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/
Re: [backstage] Greater access to news data
Richard, If you are not into coding too much, follow the URLs in the feeds, but exchange the /hi/ for a /low/ http://news.bbc.co.uk/1/hi/business/7250068.stm becomes http://news.bbc.co.uk/1/low/business/7250068.stm Then look for the first h2 in the data. If you are using PHP, you can get a long way using.. $strContents=strstr(strip_tags(join(,file( http://news.bbc.co.uk/1/low/business/7250068.stm;)),h2),h2); On 17/02/2008, Richard Askew [EMAIL PROTECTED] wrote: Hello everyone, firs time poster here! I wondered if you could help me. I am currently in my final year of University and I am currently drawing up ideas for my dissertation. I am looking to do some work with the BBC news feeds. At the moment I can receive the feeds and get the headline and brief description of the story. Is there a way in which I could go on and retrieve the whole news story for that particular feed so it can be presented in an application? Thank you for your time and I look forward to hearing from you. Richard Askew * To view the terms under which this email is distributed, please go to http://www.hull.ac.uk/legal/email_disclaimer.html * -- Please email me back if you need any more help. Brian Butterworth http://www.ukfree.tv
RE: [backstage] Greater access to news data
Thank you for evryones help with this, I feel I can move on now. When I have a protoytpe running I will post one here so you can all have a look to see what you think. Cheers Richard Askew From: [EMAIL PROTECTED] on behalf of Brian Butterworth Sent: Mon 18/02/2008 11:05 To: backstage@lists.bbc.co.uk Subject: Re: [backstage] Greater access to news data Richard, If you are not into coding too much, follow the URLs in the feeds, but exchange the /hi/ for a /low/ http://news.bbc.co.uk/1/hi/business/7250068.stm becomes http://news.bbc.co.uk/1/low/business/7250068.stm Then look for the first h2 in the data. If you are using PHP, you can get a long way using.. $strContents=strstr(strip_tags(join(,file( http://news.bbc.co.uk/1/low/business/7250068.stm ), On 17/02/2008, Richard Askew [EMAIL PROTECTED] wrote: Hello everyone, firs time poster here! I wondered if you could help me. I am currently in my final year of University and I am currently drawing up ideas for my dissertation. I am looking to do some work with the BBC news feeds. At the moment I can receive the feeds and get the headline and brief description of the story. Is there a way in which I could go on and retrieve the whole news story for that particular feed so it can be presented in an application? Thank you for your time and I look forward to hearing from you. Richard Askew * To view the terms under which this email is distributed, please go to http://www.hull.ac.uk/legal/email_disclaimer.html * -- Please email me back if you need any more help. Brian Butterworth http://www.ukfree.tv http://www.ukfree.tv/ winmail.dat* To view the terms under which this email is distributed, please go to http://www.hull.ac.uk/legal/email_disclaimer.html *
[backstage] Greater access to news data
Hello everyone, firs time poster here! I wondered if you could help me. I am currently in my final year of University and I am currently drawing up ideas for my dissertation. I am looking to do some work with the BBC news feeds. At the moment I can receive the feeds and get the headline and brief description of the story. Is there a way in which I could go on and retrieve the whole news story for that particular feed so it can be presented in an application? Thank you for your time and I look forward to hearing from you. Richard Askew* To view the terms under which this email is distributed, please go to http://www.hull.ac.uk/legal/email_disclaimer.html *
Re: [backstage] Greater access to news data
Could you use something like python's Beautiful Soup library? http://www.crummy.com/software/BeautifulSoup/ It'll scrape the page and you can drill down to isolate the main content block. It's what I used to make a script that parses rss feeds, scrapes the stories from the sites then clusters the stories based on the story, rather than the rss feed, content. You can see the pre-alpha result, it's very much a work in progress (especially the UI): www.codemeup.com Whilst this example only displays the rss title, the script has gone to the actual page and pulled out the content for word frequency analysis. hope that's useful. S. On Feb 17, 2008 9:26 AM, Richard Askew [EMAIL PROTECTED] wrote: Hello everyone, firs time poster here! I wondered if you could help me. I am currently in my final year of University and I am currently drawing up ideas for my dissertation. I am looking to do some work with the BBC news feeds. At the moment I can receive the feeds and get the headline and brief description of the story. Is there a way in which I could go on and retrieve the whole news story for that particular feed so it can be presented in an application? Thank you for your time and I look forward to hearing from you. Richard Askew * To view the terms under which this email is distributed, please go to http://www.hull.ac.uk/legal/email_disclaimer.html *