Well, it's obvious that if the XML couldn't parse the XML, then it's not valid XML. Add to this the fact that it contains a "meta" tag and it's clear that what's at the other end of the link was actually HTML. If you go to http://www.nytimes.com/rss, you'll notice this yourself.
The real feed can be found by following the browser feed icon in the URL bar. It takes you to what the page describes as a feed in the tag <link rel="alternate" type="application/rss+xml" ... />. If you follow it, you will get to this URL: http://feeds.nytimes.com/nyt/rss/HomePage Try it, it should work. Vassil On Fri, Sep 3, 2010 at 2:35 AM, Imtiaz Ahmed H E <[email protected]> wrote: > I have an action > > every 1 mins rss:http://www.nytimes.com/rss > > feed link got thro' right-click on RSS icon at bottom of nytimes.com and > "copy link location" in windows context menu and paste into esme with rss: > prepended. > > and, > > every 1 mins atom:http://twitter.com/statuses/user_timeline/esjewett.atom > > and I get in the Tomcat Window, > > WARN - Going to buffer response body of large or unknown size. Using > getResponse > BodyAsStream instead is recommended. > :37:31: expected closing tag of meta > <a > href="http://www.nytimes.com/pages/sports/index.html">Sports< > /a> > ^ > > > and, apparently for the twitter feed, > > WARN - Going to buffer response body of large or unknown size. Using > getResponse > BodyAsStream instead is recommended. > :37:31: expected closing tag of meta > <a > href="http://www.nytimes.com/pages/sports/index.html">Sports< > /a> > ^ > WARN - Cookie rejected: "$Version=0; k=122.167.31.233.1283470309076286; > $Path=/; > $Domain=.twitter.com". Illegal domain attribute ".twitter.com". Domain of > origi > n: "twitter.com" > WARN - Cookie rejected: "$Version=0; > _twitter_sess=BAh7CDoPY3JlYXRlZF9hdGwrCNq2y > tQqAToHaWQiJWRjMTNkMjQ2ZGRiYjA2%250AOTU1ZGZjMTc1NjMxMTZhN2I4IgpmbGFzaElDOidBY3Rp > b25Db250cm9sbGVy%250AOjpGbGFzaDo6Rmxhc2hIYXNoewAGOgpAdXNlZHsA--f65405470eedc4a64 > defa69a0e78d22bd676cc0c; $Path=/; $Domain=.twitter.com". Illegal domain > attribut > e ".twitter.com". Domain of origin: "twitter.com" > > > > No feed updates in Esme. > > Vassil, would you want to fix this or shall I look into it. > > Imtiaz > > ----- Original Message ----- From: "Vassil Dichev" <[email protected]> > To: <[email protected]> > Sent: Friday, September 03, 2010 1:29 AM > Subject: Re: ESME-267 - Pooled links in popular links list > > >> Fixed. Now if you post the same link to a pool and to the public, you >> will notice that the href attribute points to the internal shortened >> URL in the former case and to the target URL in the latter case. This >> means that popularity statistics will only be gathered when links on >> public messages are clicked. >> >> An unique ID is still generated for all URLs but for links in pooled >> messages they're not visible. >> >> This should fix the problem. Does someone want to verify that we have >> indeed the correct behavior? >> >> Vassil >> >> >> On Wed, Sep 1, 2010 at 8:18 AM, Richard Hirsch <[email protected]> >> wrote: >>> >>> Sounds like a god idea. >>> >>> D. >>> >>> On 8/31/10, Vassil Dichev <[email protected]> wrote: >>>> >>>> Right, we just don't generate and store a unique ID for links in pools >>>> and will generate a different object on parsing. This way links which >>>> come from pools will point directly to the target URL and links from >>>> public messages will be redirected through the internal shortened URL, >>>> which will allow statistics to be collected. This won't break any >>>> functionality and I think it could be done fairly easily. >>>> >>>> I will assign ESME-267 to me if nobody objects to the proposed solution. >>>> >>>> Vassil >>>> >>>> >>>> On Tue, Aug 31, 2010 at 9:20 PM, Richard Hirsch <[email protected]> >>>> wrote: >>>>> >>>>> Leave original link but just don't add it to PopularLinks. >>>>> >>>>> On 8/31/10, Ethan Jewett <[email protected]> wrote: >>>>>> >>>>>> Oh, I see. Yes, that would make sense. So we would just leave the >>>>>> original link in there, right? >>>>>> >>>>>> Ethan >>>>>> >>>>>> On Tue, Aug 31, 2010 at 8:12 PM, Richard Hirsch >>>>>> <[email protected]> >>>>>> wrote: >>>>>>> >>>>>>> I agree with the solution of just removing those links that originate >>>>>>> in >>>>>>> pools. >>>>>>> >>>>>>> D. >>>>>>> >>>>>>> On 8/31/10, Vassil Dichev <[email protected]> wrote: >>>>>>>> >>>>>>>> OK, I think this is a worse example, because there are many ways to >>>>>>>> find a list of URLs in a wiki (which were generally just not >>>>>>>> designed >>>>>>>> with privacy/security in mind). >>>>>>>> >>>>>>>> If you're willing to sacrifice convenience for security, the easiest >>>>>>>> change is not to parse URLs in messages in pools- it will appear as >>>>>>>> normal text, not as a hyperlink. The next thing we can do is set up >>>>>>>> a >>>>>>>> different type of URL which doesn't take you to the shortened URL, >>>>>>>> but >>>>>>>> directly to the target URL. >>>>>>>> >>>>>>>> If one really insists on shortening URLs in pools, then there must >>>>>>>> be >>>>>>>> one set of shortened URLs per pool. I don't think anyone will claim >>>>>>>> that this idea makes sense. >>>>>>>> >>>>>>>> Vassil >>>>>>>> >>>>>>>> >>>>>>>> On Tue, Aug 31, 2010 at 11:35 AM, Ethan Jewett <[email protected]> >>>>>>>> wrote: >>>>>>>>> >>>>>>>>> I agree in theory with your assessment of the google docs >>>>>>>>> situation, >>>>>>>>> but I still think we're violating the expectation of security >>>>>>>>> around >>>>>>>>> pools. >>>>>>>>> >>>>>>>>> Take another example: An HR department is using a secure wiki to >>>>>>>>> discuss and organize an upcoming layoff. The wiki page is titled >>>>>>>>> "October layoff planning" and the URL is >>>>>>>>> https://hrwiki.corp.internal/October-layoff-planning. Someone posts >>>>>>>>> this URL to the layoff-planning pool on esme (the same group of >>>>>>>>> people >>>>>>>>> with access to the wiki page) and a bunch of people in the pool >>>>>>>>> click >>>>>>>>> on it. Suddenly, the upcoming layoff has been announced to every >>>>>>>>> esme >>>>>>>>> user in the corporation. Whoops! >>>>>>>>> >>>>>>>>> The point is, maybe that private information shouldn't be in the >>>>>>>>> URL, >>>>>>>>> but a lot of applications do this whether or not it is a good idea. >>>>>>>>> I >>>>>>>>> think we need to take that reality into account and change the way >>>>>>>>> this works to avoid the possibility of these scenarios. >>>>>>>>> >>>>>>>>> Ethan >>>>>>>>> >>>>>>>>> On Tuesday, August 31, 2010, Vassil Dichev <[email protected]> >>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>> Ethan, this defeats the purpose of having an URL shortener and it >>>>>>>>>> only >>>>>>>>>> gives you a false sense of security. Read my previous mail. >>>>>>>>>> >>>>>>>>>> Links have no notion of a pool. A link could come from messages in >>>>>>>>>> different pools or it might not be clicked "inside a message" at >>>>>>>>>> all. >>>>>>>>>> >>>>>>>>>> Let me know what you think. >>>>>>>>>> >>>>>>>>>> Vassil >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Tue, Aug 31, 2010 at 9:44 AM, Ethan Jewett <[email protected]> >>>>>>>>>> wrote: >>>>>>>>>>> >>>>>>>>>>> [Changed subject to start a new thread. Was: "New issues - a >>>>>>>>>>> couple >>>>>>>>>>> of >>>>>>>>>>> blockers for 1.1 release"] >>>>>>>>>>> >>>>>>>>>>> That's correct. The "Popular messages" functionality just keeps a >>>>>>>>>>> counter of how many times a message has been resent. If you look >>>>>>>>>>> at >>>>>>>>>>> the UserActor.scala, lines 197 & 198, you'll see that the >>>>>>>>>>> statistic >>>>>>>>>>> "ResendStat" is incremented when a message is resent, but only if >>>>>>>>>>> the >>>>>>>>>>> message is not in a pool. Then when we want to find out what the >>>>>>>>>>> most >>>>>>>>>>> popular messages are, we ask the PopStatsActor - for example in >>>>>>>>>>> the >>>>>>>>>>> "popular" method of UserSnip.scala - line 213. >>>>>>>>>>> >>>>>>>>>>> On the other hand, the "LinkClicked is incremented in >>>>>>>>>>> UrlStore.scala >>>>>>>>>>> - >>>>>>>>>>> line 40. Here there is never a check to see if the link came from >>>>>>>>>>> a >>>>>>>>>>> message in a pool. (This counter is used in the "links" method in >>>>>>>>>>> UserSnip.scala, after the "popular" method.) >>>>>>>>>>> >>>>>>>>>>> I think we need to check if a link came from a pool before >>>>>>>>>>> incrementing the counter, but in order to do this we need to >>>>>>>>>>> record >>>>>>>>>>> what pool a link belonged to, so I think we need to make pool >>>>>>>>>>> part >>>>>>>>>>> of >>>>>>>>>>> the key of the UrlStore object and then populate this field when >>>>>>>>>>> a >>>>>>>>>>> new >>>>>>>>>>> link is created. >>>>>>>>>>> >>>>>>>>>>> Ethan >>>>>>>>>>> >>>>>>>>>>> On Tue, Aug 31, 2010 at 8:11 AM, Imtiaz Ahmed H E >>>>>>>>>>> <[email protected]> >>>>>>>>>>> wrote: >>>>>>>>>>>> >>>>>>>>>>>> In the home when I type in a message sharing it with one pool >>>>>>>>>>>> and >>>>>>>>>>>> click >>>>>>>>>>>> resend it does not show up in Popular Messages. But if the >>>>>>>>>>>> message >>>>>>>>>>>> is >>>>>>>>>>>> public >>>>>>>>>>>> it shows up on resend in Popular Pessages. >>>>>>>>>>>> >>>>>>>>>>>> Can you explain. Haven't gotten to Popular Links yet. >>>>>>>>>>>> >>>>>>>>>>>> Imtiaz >>>>>>>>>>>> >>>>>>>>>>>> ----- Original Message ----- From: "Ethan Jewett" >>>>>>>>>>>> <[email protected]> >>>>>>>>>>>> To: <[email protected]> >>>>>>>>>>>> Sent: Tuesday, August 31, 2010 11:37 AM >>>>>>>>>>>> Subject: Re: New issues - a couple of blockers for 1.1 release >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> Hi, >>>>>>>>>>>> >>>>>>>>>>>> The issue doesn't happen with Popular Messages, only with >>>>>>>>>>>> Popular >>>>>>>>>>>> Links. >>>>>>>>>>>> >>>>>>>>>>>> I need to look into the implementation, but I have a feeling the >>>>>>>>>>>> Popular Links issue is going to be a headache. I believe that >>>>>>>>>>>> for a >>>>>>>>>>>> given link there is no way to tell what message it shows up in, >>>>>>>>>>>> which >>>>>>>>>>>> would make it impossible to tell if it is a link from a pooled >>>>>>>>>>>> message >>>>>>>>>>>> or not. We may have to modify the data model for storing links >>>>>>>>>>>> to >>>>>>>>>>>> flag >>>>>>>>>>>> the ones that started out in a pooled message... >>>>>>>>>>>> >>>>>>>>>>>> Regarding Pubsubhubbub, as Dick said, there's no hurry. I don't >>>>>>>>>>>> think >>>>>>>>>>>> I'll be working on it over the next couple of weeks. >>>>>>>>>>>> >>>>>>>>>>>> Thanks for all your efforts! >>>>>>>>>>>> >>>>>>>>>>>> Ethan >>>>>>>>>>>> >>>>>>>>>>>> On Tue, Aug 31, 2010 at 4:20 AM, Imtiaz Ahmed H E >>>>>>>>>>>> <[email protected]> >>>>>>>>>>>> wrote: >>>>>>>>>>>>> >>>>>>>>>>>>> Re https://issues.apache.org/jira/browse/ESME-267 >>>>>>>>>>>>> >>>>>>>>>>>>> I haven't tried this but plan to fix it right away. >>>>>>>>>>>>> >>>>>>>>>>>>> Tell me, is it only the links showing up in 'Popular Links' or >>>>>>>>>>>>> is >>>>>>>>>>>>> that >>>>>>>>>>>>> a >>>>>>>>>>>>> problem with the message itself also showing up in >>>>>>>>>>>>> 'PopularMessages' >>>>>>>>>>>>> >>>>>>>>>>>>> Looks like I'll never get going with pubsubhubub ! First there >>>>>>>>>>>>> was >>>>>>>>>>>>> Dick's >>>>>>>>>>>>> Release Planning mail with the pending 1.1 issues and now here >>>>>>>>>>>>> are >>>>>>>>>>>>> some >>>>>>>>>>>>> more. Plan to get going after all 1.1 ending issues are >>>>>>>>>>>>> resolved. >>>>>>>>>>>>> >>>>>>>>>>>>> However, Ethan it was your issue originally and if you feel you >>>>>>>>>>>>> want >>>>>>>>>>>>> to >>>>>>>>>>>>> take >>>>>>>>>>>>> it back again to push it to closure faster or something please >>>>>>>>>>>>> do, >>>>>>>>>>>>> otherwise >>>>>>>>>>>>> I'll re-start on it once 1.1 is done... >>>>>>>>>>>>> >>>>>>>>>>>>> Imtiaz >>>>>>>>>>>>> >>>>>>>>>>>>> ----- Original Message ----- From: "Richard Hirsch" >>>>>>>>>>>>> <[email protected]> >>>>>>>>>>>>> To: < >>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> > >
