[twitter-dev] Re: Woe is me, I can't seek what I find (or Search is failing me)
Hi, I too have looked at the streaming API for our use but the restrictions of single keywords has stopped us from implementing it. We are also having issues in the other thread related to this issue and almost 0 data for some of our geolocated search terms. Nick On Oct 12, 8:04 am, @IDisposable idisposa...@gmail.com wrote: From that thread ticket 1930 was filed on our issue tracker which we will update when a fix is deployed: http://code.google.com/p/twitter-api/issues/detail?id=1930 Excellent, I hope it gets fixed while there is still time to back-fill some of this data,,,otherwise we're going to have a silly-looking hole in the next State of Twitter in St. Louis report :) I understand your reasons for the location tracking using the Search API but wondered if you knew that the mentions search you are doing can be carried out on using the Streaming API filter method. That should cut down on the number or REST queries you need to make. More information on that method is here: http://dev.twitter.com/pages/streaming_api_methods#statuses-filter Yes, I really need to switch to streaming for that... I just haven't had he bandwidth as of yet... we are using a Search (nee Summize) based infrastructure from a long while back and me being the one guy in the room, I've not had a chance to really skim through and update our stuff for streaming. Out of curiosity what is the third column of your figures represent? It may be possible to track that one using the Streaming API as well. We do about 68 searches (mostly hashtags, a couple keyword or user searches--for legacy/coverage guarantees) and 64 timeline follows (mostly lists, one hometimel). Each of these sources applies a label based on the source of incoming data (which search/timeline) for our various categories (seehttp://stltweets.comand click the category menus e.g. Blues). For ALL of these searches, we also apply a top-level category (e.g. Sports) and finally ALL of the tweets get a label of Everything for ease of seperating various sub-sites. Thus, the Everything column in my numbers is the overall volume of tweets from all sources. SO, am I to assume that the geocode search bug, once fixed, will go back to returning the tweets from people whose _profile location_ reads something near St. Louis like before? Thanks, Marc -- Twitter developer documentation and resources: http://dev.twitter.com/doc API updates via Twitter: http://twitter.com/twitterapi Issues/Enhancements Tracker: http://code.google.com/p/twitter-api/issues/list Change your membership to this group: http://groups.google.com/group/twitter-development-talk
[twitter-dev] Re: Woe is me, I can't seek what I find (or Search is failing me)
The Location search has been VERY unstable, and uses this typical search:http://search.twitter.com/search.atom?rpp=100geocode=38.627522%2C-90... It's getting worse all the time! Is this what we can expect going forward? If so, how can I follow all 20+ people we used to get tweets from on the location search? I'll happily create an account and manage the lists/follows... but I'm pretty sure that will get me killed, and it will only be a snapshot based on current profile location strings that we have... Sure, I could suck the *-pipe, but without a filter criteria, I'm going to be seeing all tweets from the entire universe, which seems hella-wasteful to twitter and me... Day MentionsLocationEverything 2010-09-13 498546801 53503 2010-09-14 471948110 54589 2010-09-15 477947599 54209 2010-09-16 514347087 54312 2010-09-17 525648363 55581 2010-09-18 488840943 47237 2010-09-19 587146008 53843 2010-09-20 499046219 52826 2010-09-21 49274 55933 2010-09-22 536451567 58999 2010-09-23 686642495 52967 2010-09-24 619141107 50679 2010-09-25 567336321 43950 2010-09-26 678435168 44664 2010-09-27 634632580 42192 2010-09-28 544832528 41792 2010-09-29 603840677 50472 2010-09-30 596438116 47713 2010-10-01 661538360 48302 2010-10-02 561223107 32024 2010-10-03 672822802 33328 2010-10-04 552823990 33491 2010-10-05 511638733 47023 2010-10-06 542739041 47856 2010-10-07 573330855 40742 2010-10-08 6355945922235 2010-10-09 5894835218691 2010-10-10 7240839920861 2010-10-11 4017558713010 -- Twitter developer documentation and resources: http://dev.twitter.com/doc API updates via Twitter: http://twitter.com/twitterapi Issues/Enhancements Tracker: http://code.google.com/p/twitter-api/issues/list Change your membership to this group: http://groups.google.com/group/twitter-development-talk
[twitter-dev] Re: Woe is me, I can't seek what I find (or Search is failing me)
Thanks for publishing this information. There is another thread discussing the issue with the Geocode search not respecting the radius of a search here: http://groups.google.com/group/twitter-development-talk/browse_thread/thread/a80db3eff77a88fe From that thread ticket 1930 was filed on our issue tracker which we will update when a fix is deployed: http://code.google.com/p/twitter-api/issues/detail?id=1930 I understand your reasons for the location tracking using the Search API but wondered if you knew that the mentions search you are doing can be carried out on using the Streaming API filter method. That should cut down on the number or REST queries you need to make. More information on that method is here: http://dev.twitter.com/pages/streaming_api_methods#statuses-filter Out of curiosity what is the third column of your figures represent? It may be possible to track that one using the Streaming API as well. Best @themattharris On Oct 11, 10:21 am, @IDisposable idisposa...@gmail.com wrote: The Location search has been VERY unstable, and uses this typical search:http://search.twitter.com/search.atom?rpp=100geocode=38.627522%2C-90... It's getting worse all the time! Is this what we can expect going forward? If so, how can I follow all 20+ people we used to get tweets from on the location search? I'll happily create an account and manage the lists/follows... but I'm pretty sure that will get me killed, and it will only be a snapshot based on current profile location strings that we have... Sure, I could suck the *-pipe, but without a filter criteria, I'm going to be seeing all tweets from the entire universe, which seems hella-wasteful to twitter and me... Day Mentions Location Everything 2010-09-13 4985 46801 53503 2010-09-14 4719 48110 54589 2010-09-15 4779 47599 54209 2010-09-16 5143 47087 54312 2010-09-17 5256 48363 55581 2010-09-18 4888 40943 47237 2010-09-19 5871 46008 53843 2010-09-20 4990 46219 52826 2010-09-21 49274 55933 2010-09-22 5364 51567 58999 2010-09-23 6866 42495 52967 2010-09-24 6191 41107 50679 2010-09-25 5673 36321 43950 2010-09-26 6784 35168 44664 2010-09-27 6346 32580 42192 2010-09-28 5448 32528 41792 2010-09-29 6038 40677 50472 2010-09-30 5964 38116 47713 2010-10-01 6615 38360 48302 2010-10-02 5612 23107 32024 2010-10-03 6728 22802 33328 2010-10-04 5528 23990 33491 2010-10-05 5116 38733 47023 2010-10-06 5427 39041 47856 2010-10-07 5733 30855 40742 2010-10-08 6355 9459 22235 2010-10-09 5894 8352 18691 2010-10-10 7240 8399 20861 2010-10-11 4017 5587 13010 -- Twitter developer documentation and resources: http://dev.twitter.com/doc API updates via Twitter: http://twitter.com/twitterapi Issues/Enhancements Tracker: http://code.google.com/p/twitter-api/issues/list Change your membership to this group: http://groups.google.com/group/twitter-development-talk
[twitter-dev] Re: Woe is me, I can't seek what I find (or Search is failing me)
From that thread ticket 1930 was filed on our issue tracker which we will update when a fix is deployed: http://code.google.com/p/twitter-api/issues/detail?id=1930 Excellent, I hope it gets fixed while there is still time to back-fill some of this data,,,otherwise we're going to have a silly-looking hole in the next State of Twitter in St. Louis report :) I understand your reasons for the location tracking using the Search API but wondered if you knew that the mentions search you are doing can be carried out on using the Streaming API filter method. That should cut down on the number or REST queries you need to make. More information on that method is here: http://dev.twitter.com/pages/streaming_api_methods#statuses-filter Yes, I really need to switch to streaming for that... I just haven't had he bandwidth as of yet... we are using a Search (nee Summize) based infrastructure from a long while back and me being the one guy in the room, I've not had a chance to really skim through and update our stuff for streaming. Out of curiosity what is the third column of your figures represent? It may be possible to track that one using the Streaming API as well. We do about 68 searches (mostly hashtags, a couple keyword or user searches--for legacy/coverage guarantees) and 64 timeline follows (mostly lists, one hometimel). Each of these sources applies a label based on the source of incoming data (which search/timeline) for our various categories (see http://stltweets.com and click the category menus e.g. Blues). For ALL of these searches, we also apply a top-level category (e.g. Sports) and finally ALL of the tweets get a label of Everything for ease of seperating various sub-sites. Thus, the Everything column in my numbers is the overall volume of tweets from all sources. SO, am I to assume that the geocode search bug, once fixed, will go back to returning the tweets from people whose _profile location_ reads something near St. Louis like before? Thanks, Marc -- Twitter developer documentation and resources: http://dev.twitter.com/doc API updates via Twitter: http://twitter.com/twitterapi Issues/Enhancements Tracker: http://code.google.com/p/twitter-api/issues/list Change your membership to this group: http://groups.google.com/group/twitter-development-talk
[twitter-dev] Re: Woe is me, I can't seek what I find (or Search is failing me)
I'm seeing this problem too, but it only started today, around five hours ago. Here's an example search: http://search.twitter.com/search?q=near%3Aedmonton That's returning a fraction of the tweets it was before. This problem happens occasionally, but not usually for this long. On Oct 7, 3:10 pm, @IDisposable idisposa...@gmail.com wrote: Over the last couple months, we've seen some wierd behavior in the responses to search queries. First, I understand the rules about search being non-covering, and that we are at the mercy of the index. That said, I've noticed some odd behavior lately. As background material, we run many searches (and we're white-listed by IP and OAuth account), but the two I want to reference are the Mentions and the Location searches. The Mentions search seems pretty stable and uses this typical search (and then we exclude a bunch of things like Bay St. Louis, etc.):http://search.twitter.com/search.atom?rpp=100q=stl+OR+%23stl+OR+stlo... The Location search has been VERY unstable, and uses this typical search:http://search.twitter.com/search.atom?rpp=100geocode=38.627522%2C-90... As the day progresses, we move up the high-water mark in the since_id to track what we've already received so we should be getting minimal gaps. We almost never see two 100-entry polls in a row, so I think we're keeping up with whatever coverage the search index is offering. I've posted in a Google Spreadsheet a graph of the tweet counts we're seeing since 7/1/2010 so you can see the trends http://bit.ly/9wnnFM (sheet two is the graph). Some interesting things to note: 1) The Mentions search is very consistent. 2) The Location search likes to bounce around a bit. 3) In mid August, we started to have issues with more 403s and error about since_id being too old. We were also getting rate-limited in our calls to get the tweep details (since the ATOM feed is so meager). Due to a bug, I wasn't committing all the tweets when this happened. 4) On or about Sept 1st, you guys did something that broke our ability to stay caught up... we started getting almost no tweets and lots of errors about since_id being too old. I thought this was due to your new tweet id assignment being rolled out. 5) On Sept 5th, I got back from vacation and added logic to understand and use the no new tweets, roll the tweet id forward to this driven by parsing the link rel=refresh node in the ATOM feed. 6) I also, around this time, added better logic to the tweep-lookup detail, only asking you for tweeps I don't have at least a minimal row on. This reduced the number of rate-limiting issues. 7) We were very stable and until 9/23 when volume falls off a lot, and never really recovers. I think this is the new search engine rollout. To research a little more, I tried the Twitter advanced search page and asking for the RSS (atom, really) feed from the advanced search page I get this URL now:http://search.twitter.com/search.atom?geocode=38.627522,-90.19841,30 Which starts off like ours, but adds the (seemingly redundant) human- readable search criteria q=+near:38.627522,-90.19841+within:30mi. Oddly, if we remove that and do the same search at nearly the same instant, I DO get vastly different tweets sets... probably due to volume, possibly just sorting, but I would hope that with the same since_id value, I would get the same tweets... but I don't. So, I'm asking... what's going on? Why are we seeing so much volume fall-off? What can we do about it? Should I be running both searches (my current one and one with the human-readable query) to get better coverage? Is there any hope/expectation of the volume returning to normal? Doesn't anyone else care about tweep-location searches? Now, before you tell me that I should be using Site Streams (which I want to do), realize that I _NEED_ tweets from people whose profile location says they are in St. Louis (and similar) like the old Summize search honored. I can't just get by with the _tweet_ location being STL. Marc Brooks Chief guy getting yelled at,http://stltweets.comhttp://taste.stltweets.comhttp://loufest.stltweets.com -- Twitter developer documentation and resources: http://dev.twitter.com/doc API updates via Twitter: http://twitter.com/twitterapi Issues/Enhancements Tracker: http://code.google.com/p/twitter-api/issues/list Change your membership to this group: http://groups.google.com/group/twitter-development-talk