[twitter-dev] Missing Search API Results

John Barratt Tue, 17 Nov 2009 16:40:46 -0800

Hi All,

We have been noticing gaps appearing in search results at times whendoing geocoded searches in particular. For example with this searchover South Eastern Australia :


http://search.twitter.com/search.json?rpp=100&lang=en&page=1&geocode=-35.2,144.0,1000km

Occasionally produces results with large gaps in the created_at for thetweets. For example, I just got these created_at for tweets returned :


...
Tue, 17 Nov 2009 22:59:50 +0000
Tue, 17 Nov 2009 22:59:49 +0000
Tue, 17 Nov 2009 22:59:48 +0000
Tue, 17 Nov 2009 22:59:43 +0000
Tue, 17 Nov 2009 22:52:04 +0000
Tue, 17 Nov 2009 22:52:04 +0000
Tue, 17 Nov 2009 22:51:34 +0000
Tue, 17 Nov 2009 22:50:21 +0000
Tue, 17 Nov 2009 22:50:20 +0000
Tue, 17 Nov 2009 22:43:37 +0000
...

This area is producing multiple tweets per second, but there are somegaps there many minutes long. A subsequent search 10's of seconds, to afew minutes later 'fills in these gaps' with many many more tweets fromthe periods in between these minutes-long gaps, confirming that theinitial search was in fact sparse.


The same effect exists via the normal web search interface also.

It has previously been possible to just use the maximum id of the tweetsfrom the previous search result set, and then only page through resultsuntil you saw that id again. But having the search results appear outof order means this method is not possible. It means you would have tosearch across all 15 pages x 100 rpp continuously in order to ensuresomething approaching a complete result set. Even then it will notalways be possible if ~ 1500 of results 'appear at once'. This is notsustainable for both the app doing the searching, or from the point ofview of the many additional requests that would have to hit Twitter'sservers.

Is this a problem to be resolved, or moving forward are we going tocontinue to get results appearing via search out of order from theircreation date like this?


Thanks,

JB.

[twitter-dev] Missing Search API Results

Reply via email to