Re: [twitter-dev] Re: Upcoming changes to the way status IDs are sequenced

Mark McBride Thu, 01 Apr 2010 09:39:29 -0700

Just out of curiosity, what applications are you building that require
sub-second sorting resolution for tweets?


  ---Mark

http://twitter.com/mccv


On Wed, Mar 31, 2010 at 11:01 PM, Aki <[email protected]> wrote:

> It actually makes sense to use tweet ID to sort tweets, because
> timestamp is not a valid source of information for accurate sorting.
> It is a very common case to have multiple tweets posted at the exact
> same second, and it is not possible to reproduce the correct ordering
> of tweets on the client side. This can be improved by having better
> precision for timestamp (maybe milliseconds), but it is still possible
> to get tweets posted at the exact same milliseconds (although it is
> very rare).
>
> If Twitter really needs to change the tweet ID scheme, I think better
> solution for sorting is required to be provided through API.
>
> On Mar 27, 7:41 am, Taylor Singletary <[email protected]>
> wrote:
> > Hi Developers,
> >
> > It's no secret that Twitter is growing exponentially. The tweets keep
> coming
> > with ever increasing velocity, thanks in large part to your great
> > applications.
> >
> > Twitter has adapted to the increasing number of tweets in ways that have
> > affected you in the past: We moved from 32 bit unsigned integers to
> 64-bit
> > unsigned integers for status IDs some time ago. You all weathered that
> storm
> > with ease. The tweetapoclypse was averted, and the tweets kept flowing.
> >
> > Now we're reaching the scalability limit of our current tweet ID
> generation
> > scheme. Unlike the previous tweet ID migrations, the solution to the
> current
> > issue is significantly different. However, in most cases the new approach
> we
> > will take will not result in any noticeable differences to you the
> developer
> > or your users.
> >
> > We are planning to replace our current sequential tweet ID generation
> > routine with a simple, more scalable solution. IDs will still be 64-bit
> > unsigned integers. However, this new solution is no longer guaranteed to
> > generate sequential IDs.  Instead IDs will be derived based on time: the
> > most significant bits being sourced from a timestamp and the least
> > significant bits will be effectively random.
> >
> > Please don't depend on the exact format of the ID. As our infrastructure
> > needs evolve, we might need to tweak the generation algorithm again.
> >
> > If you've been trying to divine meaning from status IDs aside from their
> > role as a primary key, you won't be able to anymore. Likewise for usage
> of
> > IDs in mathematical operations -- for instance, subtracting two status
> IDs
> > to determine the number of tweets in between will no longer be possible.
> >
> > For the majority of applications we think this scheme switch will be a
> > non-event. Before implementing these changes, we'd like to know if your
> > applications currently depend on the sequential nature of IDs. Do you
> depend
> > on the density of the tweet sequence being constant?  Are you trying to
> > analyze the IDs as anything other than opaque, ordered identifiers? Aside
> > for guaranteed sequential tweet ID ordering, what APIs can we provide you
> to
> > accomplish your goals?
> >
> > Taylor Singletary
> > Developer Advocate, Twitterhttp://twitter.com/episod
>
>
> --
> To unsubscribe, reply using "remove me" as the subject.
>

Re: [twitter-dev] Re: Upcoming changes to the way status IDs are sequenced

Reply via email to