Re: RFR(m) 2: 8072722: add stream support to Scanner

Peter Levart Thu, 17 Sep 2015 00:12:18 -0700

As an alternative to additional boolean field, you could use one bit ofexpectedCount/modCount int field(s):


- let initial value of expectedCount be 1 (odd value)
- instead of (expectedCount >= 0) ==> (expectedCount != 1)
- let initial value of modCount be 0 (even value)
- instead of modCount++ ==> modCount += 2;


Regards, Peter

On 09/17/2015 01:08 AM, Stuart Marks wrote:

On 9/16/15 8:43 AM, Xueming Shen wrote:
I'm talking about the check "immediately" prior to the call toaccept(). Itwill not function after the modCount tips over to the negative intvalue,
because the "expectedCount >=0" check.

Consider the use scenario that the Scanner is on top of an endless input
stream, you have a token stream on top of it. The check before the
"accept(token" will not be performed until the expectedCount/modCounttipsback to positive value again from the negative, then off, then on...Duringthe off period (it will take a while from negative back to positive),the
stream will just work fine to feed the accept() the "next" token even if
there is another thread keeps "stealing" tokens from the samescanner, if the
timing is right.  Looks like not really a "fail-fast" in this scenario.
Right, after modCount wraps around to negative, the CME checkingbecomes dysfunctional. It doesn't do any harm, but it ceases toperform proper checking. This is kind of a corner case but ... well Iadmit I did have to do a fair bit of puzzling to figure out what thebehavior would be and to prove to myself that it was benign.
This can be "easily" addressed, if you have a separate boolean fieldsuch as
"initialized". The code can look like below in tryAdvance(...)
[edited per your subsequent message]
     if (!initialized) {
         expectedCount = modCount;
         initialized = true;
     }
     if (expectedCount != modCount) {
         throw new CME();
     }
     ...
Well, if you think this is an unlikely use scenario and the intentionof the
check/guard here is mainly to prevent the wrong doing within the pipe
operation, then it might not worth the extra field, and I'm fine withthe
latest webrev.
The cost of the additional field is negligible. I haven't written outthe code but I suspect that an explicit "initialized" field will beeasier to reason about, certainly easier to understand than thebehavior that occurs if modCount wraps around to negative.
Note that this also applies to Matcher, although it's less likelysince Matcher's input is a CharSequence instead of an indefinite-sizedsource such as a file or an input stream. In talking to Paul Sandozabout this (author of the streams stuff for Matcher) he felt it wasimportant to keep the behaviors of Matcher and Scanner consistent.
But this has dragged out somewhat and I don't really want to addMatcher changes to this changeset. How about I do the following:
1) I'll push the latest webrev as it stands.
2) I'll file a separate bug to clean up Scanner's and Matcher'sspliterators' modCount checking to avoid the overflow issue.
3) I'll fix all the spliterators at the same time.

How does that sound?

s'marks

Re: RFR(m) 2: 8072722: add stream support to Scanner

Reply via email to