paulirwin commented on code in PR #1058:
URL: https://github.com/apache/lucenenet/pull/1058#discussion_r1997463102


##########
src/Lucene.Net.Memory/MemoryIndex.cs:
##########
@@ -275,23 +284,45 @@ public virtual TokenStream 
KeywordTokenStream<T>(ICollection<T> keywords)
             return new TokenStreamAnonymousClass<T>(keywords);
         }
 
+        /// <summary>
+        /// An anonymous implementation of <see cref="TokenStream"/> for
+        /// <see cref="KeywordTokenStream{T}(ICollection{T})"/>.
+        /// </summary>
+        /// <typeparam name="T">The type of item in the collection.</typeparam>
+        /// <remarks>
+        /// LUCENENET specific - This class originally got an enumerator in 
the constructor and stored it to a field
+        /// that was never reset, which meant that it could not be reused 
(since most IEnumerator implementations can
+        /// only be iterated once and throw on <see 
cref="System.Collections.IEnumerator.Reset()"/>). This class has

Review Comment:
   First, I should have worded this differently; I should have said "many" 
instead of "most." I meant to communicate that we generally can't rely on using 
it in Reset, because many implementations do throw. For example, any 
enumerators created by LINQ or the Enumerable class throw on Reset, which are 
pretty commonly used. Note that [Microsoft's own docs on 
IEnumerator](https://learn.microsoft.com/en-us/dotnet/api/system.collections.ienumerator?view=net-9.0#remarks)
 say that Reset "is provided for COM interoperability and does not need to be 
fully implemented; instead, the implementer can throw a 
[NotSupportedException](https://learn.microsoft.com/en-us/dotnet/api/system.notsupportedexception?view=net-9.0)."
 I think it's likely enough that it will throw that alternative 1 is not a good 
idea.
   
   Alternative 3 seems like it would be more costly than just allowing it to 
possibly allocate on reset. Enumerators for BCL types are highly optimized 
nowadays, and for some of the common ones you'd at most have a boxed struct. It 
seems to me like having try/catch unwinding would be more costly in runtime 
performance.
   
   Alternative 2 is fine, if you really think we need it. I don't think we do. 
It is quite common to get and dispose of an enumerator in .NET. There are over 
2,000 uses of `foreach` in our codebase which get a new enumerator each time 
that I doubt we are applying such scrutiny to. But, if you think it's 
important, I'd choose this option. 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@lucenenet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to