alamb commented on code in PR #8220:
URL: https://github.com/apache/arrow-rs/pull/8220#discussion_r2300634248


##########
arrow-avro/src/reader/record.rs:
##########
@@ -1471,4 +1753,196 @@ mod tests {
         assert!(int_array.is_null(0)); // row1 is null
         assert_eq!(int_array.value(1), 42); // row3 value is 42
     }
+
+    fn make_record_resolved_decoder(

Review Comment:
   I didn't fully follow these tests. but I didn't find any coverage for 
skipping the nested types (Lists, Maps, Structs). 
   
   I ran `llvm-cov` to double check and it seems to imply this code isn't 
tested:
   
   ```shell
   cargo llvm-cov --html -p arrow-avro
   ```
   
   Report is here:  
[coverage.zip](https://github.com/user-attachments/files/21986560/coverage.zip)
   
   
   For example 
   
`coverage/Users/andrewlamb/Software/arrow-rs/arrow-avro/src/reader/record.rs.html`
   
   <img width="885" height="794" alt="Screenshot 2025-08-26 at 7 05 43 AM" 
src="https://github.com/user-attachments/assets/2b5b432a-550a-41a4-b1a1-1b159ff0da01";
 />
   



##########
arrow-avro/src/reader/record.rs:
##########
@@ -736,6 +858,166 @@ fn sign_extend_to<const N: usize>(raw: &[u8]) -> 
Result<[u8; N], ArrowError> {
     Ok(arr)
 }
 
+/// Lightweight skipping decoder for writer-only fields

Review Comment:
   I found the term "writer only" field somewhat confusing -- I think the same 
concept (not decoding fields into arrow that are not requested) is called 
"non-projected fields" in the parquet, json, and csv readers.
   
   I think the name `skipper`  is quite clear, this is just a high level 
comment about the terminology in the comments (I know, 🙄 )



##########
arrow-avro/src/reader/mod.rs:
##########
@@ -1537,6 +1564,57 @@ mod test {
         assert!(batch.column(0).as_any().is::<StringViewArray>());
     }
 
+    #[test]
+    fn test_alltypes_skip_writer_fields_keep_double_only() {
+        let file = arrow_test_data("avro/alltypes_plain.avro");

Review Comment:
   - It is so cool to me to see the  files added by @Igosuki in  
https://github.com/apache/arrow-testing/pull/62 keep paying off / are used 
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to