eitsupi commented on issue #34291:
URL: https://github.com/apache/arrow/issues/34291#issuecomment-1439988827

   @elgabbas Thank you for uploading this file.
   
   Unfortunately, however, it seems that the first row is failing to load on my 
end.
   Could it be possible for you to show me the complete log at yours?
   
   ```r
   > arrow::read_tsv_arrow("Arrow_parse_Example.txt")
   Error:
   ! Invalid: CSV parse error: Expected 259 columns, got 322: 2417931730        
                                         DSS004390000131N                       
                                                  CC0_1_0                       
National Museum of Nat ...
   Run `rlang::last_error()` to see where the error occurred.
   ```
   
   Since `"` is at the end of the first row, it seems that if I read by 
`readr::read_tsv`, the last column (`eventType`) of the first row will contain 
the contents of the second row.
   
   ```r
   > readr::read_tsv("Arrow_parse_Example.txt", show_col_types = FALSE)
   # A tibble: 1 × 259                                                          
                                        
         gbifID abstract accessR…¹ accru…² accru…³ accru…⁴ alter…⁵ audie…⁶ 
avail…⁷ bibli…⁸ confo…⁹ contr…˟ cover…˟ created
          <dbl> <lgl>    <lgl>     <lgl>   <lgl>   <lgl>   <lgl>   <lgl>   
<lgl>   <lgl>   <lgl>   <lgl>   <lgl>   <lgl>  
   1 2417931730 NA       NA        NA      NA      NA      NA      NA      NA   
   NA      NA      NA      NA      NA     
   # … with 245 more variables: creator <lgl>, date <lgl>, dateAccepted <lgl>, 
dateCopyrighted <lgl>,
   #   dateSubmitted <lgl>, description <lgl>, educationLevel <lgl>, extent 
<lgl>, format <lgl>, hasFormat <lgl>,
   #   hasPart <lgl>, hasVersion <lgl>, identifier <chr>, instructionalMethod 
<lgl>, isFormatOf <lgl>, isPartOf <lgl>,
   #   isReferencedBy <lgl>, isReplacedBy <lgl>, isRequiredBy <lgl>, 
isVersionOf <lgl>, issued <lgl>, language <lgl>,
   #   license <chr>, mediator <lgl>, medium <lgl>, modified <lgl>, provenance 
<lgl>, publisher <chr>, references <lgl>,
   #   relation <lgl>, replaces <lgl>, requires <lgl>, rights <lgl>, 
rightsHolder <lgl>, source <lgl>, spatial <lgl>,
   #   subject <lgl>, tableOfContents <lgl>, temporal <lgl>, title <lgl>, type 
<lgl>, valid <lgl>, institutionID <chr>, …
   # ℹ Use `colnames()` to see all variable names
   
   > readr::read_tsv("Arrow_parse_Example.txt", show_col_types = 
FALSE)$eventType
   [1] 
"\n2417934775\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tDSS00439000014FB\t\t\t\t\t\t\t\t\t\tCC0_1_0\t\t\t\t\tNational
 Museum of Natural History, 
Luxembourg\t\t\t\t\t\t\t\t\t\t\t\t\t\t\thttps://ror.org/05natt857\tMnhnL\t\t\tMNHNL-HERB-LUX\tHerbarium\t\tPRESERVED_SPECIMEN\t\t\tTaxon
 status for Luxembourg: [Least concern - IUCN 
(2001)]\tDSS00439000014FB\t20471\t\tLéopold 
Reichling\t\t\t\t\t\t\t\t\t\t\t\t\tPRESENT\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t1953-08-06T00:00:00\t\t\t\t1953\t8\t6\t1953-8-6/1953-8-6\t\tUnknown\t\t\t\t\t\t\t\t\tEUROPE\t\t\t\tLU\t\t\t\tGarnich\tEntre
 Garnich et Windhof, chemin longeant la lisière du bois dit \"Lange Rés\" sur 
marnes liasiques"
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to