eitsupi commented on issue #34291:
URL: https://github.com/apache/arrow/issues/34291#issuecomment-1439988827
@elgabbas Thank you for uploading this file.
Unfortunately, however, it seems that the first row is failing to load on my
end.
Could it be possible for you to show me the complete log at yours?
```r
> arrow::read_tsv_arrow("Arrow_parse_Example.txt")
Error:
! Invalid: CSV parse error: Expected 259 columns, got 322: 2417931730
DSS004390000131N
CC0_1_0
National Museum of Nat ...
Run `rlang::last_error()` to see where the error occurred.
```
Since `"` is at the end of the first row, it seems that if I read by
`readr::read_tsv`, the last column (`eventType`) of the first row will contain
the contents of the second row.
```r
> readr::read_tsv("Arrow_parse_Example.txt", show_col_types = FALSE)
# A tibble: 1 × 259
gbifID abstract accessR…¹ accru…² accru…³ accru…⁴ alter…⁵ audie…⁶
avail…⁷ bibli…⁸ confo…⁹ contr…˟ cover…˟ created
<dbl> <lgl> <lgl> <lgl> <lgl> <lgl> <lgl> <lgl>
<lgl> <lgl> <lgl> <lgl> <lgl> <lgl>
1 2417931730 NA NA NA NA NA NA NA NA
NA NA NA NA NA
# … with 245 more variables: creator <lgl>, date <lgl>, dateAccepted <lgl>,
dateCopyrighted <lgl>,
# dateSubmitted <lgl>, description <lgl>, educationLevel <lgl>, extent
<lgl>, format <lgl>, hasFormat <lgl>,
# hasPart <lgl>, hasVersion <lgl>, identifier <chr>, instructionalMethod
<lgl>, isFormatOf <lgl>, isPartOf <lgl>,
# isReferencedBy <lgl>, isReplacedBy <lgl>, isRequiredBy <lgl>,
isVersionOf <lgl>, issued <lgl>, language <lgl>,
# license <chr>, mediator <lgl>, medium <lgl>, modified <lgl>, provenance
<lgl>, publisher <chr>, references <lgl>,
# relation <lgl>, replaces <lgl>, requires <lgl>, rights <lgl>,
rightsHolder <lgl>, source <lgl>, spatial <lgl>,
# subject <lgl>, tableOfContents <lgl>, temporal <lgl>, title <lgl>, type
<lgl>, valid <lgl>, institutionID <chr>, …
# ℹ Use `colnames()` to see all variable names
> readr::read_tsv("Arrow_parse_Example.txt", show_col_types =
FALSE)$eventType
[1]
"\n2417934775\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tDSS00439000014FB\t\t\t\t\t\t\t\t\t\tCC0_1_0\t\t\t\t\tNational
Museum of Natural History,
Luxembourg\t\t\t\t\t\t\t\t\t\t\t\t\t\t\thttps://ror.org/05natt857\tMnhnL\t\t\tMNHNL-HERB-LUX\tHerbarium\t\tPRESERVED_SPECIMEN\t\t\tTaxon
status for Luxembourg: [Least concern - IUCN
(2001)]\tDSS00439000014FB\t20471\t\tLéopold
Reichling\t\t\t\t\t\t\t\t\t\t\t\t\tPRESENT\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t1953-08-06T00:00:00\t\t\t\t1953\t8\t6\t1953-8-6/1953-8-6\t\tUnknown\t\t\t\t\t\t\t\t\tEUROPE\t\t\t\tLU\t\t\t\tGarnich\tEntre
Garnich et Windhof, chemin longeant la lisière du bois dit \"Lange Rés\" sur
marnes liasiques"
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]