[
https://issues.apache.org/jira/browse/JENA-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Alexiev updated JENA-1628:
-----------------------------------
Description:
We're working with some FP7 project data using tarql, and the column
"Participant Internet Homepage" sometimes has multiple values, separated by
- comma, semicolon, " / ", " or "
It would be nice if apf:strSplit took a regex separator (like perl's split),
then I could use it like this:
{noformat}
?websites apf:strSplit (?Participant_Internet_Homepage, "(,|;| / | or )")
{noformat}
If the change is considered breaking, a new function apf:strSplitByRegex can be
introduced.
For the time being, I'll try with this ugliness (should work, right?):
{noformat}
{?websites apf:strSplit (?Participant_Internet_Homepage, ",")} union
{?websites apf:strSplit (?Participant_Internet_Homepage, ";")} union
{?websites apf:strSplit (?Participant_Internet_Homepage, " / ")} union
{?websites apf:strSplit (?Participant_Internet_Homepage, " or ")}
{noformat}
was:
We're working with some FP7 project data using tarql, and the column
"Participant Internet Homepage" sometimes as multiple values, separated by
- comma, semicolon, " / ", " or "
It would be nice if apf:strSplit took a regex delimiter (like perl's split),
then I could use it like this:
{noformat}
?websites apf:strSplit (?Participant_Internet_Homepage, "(,|;| / | or )")
{noformat}
If the change is considered breaking, a new function apf:strSplitByRegex can be
introduced.
For the time being, I'll try with this ugliness (should work, right?):
{noformat}
{?websites apf:strSplit (?Participant_Internet_Homepage, ",")} union
{?websites apf:strSplit (?Participant_Internet_Homepage, ";")} union
{?websites apf:strSplit (?Participant_Internet_Homepage, " / ")} union
{?websites apf:strSplit (?Participant_Internet_Homepage, " or ")}
{noformat}
> apf:strSplit should take several possible separators
> ----------------------------------------------------
>
> Key: JENA-1628
> URL: https://issues.apache.org/jira/browse/JENA-1628
> Project: Apache Jena
> Issue Type: Improvement
> Reporter: Vladimir Alexiev
> Priority: Major
>
> We're working with some FP7 project data using tarql, and the column
> "Participant Internet Homepage" sometimes has multiple values, separated by
> - comma, semicolon, " / ", " or "
> It would be nice if apf:strSplit took a regex separator (like perl's split),
> then I could use it like this:
> {noformat}
> ?websites apf:strSplit (?Participant_Internet_Homepage, "(,|;| / | or )")
> {noformat}
> If the change is considered breaking, a new function apf:strSplitByRegex can
> be introduced.
> For the time being, I'll try with this ugliness (should work, right?):
> {noformat}
> {?websites apf:strSplit (?Participant_Internet_Homepage, ",")} union
> {?websites apf:strSplit (?Participant_Internet_Homepage, ";")} union
> {?websites apf:strSplit (?Participant_Internet_Homepage, " / ")} union
> {?websites apf:strSplit (?Participant_Internet_Homepage, " or ")}
> {noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)