Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DATAIO_NULL string in CSV mapping output using 7.0.0 #238

Closed
namedgraph opened this issue Jun 10, 2024 · 6 comments
Closed

DATAIO_NULL string in CSV mapping output using 7.0.0 #238

namedgraph opened this issue Jun 10, 2024 · 6 comments

Comments

@namedgraph
Copy link

With the previous version the output was:

<https://data.beamery.com/smth/9#this> <http://www.w3.org/2004/02/skos/core#definition> "VP STRATEGIC ACCOUNTS, NA\n\nBeamery \n
***truncated***"@en .

With version 7.0.0 the output is:

<https://data.beamery.com/smth/9#this> <http://www.w3.org/2004/02/skos/core#definition> "VP STRATEGIC ACCOUNTS, NA\nDATAIO_NULL\nBeamery \n
***truncated***"@en .

CSV row looks like this (multiline string value):

9,"Vice President, N/A Strategic Accounts",9,"VP STRATEGIC ACCOUNTS, NA

Beamery 
***truncated***"

YARRML mapping:

  Concept:
    sources: smth
    s: smth:$(id)#this
    po:
      ...
      - [ skos:definition, $(role_description), en~lang ]
@DylanVanAssche
Copy link
Contributor

Hi!

Can you please add a reproducible example for this issue?
We're trying to solve this, but the mapping's source access description is missing and the CSV header as well.
Would it be possible to make a toy example that reproduces the problem? Thanks!

@namedgraph
Copy link
Author

Attaching the files (zipped): rmlmapper_test.zip

The command (ran within the rmlmapper_test folder):

docker run --rm \
  -v "$PWD":"/data" \
  rmlio/rmlmapper-java:7.0.0 \
  -m /data/mapping.ttl \
  -o /data/output.nt \
  -s ntriples \
  -d

@wna-se
Copy link

wna-se commented Jul 31, 2024

I have the same issue with any CSV that has multi-line values containing empty lines (e.g. \n\n). Perhaps related to the 1.1.0 release of the dataio library and the NewCSVNullInjector that was introduced?

ghsnd pushed a commit to RMLio/dataio that referenced this issue Aug 12, 2024
When multiple new lines are occuring in a quoted CSV string we have
to remember that we are inside a quoted string which can occur over
multiple lines if the CSV file has cell values with new line characters.

See RMLio/rmlmapper-java#238
@namedgraph
Copy link
Author

@DylanVanAssche any updates on this?

@DylanVanAssche
Copy link
Contributor

@namedgraph This is fixed in dataio: RMLio/dataio@6ba79ab available in dataio 1.3.1.

A new release of RMLMapper needs to be made with dataio updated to 1.3.1

@DylanVanAssche
Copy link
Contributor

Fixed in v7.1.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants