Imported ontologies should be queryable as separate graphs for robot query #158

cmungall · 2017-04-12T21:26:26Z

Robot converts an OWL ontology to Turtle for presentation to Jena for SPARQL queries. It appears the ontology goes into an (unnamed?) graph.

Would it not make sense to make the whole import chain queryable, and to preserve each ontology as its own graph?

As an aside, it would be great to have more published standards here. For example, OntoBee puts each ontology in its own graph which its nice, but it does its own renaming of the URI for the graph.

I thought to implement this in Robot it would be a straightforward switch of this line to use Trig...

...unfortunately saving Trig from the OWLAPI does not have the effect I would expect. It only seems to save the parent ontology, not the closure... ...and it seems to place each class in its own unnamed graph, hmm.

It would be relatively straightforward to iterate through the imports closure and add each separately, thought there may be a cleaner way.

And finally (this may deserve a separate ticket) but it is common to store inferences in a separate NG. It would be straightforward to do this stepwise in robot (reason, save results, and then combine this into the source ontology as an import). There may be a more elegant way to do this?

cc @balhoff @dougli1sqrd

The text was updated successfully, but these errors were encountered:

jamesaoverton · 2017-04-12T21:46:53Z

Separate named graphs for imports and inferences would be very nice, as long as it's still easy to query everything at once. Many (but not all?) systems make the default graph the union of the named graphs, and I think that would be good behaviour in this case. I don't remember what Fuseki does off-hand.

The graph for each import can use the import IRI as its name. The graph for the uninferred ontology can use the ontology IRI as its name. The graph with inferences would need a new IRI, which could be ROBOT-specific.

Off the top of my head, I can't think of a better method than iterating through the imports, converting to Turtle, and inserting into a named graph.

Maybe the query command can accept a --reasoner option to indicate that reasoning should be done.

cmungall · 2017-04-12T22:10:17Z

On 12 Apr 2017, at 14:46, James A. Overton wrote: Maybe the query command can accept a `--reasoner` option to indicate that reasoning should be done.

We may end up replicating a lot of options; or options may be different. The standard use case for the reason command is to make a new ontology with direct inferred links materialized. For this use case, we may want indirect too. We may also want to do things like shadow the Tbox in the Abox (ie make triples from subclass-of-some-values-from). But then maybe this is something that belongs outside robot. Not quite sure what my main point is here other than its complex and maybe best give some thought before putting too much of the kitchen sink in.

dougli1sqrd · 2017-05-03T21:37:14Z

It looks like the way we load data from the OWL API into a Jena DatasetGraph might be part of the difficulty here. I tried to manually add some named graphs from the OWLOntology object, but the sparql queries that talk to named graphs turn up nothing. I think it's possible Jena wants us to use Dataset as our primary way to SPARQL. Based on this article: https://www.ibm.com/developerworks/community/blogs/nlp/entry/an_introduction_to_the_jena_api?lang=en. I haven't had a chance to completely explore this yet, but this is what I've run into in my research.

jamesaoverton · 2017-05-03T21:41:25Z

I was working on something similar lately, so I think I know the solution. In order to make the default graph be (or just include?) the union of all named graphs, I switched to TDB for managing the dataset, with the settings described here: https://jena.apache.org/documentation/tdb/datasets.html

dougli1sqrd · 2017-05-03T21:44:20Z

Ah yeah. It looks like that's what they're using in the IBM article, too. Do you want to be assigned to this ticket then instead of (or in addition to) me?

jamesaoverton · 2017-05-03T21:46:09Z

No thanks, I have too many deadlines right now.

dougli1sqrd · 2017-05-03T21:49:46Z

Oh sorry, I guess I misunderstood. You were just saying you know how to do it in the robot case because of a different project, not that you have done it here already? ha, my mistake.

zhengj2007 · 2018-07-17T16:59:39Z

It's is very useful feature. We have several ontologies built based on OBO Foundry ontologies used for data loading and search. If this feature implemented in ROBOT tool, we can easily identify whether OBO Foundry ontology terms use consistently in the ontologies used for data loading and search.

Looking forward to seeing the feature in ROBOT.

jamesaoverton · 2018-07-17T17:09:07Z

This should not be hard to implement. The biggest questions in my mind are:

what to name each import: the version IRI, the import IRI, the ontology IRI? I can imagine any of these choices causing some confusing. @cmungall @balhoff does the recent discussion of names for ontology parts/variants help clarify this?
what to do with the default graph; it would be convenient to make it the union of all named graphs, but that would break backwards compatibility; if not, we need a name for the union

cmungall · 2018-07-17T17:36:32Z

Backwards compatibility is important. We could have a command line switch with 3+ possibilities

core (default) - use only main ontology
union - all in one graph
stratify - each ontology in a named graph named by ontology IRI (the versionIRI will be associated with that graph)

Need to think how this interacts with reason command

zhengj2007 · 2018-07-23T12:19:33Z

@cmungall @jamesaoverton For our use case, we don't need to reason on the ontology. Any expected date on its implementation in robot? Thanks!

jamesaoverton · 2018-07-24T12:01:42Z

@zhengj2007: @rctauber is working on this. We have a lot to do before ICBO, so I'm not sure when it will be ready.

zhengj2007 · 2018-07-24T19:33:03Z

@jamesaoverton Thanks for update.
@rctauber Thanks for working on it.

beckyjackson · 2018-07-25T16:42:55Z

I made some progress on this here: https://github.com/rctauber/robot/tree/graphs
This branch includes unit and integration tests to make sure the new features work and to support backwards compatibility.

It adds in a new --imports option to query:

--imports ignore default behavior, does not load imports
--imports union loads imports as named graphs and queries over the union of the graphs
--imports graphs loads imports as named graphs and queries on the named graphs

This option is just a suggestion, if anybody has another idea on how to implement this I'd love to hear it!

balhoff · 2018-07-25T17:45:07Z

@rctauber looking at @jamesaoverton and @cmungall 's descriptions, I'm not sure what the difference would be in your union and graphs options. I think union according to @cmungall would just put everything into the default graph instead of loading as named graphs. It seems to me you may just need ignore and graphs. And then specify that the default graph queries the union of the named graphs in the graphs case (I believe you need to set this up on purpose in Jena, although it is the default behavior for many triplestores like Blazegraph). If the default graph works this way, I don't see why we need an option for loading all imports but not putting them into named graphs.

beckyjackson · 2018-07-25T18:42:03Z

True, that probably makes more sense. Should it be --imports ignore and --imports graphs or maybe --use-graphs with true and false?

beckyjackson · 2018-07-27T15:41:25Z

I pushed a new update with the option --use-graphs true and --use-graphs false (default: false). If you set it to true, the default graph is the union of all imports, otherwise you can specify an import by its IRI.

A problem that @jamesaoverton pointed out is that the actual ontology IRIs of the import documents may collide (or be null). The import IRI may be different than the actual ontology IRI. Right now, the graph name is the ontology IRI from the ontology ID for an OWLOntology object. If that IRI is null, it will fail. If that IRI is the same as another import's ontology IRI, there will be a name collision.

As far as I know, OWLAPI doesn't provide a method for mapping the import IRIs to the actual OWLOntology objects that are returned when you run ontology.getImports(). The benefit of using --imports union is that you could load all the imports without worrying about their IRIs. --imports graphs would still run into the same problem as --use-graphs true, but at least users would have an alternative with the union option.

That said, the --use-graphs option may be a bit more user-friendly.

balhoff · 2018-07-27T16:48:52Z

If that IRI is the same as another import's ontology IRI, there will be a name collision.

I think this should cause an exception in the OWLOntologyManager anyway—it won't load two ontologies with the same ontology IRI. For anonymous ontologies, I would suggest autogenerating a graph IRI (something like urn:uuid:EF2F72A6-79DC-40C7-A5D2-0D00B9120F65). It won't matter that the user doesn't know what it is.

cmungall · 2018-08-02T00:47:45Z

New commits look good, instructions in the markdown seem clear

zhengj2007 · 2018-08-03T14:41:35Z

@rctauber Thanks for implementing the feature. When will it be available in the release version of ROBOT? Is it possible including the feature in release 1.1.0 @jamesaoverton ? Thanks!

jamesaoverton · 2018-08-05T12:23:24Z

@zhengj2007 I merged this yesterday, and it's included it in the 1.2.1-alpha-1 release: https://github.com/ontodev/robot/releases/tag/v1.2.0-alpha-1

zhengj2007 · 2018-08-06T14:17:05Z

@jamesaoverton Thanks a lot!

beckyjackson · 2018-08-14T15:24:53Z

Implemented by 882a517 - please re-open if this requires more discussion.

zhengj2007 · 2018-08-20T17:35:50Z

@rctauber I downloaded the robot.jar that contains the feature from: https://github.com/ontodev/robot/releases/tag/v1.2.0-alpha-1

I tried '--use-graphs true' options in query. I sent the query like:
robot query --use-graphs true --input gates.owl --query QC_termWithMultipleLabels.rq output.csv
But got IllegalArgumentException error: Unknown command or option: --use-graphs

How should I use this option? Thanks!

beckyjackson · 2018-08-21T13:27:53Z

Hi @zhengj2007 - I just tried to replicate your problem with the jar from the pre-release, but I was able to use the --use-graphs option.

When you downloaded the jar, did you replace the jar in your system PATH?

zhengj2007 · 2018-08-21T14:28:30Z

@rctauber I replaced the old jar file by the newly downloaded one. So, it should be in my system PATH, right?

beckyjackson · 2018-08-21T14:34:49Z

Yes - it should be. Can you confirm that your PATH points to where you replaced that jar? If you're on MacOS, it should be in ~/.bash_profile. For Windows, go to System -> Advanced system settings -> Environment Variables.

jamesaoverton · 2018-08-21T14:37:38Z

And you can run robot version to check which version is actually being run.

zhengj2007 · 2018-08-21T14:47:01Z

@rctauber Thanks! I will check it.

@jamesaoverton I ran the command and got "ROBOT version null" message.

jamesaoverton · 2018-08-21T14:48:24Z

ROBOT version null means an old version, without --use-graphs. It should say "ROBOT version 1.2.0-alpha-1".

zhengj2007 · 2018-08-21T14:48:51Z

@jamesaoverton got it. Will check what's wrong.

zhengj2007 · 2018-08-21T15:07:32Z

@rctauber @jamesaoverton I found the issue. I forgot that I installed the robot under usr/local/bin but I updated the robot.jar in my downloaded folder. Now I am using the version of robot 1.2.0-alpha-1. Thanks for your help.

beckyjackson · 2018-08-21T15:08:26Z

Great!

zhengj2007 · 2018-09-11T19:51:20Z

@rctauber The queries that treat imported ontologies as separate graphs worked well (using --use-graphs true). Thanks for your efforts.
However, when I query the same ontology that import multiple OWL files and want to treat them as a union single graph, it does not work (using --use-graphs false). Always return 0 row. I need to run merge OWL files command then run the query. Did I miss anything? Thanks!

beckyjackson · 2018-09-11T23:45:26Z

The behavior for `--use-graphs false` is to not load any imports, only the main ontology. When you set this to `true`, the default graph is the union of all imports, but you can also query using named graphs.

…

On Tue, Sep 11, 2018 at 15:51 jie zheng ***@***.***> wrote: @rctauber <https://github.com/rctauber> The queries that treat imported ontologies as separate graphs worked well (using --use-graphs true). Thanks for your efforts. However, when I query the same ontology that import multiple OWL files and want to treat them as a union single graph, it does not work (using --use-graphs false). Always return 0 row. I need to run merge OWL files command then run the query. Did I miss anything? Thanks! — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#158 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AP-WRnlcfXkhpyVATmetRkqWXxqfbMY9ks5uaBQ5gaJpZM4M7_dC> .

zhengj2007 · 2018-09-12T14:29:37Z

@rctauber Thanks for your explanation. It's very helpful. Now everything works fine.

dougli1sqrd self-assigned this May 1, 2017

cmungall mentioned this issue Oct 18, 2017

robot verify does not use the catalog-xml #195

Closed

cmungall mentioned this issue Feb 28, 2018

allow SPARQL queries over saturated inferred graph #258

Open

jamesaoverton assigned beckyjackson Jul 24, 2018

beckyjackson mentioned this issue Jul 26, 2018

Update Jena dependency #314

Closed

beckyjackson mentioned this issue Aug 1, 2018

Query imports as named graphs #328

Merged

beckyjackson closed this as completed Aug 14, 2018

cmungall mentioned this issue Oct 30, 2018

Improvements to report command #391

Closed

Imported ontologies should be queryable as separate graphs for robot query #158

Imported ontologies should be queryable as separate graphs for robot query #158

Comments

cmungall commented Apr 12, 2017

jamesaoverton commented Apr 12, 2017

cmungall commented Apr 12, 2017 via email

dougli1sqrd commented May 3, 2017

jamesaoverton commented May 3, 2017

dougli1sqrd commented May 3, 2017

jamesaoverton commented May 3, 2017

dougli1sqrd commented May 3, 2017

zhengj2007 commented Jul 17, 2018

jamesaoverton commented Jul 17, 2018

cmungall commented Jul 17, 2018

zhengj2007 commented Jul 23, 2018

jamesaoverton commented Jul 24, 2018

zhengj2007 commented Jul 24, 2018

beckyjackson commented Jul 25, 2018

balhoff commented Jul 25, 2018

beckyjackson commented Jul 25, 2018

beckyjackson commented Jul 27, 2018

balhoff commented Jul 27, 2018

cmungall commented Aug 2, 2018

zhengj2007 commented Aug 3, 2018

jamesaoverton commented Aug 5, 2018

zhengj2007 commented Aug 6, 2018

beckyjackson commented Aug 14, 2018

zhengj2007 commented Aug 20, 2018

beckyjackson commented Aug 21, 2018

zhengj2007 commented Aug 21, 2018

beckyjackson commented Aug 21, 2018

jamesaoverton commented Aug 21, 2018

zhengj2007 commented Aug 21, 2018

jamesaoverton commented Aug 21, 2018

zhengj2007 commented Aug 21, 2018

zhengj2007 commented Aug 21, 2018

beckyjackson commented Aug 21, 2018

zhengj2007 commented Sep 11, 2018

beckyjackson commented Sep 11, 2018 via email

zhengj2007 commented Sep 12, 2018