Merging two Nblast results tables #466

dosumis · 2019-12-11T12:03:39Z

Merging two NBLAST results doesn't fit the current paradigm of results table merging.

Merge on ID won't work - score columns will differ.
To make results interpretable to users, we'd need a column for query image (result X has a similarity score n when compared to (segmented) image Y) .

MVP: Block combination of any two results where both have a score column => user message about this not being supported.

Sketch of spec for supporting in future:

Extend NBLAST results to include query image column (the comparator against which the score makes sense)
Combine based on compound key of ID + comparator ID

ddelpiano · 2019-12-11T12:14:00Z

@Robbie1977 do we want to tackle this from the xmi or from the datasource bundle?

Robbie1977 · 2019-12-11T14:50:46Z

@dosumis @ddelpiano I think to take the last query results values as the overriding values will give us a workable solution for the moment!?

dosumis · 2019-12-11T15:25:11Z

to take the last query results values as the overriding values will give us a workable solution for the moment!?

Disagree - really confusing. Doing this in a non-confusing way is challenging.

As stated in #464 (comment), I think we just block for now: Any two results tables where both have a score column => user message about this combination not being supported. One of those two results table could already be compound.

Robbie1977 · 2019-12-12T13:17:02Z

@dosumis @ddelpiano As a workable solution I think the first query determines the results content and additional queries only remove non-matches but don't change the data displayed (in remaining rows)?

ddelpiano · 2019-12-12T15:18:34Z

@dosumis @ddelpiano As a workable solution I think the first query determines the results content and additional queries only remove non-matches but don't change the data displayed (in remaining rows)?

@Robbie1977 that is pretty much what @jrmartin implemented with the last PR he opened today (first query drives the results, if the same column is found in the second query results that have to be merged we skip the columns already present and add to the record with the same id only the columns that it is missing), if that suitable we are good with that (sounds good to me), differently if @dosumis considers this still not enough to preserve the validity of the data we need to think about this a bit and allocate some development.

dosumis · 2019-12-12T15:41:44Z

@jrmartin PR is fine for everything except the 2 NBLAST case - which I think is just hard. It will not be clear to users which NBLAST the score refers to. Merged results could have a low similarity score in second NBLAST query. Is it really hard to just block compound queries when both results tables to combine have a score column?

ddelpiano · 2019-12-12T18:44:56Z

@jrmartin PR is fine for everything except the 2 NBLAST case - which I think is just hard. It will not be clear to users which NBLAST the score refers to. Merged results could have a low similarity score in second NBLAST query. Is it really hard to just block compound queries when both results tables to combine have a score column?

@dosumis it will require some development for sure, we cannot hardcode the header field in the datasource bundle since if this bundle is used in other applications and it's not feasible.
A possible approach could be to define in the datasource bundle in app-config.xml a bean for the class in charge of merging the results, in this bean we provide a list of strings that if matching the headers fields we don't want to merge we jump to the error message you suggested.
I spent literally 10 minutes on this and might be refined after further discussion on the implementation but that is just an approximate explanation of how we can implement this.

Pseudo code below, not real code and not real geppetto classes, just an example:

<bean id="datasourceMergeListWhatever" class="org.geppetto.datasources.mergeResultsClass">
    <property name="myListOfHeadersStopper">
        <list value-type="com.somePackage.TypeForList">
            <ref bean="score"/>
            <ref bean="blabla"/>
            <ref bean="etcetc"/>
        </list>
    </property>
</bean>

class org.geppetto.datasources.mergeResultsClass {
    private List<TypeForList> myListOfHeadersStoppers;
    @Required
    public void setMyList(List<TypeForList> myList) {
        this.myListOfHeadersStoppers = myListOfHeadersStoppers;
    }
}

dosumis mentioned this issue Dec 11, 2019

Merge query results on refined query when sub-results contains different headers #464

Closed

ddelpiano assigned Robbie1977 and ddelpiano Dec 11, 2019

Robbie1977 mentioned this issue Nov 4, 2022

[Snyk] Fix for 1 vulnerabilities #1381

Closed

Robbie1977 mentioned this issue Dec 25, 2022

[Snyk] Fix for 1 vulnerabilities #1398

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merging two Nblast results tables #466

Merging two Nblast results tables #466

dosumis commented Dec 11, 2019

ddelpiano commented Dec 11, 2019

Robbie1977 commented Dec 11, 2019

dosumis commented Dec 11, 2019 •

edited

Loading

Robbie1977 commented Dec 12, 2019 •

edited

Loading

ddelpiano commented Dec 12, 2019

dosumis commented Dec 12, 2019

ddelpiano commented Dec 12, 2019

Merging two Nblast results tables #466

Merging two Nblast results tables #466

Comments

dosumis commented Dec 11, 2019

ddelpiano commented Dec 11, 2019

Robbie1977 commented Dec 11, 2019

dosumis commented Dec 11, 2019 • edited Loading

Robbie1977 commented Dec 12, 2019 • edited Loading

ddelpiano commented Dec 12, 2019

dosumis commented Dec 12, 2019

ddelpiano commented Dec 12, 2019

dosumis commented Dec 11, 2019 •

edited

Loading

Robbie1977 commented Dec 12, 2019 •

edited

Loading