Performance testing triplestores for Islandora community recommendations #30

ruebot · 2015-03-24T16:44:31Z

Identify triplestores
Identify benchmark tests

DiegoPino · 2015-03-24T17:02:50Z

Nice @ruebot

I would like to add also some requirements for the possible candidates if possible:

100% Sparql 1.1 compliant
Opensource =)
Good/active community+developers
Nice if capable of making distributed/cross queries (my future needs)
Horizontal Scaling and Clustering
Multiple storage choices

Quick list from google

Identify triplestores
- Apache Jena Fuseki (2.0?)
- BigOWLIM
- D2R Server
- Sesame
- Open Link Virtuoso
- 4store
- ~~AllegroGraph~~ (not opensource)
- BlazeGraph (Thanks @daniel-dgi )

Some existing work on benchmarks

ruebot · 2015-03-24T17:13:33Z

@DiegoPino++

daniel-dgi · 2015-04-14T14:25:43Z

Wanna throw BlazeGraph into the mix: http://www.blazegraph.com/ . It's what wikipedia is using.

DiegoPino · 2015-04-14T15:49:46Z

Nice addition @daniel-dgi. BlazeGraph Looks really good. ++ for testing that one first.

ruebot · 2015-04-23T13:12:38Z

Shall we identify benchmarks, and datasets from this RdfStoreBenchmarking list? Maybe we can coordinate with the Fedora community? Get some input there as well?

looks at @awoods

awoods · 2015-04-23T13:59:40Z

It would be good to identify usage characteristics and expectations of the community in order to ensure that we are looking at the right metrics. As a side note, I believe @no-reply at DPLA is also planning on such an analysis. Maybe we can extend the coordination.

DiegoPino · 2015-04-23T14:28:07Z

Hi, do we have some stats on how many triples do we will get for every FF object?

awoods · 2015-04-23T14:35:20Z

No, but that should be easy to determine. My guess is 20.

DiegoPino · 2015-04-23T14:37:52Z

Ok, that's less than what we got now in Fedora 3. A simple object with RELS-EXT + full DC document gives me about 30.

awoods · 2015-04-23T14:57:24Z

You will want to check what the F4 triples look like from your specific data, of course. I was just throwing out a guess. 30 may be closer to the truth.

DiegoPino · 2015-04-23T15:06:28Z

Thanks @awoods! , i just wan't to try to infer what will be the reality for the largest (and ever growing) islandora implementations we have on the community. @ruebot , do you think we could make a quick and dirty poll about this on the google group? Like "how many objects are you handling right now, and how fast are you growing every year"?. I have read in the group of repos with over 250000 objects. That's 7.500.000 triples. To have this as basis to identify usage "characteristics and expectations" as @awoods correctly stated.

DiegoPino · 2015-04-23T15:21:00Z

Looks like LUBM: http://swat.cse.lehigh.edu/projects/lubm/ is a standard test sets and tools used on benchmarking triple stores. At least Oracle thinks so!
http://download.oracle.com/otndocs/tech/semantic_web/pdf/OracleSpatialGraph_RDFgraph_1_trillion_Benchmark.pdf

dmoses · 2015-04-24T19:36:08Z

fyi ... Open Link Virtuoso (i believe) is also used by the OSF for Drupal project

DiegoPino · 2015-04-24T19:48:40Z

Nice Donald! OSF for drupal looks like a nice addition, reading quickly through the documentation i see there is a lot of things we could do without having to write custom code, even importing whole ontologies. Also 3.2 version does not require Virtuoso anymore, you can use any Triple store, even better. Thanks a lot, this could make the bridge and bring Linked data to Drupal.

ruebot · 2016-04-23T15:56:13Z

This could be done as Fedora community Performance Scaling & Testing; relevant agenda item from this meeting.

ruebot · 2016-06-29T02:08:20Z

Because sometimes we have a conversation on Twitter a year or so later:
https://twitter.com/ruebot/status/747955866385539072

...and a document now thanks to @cmh2166
https://docs.google.com/document/d/1EoD-JD4OxF9M-pfifQxF_0U7CLGThd8cMzjFH0DwKgU/edit#heading=h.84vdault4l0g

no-reply · 2016-06-29T03:02:05Z

For Ruby users, I've done some initial work on a benchmark suite for ruby-rdf at: https://github.com/ruby-rdf/rdf-benchmark

My hope is that this will become a general purpose benchmark for RDF.rb, using the Berlin Benchmark data generator. It's early days, still, but the work might have more general usefulness.

bradspry · 2016-07-13T16:04:43Z

Blazegraph GPU on AWS EC2 G2 Family :-)

ruebot · 2016-07-18T14:50:56Z

Add Stardog to the list. h/t @ajs6f

http://sparqlscore.com/ too

ajs6f · 2016-07-18T14:59:34Z

Stardog is not open source, although in my experience @kendall at @Complexible is approachable and very willing to have discussions about favorable licensing terms. I had that experience in the context of work I did for @ddavis at @Smithsonian, so YMMV.

DiegoPino mentioned this issue Apr 24, 2015

Investigate XML editors to replace XML Forms #28

Closed

whikloj mentioned this issue May 27, 2015

Triplestore #36

Merged

ruebot added help wanted Seeking a volunteer or co-worker question labels Apr 7, 2016

whikloj mentioned this issue Jan 8, 2019

Future of Blazegraph #1000

Closed

dflitner mentioned this issue Feb 10, 2020

Large video file can't be moved to Fedora (test case #23) #1436

Closed

kstapelfeldt added Type: question asks for support (asks a question) and removed architecture labels Sep 25, 2021

kstapelfeldt added the Subject: Linked Data related to linked data. Consider also using metadata or modelling tags. label Sep 30, 2021

kstapelfeldt mentioned this issue Oct 6, 2021

Meta-Issue: Performance Testing #936

Open

kstapelfeldt added this to Islandora Issues Queue Feb 1, 2022

kstapelfeldt moved this to Todo in Islandora Issues Queue Feb 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance testing triplestores for Islandora community recommendations #30

Performance testing triplestores for Islandora community recommendations #30

ruebot commented Mar 24, 2015

DiegoPino commented Mar 24, 2015

ruebot commented Mar 24, 2015

daniel-dgi commented Apr 14, 2015

DiegoPino commented Apr 14, 2015

ruebot commented Apr 23, 2015

awoods commented Apr 23, 2015

DiegoPino commented Apr 23, 2015

awoods commented Apr 23, 2015

DiegoPino commented Apr 23, 2015

awoods commented Apr 23, 2015

DiegoPino commented Apr 23, 2015

DiegoPino commented Apr 23, 2015

dmoses commented Apr 24, 2015

DiegoPino commented Apr 24, 2015

ruebot commented Apr 23, 2016

ruebot commented Jun 29, 2016

no-reply commented Jun 29, 2016

bradspry commented Jul 13, 2016 •

edited

Loading

ruebot commented Jul 18, 2016 •

edited

Loading

ajs6f commented Jul 18, 2016

Performance testing triplestores for Islandora community recommendations #30

Performance testing triplestores for Islandora community recommendations #30

Comments

ruebot commented Mar 24, 2015

DiegoPino commented Mar 24, 2015

ruebot commented Mar 24, 2015

daniel-dgi commented Apr 14, 2015

DiegoPino commented Apr 14, 2015

ruebot commented Apr 23, 2015

awoods commented Apr 23, 2015

DiegoPino commented Apr 23, 2015

awoods commented Apr 23, 2015

DiegoPino commented Apr 23, 2015

awoods commented Apr 23, 2015

DiegoPino commented Apr 23, 2015

DiegoPino commented Apr 23, 2015

dmoses commented Apr 24, 2015

DiegoPino commented Apr 24, 2015

ruebot commented Apr 23, 2016

ruebot commented Jun 29, 2016

no-reply commented Jun 29, 2016

bradspry commented Jul 13, 2016 • edited Loading

ruebot commented Jul 18, 2016 • edited Loading

ajs6f commented Jul 18, 2016

bradspry commented Jul 13, 2016 •

edited

Loading

ruebot commented Jul 18, 2016 •

edited

Loading