Schema Admin via HTTP [JIRA: RIAK-1715] #10

rzezeski · 2012-10-16T15:51:29Z

Yokozuna should have the ability to create and modify schemas
remotely, HTTP for this specific issue.

There are still a lot of questions regarding this issue. The
fundamental ones are:

Can a running system cope with schema changes? If so, how can it be
done safely?
Can schemas be modified piecemeal or must it be all-or-nothing?
Can JSON be used to read/modify/write the schema?
Should concurrent writers/siblings be accounted for? I'm a little
less worried about concurrent writers in a healthy cluster and more
worried about partitioned writes. Would PW/PR/DW/W/R=N be good
enough?

Specification

The resource: <host-port>/yokozuna/schema/<schema-name>

GET

Return the schema with content type of application/xml.
TODO: allow to pick-out subset of schema to return, e.g. a list of
fields?
~~TODO: allow to return in JSON format?~~

PUT

Accepts text/xml or application/xml.
The body is a properly formed Solr schema. See the
example schema.
If the schema name already exists then don't replace the current
one. Instead return an error to user stating it already exists. Need
to be able to overwrite a schema in case a bad schema is uploaded.
TODO: Think about adding param overwrite=true to bypass the
previous check allowing the user to overwrite the current schema
definition. This has to be thought about carefully because changing
schemas could cause issues.

POST

TODO: Think about allowing POSTs to add to or modify a subset of a
schema. E.g. adding a new field without read/modify/write of entire
schema.

DELETE

TODO: Do we allow deletes of schemas?

The text was updated successfully, but these errors were encountered:

abhinavsingh · 2012-10-17T06:27:12Z

@rzezeski My 2 cents over above concerns:

Yokozuna should explore the idea of adapting to schema changes i.e. running system will eventually cope up with schema changes all by itself
There can also be a flag using which developer can decide whether Yokozuna should adapt to schema changes for old data by itself. If this flag is turned on, Yokozuna can do relevant reindexing job in the background
Currently schema version tag is not utilised by riak search. Yokozuna can make use of these version strings and make relevant indexes available via different url paths e.g. /solr/1.0/select?q= and /solr/1.1/select?q=. Developers can enjoy consuming older indexes from /solr/1.0/select?q= and new indexes will be available via /solr/1.1/select?q= while they are being rebuilt
Similarly there can be a way using which developer can cancel/revert back the schema changes being done. This will also stop/pause the background reindexing job
Finally, if developer is happy with schema version 1.1, he can do the garbage collection job which cleans up indexes for version 1.0

Having said, these functionalities are bound to put some load on yokozuna clusters while reindexing job is on for a large number of documents in db.

rzezeski · 2012-10-17T14:28:33Z

@abhinavsingh Very interesting ideas. Given the fact that Solr cores can be copied/swapped and the active anti-entropy sub-system will repair missing data perhaps this is a doable. There is much to think about here. I think the main issue can be done without considering your points, i.e. they are additions. I'll give it some thought and perhaps create some additional issues. At minimum, Yokozuna should strive to make schema migration not a pain in the ass.

dreverri · 2012-10-26T00:07:36Z

This may not be acceptable to everyone but it seems immutable schemas might be acceptable if Yokozuna allowed for a bucket to have many indexes. If a new schema is needed, create a new index with a new immutable schema. AAE will take care of indexing old data. Developers can switch over to the new schema when AAE is done and drop the old index when appropriate.

rzezeski · 2013-01-23T22:40:10Z

PR #42 addressed the basic concerns in this issue but there are still things that must be addressed. Pushing this issue back another release so it can continued to be iterated upon.

rzezeski · 2013-03-14T19:31:31Z

It appears that Solr has been doing some work related to this issue.

SOLR-4503 allows fetching schema properties via HTTP.

SOLR-3251 would allow dynamic adding of fields.

SOLR-1147 would allow configuration of solrconfig.xml, which is a bit tangental from the schema but I thought I'd add it here anyways.

SOLR-791 would allow setting the schema during core creation. This doesn't really change anything from a Yokozuna user's perspective but would help with Yokozuna code. Wouldn't need to do direct file copying anymore.

coderoshi · 2013-03-29T00:04:52Z

#58 is related to this issue, namely, the verification of an uploaded schema

coderoshi · 2013-04-11T22:56:06Z

Should this still be labeled as a "must"? Many of the important items are either done, or undoable. +1 for closing.

rzezeski · 2013-04-16T21:23:07Z

I agree the basics are there. I'd like to see how Solr upstream deals with adding fields on the fly and then revisit this topic.

I'm still concerned about modifying a schema for an index that already has data. I'm not sure what effects changing field names, field types, or analyzer chains might have. I'm sure it's often not good. In the future I think it would be good if Yokozuna kept track of schema versions via hash + datetime. It might be feasible to design graceful migrations by way of Solr cores. But that is a change that requires some thought and probably a fair amount of code. For now I'd like to punt on it and leave the behavior undefined. Essentially, modifying schemas with existing data should be done with great care for now.

Since this is an umbrella issue I'm going to remove the 'must' tag but leave it open as a reminder to revisit later.

DSomogyi · 2015-04-14T20:40:57Z

Comment for Jira.

rzezeski · 2021-10-07T21:58:23Z

I wear the wolf shirt.

jaredmorrow · 2021-10-07T22:02:19Z

I wear the wolf shirt.

Obviously not if you haven't fixed this in the past 9 years.

rzezeski · 2021-10-07T22:03:40Z

Problem?

jaredmorrow · 2021-10-07T22:04:17Z

Shouldn't you be debugging a printer or something?

andrewjstone · 2021-10-08T05:06:35Z

Shouldn't you be debugging a printer or something?

He doesn't have time. He's still on projector duty.

rzezeski · 2021-10-08T15:27:04Z

As @vinoski would say, due tomorrow means do tomorrow.

coderoshi mentioned this issue Jan 8, 2013

Timeouts during bulk data load, possible tie in to yz_events crash and a bad state on core create [JIRA: RIAK-1585] #42

Merged

rzezeski mentioned this issue Oct 24, 2013

Schemaless Mode [JIRA: RIAK-1707] #219

Open

rzezeski mentioned this issue Mar 13, 2014

Add schema update + index reloading [JIRA: RIAK-1591] #130

Open

rzezeski added this to the 2.0.1 milestone Mar 22, 2014

zeeshanlakhani mentioned this issue Dec 17, 2014

add schema create add remove to cli-wip #439

Open

Basho-JIRA changed the title ~~Schema Admin via HTTP~~ Schema Admin via HTTP [JIRA: RIAK-1715] Apr 14, 2015

Basho-JIRA added the JIRA: To Do label Apr 14, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Schema Admin via HTTP [JIRA: RIAK-1715] #10

Schema Admin via HTTP [JIRA: RIAK-1715] #10

rzezeski commented Oct 16, 2012

abhinavsingh commented Oct 17, 2012

rzezeski commented Oct 17, 2012

dreverri commented Oct 26, 2012

rzezeski commented Jan 23, 2013

rzezeski commented Mar 14, 2013

coderoshi commented Mar 29, 2013

coderoshi commented Apr 11, 2013

rzezeski commented Apr 16, 2013

DSomogyi commented Apr 14, 2015

rzezeski commented Oct 7, 2021

jaredmorrow commented Oct 7, 2021

rzezeski commented Oct 7, 2021

jaredmorrow commented Oct 7, 2021

andrewjstone commented Oct 8, 2021

rzezeski commented Oct 8, 2021

Schema Admin via HTTP [JIRA: RIAK-1715] #10

Schema Admin via HTTP [JIRA: RIAK-1715] #10

Comments

rzezeski commented Oct 16, 2012

Specification

GET

PUT

POST

DELETE

abhinavsingh commented Oct 17, 2012

rzezeski commented Oct 17, 2012

dreverri commented Oct 26, 2012

rzezeski commented Jan 23, 2013

rzezeski commented Mar 14, 2013

coderoshi commented Mar 29, 2013

coderoshi commented Apr 11, 2013

rzezeski commented Apr 16, 2013

DSomogyi commented Apr 14, 2015

rzezeski commented Oct 7, 2021

jaredmorrow commented Oct 7, 2021

rzezeski commented Oct 7, 2021

jaredmorrow commented Oct 7, 2021

andrewjstone commented Oct 8, 2021

rzezeski commented Oct 8, 2021