Skip to content

Commit

Permalink
[accumulo] README feedback from @busbey
Browse files Browse the repository at this point in the history
* Clarify supported Accumulo versions for table setup instructions
* Use long-form arg names for non-obvious shell commands
  • Loading branch information
joshelser committed Mar 27, 2017
1 parent 144e9f6 commit 5026a87
Showing 1 changed file with 8 additions and 4 deletions.
12 changes: 8 additions & 4 deletions accumulo/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,8 @@ Git clone YCSB and compile:
By default, YCSB uses a table with the name "usertable". Users must create this table before loading
data into Accumulo. For maximum Accumulo performance, the Accumulo table must be pre-split. A simple
Ruby script, based on the HBase README, can generate adequate split-point. 10's of Tablets per
TabletServer is a good starting point.
TabletServer is a good starting point. Unless otherwise specified, the following commands should run
on any version of Accumulo.

$ echo 'num_splits = 20; puts (1..num_splits).map {|i| "user#{1000+i*(9999-1000)/num_splits}"}' | ruby > /tmp/splits.txt
$ accumulo shell -u <user> -p <password> -e "createtable usertable"
Expand All @@ -56,16 +57,19 @@ is a faster and similarly-efficient option. The mutation queue property controls
that Accumulo will buffer in memory before performing a flush; this property should be set relative
to the amount of JVM heap the TabletServers are given.

Please note that the `table.durability` and `tserver.total.mutation.queue.max` properties only
exists for >=Accumulo-1.7. There are no concise replacements for these properties in earlier versions.

accumulo> config -s table.durability=flush
accumulo> config -s tserver.total.mutation.queue.max=256M
accumulo> config -t usertable -s table.file.compress.type=snappy

On repeated data loads, the following commands may be helpful to re-set the state of the table quickly.

accumulo> createtable tmp -cs usertable -cc usertable
accumulo> deletetable -f usertable
accumulo> createtable tmp --copy-splits usertable --copy-config usertable
accumulo> deletetable --force usertable
accumulo> renametable tmp usertable
accumulo> compact -w -t accumulo.metadata
accumulo> compact --wait -t accumulo.metadata

### 4. Load Data and Run Tests

Expand Down

0 comments on commit 5026a87

Please sign in to comment.