diff --git a/docs/source/_images/CedaArchive0824.png b/docs/source/_images/CedaArchive0824.png
new file mode 100644
index 0000000..facf566
Binary files /dev/null and b/docs/source/_images/CedaArchive0824.png differ
diff --git a/docs/source/_images/DataDistributed.png b/docs/source/_images/DataDistributed.png
new file mode 100644
index 0000000..688a7f2
Binary files /dev/null and b/docs/source/_images/DataDistributed.png differ
diff --git a/docs/source/allocation.rst b/docs/source/allocation.rst
deleted file mode 100644
index 3c8cb97..0000000
--- a/docs/source/allocation.rst
+++ /dev/null
@@ -1,6 +0,0 @@
-=================
-Allocation Module
-=================
-
-.. automodule:: pipeline.allocate
-    :members:
\ No newline at end of file
diff --git a/docs/source/assess-overview.rst b/docs/source/assess-overview.rst
deleted file mode 100644
index 9d8368f..0000000
--- a/docs/source/assess-overview.rst
+++ /dev/null
@@ -1,136 +0,0 @@
-Assessor Tool
-=============
-
-The assessor script ```assess.py``` is an all-purpose pipeline checking tool which can be used to assess:
- - The current status of all datasets within a given group in the pipeline (which phase each dataset currently sits in)
- - The errors/outputs associated with previous job runs.
- - Specific logs from datasets which are presenting a specific type of error.
-
-An example command to run the assessor tool can be found below:
-::
-    
-    python assess.py <operation> <group>
-
-Where the operation can be one of the below options:
- - Progress: Get a general overview of the pipeline; how many datasets have completed or are stuck on each phase.
- - Display: Display a specific type of information about the pipeline (blacklisted codes, datasets with virtual dimensions or using parquet)
- - Match: Match specific attributes within the ``detail-cfg.json`` file (and save to a new ID).
- - Summarise: Get an assessment of the data processed for this group
- - Upgrade: Update the version of a set of kerchunk files (includes internal metadata standard updates (timestamped, reason provided).
- - Cleanup: Remove cached files as part of group runs (errs, outs, repeat_ids etc.)
-
-1. Progress of a group
-----------------------
-
-To see the general status of the pipeline for a given group:
-::
-
-    python assess.py progress <group>
-
-An example output from this command can be seen below:
-::
-
-    Group: cci_group_v1
-    Total Codes: 361
-
-    scan      : 1     [0.3 %] (Variety: 1)
-        - Complete                 : 1
-
-    complete  : 185   [51.2%] (Variety: 1)
-        - complete                 : 185
-
-    unknown   : 21    [5.8 %] (Variety: 1)
-        - no data                  : 21
-
-    blacklist : 162   [44.9%] (Variety: 7)
-        - NonKerchunkable          : 50
-        - PartialDriver            : 3
-        - PartialDriverFail        : 5
-        - ExhaustedMemoryLimit     : 64
-        - ExhaustedTimeLimit       : 18
-        - ExhaustedTimeLimit*      : 1
-        - ValidationMemoryLimit    : 21
-
-In this case there are 185 datasets that have completed the pipeline with 1 left to be scanned. The 21 unknowns have no log file so there is no information on these. This will be resolved in later versions where a `seek` function will automatically run when checking the progress, to fix gaps in the logs for missing datasets.
-
-
-An example use case is to write out all datasets that require scanning to a new label (repeat_label):
-::
-
-    python assess.py progress <group> -p scan -r <label_for_scan_subgroup> -W
-
-
-The last flag ```-W``` is required when writing an output file from this program, otherwise the program will dryrun and produce no files.
-
-1.1. Checking errors
---------------------
-Check what repeat labels are available already using:
-::
-
-    python assess.py display <group> -s labels
-
-For listing the status of all datasets from a previous repeat idL
-::
-
-    python assess.py progress <group> -r <repeat_id>
-
-
-For selecting a specific type of error (-e) and examine the full log for each example (-E)
-::
-
-    python assess.py progress <group> -r <old_id> -e "type_of_error" -p scan -E
-
-Following from this, you may want to rerun the pipeline for just one type of error previously found:
-::
-
-    python assess.py progress <group> -r <old_repeat_id> -e "type_of_error" -p scan -n <new_repeat_id> -W
-
-.. Note::
-
-    If you are looking at a specific repeat ID, you can forego the phase (-p) flag, since it is expected this set would appear in the same phase anyway.
-    The (-W) write flag is also required for any commands that would output data to a file. If the file already exists, you will need to specify an override
-    level (-O or -OO) for merging or overwriting existing data (project code lists) respectively.
-
-2. Display options
---------------------------
-
-Check how many of the datasets in a group have virtual dimensions
-::
-
-    python assess.py display <group> -s virtuals
-
-3. Match Special Attributes
----------------------------
-
-Find the project codes where a specific attribute in ``detail-cfg.json`` matches some given value
-::
-
-    python assess.py match <group> -c "links_added:False"
-
-4. Summarise data
------------------
-
-Summarise the Native/Kerchunk data generated (thus far) for an existing group.
-::
-
-    python assess.py summarise <group>
-
-5. Upgrade Kerchunk version
----------------------------
-
-Upgrade all kerchunk files (compute-validate stages) to a new version for a given reason. This is the 'formal' way of updating the version.
-::
-
-    python assess.py upgrade <group> -r <codes_to_upgrade> -R "Reason for upgrade" -W -U "krX.X" # New version id
-
-6. Cleanup
-----------
-
-"Clean" or remove specific types of files:
- - Errors/Outputs in the correct places
- - "labels" i.e repeat_ids (including allocations and bands under that repeat_id)
-
-In the below example we will remove every created ``repeat_id`` (equivalent terminology to 'label') except for ``main``.
-::
-
-    python assess.py cleanup <group> -c labels
diff --git a/docs/source/assess.rst b/docs/source/assess.rst
deleted file mode 100644
index c508e61..0000000
--- a/docs/source/assess.rst
+++ /dev/null
@@ -1,5 +0,0 @@
-Assess Module
-=============
-
-.. automodule:: assess
-    :members:
\ No newline at end of file
diff --git a/docs/source/cci_water.rst b/docs/source/cci_water.rst
index 357f279..94ad2bd 100644
--- a/docs/source/cci_water.rst
+++ b/docs/source/cci_water.rst
@@ -8,9 +8,10 @@ A new *group* is created within the pipeline using the ``init`` operation as fol
 
 ::
 
-    python group_run.py init <my_new_group> -i extensions/example_water_vapour/water_vapour.csv -v
+    padocc init -G <my_new_group> -i extensions/example_water_vapour/water_vapour.csv -v
 
 .. note::
+
     Multiple flag options are available throughout the pipeline for more specific operations and methods. In the above case we have used the (-v) *verbose* flag to indicate we want to see the ``[INFO]`` messages put out by the pipeline. Adding a second (v) would also show ``[DEBUG]`` messages.
     Also the ``init`` phase is always run as a serial process since it just involves creating the directories and config files required by the pipeline.
 
@@ -56,28 +57,6 @@ Ok great, we've initialised the pipeline for our new group! Here's a summary dia
                      -  validate.log
                  -  status_log.csv
 
-For peace of mind and to check you understand the pipeline assessor tool we would suggest running this command next:
-
-::
-
-    python assess.py progress my_new_group
-
-Upon which your output should look something like this:
-
-.. code-block:: console
-
-    Group: my_new_group
-    Total Codes: 4
-
-    Pipeline Current:
-
-    init      : 4     [100.%] (Variety: 1)
-        - complete : 4
-
-    Pipeline Complete:
-
-    complete  : 0     [0.0 %]
-
 All 4 of our datasets were initialised successfully, no datasets are complete through the pipeline yet.
 
 The next steps are to ``scan``, ``compute``, and ``validate`` the datasets which would complete the pipeline.
@@ -88,52 +67,8 @@ The next steps are to ``scan``, ``compute``, and ``validate`` the datasets which
 
 .. code-block:: console
 
-    python group_run.py scan my_new_group
-    python group_run.py compute my_new_group
-    python group_run.py validate my_new_group
-
-An more complex example of what you might see while running the pipeline in terms of errors encountered can be found below:
-
-.. code-block:: console
-
-    Group: cci_group_v1
-    Total Codes: 361
-
-    Pipeline Current:
-
-    compute   : 21    [5.8 %] (Variety: 2)
-        - complete                 : 20
-        - KeyError 'refs'          : 1
-
-    Pipeline Complete:
-
-    complete  : 185   [51.2%]
-
-    blacklist : 155   [42.9%] (Variety: 8)
-        - NonKerchunkable          : 50
-        - PartialDriver            : 3
-        - PartialDriverFail        : 5
-        - ExhaustedMemoryLimit     : 56
-        - ExhaustedTimeLimit       : 18
-        - ExhaustedTimeLimit*      : 1
-        - ValidationMemoryLimit    : 21
-        - ScipyDimIssue            : 1
-
-In this example ``cci_group_v1`` group, 185 of the datasets have completed the pipeline, while 155 have been excluded (See blacklisting in the Assessor Tool section). 
-Of the remaining 21 datasets, 20 of them have completed the ``compute`` phase and now need to be run through ``validate``, but one encountered a KeyError which needs to be inspected. To view the log for this dataset we can use the command below:
-
-.. code-block:: console
-
-    python assess.py progress cci_group_v1 -e "KeyError 'refs'" -p compute -E
-
-This will match with our ``compute``-phase error with that message, and the (-E) flag will give us the whole error log from that run. This may be enough to assess and fix the issue but otherwise, to rerun just this dataset a rerun command will be suggested by the assessor:
-
-.. code-block:: console
-
-    Project Code: 201601-201612-ESACCI-L4_FIRE-BA-MSI-fv1.1 - <class 'KeyError'>'refs'
-    Rerun suggested command:    python single_run.py compute 218 -G cci_group_v1 -vv -d
-
-This rerun command has several flags included, the most importand here is the (-G) group flag, since we need to use the ``single_run`` script so now need to specify the group. The (-d) dryrun flag will simply mean we are not producing any output files since we may need to test and rerun several times.
-
-
+    padocc scan -G my_new_group
+    padocc compute -G my_new_group
+    padocc validate -G my_new_group
 
+This section will be updated for the full release of v1.3 with additional content relating to the assessor tool.
\ No newline at end of file
diff --git a/docs/source/compute.rst b/docs/source/compute.rst
deleted file mode 100644
index d5c6fc3..0000000
--- a/docs/source/compute.rst
+++ /dev/null
@@ -1,7 +0,0 @@
-==============
-Compute Module
-==============
-
-.. automodule:: pipeline.compute
-    :members:
-    :show-inheritance:
\ No newline at end of file
diff --git a/docs/source/deep_dive.rst b/docs/source/deep_dive.rst
new file mode 100644
index 0000000..5ea05bf
--- /dev/null
+++ b/docs/source/deep_dive.rst
@@ -0,0 +1,65 @@
+===================================
+A Deeper Dive into PADOCC Mechanics
+===================================
+
+Revision Numbers
+----------------
+
+The PADOCC revision numbers for each product are auto-generated using the following rules.
+
+ * All projects begin with the revision number ``1.1``.
+ * The first number denotes major updates to the product, for instance where a data source file has been replaced.
+ * The second number denotes minor changes like alterations to attributes and metadata.
+ * The letters prefixed to the revision numbers identify the file type for the product. For example a zarr store has the letter ``z`` applied, while a Kerchunk (parquet) store has ``kp``.
+
+The Validation Report
+---------------------
+
+The ``ValidateDatasets`` class produces a validation report for both data and metadata validations. 
+This is designed to be fairly simple to interpret, while still being machine-readable. 
+The following headings which may be found in the report have the following meanings:
+
+1. Metadata Report (with Examples)
+These are considered non-fatal errors that will need either a minor correction or can be ignored.
+
+* ``variables.time: {'type':'missing'...}`` - The time variable is missing from the specified product.
+* ``dims.all_dims: {'type':'order'}`` - The ordering of dimensions is not consistent across products.
+* ``attributes {'type':'ignore'...}`` - Attributes that have been ignored. These may have already been edited.
+* ``attributes {'type':'missing'...}`` - Attributes that are missing from the specified product file.
+* ``attributes {'type':'not_equal'...}`` - Attributes that are not equal across products.
+
+2. Data Report
+These are considered **fatal** errors that need a major correction or possibly a fix to the pipeline itself.
+
+* ``size_errors`` - The size of the array is not consistent between products.
+* ``dim_errors`` - Arrays have inconsistent dimensions (where not ignored).
+* ``dim_size_errors`` - The dimensions are consistent for a variable but their sizes are not.
+* ``data_errors`` - The data arrays do not match across products, this is the most fatal of all validation errors. 
+The validator should give an idea of which array comparisons failed.
+* ``data_errors: {'type':'growbox_exceeded'...}`` - The variable in question could not be validated as no area could be identified that is not empty of values.
+
+BypassSwitch Options
+--------------------
+
+Certain non-fatal errors may be bypassed using the Bypass flag:
+::
+
+  Format: -b "D"
+
+  Default: "D" # Highlighted by a '*'
+
+  "D" - * Skip driver failures - Pipeline tries different options for NetCDF (default).
+      -   Only need to turn this skip off if all drivers fail (KerchunkDriverFatalError).
+  "F" -   Skip scanning (fasttrack) and go straight to compute. Required if running compute before scan
+          is attempted.
+  "L" -   Skip adding links in compute (download links) - this will be required on ingest.
+  "S" -   Skip errors when running a subset within a group. Record the error then move onto the next dataset.
+
+Custom Pipeline Errors
+----------------------
+
+**A summary of the custom errors that are experienced through running the pipeline.**
+
+.. automodule:: padocc.core.errors
+    :members:
+    :show-inheritance:
\ No newline at end of file
diff --git a/docs/source/errors.rst b/docs/source/errors.rst
deleted file mode 100644
index 1a379fe..0000000
--- a/docs/source/errors.rst
+++ /dev/null
@@ -1,8 +0,0 @@
-Custom Pipeline Errors
-======================
-
-**A summary of the custom errors that are experienced through running the pipeline.**
-
-.. automodule:: pipeline.errors
-    :members:
-    :show-inheritance:
\ No newline at end of file
diff --git a/docs/source/execution-source.rst b/docs/source/execution-source.rst
deleted file mode 100644
index 399113e..0000000
--- a/docs/source/execution-source.rst
+++ /dev/null
@@ -1,8 +0,0 @@
-Pipeline Execution
-==================
-
-.. automodule:: group_run
-    :members:
-        
-.. automodule:: single_run
-    :members:
\ No newline at end of file
diff --git a/docs/source/execution.rst b/docs/source/execution.rst
deleted file mode 100644
index b5d396c..0000000
--- a/docs/source/execution.rst
+++ /dev/null
@@ -1,192 +0,0 @@
-Pipeline Flags
-==============
-
-====================
-BypassSwitch Options
-====================
-
-Certain non-fatal errors may be bypassed using the Bypass flag:
-::
-
-  Format: -b "DBSCR"
-
-  Default: "DBSCR" # Highlighted by a '*'
-
-  "D" - * Skip driver failures - Pipeline tries different options for NetCDF (default).
-      -   Only need to turn this skip off if all drivers fail (KerchunkFatalDriverError).
-  "B" - * Skip Box compute errors.
-  "S" - * Skip Soft fails (NaN-only boxes in validation) (default).
-  "C" - * Skip calculation (data sum) errors (time array typically cannot be summed) (default).
-  "X" -   Skip initial shape errors, by attempting XKShape tolerance method (special case.)
-  "R" - * Skip reporting to status_log which becomes visible with assessor. Reporting is skipped
-          by default in single_run.py but overridden when using group_run.py so any serial
-          testing does not by default report the error experienced to the status log for that project.
-  "F" -   Skip scanning (fasttrack) and go straight to compute. Required if running compute before scan
-          is attempted.
-
-========================
-Single Dataset Operation
-========================
-
-Run all single-dataset processes with the ``single-run.py`` script.
-
-.. code-block:: python
-    
-  usage: single_run.py [-h] [-f] [-v] [-d] [-Q] [-B] [-A] [-w WORKDIR] [-g GROUPDIR] [-p PROJ_DIR] 
-                       [-t TIME_ALLOWED] [-G GROUPID] [-M MEMORY] [-s SUBSET]
-                       [-r REPEAT_ID] [-b BYPASS] [-n NEW_VERSION] [-m MODE] [-O OVERRIDE_TYPE]
-                       phase proj_code
-
-  Run a pipeline step for a single dataset
-
-  positional arguments:
-    phase                 Phase of the pipeline to initiate
-    proj_code             Project identifier code
-
-  options:
-    -h, --help            show this help message and exit
-    -f, --forceful        Force overwrite of steps if previously done
-    -v, --verbose         Print helpful statements while running
-    -d, --dryrun          Perform dry-run (i.e no new files/dirs created)
-    -Q, --quality         Create refs from scratch (no loading), use all NetCDF files in validation
-    -B, --backtrack       Backtrack to previous position, remove files that would be created in this job.
-    -A, --alloc-bins      Use binpacking for allocations (otherwise will use banding)
-
-    -w WORKDIR, --workdir WORKDIR
-                          Working directory for pipeline
-    -g GROUPDIR, --groupdir GROUPDIR
-                          Group directory for pipeline
-    -p PROJ_DIR, --proj_dir PROJ_DIR
-                          Project directory for pipeline
-    -t TIME_ALLOWED, --time-allowed TIME_ALLOWED
-                          Time limit for this job
-    -G GROUPID, --groupID GROUPID
-                          Group identifier label
-    -M MEMORY, --memory MEMORY
-                          Memory allocation for this job (i.e "2G" for 2GB)
-    -s SUBSET, --subset SUBSET
-                          Size of subset within group
-    -r REPEAT_ID, --repeat_id REPEAT_ID
-                          Repeat id (1 if first time running, <phase>_<repeat> otherwise)
-    -b BYPASS, --bypass-errs BYPASS
-                          Bypass switch options: See Above
-
-    -n NEW_VERSION, --new_version NEW_VERSION
-                          If present, create a new version
-    -m MODE, --mode MODE  Print or record information (log or std)
-    -O OVERRIDE_TYPE, --override_type OVERRIDE_TYPE
-                          Specify cloud-format output type, overrides any determination by pipeline.
-
-=============================
-Multi-Dataset Group Operation
-=============================
-
-Run all multi-dataset group processes within the pipeline using the ``group_run.py`` script.
-
-.. code-block:: python
-  
-  usage: group_run.py [-h] [-S SOURCE] [-e VENVPATH] [-i INPUT] [-A] [--allow-band-increase] [-f] [-v] [-d] [-Q] [-b BYPASS] [-B] [-w WORKDIR] [-g GROUPDIR]
-                      [-p PROJ_DIR] [-G GROUPID] [-t TIME_ALLOWED] [-M MEMORY] [-s SUBSET] [-r REPEAT_ID] [-n NEW_VERSION] [-m MODE]
-                      phase groupID
-
-  Run a pipeline step for a group of datasets
-
-  positional arguments:
-    phase                 Phase of the pipeline to initiate
-    groupID               Group identifier code
-
-  options:
-    -h, --help            show this help message and exit
-    -S SOURCE, --source SOURCE
-                          Path to directory containing master scripts (this one)
-    -e VENVPATH, --environ VENVPATH
-                          Path to virtual (e)nvironment (excludes /bin/activate)
-    -i INPUT, --input INPUT
-                          input file (for init phase)
-    -A, --alloc-bins      input file (for init phase)
-
-    --allow-band-increase
-                          Allow automatic banding increase relative to previous runs.
-
-    -f, --forceful        Force overwrite of steps if previously done
-    -v, --verbose         Print helpful statements while running
-    -d, --dryrun          Perform dry-run (i.e no new files/dirs created)
-    -Q, --quality         Quality assured checks - thorough run
-
-    -b BYPASS, --bypass-errs BYPASS
-                          Bypass switch options: See Above
-
-    -B, --backtrack       Backtrack to previous position, remove files that would be created in this job.
-    -w WORKDIR, --workdir WORKDIR
-                          Working directory for pipeline
-    -g GROUPDIR, --groupdir GROUPDIR
-                          Group directory for pipeline
-    -p PROJ_DIR, --proj_dir PROJ_DIR
-                          Project directory for pipeline
-    -G GROUPID, --groupID GROUPID
-                          Group identifier label
-    -t TIME_ALLOWED, --time-allowed TIME_ALLOWED
-                          Time limit for this job
-    -M MEMORY, --memory MEMORY
-                          Memory allocation for this job (i.e "2G" for 2GB)
-    -s SUBSET, --subset SUBSET
-                          Size of subset within group
-    -r REPEAT_ID, --repeat_id REPEAT_ID
-                          Repeat id (main if first time running, <phase>_<repeat> otherwise)
-    -n NEW_VERSION, --new_version NEW_VERSION
-                          If present, create a new version
-    -m MODE, --mode MODE  Print or record information (log or std)
-
-=======================
-Assessor Tool Operation
-=======================
-
-Perform assessments of groups within the pipeline using the ``assess.py`` script.
-
-.. code-block:: python
-
-  usage: assess.py [-h] [-B] [-R REASON] [-s OPTION] [-c CLEANUP] [-U UPGRADE] [-l] [-j JOBID] [-p PHASE] [-r REPEAT_ID] [-n NEW_ID] [-N NUMBERS] [-e ERROR] [-E] [-W]
-                   [-O] [-w WORKDIR] [-g GROUPDIR] [-v] [-m MODE]
-                   operation groupID
-
-  Run a pipeline step for a single dataset
-
-  positional arguments:
-    operation             Operation to perform - choose from ['progress', 'blacklist', 'upgrade', 'summarise', 'display', 'cleanup', 'match',
-                          'status_log']
-    groupID               Group identifier code for the group on which to operate.
-
-  options:
-    -h, --help            show this help message and exit
-    -B, --blacklist       Use when saving project codes to the blacklist
-
-    -R REASON, --reason REASON
-                          Provide the reason for handling project codes when saving to the blacklist or upgrading
-    -s OPTION, --show-opts OPTION
-                          Show options for jobids, labels, also used in matching and status_log.
-    -c CLEANUP, --clean-up CLEANUP
-                          Clean up group directory of errors/outputs/labels
-    -U UPGRADE, --upgrade UPGRADE
-                          Upgrade to new version
-    -l, --long            Show long error message (no concatenation)
-    -j JOBID, --jobid JOBID
-                          Identifier of job to inspect
-    -p PHASE, --phase PHASE
-                          Pipeline phase to inspect
-    -r REPEAT_ID, --repeat_id REPEAT_ID
-                          Inspect an existing ID for errors
-    -n NEW_ID, --new_id NEW_ID
-                          Create a new repeat ID, specify selection of codes by phase, error etc.
-    -N NUMBERS, --numbers NUMBERS
-                          Show project code IDs for lists of codes less than the N value specified here.
-    -e ERROR, --error ERROR
-                          Inspect error of a specific type
-    -E, --examine         Examine log outputs individually.
-    -W, --write           Write outputs to files
-    -O, --overwrite       Force overwrite of steps if previously done
-    -w WORKDIR, --workdir WORKDIR
-                          Working directory for pipeline
-    -g GROUPDIR, --groupdir GROUPDIR
-                          Group directory for pipeline
-    -v, --verbose         Print helpful statements while running
-    -m MODE, --mode MODE  Print or record information (log or std)
\ No newline at end of file
diff --git a/docs/source/extras.rst b/docs/source/extras.rst
deleted file mode 100644
index de6e1dc..0000000
--- a/docs/source/extras.rst
+++ /dev/null
@@ -1,18 +0,0 @@
-Padocc Utility Scripts
-======================
-
-=========
-Utilities
-=========
-
-.. automodule:: pipeline.utils 
-    :members:
-    :show-inheritance:
-
-=======
-Logging
-=======
-
-.. automodule:: pipeline.logs
-    :members:
-    :show-inheritance:
\ No newline at end of file
diff --git a/docs/source/group_source.rst b/docs/source/group_source.rst
new file mode 100644
index 0000000..7f493e6
--- /dev/null
+++ b/docs/source/group_source.rst
@@ -0,0 +1,11 @@
+========================================
+GroupOperation Core and Mixin Behaviours
+========================================
+
+Source code for group operations and mixin behaviours.
+
+.. automodule:: padocc.operations.group
+    :members:
+
+.. automodule:: padocc.operations.mixins
+    :members:
\ No newline at end of file
diff --git a/docs/source/groups.rst b/docs/source/groups.rst
new file mode 100644
index 0000000..8163057
--- /dev/null
+++ b/docs/source/groups.rst
@@ -0,0 +1,87 @@
+Groups in PADOCC
+================
+
+The advantage of using PADOCC over other tools for creating cloud-format files is the scalability built-in, with parallelisation and deployment in mind.
+PADOCC allows the creation of groups of datasets, each with N source files, that can be operated upon as a single entity. 
+The operation can be applied to all or a subset of the datasets within the group with relative ease. Here we outline some basic functionality of the ``GroupOperation``. 
+See the source documentation page for more detail.
+
+Instantiating a Group
+---------------------
+
+A group is most easily created using a python terminal or Jupyter notebook, with a similar form to the below.
+
+.. code-block:: python
+
+    from padocc.operations import GroupOperation
+
+    my_group = GroupOperation(
+        'mygroup',
+        workdir='path/to/dir',
+        verbose=1
+    )
+
+At the point of defining the group, all required files and folders are created on the file system with default
+or initial values for some parameters. Further processing steps which incur changes to parameters will only be saved
+upon completion of an operation. If in doubt, all files can be saved with current values using ``.save_files()``
+for the group.
+
+This is a blank group with no attached parameters, so the initial values in all created files will be blank or templated
+with default values. To fill the group with actual data, we need to initialise from an input file.
+
+.. note::
+
+    In the future it will be possible to instantiate from other file types or records (e.g STAC) but for now the accepted
+    format is a csv file, where each entry fits the format:
+    ``project_code, /file/pattern/**/*.nc, /path/to/updates.json or empty, /path/to/removals.json or empty``
+
+Initialisation from a File
+--------------------------
+
+A group can be initialised from a CSV file using:
+
+.. code-block:: python
+
+    my_group.init_from_file('/path/to/csv.csv')
+
+Substitutions can be provided here if necessary, of the format:
+
+.. code-block:: python
+
+    substitutions = {
+        'init_file': {
+            'swap/this/for':'this'
+        },
+        'dataset_file': {
+            'swap/that/for':'that'
+        },
+        'datasets': {
+            'swap/that/for':'these'
+        },
+    }
+
+Where the respective sections relate to the following:
+ - Init file: Substitutions to the path to the provided CSV file
+ - Dataset file: Substitutions in the CSV file, specifically with the paths to ``.txt`` files or patterns.
+ - Datasets: Substitutions in the ``.txt`` file that lists each individual file in the dataset.
+
+Applying an operation
+---------------------
+
+Now we have an initialised group, in the same group instance we can apply an operation.
+
+.. code-block:: python
+
+    mygroup.run('scan', mode='kerchunk')
+
+The operation/phase being applied is a positional argument and must be one of ``scan``, ``compute`` or ``validate``. 
+(``ingest/catalog`` may be added with the full version 1.3). There are also several keyword arguments that can be applied here:
+ - mode: The format to use for the operation (default is Kerchunk)
+ - repeat_id: If subsets have been produced for this group, use the subset ID, otherwise this defaults to ``main``.
+ - proj_code: For running a single project code within the group instead of all groups.
+ - subset: Used in combination with project code, if both are set they must be integers where the group is divided into ``subset`` sections, and this operation is concerned with the nth one given by ``proj_code`` which is now an integer.
+ - bypass: BypassSwitch object for bypassing certain errors (see the Deep Dive section for more details)
+
+Merging or Unmerging
+--------------------
+**currently in development - alpha release**
\ No newline at end of file
diff --git a/docs/source/index.rst b/docs/source/index.rst
index ad097fc..f384603 100644
--- a/docs/source/index.rst
+++ b/docs/source/index.rst
@@ -6,7 +6,7 @@
 PADOCC - User Documentation
 ============================
 
-**padocc** (Pipeline to Aggregate Data for Optimised Cloud Capabilites) is a Python package (formerly **kerchunk-builder**) for aggregating data to enable methods of access for cloud-based applications.
+**padocc** (Pipeline to Aggregate Data for Optimised Cloud Capabilites) is a Python package for aggregating data to enable methods of access for cloud-based applications.
 
 The pipeline makes it easy to generate data-aggregated access patterns in the form of Reference Files or Cloud Formats across different datasets simultaneously with validation steps to ensure the outputs are correct.
 
@@ -14,49 +14,50 @@ Vast amounts of archival data in a variety of formats can be processed using the
 
 Currently supported input file formats:
  - NetCDF/HDF
- - GeoTiff (**coming soon**)
- - GRIB (**coming soon**)
+ - GeoTiff
+ - GRIB
  - MetOffice (**future**)
 
-*padocc* is capable of generating both reference files with Kerchunk (JSON or Parquet) and cloud formats like Zarr.
+*padocc* is capable of generating both reference files with Kerchunk (JSON or Parquet) and cloud formats like Zarr. 
+Additionally, PADOCC creates CF-compliant aggregation files as part of the standard workflow, which means you get CFA-netCDF files as standard!
+You can find out more about Climate Forecast Aggregations `here <https://cedadev.github.io/CFAPyX/>`_, these files are denoted with the extension ``.nca`` and can be opened using xarray with ``engine="CFA"`` if you have the ``CFAPyX`` package installed.
 
-The pipeline consists of four central phases, with an additional phase for ingesting/cataloging the produced Kerchunk files. This is not part of the code-base of the pipeline currently but could be added in a future update.
+The pipeline consists of three central phases, with an additional phase for ingesting/cataloging the produced Kerchunk files. 
+These phases represent operations that can be applied across groups of datasets in parallel, depending on the architecture of your system.
+For further information around configuring PADOCC for parallel deployment please contact `daniel.westwood@stfc.ac.uk <daniel.westwood@stfc.ac.uk>`_.
+
+The ingestion/cataloging phase is not currently implemented for public use but may be added in a future update.
 
 .. image:: _images/pipeline.png
-   :alt: Stages of the Kerchunk Pipeline
+   :alt: Stages of the PADOCC workflow
 
 .. toctree::
    :maxdepth: 1
    :caption: Contents:
 
-   Introduction <pipeline-overview>
+   Inspiration <inspiration>
+   Steps to Run Padocc <phases>
    Getting Started <start>
-   Example CCI Water Vapour <cci_water>
-   Padocc Flags/Options <execution>
-   Assessor Tool Overview <assess-overview>
-   Error Codes <errors>
+   Example Operation <cci_water>
+   A Deep Dive <deep_dive>
    Developer's Guide <dev-guide>
 
 .. toctree::
    :maxdepth: 1
-   :caption: CLI Tool Source:
+   :caption: Operations:
 
-   Assessor Source <assess>
-   Control Scripts Source <execution-source>
+   The Project Operator <projects>
+   The Group Operator <groups>
+   SHEPARD <shepard>
 
 .. toctree::
    :maxdepth: 1
-   :caption: Pipeline Source:
+   :caption: PADOCC Source:
+   
+   Projects <project_source>
+   Groups <group_source>
+   Filehandlers, Logs, and Utilities <misc_source>
    
-   Initialisation <init>
-   Scanning <scan>
-   Compute <compute>
-   Validate <validate>
-   Allocations <allocation>
-   Utils <extras>
-
-
-
 Indices and Tables
 ==================
 
@@ -72,9 +73,7 @@ PADOCC was developed at the Centre for Environmental Data Analysis, supported by
 .. image:: _images/ceda.png
    :width: 300
    :alt: CEDA Logo
-   :width: 300
 
 .. image:: _images/esa.png
    :width: 300
    :alt: ESA Logo
-   :width: 300
diff --git a/docs/source/init.rst b/docs/source/init.rst
deleted file mode 100644
index 1b3bdbc..0000000
--- a/docs/source/init.rst
+++ /dev/null
@@ -1,6 +0,0 @@
-=====================
-Initialisation Module
-=====================
-
-.. automodule:: pipeline.init
-    :members:
\ No newline at end of file
diff --git a/docs/source/inspiration.rst b/docs/source/inspiration.rst
new file mode 100644
index 0000000..dd31799
--- /dev/null
+++ b/docs/source/inspiration.rst
@@ -0,0 +1,52 @@
+Inspiration for Cloud Formats and Aggregations
+==============================================
+
+Data Archives
+-------------
+
+The need for cloud-accessible analysis-ready data is increasing due to high demand for cloud-native applications and wider usability of data.
+Current archival formats and access methods are insufficient for an increasing number of user needs, especially given the volume of data being
+produced by various projects globally. 
+
+.. image:: _images/CedaArchive0824.png
+   :alt: Contents of the CEDA Archive circa August 2024. 
+   :align: center
+
+The CEDA-operated JASMIN data analysis facility has a current (2024) data archive of more than 30 Petabytes, with more datasets being ingested 
+daily. Around 25% of all datasets are in NetCDF/HDF formats which are well-optimised for HPC architecture, but do not typically perform as well 
+and are not as accessible for cloud-based applications. The standard NetCDF/HDF python readers for example require direct access to the source
+files, so are not able to open files stored either in Object Storage (S3) or served via a download service, without first downloading the whole file.
+
+Distributed Data
+----------------
+
+The aim of distributed data aggregations is to make the access of data more effective when dealing with these vast libraries of data.
+Directly accessing the platforms, like JASMIN, where the data is stored is not necessarily possible for all users, and we would like to avoid the dependence
+on download services where GB/TBs of data is copied across multiple sites. Instead, the data may be accessed via a **reference/aggregation file** which provides
+the instructions to fetch portions of the data, and applications reading the file are able to load data as needed rather than all at once (Lazy Loading).
+
+.. image:: _images/DataDistributed.png
+   :alt: A diagram of how the typcial Distributed Data methods operate.
+   :align: center
+
+Formats which provide effective remote data access are typically referred to as **Cloud Optimised Formats** (COFs) like `Zarr <https://zarr.readthedocs.io/en/stable/>`_ and `Kerchunk <https://fsspec.github.io/kerchunk/>`_, as in the diagram above. 
+Zarr stores contain individual **binary-encoded** files for each chunk of data in memory. Opening a Zarr store means accessing the top-level metadata which 
+informs the application reader how the data is structured. Subsequent calls to load the data will then only load the appropriate memory chunks. Kerchunk
+functions similarly as a pointer to chunks of data in another location, however Kerchunk only references the existing chunk structure within NetCDF files,
+rather than having each chunk as a separate file. 
+
+PADOCC supports an additional format called CFA, which takes elements of both of these methods. CFA files store references to portions of the array, rather than ranges of bytes of compressed/uncompressed data like with Kerchunk. 
+These references are stored in NetCDF instead of JSON metadata files, which has the advantage of lazily-loaded references from a single file. Read more about CF Aggregations `here <https://cedadev.github.io/CFAPyX/>`_. 
+
+A workflow for data conversion
+------------------------------
+
+PADOCC is a tool being actively developed at CEDA to enable large-scale conversion of archival data to some of these new cloud formats, to address the issues above.
+Originally created as part of the ESA Climate Change Initiative project, PADOCC is steadily growing into an essential part of the CEDA ingestion pipeline.
+New datasets deposited into the CEDA archive will soon be automatically converted by PADOCC and represented as part of the growing STAC catalog collection at CEDA.
+Use of the catalogs is facilitated by the `CEDA DataPoint <https://cedadev.github.io/datapoint/>`_ package, which auto-configures for multiple different file types.
+
+The result of this new data architecture will be that users of CEDA data can discover and access data through our packages much faster and more efficiently than before,
+without the need to learn to use many new formats. All the nuances of each dataset are handled by DataPoint, and use products created by PADOCC to facilitate fast search/access
+to the data.
+
diff --git a/docs/source/misc_source.rst b/docs/source/misc_source.rst
new file mode 100644
index 0000000..23e6bd9
--- /dev/null
+++ b/docs/source/misc_source.rst
@@ -0,0 +1,31 @@
+Padocc Filehandlers
+======================
+
+Filehandlers are an integral component of PADOCC on the filesystem. The filehandlers
+connect directly to files within the pipeline directories for different groups and projects
+and provide a seamless environment for fetching and saving values to these files.
+
+Filehandlers act like their respective data-types in most or all methods. 
+For example the ``JSONFileHandler`` acts like a dictionary, but with extra methods to close and save
+the loaded data. Filehandlers can also be easily migrated or removed from the filesystem as part of other
+processes.
+
+.. automodule:: padocc.core.filehandlers
+    :members:
+    :show-inheritance:
+
+=========
+Utilities
+=========
+
+.. automodule:: padocc.core.utils
+    :members:
+    :show-inheritance:
+
+=======
+Logging
+=======
+
+.. automodule:: padocc.core.logs
+    :members:
+    :show-inheritance:
\ No newline at end of file
diff --git a/docs/source/phases.rst b/docs/source/phases.rst
new file mode 100644
index 0000000..d165111
--- /dev/null
+++ b/docs/source/phases.rst
@@ -0,0 +1,118 @@
+=============================
+Phases of the PADOCC Pipeline
+=============================
+
+.. image:: _images/padocc.png
+   :alt: Stages of the PADOCC workflow
+
+**Initialisation of a Group of Datasets**
+
+
+The pipeline takes a CSV (or similar) input file from which to instantiate a ``GroupOperation``, which includes:
+ - creating subdirectories for all associated datasets (projects)
+ - creating multiple group files with information regarding this group.
+
+Scan
+----
+
+The first main phase of the pipeline involves scanning a subset of the native source files to determine certain parameters:
+
+* Ensure source files are compatible with one of the available converters for Kerchunk/Zarr etc.:
+* Calculate expected memory (for job allocation later.)
+* Calculate estimated chunk sizes and other values.
+* Determine suggested file type, including whether to use JSON or Parquet for Kerchunk references.
+* Identify Identical/Concat dims for use in **Compute** phase.
+* Determine any other specific parameters for the dataset on creation and concatenation.
+
+A scan operation is performed across a group of datasets/projects to determine specific
+properties of each project and some estimates of time/memory allocations that will be
+required in later phases.
+
+The scan phase can be activated with the following:
+
+.. code:: python
+
+    mygroup = GroupOperation(
+        'my-group',
+        workdir='path/to/pipeline/directory'
+    )
+    # Assuming this group has already been initialised from a file.
+
+    mygroup.run('scan',mode='kerchunk')
+
+.. automodule:: padocc.phases.scan
+    :members:
+
+Compute
+-------
+
+Building the Cloud/reference product for a dataset requires a multi-step process:
+
+Example for Kerchunk:
+
+* Create Kerchunk references for each archive-type file.
+* Save cache of references for each file prior to concatenation.
+* Perform concatenation (abort if concatenation fails, can load cache on second attempt).
+* Perform metadata corrections (based on updates and removals specified at the start)
+* Add Kerchunk history global attributes (creation time, pipeline version etc.)
+* Reconfigure each chunk for remote access (replace local path with https:// download path)
+
+Computation will either refer to outright data conversion to a new format, 
+or referencing using one of the Kerchunk drivers to create a reference file. 
+In either case the computation may be extensive and require processing in the background
+or deployment and parallelisation across the group of projects.
+
+Computation can be executed in serial for a group with the following:
+
+.. code:: python
+
+    mygroup = GroupOperation(
+        'my-group',
+        workdir='path/to/pipeline/directory'
+    )
+    # Assuming this group has already been initialised and scanned
+
+    mygroup.run('compute',mode='kerchunk')
+
+.. automodule:: padocc.phases.compute
+    :members:
+    :show-inheritance:
+
+Validate
+--------
+
+Cloud products must be validated against equivalent Xarray objects from CF Aggregations (CFA) where possible, or otherwise using the original NetCDF as separate Xarray Datasets.
+
+* Ensure all variables present in original files are present in the cloud products (barring exceptions where metadata has been altered/corrected)
+* Ensure array shapes are consistent across the products.
+* Ensure data representations are consistent (values in array subsets)
+
+The validation step produced a two-sectioned report that outlines validation warnings and errors with the data or metadata
+around the project. See the documentation on the validation report for more details.
+
+It is advised to run the validator for all projects in a group to determine any issues
+with the conversion process. Some file types or specific arrangements may produce unwanted effects
+that result in differences between the original and new representations. This can be identified with the
+validator which checks the Xarray representations and identifies differences in both data and metadata.
+
+.. code:: python
+
+    mygroup = GroupOperation(
+        'my-group',
+        workdir='path/to/pipeline/directory'
+    )
+    # Assuming this group has already been initialised, scanned and computed
+
+    mygroup.run('validate')
+
+    # The validation reports will be saved to the filesystem for each project in this group
+    # as 'data_report.json' and 'metadata_report.json'
+
+.. automodule:: padocc.phases.validate
+    :members:
+
+Next Steps
+----------
+
+Cloud products that have been validated are moved to a ``complete`` directory with the project code as the name, plus the revision identifier `abX.X` - learn more about this in the deep dive section.
+These can then be linked to a catalog or ingested into the CEDA archive where appropriate.
diff --git a/docs/source/pipeline-overview.rst b/docs/source/pipeline-overview.rst
deleted file mode 100644
index 3d1ddc1..0000000
--- a/docs/source/pipeline-overview.rst
+++ /dev/null
@@ -1,45 +0,0 @@
-Overview of Pipeline Phases
-===========================
-
-.. image:: _images/pipeline.png
-   :alt: Stages of the Kerchunk Pipeline
-
-**Init (Initialisation) Phase**
-
-The pipeline takes a CSV (or similar) input file and creates the necessary directories and config files for the pipeline to being running.
-
-**Scan Phase**
-
-Second phase of the pipeline involves scanning a subset of the NetCDF/HDF/Tiff files to determine certain parameters:
-
-* Ensure NetCDF/HDF/Tiff files can be converted successfully using one of the available drivers:
-* Calculate expected memory (for job allocation later.)
-* Calculate estimated chunk sizes and other values.
-* Determine file-type (JSON or Parquet) for final Kerchunk file.
-* Identify Identical/Concat dims for use in **Compute** phase.
-* Determine any other specific parameters for the dataset on creation and concatenation.
-
-**Compute Phase**
-
-Building the Kerchunk file for a dataset requires a multi*step process:
-
-* Create Kerchunk references for each archive-type file.
-* Save cache of references for each file prior to concatenation.
-* Perform concatenation (abort if concatenation fails, can load cache on second attempt).
-* Perform metadata corrections (based on updates and removals specified at the start)
-* Add Kerchunk history global attributes (creation time, pipeline version etc.)
-* Reconfigure each chunk for remote access (replace local path with https:// download path)
-
-**Validation Phase**
-
-Kerchunk files must be validated against equivalent Xarray objects from the original NetCDF:
-
-* Ensure all variables present in original files are present in Kerchunk (barring exceptions)
-* Ensure array shapes are consistent across Kerchunk/NetCDF
-* Ensure data representations are consistent (values in array subsets)
-
-Several options and switches can be configured for the validation step, see the BypassSwitch class.
-
-**Next Steps**
-
-Kerchunk files that have been validated are moved to a ``complete`` directory with the project code as the name, plus the kerchunk revision `krX.X`. These can then be linked to a catalog or ingested into the CEDA archive where appropriate.
diff --git a/docs/source/project_source.rst b/docs/source/project_source.rst
new file mode 100644
index 0000000..2420811
--- /dev/null
+++ b/docs/source/project_source.rst
@@ -0,0 +1,11 @@
+==========================================
+ProjectOperation Core and Mixin Behaviours
+==========================================
+
+Source code for individual project operations and mixin behaviours.
+
+.. automodule:: padocc.core.project
+    :members:
+
+.. automodule:: padocc.core.mixins
+    :members:
\ No newline at end of file
diff --git a/docs/source/projects.rst b/docs/source/projects.rst
new file mode 100644
index 0000000..6edf7cd
--- /dev/null
+++ b/docs/source/projects.rst
@@ -0,0 +1,61 @@
+Projects in PADOCC
+==================
+
+To differentiate syntax of datasets/datafiles with other packages that have varying definitions of those terms,
+PADOCC uses the term ``Project`` to refer to a set of files to be aggregated into a single 'Cloud Product'. 
+
+The ``ProjectOperation`` class within PADOCC allows us to access all information about a specific dataset, including
+fetching data from files within the pipeline directory. This class also inherits from several Mixin classes which 
+act as containers for specific behaviours for easier organisation and future debugging.
+
+Directory Mixin
+---------------
+
+The directory mixin class contains all behaviours relating to creating directories within a project (or group) in PADOCC.
+This includes the inherited ability for any project to create its parent working directory and group directory if needed, as well
+as a subdirectory for cached data files. The switch values ``forceful`` and ``dryrun`` are also closely tied to this 
+container class, as the creation of new directories may be bypassed/forced if they exist already, or bypassed completely in a dry run.
+
+Evaluations Mixin
+-----------------
+
+Previously, all evaluations were handled by an assessor module (pre 1.3), but this has now been reorganised
+into a mixin class for the projects themselves, meaning any project instance has the capacity for self-evaluation. The routines
+grouped into this container class relate to the self analysis of details and parameters of the project and various 
+files:
+ - get last run: Determine the parameters used in the most recent operation for a project.
+ - get last status: Get the status of the most recent (completed) operation.
+ - get log contents: Examine the log contents for a specific project.
+
+This list will be expanded in the full release version 1.3 to include many more useful evaluators including
+statistics that can be averaged across a group.
+
+Properties Mixin
+----------------
+
+A collection of dynamic properties about a specific project. The Properties Mixin class abstracts any
+complications or calculations with retrieving specific parameters; some may come from multiple files, are worked out on-the-fly
+or may be based on an external request. Properties currently included are:
+ - Outpath: The output path to a 'product', which could be a zarr store, kerchunk file etc.
+ - Outproduct: The name of the output product which includes the cloud format and version number.
+ - Revision/Version: Abstracts the construction of revision and version numbers for the project.
+ - Cloud Format: Kerchunk/Zarr etc. - value stored in the base config file and can be set manually for further processing.
+ - File Type: Extension applied to the output product, can be one of 'json' or 'parquet' for Kerchunk products.
+ - Source Format: Format(s) detected during scan - retrieved from the detail config file after scanning.
+
+The properties mixin also enables a manual adjustment of some properties, like cloud format or file type, but also enables
+minor and major version increments. This will later be wrapped into an ``Updater`` module to enable easier updates to 
+Cloud Product data/metadata.
+
+The Project Operator class
+--------------------------
+
+The 'core' behaviour of all classes is contained in the ``ProjectOperation`` class.
+This class has public UI methods like ``info`` and ``help`` that give general information about a project, 
+and list some of the other public methods available respectively.
+
+Key Functions:
+ - Acts as an access point to all information and data about a project (dataset).
+ - Can adjust values within key files (abstracted) by setting specific parameters of the project instance and then using ``save_files``.
+ - Enables quick stats gathering for use with group statistics calculations.
+ - Can run any process on a project from the Project Operator.
\ No newline at end of file
diff --git a/docs/source/scan.rst b/docs/source/scan.rst
deleted file mode 100644
index db892f7..0000000
--- a/docs/source/scan.rst
+++ /dev/null
@@ -1,6 +0,0 @@
-==============
-Scanner Module
-==============
-
-.. automodule:: pipeline.scan
-    :members:
\ No newline at end of file
diff --git a/docs/source/shepard.rst b/docs/source/shepard.rst
new file mode 100644
index 0000000..4f93322
--- /dev/null
+++ b/docs/source/shepard.rst
@@ -0,0 +1,16 @@
+The SHEPARD Module
+==================
+
+The latest development in the PADOCC package is the SHEPARD Module (coming in 2025).
+
+SHEPARD (Serial Handler for Enabling PADOCC Aggregations via Recurrent Deployment) is a component
+designed as an entrypoint script within the PADOCC environment to automate the operation of the pipeline.
+Groups of datasets (called ``flocks``) can be created by producing input files and placing them in a persistent
+directory accessible to a deployment of PADOCC. The deployment operates an hourly check in this directory, 
+and picks up any added files or any changes to existing files. The groups specified can then be automatically 
+run through all sections of the pipeline.
+
+The deployment of SHEPARD at CEDA involves a Kubernetes Pod that has access to the JASMIN filesystem as well as
+the capability to deploy to JASMIN's LOTUS cluster for job submissions. The idea will be for SHEPARD to run 
+continuously, slowly processing large sections of the CEDA archive and creating cloud formats that can be utilised
+by other packages like DataPoint (see the Inspiration tab) that provide fast access to data.
\ No newline at end of file
diff --git a/docs/source/start.rst b/docs/source/start.rst
index b801f3e..bcff1bf 100644
--- a/docs/source/start.rst
+++ b/docs/source/start.rst
@@ -13,6 +13,10 @@ If you need to clone the repository, either simply clone the main branch of the
 
     git clone git@github.com:cedadev/padocc.git
 
+.. note::
+
+    The instructions below are specific to version 1.3 and later. To obtain documentation for pre-1.3, please contact `daniel.westwood@stfc.ac.uk <daniel.westwood@stfc.ac.uk>`_.
+
 Step 1: Set up Virtual Environment
 ----------------------------------
 
@@ -22,7 +26,7 @@ Step 1 is to create a virtual environment and install the necessary packages wit
 
     python -m venv name_of_venv;
     source name_of_venv/bin/activate;
-    pip install -r requirements.txt;
+    pip install ./;
 
 
 Step 2: Environment configuration
@@ -32,11 +36,10 @@ Create a config file to set necessary environment variables. (Suggested to place
 .. code-block:: console
 
     export WORKDIR = /path/to/kerchunk-pipeline
-    export SRCDIR  = /gws/nopw/j04/cedaproc/kerchunk_builder/kerchunk-builder
-    export KVENV   = $SRCDIR/kvenv
+    export KVENV   = /path/to/virtual/environment/venv
 
 
-Now you should be set up to run the pipeline properly. For any of the pipeline scripts, running ```python <script>.py -h # or --help``` will bring up a list of options to use for that script as well as the required parameters.
+Now you should be set up to run the pipeline properly.
 
 Step 3: Assembling pipeline inputs
 ----------------------------------
@@ -49,7 +52,6 @@ In order to successfully run the pipeline you need the following input files:
 
 It is also helpful to create a setup/config bash script to set all your environment variables which include:
  - WORKDIR: The working directory for the pipeline (where to store all the cache files)
- - SRCDIR: Path to the kerchunk-builder repo where it has been cloned.
  - KVENV: Path to a virtual environment for the pipeline.
 
 Step 4: Commands to run the pipeline
@@ -72,19 +74,19 @@ Some useful option/flags to add:
     -d # dryrun
        #  - Skip creating any new files in this phase
 
-The pipeline is run using the ``single_run.py`` or ``group_run.py`` scripts as a command-line interface (CLI) tool.
+The pipeline is now run using the entrypoint script ``padocc`` as a command-line interface (CLI) tool.
 
 .. code-block:: python
 
     # 4.1 Initialise from your CSV file:
-    python group_run.py init <group_name> -i path/to/file.csv
+    padocc init -G <group_name> -i path/to/file.csv
 
     # 4.2 Perform scanning of netcdf files:
-    python group_run.py scan <group_name>
+    padocc scan -G <group_name>
 
 .. note::
 
-    You should check after every ``scan``, ``compute`` and ``validate`` that your SLURM jobs are running properly:
+    For Jasmin users with SLURM access, you should check after every ``scan``, ``compute`` and ``validate`` that your SLURM jobs are running properly:
     
     ``squeue -u <jasmin_username>``
 
@@ -93,47 +95,18 @@ The pipeline is run using the ``single_run.py`` or ``group_run.py`` scripts as a
 .. code-block:: python
 
     # 4.3 Perform computation (example options: ignore cache and show debug messages):
-    python group_run.py compute <group_name> -vQ
+    padocc compute -G <group_name> -vT
 
     # 4.4 Perform validation (example options: using repeat_id long, set time and memory to specific values, forceful overwrite if outputs already present):
-    python group_run.py validate <group_name> -r long -t 120:00 -M 4G -vf
+    padocc validate -G <group_name> -r long -t 120:00 -M 4G -vf
 
 Step 5: Assess pipeline results
 -------------------------------
 
 5.1 General progress
 --------------------
-To see the general status of the pipeline for a given group:
-::
-
-    python assess.py <group> progress
-
-An example use case is to write out all datasets that require scanning to a new label (repeat_label):
-::
-    
-    python assess.py <group> progress -p scan -r <label_for_scan_subgroup> -W
-
-The last flag ```-W``` is required when writing an output file from this program, otherwise the program will dryrun and produce no files.
-
-5.2 Check errors
-----------------
 
-Check what repeat labels are available already using
-::
-
-    python assess.py <group> errors -s labels
-
-Show what jobs have previously run
-::
-
-    python assess.py <group> errors -s jobids
-
-For showing all errors from a previous job run
-::
-
-    python assess.py <group> errors -j <jobid>
+.. note::
 
-For selecting a specific type of error to investigate (-i) and examine the full log for each example (-E)
-::
+    This section will be filled with the full release version of padocc v1.3
 
-    python assess.py test errors -j <jobid> -i "type_of_error" -E
diff --git a/docs/source/validate.rst b/docs/source/validate.rst
deleted file mode 100644
index 55b0647..0000000
--- a/docs/source/validate.rst
+++ /dev/null
@@ -1,6 +0,0 @@
-=================
-Validation Module
-=================
-
-.. automodule:: pipeline.validate
-    :members:
diff --git a/padocc/__init__.py b/padocc/__init__.py
index 873dd50..c6544ad 100644
--- a/padocc/__init__.py
+++ b/padocc/__init__.py
@@ -1,3 +1,21 @@
 __author__    = "Daniel Westwood"
 __contact__   = "daniel.westwood@stfc.ac.uk"
-__copyright__ = "Copyright 2024 United Kingdom Research and Innovation"
\ No newline at end of file
+__copyright__ = "Copyright 2024 United Kingdom Research and Innovation"
+
+from padocc.phases import (
+    ScanOperation, 
+    KerchunkDS, 
+    ZarrDS, 
+    cfa_handler,
+    ValidateOperation
+)
+
+phase_map = {
+    'scan': ScanOperation,
+    'compute': {
+        'kerchunk': KerchunkDS,
+        'zarr': ZarrDS,
+        'CFA': cfa_handler,
+    },
+    'validate': ValidateOperation
+}
\ No newline at end of file
diff --git a/padocc/cli.py b/padocc/cli.py
index 81b385f..cf8f91d 100644
--- a/padocc/cli.py
+++ b/padocc/cli.py
@@ -4,8 +4,105 @@
 
 ## PADOCC CLI for entrypoint scripts
 
+import argparse
+
+from padocc.core.utils import BypassSwitch
+from padocc.operations import GroupOperation
+from padocc import phase_map
+
+def get_args():
+    parser = argparse.ArgumentParser(description='Run a pipeline step for a group of datasets')
+    parser.add_argument('phase', type=str, help='Phase of the pipeline to initiate')
+
+    # Action-based - standard flags
+    parser.add_argument('-f','--forceful',dest='forceful',action='store_true', help='Force overwrite of steps if previously done')
+    parser.add_argument('-v','--verbose', dest='verbose', action='count', default=0, help='Print helpful statements while running')
+    parser.add_argument('-d','--dryrun',  dest='dryrun',  action='store_true', help='Perform dry-run (i.e no new files/dirs created)' )
+    parser.add_argument('-T','--thorough', dest='thorough', action='store_true', help='Thorough processing - start from scratch')
+    parser.add_argument('-b','--bypass-errs', dest='bypass', default='DBSCL', help=BypassSwitch().help())
+
+    # Environment variables
+    parser.add_argument('-w','--workdir',   dest='workdir',      help='Working directory for pipeline')
+
+    # Single-job within group
+    parser.add_argument('-G','--groupID',   dest='groupID', default=None, help='Group identifier label')
+    parser.add_argument('-s','--subset',    dest='subset',    default=1,   type=int, help='Size of subset within group')
+    parser.add_argument('-r','--repeat_id', dest='repeat_id', default='main', help='Repeat id (main if first time running, <phase>_<repeat> otherwise)')
+
+    # Specialised
+    parser.add_argument('-C','--cloud-format', dest='mode', default='kerchunk', help='Output format required.')
+    parser.add_argument('-i', '--input', dest='input', help='input file (for init phase)')
+
+    # Unused v1.3
+    parser.add_argument('-n','--new_version', dest='new_version',   help='If present, create a new version')
+    parser.add_argument('-t','--time-allowed',dest='time_allowed',  help='Time limit for this job')
+    parser.add_argument('-M','--memory', dest='memory', default='2G', help='Memory allocation for this job (i.e "2G" for 2GB)')
+    parser.add_argument('-B','--backtrack', dest='backtrack', action='store_true', help='Backtrack to previous position, remove files that would be created in this job.')
+    parser.add_argument('-e','--environ',dest='venvpath', help='Path to virtual (e)nvironment (excludes /bin/activate)')
+    parser.add_argument('-A', '--alloc-bins', dest='binpack',action='store_true', help='input file (for init phase)')
+    parser.add_argument('--allow-band-increase', dest='band_increase',action='store_true', help='Allow automatic banding increase relative to previous runs.')
+
+    args = parser.parse_args()
+    return args
+
 def main():
-    pass
+    """
+    Run Command Line functions for PADOCC serial
+    processing. Parallel process deployment will 
+    be re-added in the full version."""
+    args = get_args()
+
+    if args.groupID is not None:
+        group = GroupOperation(
+            args.groupID,
+            workdir=args.workdir,
+            forceful=args.forceful,
+            dryrun=args.dryrun,
+            thorough=args.thorough,
+            label=f'PADOCC-CLI-{args.phase}',
+            verbose=args.verbose,
+            bypass=args.bypass
+        )
+
+        if args.phase == 'init':
+            group.init_from_file(args.input_file)
+            return
+
+        group.run(
+            args.phase,
+            mode=args.mode,
+            repeat_id=args.repeat_id,
+            proj_code=args.proj_code,
+            subset=args.subset,
+        )
+
+    else:
+
+        if args.phase not in phase_map:
+            print(f'Error: Unrecognised phase "{args.phase}" - must be one of {list(phase_map.keys())}')
+            return
+        
+        operation = phase_map[args.phase]
+        if isinstance(operation, dict):
+            # Multiple choice
+            if args.mode not in operation:
+                print(f'Error: Unrecognised cloud format "{args.mode}" - must be one of {list(operation.keys())}')
+                return
+            
+            operation = operation[args.mode]
+
+        proj = operation(
+            args.proj_code,
+            args.workdir,
+            bypass=args.bypass,
+            label=f'PADOCC-CLI-{args.proj_code}',
+            verbose=args.verbose,
+            forceful=args.forceful,
+            dryrun=args.dryrun,
+            thorough=args.thorough
+        )
+
+        proj.run(mode=args.mode)
 
 if __name__ == '__main__':
     main()
\ No newline at end of file
diff --git a/padocc/core/errors.py b/padocc/core/errors.py
index 8b215e9..e34e2c4 100644
--- a/padocc/core/errors.py
+++ b/padocc/core/errors.py
@@ -7,18 +7,16 @@
 import logging
 import traceback
 
-from typing import Optional
-
-from .filehandlers import CSVFileHandler
+from typing import Optional, Union
 
 def error_handler(
         err : Exception, 
         logger: logging.Logger, 
         phase: str,
+        dryrun: bool = False,
+        subset_bypass: bool = False,
         jobid: Optional[str] = None,
-        dryrun: Optional[bool] = False,
-        subset_bypass: Optional[bool] = False, 
-        status_fh: Optional[CSVFileHandler] = None
+        status_fh: Optional[object] = None
     ):
 
     """
@@ -50,7 +48,7 @@ def get_status(tb: list) -> str:
             status = get_status(tb)
 
     if status_fh is not None:
-        status_fh.update_status(phase, status, jobid=jobid, dryrun=dryrun)
+        status_fh.update_status(phase, status, jobid=jobid)
 
     if subset_bypass:
         logger.error(tb)
@@ -60,14 +58,22 @@ def get_status(tb: list) -> str:
 
 
 class KerchunkException(Exception):
-    def __init__(self, proj_code, groupdir):
+    def __init__(self, proj_code: Union[str,None], groupdir: Union[str,None]) -> None:
         self.proj_code = proj_code
         self.groupdir  = groupdir
-        super().__init__(self.message)
+        if hasattr(self, 'message'):
+            msg = getattr(self,'message')
+        super().__init__(msg)
 
 class PartialDriverError(KerchunkException):
     """All drivers failed (NetCDF3/Hdf5/Tiff) for one or more files within the list"""
-    def __init__(self,filenums=None, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            filenums: Union[int,None] = None, 
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f"All drivers failed when performing conversion for files {filenums}"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -77,7 +83,12 @@ def get_str(self):
 
 class NaNComparisonError(KerchunkException):
     """When comparing NaN values between objects - different values found"""
-    def __init__(self, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self, 
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f"NaN values do not match between comparison objects"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -87,7 +98,14 @@ def get_str(self):
 
 class RemoteProtocolError(KerchunkException):
     """All drivers failed (NetCDF3/Hdf5/Tiff) for one or more files within the list"""
-    def __init__(self,filenums=None, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            filenums: Union[int,None] = None, 
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
         self.message = f"All drivers failed when performing conversion for files {filenums}"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -97,7 +115,12 @@ def get_str(self):
 
 class KerchunkDriverFatalError(KerchunkException):
     """All drivers failed (NetCDF3/Hdf5/Tiff) - run without driver bypass to assess the issue with each driver type."""
-    def __init__(self,verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = "All drivers failed when performing conversion"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -107,7 +130,12 @@ def get_str(self):
 
 class IdenticalVariablesError(KerchunkException):
     """All variables found to be suitably identical between files as to not stack or concatenate"""
-    def __init__(self,verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = "All variables are identical across files"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -117,7 +145,15 @@ def get_str(self):
     
 class XKShapeToleranceError(KerchunkException):
     """Attempted validation using a tolerance for shape mismatch on concat-dims, shape difference exceeds tolerance allowance."""
-    def __init__(self,tolerance=0, diff=0, dim='',verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            tolerance: int = 0, 
+            diff: int = 0,
+            dim: str = '',
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f"Shape difference ({diff}) exceeds allowed tolerance ({tolerance}) for dimension ({dim})"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -127,7 +163,12 @@ def get_str(self):
 
 class BlacklistProjectCode(KerchunkException):
     """The project code you are trying to run for is on the list of project codes to ignore."""
-    def __init__(self, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = 'Project Code listed in blacklist for bad data - will not be processed.'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -137,7 +178,13 @@ def get_str(self):
 
 class MissingVariableError(KerchunkException):
     """A variable is missing from the environment or set of arguments."""
-    def __init__(self, vtype='$', verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            vtype: str = "$",
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f'Missing variable: {vtype}'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -147,7 +194,14 @@ def get_str(self):
 
 class ExpectTimeoutError(KerchunkException):
     """The process is expected to time out given timing estimates."""
-    def __init__(self, required=0, current='', verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            required: int = 0,
+            current: str = '',
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f'Scan requires minimum {required} - current {current}'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -157,7 +211,14 @@ def get_str(self):
     
 class ExpectMemoryError(KerchunkException):
     """The process is expected to run out of memory given size estimates."""
-    def __init__(self, required='', current='', verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            required: int = 0,
+            current: str = '',
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f'Scan requires minimum {required} - current {current}'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -167,7 +228,12 @@ def get_str(self):
 
 class ProjectCodeError(KerchunkException):
     """Could not find the correct project code from the list of project codes for this run."""
-    def __init__(self, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f'Project Code Extraction Failed'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -177,7 +243,13 @@ def get_str(self):
 
 class FilecapExceededError(KerchunkException):
     """During scanning, could not find suitable files within the set of files specified."""
-    def __init__(self, nfiles=0, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            nfiles: int = 0,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f'Filecap exceeded: {nfiles} files attempted'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -187,7 +259,12 @@ def get_str(self):
 
 class ChunkDataError(KerchunkException):
     """Overflow Error from pandas during decoding of chunk information, most likely caused by bad data retrieval."""
-    def __init__(self, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f'Decoding resulted in overflow - received chunk data contains junk (attempted 3 times)'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -197,7 +274,13 @@ def get_str(self):
 
 class NoValidTimeSlicesError(KerchunkException):
     """Unable to find any time slices to test within the object."""
-    def __init__(self, message='Kerchunk', verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            message: str = 'kerchunk',
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
         self.message = f'No valid timeslices found for {message}'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -207,7 +290,15 @@ def get_str(self):
 
 class VariableMismatchError(KerchunkException):
     """During testing, variables present in the NetCDF file are not present in Kerchunk"""
-    def __init__(self, missing={}, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            missing: Union[dict, None] = None,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+        missing = missing or {}
+
         self.message = f'Missing variables {missing} in Kerchunk file'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -217,7 +308,20 @@ def get_str(self):
 
 class ShapeMismatchError(KerchunkException):
     """Shapes of ND arrays do not match between Kerchunk and Xarray objects - when using a subset of the Netcdf files."""
-    def __init__(self, var={}, first={}, second={}, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self, 
+            var: Union[dict,None] = None,
+            first: Union[dict,None] = None, 
+            second: Union[dict,None] = None,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
+        var = var or {}
+        first = first or {}
+        second = second or {}
+
         self.message = f'Kerchunk/NetCDF mismatch for variable {var} with shapes - K {first} vs X {second}'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -227,8 +331,15 @@ def get_str(self):
 
 class TrueShapeValidationError(KerchunkException):
     """Shapes of ND arrays do not match between Kerchunk and Xarray objects - when using the complete set of files."""
-    def __init__(self, message='Kerchunk', verbose=0, proj_code=None, groupdir=None):
-        self.message = f'Kerchunk/NetCDF mismatch with shapes using full dataset - check logs'
+    def __init__(
+            self,
+            message: str = 'kerchunk',
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
+        self.message = f'{message} mismatch with shapes using full dataset - check logs'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
             self.__class__.__module__ = 'builtins'
@@ -237,7 +348,13 @@ def get_str(self):
 
 class NoOverwriteError(KerchunkException):
     """Output file already exists and the process does not have forceful overwrite (-f) set."""
-    def __init__(self, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
         self.message = 'Output file already exists and forceful overwrite not set.'
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -247,8 +364,13 @@ def get_str(self):
 
 class MissingKerchunkError(KerchunkException):
     """Kerchunk file not found."""
-    def __init__(self, message="No suitable kerchunk file found for validation.", verbose=0, proj_code=None, groupdir=None):
-        self.message = message
+    def __init__(
+            self, 
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+        self.message = "No suitable kerchunk file found."
         super().__init__(proj_code, groupdir)
         if verbose < 1:
             self.__class__.__module__ = 'builtins'
@@ -257,8 +379,13 @@ def get_str(self):
 
 class ValidationError(KerchunkException):
     """One or more checks within validation have failed - most likely elementwise comparison of data."""
-    def __init__(self, message="Fatal comparison failure for Kerchunk/NetCDF", verbose=0, proj_code=None, groupdir=None):
-        self.message = message
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+        self.message = "Fatal Validation Error"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
             self.__class__.__module__ = 'builtins'
@@ -267,8 +394,13 @@ def get_str(self):
     
 class ComputeError(KerchunkException):
     """Compute stage failed - likely due to invalid config/use of the classes"""
-    def __init__(self, message="Invalid configuration for the Compute stage", verbose=0, proj_code=None, groupdir=None):
-        self.message = message
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+        self.message = "Invalid configuration for the Compute stage"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
             self.__class__.__module__ = 'builtins'
@@ -277,8 +409,14 @@ def get_str(self):
 
 class SoftfailBypassError(KerchunkException):
     """Validation could not be completed because some arrays only contained NaN values which cannot be compared."""
-    def __init__(self, message="Kerchunk validation failed softly with no bypass - rerun with bypass flag", verbose=0, proj_code=None, groupdir=None):
-        self.message = message
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
+        self.message = "Kerchunk validation failed softly with no bypass - rerun with bypass flag"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
             self.__class__.__module__ = 'builtins'
@@ -287,8 +425,14 @@ def get_str(self):
     
 class ConcatenationError(KerchunkException):
     """Variables could not be concatenated over time and are not duplicates - no known solution"""
-    def __init__(self, message="Variables could not be concatenated over time and are not duplicates - no known solution", verbose=0, proj_code=None, groupdir=None):
-        self.message = message
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
+        self.message = "Variables could not be concatenated over time and are not duplicates - no known solution"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
             self.__class__.__module__ = 'builtins'
@@ -297,7 +441,16 @@ def get_str(self):
     
 class ConcatFatalError(KerchunkException):
     """Chunk sizes differ between refs - files cannot be concatenated"""
-    def __init__(self, var=None, chunk1=None, chunk2=None, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self, 
+            var: Union[str,None] = None, 
+            chunk1: Union[int,None] = None, 
+            chunk2: Union[int,None] = None, 
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
         self.message = f"Chunk sizes differ between refs for {var}: {chunk1} - {chunk2} - files cannot be concatenated"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -307,7 +460,14 @@ def get_str(self):
     
 class SourceNotFoundError(KerchunkException):
     """Source File could not be located."""
-    def __init__(self, sfile=None, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            sfile: Union[str, None],
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
         self.message = f"Source file could not be located: {sfile}"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -318,7 +478,13 @@ def get_str(self):
 # Potentially useful but currently unused.
 class ArchiveConnectError(KerchunkException):
     """Connection to the CEDA Archive could not be established"""
-    def __init__(self, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
         self.message = f"Connection verification to the CEDA archive failed - {proj_code}"
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -328,7 +494,13 @@ def get_str(self):
 
 class KerchunkDecodeError(KerchunkException):
     """Decoding of Kerchunk file failed - likely a time array issue."""
-    def __init__(self, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+
         self.message = f"Decoding of Kerchunk file failed - likely a time array issue."
         super().__init__(proj_code, groupdir)
         if verbose < 1:
@@ -338,7 +510,13 @@ def get_str(self):
     
 class FullsetRequiredError(KerchunkException):
     """This project must be validated using the full set of files."""
-    def __init__(self, verbose=0, proj_code=None, groupdir=None):
+    def __init__(
+            self,
+            verbose: int = 0, 
+            proj_code: Union[str,None] = None, 
+            groupdir: Union[str,None] = None
+        ) -> None:
+        
         self.message = f"This project must be validated by opening the full set of files."
         super().__init__(proj_code, groupdir)
         if verbose < 1:
diff --git a/padocc/core/filehandlers.py b/padocc/core/filehandlers.py
index 20da9e0..0282232 100644
--- a/padocc/core/filehandlers.py
+++ b/padocc/core/filehandlers.py
@@ -7,9 +7,13 @@
 import yaml
 from datetime import datetime
 import logging
-from typing import Generator
+from typing import Iterator
+from typing import Optional, Union
+import xarray as xr
 
 from padocc.core import LoggedOperation, FalseLogger
+from .utils import format_str
+
 
 class FileIOMixin(LoggedOperation):
     """
@@ -41,16 +45,16 @@ def __init__(
             self, 
             dir : str, 
             filename : str, 
-            logger   : logging.Logger | FalseLogger = None, 
-            label    : str = None,
-            fh       : str = None,
-            logid    : str = None,
-            dryrun   : bool = None,
-            forceful : bool = None,
+            logger   : Optional[Union[logging.Logger,FalseLogger]] = None, 
+            label    : Union[str,None] = None,
+            fh       : Optional[str] = None,
+            logid    : Optional[str] = None,
+            dryrun   : bool = False,
+            forceful : bool = False,
             verbose  : int = 0
         ) -> None:
         """
-        General filehandler for PADOCC operations involving file I/O operations.
+        Generic filehandler for PADOCC operations involving file I/O operations.
 
         :param dir:     (str) The path to the directory in which this file can be found.
 
@@ -76,14 +80,12 @@ def __init__(
         :returns: None
         """
         
-        self.dir       = dir
-        self._file     = filename
-
-        self._dryrun   = dryrun
-        self._forceful = forceful
-        self._value    = None
+        self._dir: str   = dir
+        self._file: str = filename
 
-        self._set_file()
+        self._dryrun: bool   = dryrun
+        self._forceful: bool = forceful
+        self._extension: str = ''
 
         # All filehandlers are logged operations
         super().__init__(
@@ -92,268 +94,359 @@ def __init__(
             fh=fh,
             logid=logid,
             verbose=verbose)
-
-    def __contains__(self, item) -> bool:
-        """
-        Enables checking 'if x in fh'.
-        """
-        if self._value is None:
-            self._get_content()
         
-        return item in self._value
-
     @property
     def filepath(self) -> str:
         """
-        Returns the private file attribute.
+        Returns the full filepath attribute.
         """
-        return self._file
+        return f'{self._dir}/{self.file}'
+
+    @property
+    def file(self) -> str:
+        """
+        Returns the full filename attribute."""
+        return f'{self._file}.{self._extension}'
 
     def file_exists(self) -> bool:
         """
         Return true if the file is found.
         """
-        return os.path.isfile(self._file)
+        return os.path.isfile(self.filepath)
 
-    def create_file(self):
+    def create_file(self) -> None:
         """
         Create the file if not on dryrun.
         """
         if not self._dryrun:
-            self.logger.debug(f'Creating file "{self._file}"')
-            os.system(f'touch {self._file}')
+            self.logger.debug(f'Creating file "{self.file}"')
+            os.system(f'touch {self.filepath}')
         else:
-            self.logger.info(f'DRYRUN: Skipped creating "{self._file}"')
-
-    def close(self):
-        """
-        Wrapper for _set_content method
-        """
-        self.logger.debug(f'Saving file {self._file}')
-        self._set_content()
+            self.logger.info(f'DRYRUN: Skipped creating "{self.file}"')
 
-    def set(self, value):
+    def remove_file(self) -> None:
         """
-        Reset the whole value of the private ``_value`` attribute.
+        Remove the file on the filesystem
+        if not on dryrun
         """
-        self._check_value()
-
-        self._value = value
-
-    def get(self, index: str = None, default: str = None):
-        """
-        Get the value of the private ``_value`` attribute. Can also get a
-        parameter from this item as you would with a dictionary, if possible
-        for the item type represented by ``_value``.
-        """
-        self._check_value()
-
-        if index is None:
-            return self._value
+        if not self._dryrun:
+            self.logger.debug(f'Deleting file "{self.file}"')
+            os.system(f'rm {self.filepath}')
+        else:
+            self.logger.info(f'DRYRUN: Skipped deleting "{self.file}"')
+
+    def move_file(
+            self,
+            new_dir: str,
+            new_name: Union[str,None] = None,
+            new_extension: Union[str, None] = None
+        ):
+
+        if not os.access(new_dir, os.W_OK):
+            raise OSError(
+                f'Specified directory "{new_dir}" is not writable'
+            )
         
+        old_path = str(self.filepath)
+        self._dir = new_dir
+        
+        if new_name is not None:
+            self._file = new_name
+
+        if new_extension is not None:
+            self._extension = new_extension
         try:
-            return self._value.get(index, default)
-        except AttributeError:
-            raise AttributeError(
-                f'Filehandler for {self._file} does not support getting specific items.'
+            os.system(f'mv {old_path} {self.filepath}')
+            self.logger.debug(
+                f'Moved file successfully from {old_path} to {self.filepath}'
             )
-
-    def _check_save(self) -> bool:
+        except OSError as err:
+            self.__set_filepath(old_path)
+            raise err
+        
+    def __set_filepath(self, filepath) -> None:
         """
-        Returns true if content is able to be saved.
+        Private method to hard reset the filepath
         """
 
-        # Only set value if value has been loaded to edit.
-        self._check_value()
+        components = '/'.join(filepath.split("/"))
+        self._dir = components[:-2]
+        filename  = components[-1]
 
-        # Only set value if not doing a dryrun
-        if self._dryrun:
-            self.logger.info(f'DRYRUN: Skip writing file "{self.filename}"')
-            return None
+        self._file, self._extension = filename.split('.')
 
-        # Create new file as required
-        if not self.file_exists():
-            self.create_file()
+class ListFileHandler(FileIOMixin):
+    """
+    Filehandler for string-based Lists in Padocc
+    """
 
-        # Continue with setting content to Filesystem object.
-        return True
+    def __init__(
+            self, 
+            dir: str, 
+            filename: str,
+            extension: Union[str,None] = None,
+            init_value: Union[list, None] = None,
+            **kwargs) -> None:
+        
+        super().__init__(dir, filename, **kwargs)
+
+        self._value: list    = init_value or []
+        self._extension: str = extension or 'txt'
+
+    def append(self, newvalue: str) -> None:
+        """Add a new value to the internal list"""
+        self._obtain_value()
+        
+        self._value.append(newvalue)
 
-    def _check_value(self):
+    def set(self, value: list) -> None:
+        """
+        Reset the value as a whole for this 
+        filehandler.
         """
-        Check if the value needs to be loaded from the file.
+        self._value = list(value)
+
+    def __contains__(self, item: str) -> bool:
         """
-        if self._value is None:
-            self._get_content()
+        Check if the item value is contained in
+        this list."""
+        self._obtain_value()
 
-class ListIOMixin(FileIOMixin):
+        return item in self._value
 
     def __str__(self) -> str:
         """String representation"""
-        content = self.get()
-        return '\n'.join(content)
+        self._obtain_value()
+
+        return '\n'.join(self._value)
+    
+    def __repr__(self) -> str:
+        """Programmatic representation"""
+        return f"<PADOCC List Filehandler: {format_str(self.file,10, concat=True)}>"
     
     def __len__(self) -> int:
         """Length of value"""
-        content = self.get()
-        self.logger.debug(f'content length: {len(content)}')
-        return len(content)
+        self._obtain_value()
+
+        self.logger.debug(f'content length: {len(self._value)}')
+        return len(self._value)
     
-    def __iter__(self) -> Generator[str, None, None]:
+    def __iter__(self) -> Iterator[str]:
         """Iterator for the set of values"""
+        self._obtain_value()
+
         for i in self._value:
             if i is not None:
                 yield i
 
-    def __getitem__(self, index):
+    def __getitem__(self, index: int) -> str:
         """
         Override FileIOMixin class for getting index
         """
-        if self._value is None:
-            self._get_content()
-
-        if not isinstance(index, int):
-            raise ValueError(
-                'List-based Filehandler is not numerically indexable.'
-            )
+        self._obtain_value()
 
         return self._value[index]
     
-    def __setitem__(self, index: int, value) -> None:
+    def get(self) -> list:
         """
-        Enables setting items in filehandlers 'fh[0] = 1'
+        Get the current value
         """
-        if self._value is None:
-            self._get_content()
+        self._obtain_value()
 
-        if not isinstance(index, int):
-            raise ValueError(
-                'List-based Filehandler is not numerically indexable.'
-            )
+        return self._value
+    
+    def __setitem__(self, index: int, value: str) -> None:
+        """
+        Enables setting items in filehandlers 'fh[0] = 1'
+        """
+        self._obtain_value()
 
         self._value[index] = value
 
-    def append(self, newvalue) -> None:
-        """Add a new value to the internal list"""
-        self._value.append(newvalue)
+    def _obtain_value(self) -> None:
+        """
+        Obtain the value for this filehandler.
+        """
+        if self._value == []:
+            self._obtain_value_from_file()
 
-    def set(self, value: list):
+    def _obtain_value_from_file(self) -> None:
         """
-        Extends the set function of the parent, creates a copy
-        of the input list so the original parameter is preserved.
+        Obtain the value specifically from
+        the represented file
         """
-        super().set(list(value))
+        if not self.file_exists():
+            self.create_file()
+
+        with open(self.filepath) as f:
+            self._value = [r.strip() for r in f.readlines()]
 
-    def _get_content(self) -> None:
+    def _set_value_in_file(self) -> None:
         """
-        Open the file to get content if it exists
+        On initialisation or close, set the value
+        in the file.
         """
-        if self.file_exists():
-            self.logger.debug('Opening existing file')
-            with open(self._file) as f:
-                content = [r.strip() for r in f.readlines()]
-            self._value = content
+        if self._dryrun or self._value == []:
+            self.logger.debug(f"Skipped setting value in {self.file}")
+            return
 
-        else:
-            self.logger.debug('Creating new file')
+        if not self.file_exists():
             self.create_file()
-            self._value = []
 
-    def _set_content(self) -> None:
-        """If the content can be saved, save to the file."""
-        if super()._check_save():
-            with open(self._file,'w') as f:
-                f.write('\n'.join(self._value))
+        with open(self.filepath,'w') as f:
+            f.write('\n'.join(self._value))
+
+    def close(self) -> None:
+        """
+        Save the content of the filehandler
+        """
+        self._set_value_in_file()
 
 class JSONFileHandler(FileIOMixin):
-    description = "JSON File handler for padocc config files."
+    """JSON File handler for padocc config files."""
 
     def __init__(
             self, 
             dir: str, 
             filename: str, 
-            logger: logging.Logger | FalseLogger = None,
-            conf: dict = None, 
+            conf: Union[dict,None] = None, 
+            init_value: Union[dict,None] = None,
             **kwargs
         ) -> None:
 
-        self._conf = conf
-        super().__init__(dir, filename, logger=logger, **kwargs)
+        super().__init__(dir, filename, **kwargs)
+        self._conf: dict  = conf or {}
+        self._value: dict = init_value or {}
+        self._extension: str = 'json'
+
+    def set(self, value: dict) -> None:
+        """
+        Set the value of the whole dictionary.
+        """
+        self._value = dict(value)
+
+    def __contains__(self, key: str):
+        """
+        Check if the dict for this filehandler
+        contains this key."""
+        self._obtain_value()
+
+        return key in self._value.keys()
 
     def __str__(self) -> str:
         """String representation"""
-        return yaml.dump(self.get())
+        self._obtain_value()
+
+        return yaml.safe_dump(self._value,indent=2)
+
+    def __repr__(self) -> str:
+        """Programmatic representation"""
+        return f"<PADOCC JSON Filehandler: {format_str(self.file,10, concat=True)}>"
 
     def __len__(self) -> int:
         """Returns number of keys in this dict-like object."""
-        self._check_value()
+        self._obtain_value()
 
         return len(self._value.keys())
     
-    def __iter__(self) -> Generator[str, None, None]:
+    def __iter__(self) -> Iterator[str]:
         """Iterate over set of keys."""
-        self._check_value()
+        self._obtain_value()
 
         for i in self._value.keys():
             yield i
 
-    def __getitem__(self, index: str):
+    def __getitem__(self, index: str) -> Union[str,dict,None]:
         """
-        Enables indexing for filehandlers 'fh[0]'
+        Enables indexing for filehandlers. 
+        Dict-based filehandlers accept string keys only.
         """
-        if self._value is None:
-            self._get_content()
-
-        if self._conf is not None:
-            if index in self._conf:
-                self._apply_conf()
+        self._obtain_value()
 
         if index in self._value:
             return self._value[index]
         
         return None
+    
+    def create_file(self) -> None:
+        """JSON files require entry of a single dict on creation"""
+        super().create_file()
 
-    def __setitem__(self, index: str, value) -> None:
+        if not self._dryrun:
+            with open(self.filepath,'w') as f:
+                f.write(json.dumps({}))
+    
+    def get(
+            self, 
+            index: Union[str,None] = None, 
+            default: Union[str,None] = None
+        ) -> Union[str,dict,None]:
         """
-        Enables setting items in filehandlers 'fh[0] = 1'
+        Safe method to get a value from this filehandler
         """
-        if self._value is None:
-            self._get_content()
+        self._obtain_value()
+
+        if index is None:
+            return self._value
+
+        return self._value.get(index, default)
+
+    def __setitem__(self, index: str, value: str) -> None:
+        """
+        Enables setting items in filehandlers.
+        Dict-based filehandlers accept string keys only.
+        """
+        self._obtain_value()
 
         if index in self._value:
             self._value[index] = value
-        return None
+    
+    def _obtain_value(self, index: Union[str,None] = None) -> None:
+        """
+        Obtain the value for this filehandler.
+        """
+        if self._value == {}:
+            self._obtain_value_from_file()
 
-    def set(self, value: dict):
+        if index is None:
+            return
+        
+        if self._conf is not None:
+            if index in self._conf:
+                self._apply_conf()
+
+    def _obtain_value_from_file(self) -> None:
         """
-        Wrapper to create a detached dict copy
+        Obtain the value specifically from
+        the represented file
         """
-        super().set(dict(value))
+        if not self.file_exists():
+            self.create_file()
+            return
 
-    def _set_file(self):
-        if '.json' not in self._file:
-            self._file = f'{self.dir}/{self._file}.json'
-        else:
-            self._file = f'{self.dir}/{self._file}'
-
-    # Get/set routines for the filesystem files.
-    def _get_content(self):
-        if self.file_exists():
-            try:
-                with open(self._file) as f:
-                    self._value = json.load(f)
-            except json.decoder.JSONDecodeError:
-                self._value={}
-        else:
+        with open(self.filepath) as f:
+            self._value = json.load(f)
+
+    def _set_value_in_file(self) -> None:
+        """
+        On initialisation or close, set the value
+        in the file.
+        """
+        if self._dryrun or self._value == {}:
+            self.logger.debug(f"Skipped setting value in {self.file}")
+            return
+        
+        self._apply_conf()
+
+        if not self.file_exists():
             self.create_file()
-            self._value = {}
 
-    def _set_content(self):
-        if super()._check_save():
-            self._apply_conf()
-            with open(self._file,'w') as f:
-                f.write(json.dumps(self._value))
+        with open(self.filepath,'w') as f:
+            f.write(json.dumps(self._value))
+
+    
 
-    def _apply_conf(self):
+    def _apply_conf(self) -> None:
         """
         Update value with properties from conf - fill
         missing values.
@@ -365,24 +458,36 @@ def _apply_conf(self):
         self._conf.update(self._value)
 
         self._value = dict(self._conf)
-        self._conf = None
+        self._conf = {}
+
+    def close(self) -> None:
+        """
+        Save the content of the filehandler
+        """
+        self._set_value_in_file()
 
 class KerchunkFile(JSONFileHandler):
+    """
+    Filehandler for Kerchunk file, enables substitution/replacement
+    for local/remote links, and updating content.
+    """
 
-    def add_download_link(self) -> dict:
+    def add_download_link(
+            self,
+            sub: str = '/',
+            replace: str = 'https://dap.ceda.ac.uk'
+        ) -> None:
         """
         Add the download link to this Kerchunk File
         """
-        refs = self.get()
-
-        for key in refs.keys():
-            if len(refs[key]) == 3:
-                if refs[key][0][0] == '/':
-                    refs[key][0] = 'https://dap.ceda.ac.uk' + refs[key][0]
+        self._obtain_value()
 
-        self.set(refs)
+        for key in self._value.keys():
+            if len(self._value[key]) == 3:
+                if self._value[key][0][0] == sub:
+                    self._value[key][0] = replace + self._value[key][0]
 
-    def add_kerchunk_history(self, version_no) -> dict:
+    def add_kerchunk_history(self, version_no: str) -> None:
         """
         Add kerchunk variables to the metadata for this dataset, including 
         creation/update date and version/revision number.
@@ -391,15 +496,20 @@ def add_kerchunk_history(self, version_no) -> dict:
         from datetime import datetime
 
         # Get current time
-        attrs = self['refs']
+        attrs = self.get('refs',None)
+
+        if attrs is None or not isinstance(attrs,str):
+            raise ValueError(
+                'Attribute "refs" not present in Kerchunk file'
+            )
 
         # Format for different uses
         now = datetime.now()
         if 'history' in attrs:
-            if type(attrs['history']) == str:
-                hist = attrs['history'].split('\n')
-            else:
-                hist = attrs['history']
+            hist = attrs.get('history','')
+
+            if type(hist) == str:
+                hist = hist.split('\n')
 
             if 'Kerchunk' in hist[-1]:
                 hist[-1] = 'Kerchunk file updated on ' + now.strftime("%D")
@@ -412,45 +522,184 @@ def add_kerchunk_history(self, version_no) -> dict:
         attrs['kerchunk_revision'] = version_no
         attrs['kerchunk_creation_date'] = now.strftime("%d%m%yT%H%M%S")
         
-        self.set(attrs, index='refs')
+        self['refs'] = attrs
 
-class ZarrStore(FileIOMixin):
-    def clear(self):
+class GenericStore(LoggedOperation):
+    """
+    Filehandler for Generic stores in Padocc - enables Filesystem
+    operations on component files.
+    """
+
+    def __init__(
+            self,
+            parent_dir: str,
+            store_name: str, 
+            metadata_name: str = '.zattrs',
+            extension: str = 'zarr',
+            logger   : Optional[Union[logging.Logger,FalseLogger]] = None, 
+            label    : Union[str,None] = None,
+            fh       : Optional[str] = None,
+            logid    : Optional[str] = None,
+            dryrun   : bool = False,
+            forceful : bool = False,
+            verbose  : int = 0
+        ) -> None:
+
+        self._parent_dir: str = parent_dir
+        self._store_name: str = store_name
+        self._extension: str = extension
+
+        self._meta: JSONFileHandler = JSONFileHandler(
+            self.store_path, metadata_name)
+
+        self._dryrun: bool   = dryrun
+        self._forceful: bool = forceful
+
+        # All filehandlers are logged operations
+        super().__init__(
+            logger,
+            label=label,
+            fh=fh,
+            logid=logid,
+            verbose=verbose)
+        
+    @property
+    def store_path(self) -> str:
+        """Assemble the store path"""
+        return f'{self._parent_dir}/{self._store_name}.{self._extension}'
+
+    def clear(self) -> None:
+        """
+        Remove all components of the store"""
         if not self._dryrun:
-            os.system(f'rm -rf {self._file}')
+            os.system(f'rm -rf {self.store_path}')
         else:
-            self.logger.warning(
-                f'Unable to clear ZarrStore "{self._file}" in dryrun mode.')
+            self.logger.debug(
+                f'Skipped clearing "{self._extension}"-type '
+                f'Store "{self._store_name}" in dryrun mode.'
+            )
+
+    def open(self, engine: str = 'zarr', **open_kwargs) -> xr.Dataset:
+        """Open the store as a dataset (READ_ONLY)"""
+        return xr.open_dataset(self.store_path, engine=engine,**open_kwargs)
 
-    def _set_file(self):
-        self._file = f'{self.dir}/{self.filename}'
+    def __contains__(self, key: str) -> bool:
+        """
+        Check if a key exists in the zattrs file"""
+        return key in self._meta
+    
+    def __str__(self) -> str:
+        """Return the string representation of the store"""
+        return self.__repr__()
+    
+    def __len__(self) -> int:
+        """Find the number of keys in zattrs""" 
+        return len(self._meta)
 
-class TextFileHandler(ListIOMixin):
-    description = "Text File handler for padocc config files."
+    def __repr__(self) -> str:
+        """Programmatic representation"""
+        return f'<PADOCC Store: {format_str(self._store_name,10)}>'
 
-    def _set_file(self):
-        if '.txt' not in self._file:
-            self._file = f'{self.dir}/{self._file}.txt'
-        else:
-            self._file = f'{self.dir}/{self._file}'
+    def __getitem__(self, index: str) -> Union[str,dict,None]:
+        """Get an attribute from the zarr store"""
+        return self._meta[index]
+    
+    def __setitem__(self, index: str, value: str) -> None:
+        """Set an attribute in the zarr store"""
+        self._meta[index] = value
+
+class ZarrStore(GenericStore):
+    """
+    Filehandler for Zarr stores in PADOCC.
+    Enables manipulation of Zarr store on filesystem
+    and setting metadata attributes."""
+
+    def __init__(
+            self,
+            parent_dir: str,
+            store_name: str,
+            **kwargs
+        ) -> None:
 
-class LogFileHandler(ListIOMixin):
+        super().__init__(parent_dir, store_name, **kwargs)
+
+    def __repr__(self) -> str:
+        """Programmatic representation"""
+        return f'<PADOCC ZarrStore: {format_str(self._store_name,10)}>'
+    
+    def open(self, *args, **zarr_kwargs) -> xr.Dataset:
+        """
+        Open the ZarrStore as an xarray dataset
+        """
+        return super().open(engine='zarr',**zarr_kwargs)
+
+class KerchunkStore(GenericStore):
+    """
+    Filehandler for Kerchunk stores using parquet
+    in PADOCC. Enables setting metadata attributes and
+    will allow combining stores in future.
+    """
+
+    def __init__(
+            self,
+            parent_dir: str,
+            store_name: str,
+            **kwargs
+        ) -> None:
+
+        super().__init__(
+            parent_dir, store_name, 
+            metadata_name='.zmetadata',
+            extension='parq',
+            **kwargs)
+
+    def __repr__(self) -> str:
+        """Programmatic representation"""
+        return f'<PADOCC ParquetStore: {format_str(self._store_name,10)}>'
+    
+    def open(self, *args, **parquet_kwargs) -> xr.Dataset:
+        """
+        Open the Parquet Store as an xarray dataset
+        """
+        raise NotImplementedError
+
+class LogFileHandler(ListFileHandler):
+    """Log File handler for padocc phase logs."""
     description = "Log File handler for padocc phase logs."
 
-    def __init__(self, dir, filename, logger, extra_path, **kwargs):
+    def __init__(
+            self, 
+            dir: str, 
+            filename: str, 
+            extra_path: str = '',
+            **kwargs
+        ) -> None:
+
         self._extra_path = extra_path
-        super().__init__(dir, filename, logger, **kwargs)
+        super().__init__(dir, filename, **kwargs)
+
+        self._extension = 'log'
 
-    def _set_file(self):
-        self._file = f'{self.dir}/{self._extra_path}{self._file}.log'
+    @property
+    def file(self) -> str:
+        return f'{self._extra_path}{self._file}.{self._extension}'
 
-class CSVFileHandler(ListIOMixin):
+class CSVFileHandler(ListFileHandler):
+    """CSV File handler for padocc config files"""
     description = "CSV File handler for padocc config files"
     
-    def _set_file(self):
-        self._file = f'{self.dir}/{self._file}.csv'
+    def __init__(
+            self, 
+            dir: str, 
+            filename: str, 
+            **kwargs
+        ) -> None:
 
-    def __iter__(self):
+        super().__init__(dir, filename, **kwargs)
+
+        self._extension = 'csv'
+
+    def __iter__(self) -> Iterator[str]:
         for i in self._value:
             if i is not None:
                 yield i.replace(' ','').split(',')
@@ -460,12 +709,22 @@ def update_status(
             phase: str, 
             status: str,
             jobid : str = '',
-            dryrun: bool = False
         ) -> None:
 
-        self._check_value()
+        """
+        Update formatted status for this 
+        log with the phase and status
+        
+        :param phase:   (str) The phase for which this project is being
+            operated.
+            
+        :param status:  (str) The status of the current run 
+            (e.g. Success, Failed, Fatal) 
+        
+        :param jobid:   (str) The jobID of this run if present.
+        """
 
         status = status.replace(',', '.').replace('\n','.')
-        addition = f'{phase},{status},{datetime.now().strftime("%H:%M %D")},{jobid},{dryrun}'
+        addition = f'{phase},{status},{datetime.now().strftime("%H:%M %D")},{jobid}'
         self.append(addition)
         self.logger.info(f'Updated new status: {phase} - {status}')
\ No newline at end of file
diff --git a/padocc/core/logs.py b/padocc/core/logs.py
index 8291cd9..a42ef17 100644
--- a/padocc/core/logs.py
+++ b/padocc/core/logs.py
@@ -4,6 +4,7 @@
 
 import logging
 import os
+from typing import Union, Optional
 
 levels = [
     logging.WARN,
@@ -19,16 +20,33 @@
     'G': 1000000000
 }
 
+
+class FalseLogger:
+    """
+    Supplementary class where a logger is not wanted but is required for
+    some operations.
+    """
+    def __init__(self):
+        pass
+    def debug(self, message: str):
+        pass
+    def info(self, message: str):
+        pass
+    def warning(self, message: str):
+        pass
+    def error(self, message: str):
+        pass
+
 class LoggedOperation:
     """
     Allows inherritance of logger objects without creating new ones.
     """
     def __init__(
             self, 
-            logger : logging.Logger = None,
-            label  : str = None, 
-            fh     : str = None, 
-            logid  : str = None, 
+            logger : Union[logging.Logger,FalseLogger, None] = None,
+            label  : Union[str,None] = None, 
+            fh     : Union[str,None] = None, 
+            logid  : Union[str,None] = None, 
             verbose: int = 0
         ) -> None:
 
@@ -45,22 +63,6 @@ def __init__(
         else:
             self.logger = logger
 
-class FalseLogger:
-    """
-    Supplementary class where a logger is not wanted but is required for
-    some operations.
-    """
-    def __init__(self):
-        pass
-    def debug(self, message: str):
-        pass
-    def info(self, message: str):
-        pass
-    def warning(self, message: str):
-        pass
-    def error(self, message: str):
-        pass
-
 def reset_file_handler(
         logger  : logging.Logger,
         verbose : int, 
diff --git a/padocc/core/mixins.py b/padocc/core/mixins.py
index 7eda6e4..c7aabf0 100644
--- a/padocc/core/mixins.py
+++ b/padocc/core/mixins.py
@@ -49,7 +49,7 @@ def __init__(
 
     def values(self):
         print(f' - forceful: {bool(self._forceful)}')
-        print(f' - thorough: {bool(self._thorough)}')
+        print(f' - verbose: {bool(self._verbose)}')
         print(f' - dryrun: {bool(self._dryrun)}')
 
     @property
@@ -188,7 +188,7 @@ def _rerun_command(self):
 
 class PropertiesMixin:
 
-    def _check_override(self, key, mapper):
+    def _check_override(self, key, mapper) -> str:
         if self.base_cfg['override'][key] is not None:
             return self.base_cfg['override'][key]
         
@@ -206,12 +206,12 @@ def outpath(self):
     @property
     def outproduct(self):
         if self.stage == 'complete':
-            return f'{self.proj_code}.{self.version_no}.{self.file_type}'
+            return f'{self.proj_code}.{self.revision}.{self.file_type}'
         else:
-            vn = f'{self.version_no}a.{self.file_type}'
+            vn = f'{self.revision}a'
             if self._is_trial:
                 vn = f'trial-{vn}'
-            return vn
+            return f'{vn}.{self.file_type}'
     
     @property
     def revision(self) -> str:
@@ -233,16 +233,16 @@ def version_no(self) -> str:
 
     @property
     def cloud_format(self) -> str:
-        return self._check_override('cloud_type','scanned_with')
+        return self._check_override('cloud_type','scanned_with') or 'kerchunk'
 
     @cloud_format.setter
     def cloud_format(self, value):
         self.base_cfg['override']['cloud_type'] = value
 
     @property
-    def file_type(self) -> bool:
+    def file_type(self) -> str:
         """
-        Return True if the project is configured to use parquet.
+        Return the file type for this project.
         """
 
         return self._check_override('file_type','type')
diff --git a/padocc/core/project.py b/padocc/core/project.py
index c636ebf..8e86de2 100644
--- a/padocc/core/project.py
+++ b/padocc/core/project.py
@@ -6,6 +6,8 @@
 import glob
 import logging
 
+from typing import Union
+
 from .errors import error_handler
 from .utils import extract_file, BypassSwitch, apply_substitutions, phases, file_configs
 from .logs import reset_file_handler
@@ -14,7 +16,7 @@
 from .filehandlers import (
     JSONFileHandler, 
     CSVFileHandler,
-    TextFileHandler,
+    ListFileHandler,
     LogFileHandler,
     KerchunkFile
 )
@@ -129,7 +131,7 @@ def __init__(
         # Project FileHandlers
         self.base_cfg   = JSONFileHandler(self.dir, 'base-cfg', logger=self.logger, conf=file_configs['base_cfg'], **self.fh_kwargs)
         self.detail_cfg = JSONFileHandler(self.dir, 'detail-cfg', logger=self.logger, conf=file_configs['detail_cfg'], **self.fh_kwargs)
-        self.allfiles   = TextFileHandler(self.dir, 'allfiles', logger=self.logger, **self.fh_kwargs)
+        self.allfiles   = ListFileHandler(self.dir, 'allfiles', logger=self.logger, **self.fh_kwargs)
 
         # ft_kwargs <- stored in base_cfg after this point.
         if first_time:
@@ -138,14 +140,14 @@ def __init__(
             self._configure_filelist()
 
         # ProjectOperation attributes
-        self.status_log = CSVFileHandler(self.dir, 'status_log', self.logger, **self.fh_kwargs)
+        self.status_log = CSVFileHandler(self.dir, 'status_log', logger=self.logger, **self.fh_kwargs)
 
         self.phase_logs = {}
         for phase in ['scan', 'compute', 'validate']:
             self.phase_logs[phase] = LogFileHandler(
                 self.dir,
                 phase, 
-                self.logger, 
+                logger=self.logger, 
                 extra_path='phase_logs/', 
                 **self.fh_kwargs
             )
@@ -193,10 +195,11 @@ def help(self, fn=print):
     def run(
             self,
             mode: str = 'kerchunk',
-            subset_bypass: bool = False, 
+            bypass: Union[BypassSwitch,None] = None,
             forceful : bool = None,
             thorough : bool = None,
             dryrun : bool = None,
+            **kwargs
         ) -> str:
         """
         Main function for running any project operation. All 
@@ -204,6 +207,8 @@ def run(
         ``_run`` method called from here. This means all error handling
         with status logs and log files can be dealt with here."""
 
+        self._bypass = bypass or self._bypass
+
         # Reset flags given specific runs
         if forceful is not None:
             self._forceful = forceful
@@ -213,30 +218,33 @@ def run(
             self._dryrun = dryrun
 
         try:
-            status = self._run(mode=mode)
+            status = self._run(mode=mode, **kwargs)
             self.save_files()
             return status
         except Exception as err:
-            raise err
-            #return error_handler(
-                #err, self.logger, self.phase,
-                #jobid=self._logid, dryrun=self._dryrun, 
-                ##subset_bypass=subset_bypass,
-                #status_fh=self.status_log)
-
-    def _run(self, **kwargs):
+            return error_handler(
+                err, self.logger, self.phase,
+                jobid=self._logid, dryrun=self._dryrun, 
+                subset_bypass=self._bypass.skip_subsets,
+                status_fh=self.status_log)
+
+    def move_to(self, new_directory: str) -> None:
+        """
+        Move all associated files across to new directory."""
+
+    def _run(self, **kwargs) -> None:
         # Default project operation run.
         self.logger.info("Nothing to run with this setup!")
 
-    def create_new_kfile(self, product : str):
+    def create_new_kfile(self, product : str) -> None:
         self.kfile = KerchunkFile(
             self.dir,
             product,
-            self.logger,
+            logger=self.logger,
             **self.fh_kwargs
         )
 
-    def create_new_kstore(self, product : str):
+    def create_new_kstore(self, product: str) -> None:
         raise NotImplementedError
 
     @property
@@ -268,10 +276,9 @@ def update_status(
             self, 
             phase : str, 
             status: str, 
-            jobid : str = '', 
-            dryrun: str = ''
+            jobid : str = ''
         ) -> None: 
-        self.status_log.update_status(phase, status, jobid=jobid, dryrun=dryrun)
+        self.status_log.update_status(phase, status, jobid=jobid)
 
     def save_files(self):
         # Add all files here.
@@ -315,7 +322,7 @@ def _configure_filelist(self):
             if 'latest' in pattern:
                 pattern = pattern.replace('latest', os.readlink(pattern))
 
-            self.allfiles.set(glob.glob(pattern))
+            self.allfiles.set(sorted(glob.glob(pattern, recursive=True)))
 
     def _setup_config(
             self, 
diff --git a/padocc/core/utils.py b/padocc/core/utils.py
index e476ff3..43499fc 100644
--- a/padocc/core/utils.py
+++ b/padocc/core/utils.py
@@ -73,21 +73,17 @@ class BypassSwitch:
     switches stored in this class.
     """
 
-    def __init__(self, switch='DBSCLR'):
+    def __init__(self, switch='D'):
         if switch.startswith('+'):
-            switch = 'DBSCLR' + switch[1:]
+            switch = 'D' + switch[1:]
         self.switch = switch
         if isinstance(switch, str):
             switch = list(switch)
         
-        self.skip_driver   = ('D' in switch)
-        self.skip_boxfail  = ('B' in switch)
-        self.skip_softfail = ('S' in switch)
-        self.skip_data_sum = ('C' in switch)
-        self.skip_xkshape  = ('X' in switch)
-        self.skip_report   = ('R' in switch)
+        self.skip_driver   = ('D' in switch) # Keep
         self.skip_scan     = ('F' in switch) # Fasttrack
         self.skip_links    = ('L' in switch)
+        self.skip_subsets  = ('S' in switch)
 
     def __str__(self):
         """Return the switch string (letters representing switches)"""
@@ -98,16 +94,10 @@ def help(self):
 Bypass switch options: \n
   "D" - * Skip driver failures - Pipeline tries different options for NetCDF (default).
       -   Only need to turn this skip off if all drivers fail (KerchunkDriverFatalError).
-  "B" - * Skip Box compute errors.
-  "S" - * Skip Soft fails (NaN-only boxes in validation) (default).
-  "C" - * Skip calculation (data sum) errors (time array typically cannot be summed) (default).
-  "X" -   Skip initial shape errors, by attempting XKShape tolerance method (special case.)
-  "R" -   Skip reporting to status_log which becomes visible with assessor. Reporting is skipped
-          by default in single_run.py but overridden when using group_run.py so any serial
-          testing does not by default report the error experienced to the status log for that project.
   "F" -   Skip scanning (fasttrack) and go straight to compute. Required if running compute before scan
           is attempted.
   "L" -   Skip adding links in compute (download links) - this will be required on ingest.
+  "S" -   Skip errors when running a subset within a group. Record the error then move onto the next dataset.
 """)
   
 def open_kerchunk(kfile: str, logger, isparq=False, retry=False, attempt=1, **kwargs) -> xr.Dataset:
diff --git a/padocc/operations/group.py b/padocc/operations/group.py
index 5c18652..0712e4e 100644
--- a/padocc/operations/group.py
+++ b/padocc/operations/group.py
@@ -4,7 +4,7 @@
 
 import os
 import logging
-from typing import Optional
+from typing import Optional, Union
 
 from padocc.core import BypassSwitch, FalseLogger
 from padocc.core.utils import format_str, times
@@ -13,19 +13,19 @@
     ScanOperation,
     KerchunkDS, 
     ZarrDS, 
-    cfa_handler,
+    CfaDS,
     KNOWN_PHASES,
     ValidateOperation,
 )
 from padocc.core.mixins import DirectoryMixin
-from padocc.core.filehandlers import CSVFileHandler, TextFileHandler
+from padocc.core.filehandlers import CSVFileHandler, ListFileHandler
 
 from .mixins import AllocationsMixin, InitialisationMixin, EvaluationsMixin
 
 COMPUTE = {
     'kerchunk':KerchunkDS,
     'zarr':ZarrDS,
-    'cfa': cfa_handler,
+    'cfa': CfaDS,
 }
 
 class GroupOperation(
@@ -106,7 +106,7 @@ def __init__(
         self.blacklist_codes = CSVFileHandler(
             self.groupdir,
             'blacklist_codes',
-            self.logger,
+            logger=self.logger,
             dryrun=self._dryrun,
             forceful=self._forceful,
         )
@@ -114,7 +114,7 @@ def __init__(
         self.datasets = CSVFileHandler(
             self.groupdir,
             'datasets',
-            self.logger,
+            logger=self.logger,
             dryrun=self._dryrun,
             forceful=self._forceful,
         )
@@ -126,7 +126,15 @@ def __str__(self):
     
     def __repr__(self):
         return str(self)
+    
+    def __getitem__(self, index: int) -> ProjectOperation:
+        """
+        Indexable group allows access to individual projects
+        """
 
+        proj_code = self.proj_codes['main'][index]
+        return self.get_project(proj_code)
+    
     @property
     def proj_codes_dir(self):
         return f'{self.groupdir}/proj_codes'
@@ -138,6 +146,113 @@ def new_inputfile(self):
         else:
             raise NotImplementedError
         
+    def merge(group_A,group_B):
+        """
+        Merge group B into group A.
+        1. Migrate all projects from B to A and reset groupID values.
+        2. Combine datasets.csv
+        3. Combine project codes
+        4. Combine blacklists.
+        """
+
+        new_proj_dir = f'{group_A.workdir}/in_progress/{group_A.groupID}'
+        group_A.logger.info(f'Merging {group_B.groupID} into {group_A.groupID}')
+
+        # Combine projects
+        for proj_code in group_B.proj_codes['main']:
+            proj_op = ProjectOperation(
+                proj_code,
+                group_B.workdir,
+                group_B.groupID
+            )
+            group_A.logger.debug(f'Migrating project {proj_code}')
+            proj_op.move_to(new_proj_dir)
+
+        # Datasets
+        group_A.datasets.set(
+            group_A.datasets.get() + group_B.datasets.get()
+        )
+        group_B.datasets.remove_file()
+        group_A.logger.debug(f'Removed dataset file for {group_B.groupID}')
+
+        # Blacklists
+        group_A.blacklist_codes.set(
+            group_A.blacklist_codes.get() + group_B.blacklist_codes.get()
+        )
+        group_B.blacklist_codes.remove_file()
+        group_A.logger.debug(f'Removed blacklist file for {group_B.groupID}')
+
+        # Subsets
+        for name, subset in group_B.proj_codes.items():
+            if name not in group_A.proj_codes:
+                subset.move_file(group_A.groupdir)
+                group_A.logger.debug(f'Migrating subset {name}')
+            else:
+                group_A.proj_codes[name].set(
+                    group_A.proj_codes[name].get() + subset.get()
+                )
+                group_A.logger.debug(f'Merging subset {name}')
+                subset.remove_file()
+
+        group_A.logger.info("Merge operation complete")
+        del group_B
+
+    def unmerge(group_A, group_B, dataset_list: list):
+        """
+        Separate elements from group_A into group_B
+        according to the list
+        1. Migrate projects
+        2. Set the datasets
+        3. Set the blacklists
+        4. Project codes (remove group B sections)"""
+
+        group_A.logger.info(
+            f"Separating {len(dataset_list)} datasets from "
+            f"{group_A.groupID} to {group_B.groupID}")
+        
+        new_proj_dir = f'{group_B.workdir}/in_progress/{group_B.groupID}'
+
+        # Combine projects
+        for proj_code in dataset_list:
+            proj_op = ProjectOperation(
+                proj_code,
+                group_A.workdir,
+                group_A.groupID
+            )
+
+            proj_op.move_to(new_proj_dir)
+            proj_op.groupID = group_B.groupID
+        
+        # Set datasets
+        group_B.datasets.set(dataset_list)
+        group_A.datasets.set(
+            [ds for ds in group_A.datasets if ds not in dataset_list]
+        )
+
+        group_A.logger.debug(f"Created datasets file for {group_B.groupID}")
+
+        # Set blacklist
+        A_blacklist, B_blacklist = [],[]
+        for bl in group_A.blacklist_codes:
+            if bl in dataset_list:
+                B_blacklist.append(bl)
+            else:
+                A_blacklist.append(bl)
+
+        group_A.blacklist_codes.set(A_blacklist)
+        group_B.blacklist_codes.set(B_blacklist)
+        group_A.logger.debug(f"Created blacklist file for {group_B.groupID}")
+
+        # Combine project subsets
+        group_B.proj_codes['main'].set(dataset_list)
+        for name, subset in group_A.proj_codes.items():
+            if name != 'main':
+                subset.set([s for s in subset if s not in dataset_list])
+        group_A.logger.debug(f"Removed all datasets from all {group_A.groupID} subsets")
+
+
+        group_A.logger.info("Unmerge operation complete")
+
     def values(self):
         print(f'Group: {self.groupID}')
         print(f' - Workdir: {self.workdir}')
@@ -164,10 +279,12 @@ def run(
             repeat_id: str = 'main',
             proj_code: Optional[str] = None,
             subset: Optional[str] = None,
-            subset_bypass: bool = False,
+            bypass: Union[BypassSwitch, None] = None,
             **kwargs
         ) -> dict[str]:
 
+        bypass = bypass or BypassSwitch()
+
         phases = {
             'scan': self._scan_config,
             'compute': self._compute_config,
@@ -208,7 +325,7 @@ def run(
             status = func(
                 proj_code, 
                 mode=mode, logid=logid, label=phase, 
-                fh=fh, subset_bypass=subset_bypass,
+                fh=fh, bypass=bypass,
                 **kwargs)
             
             if status in results:
@@ -225,9 +342,9 @@ def run(
 
     def _scan_config(
             self,
-            proj_code,
-            mode='kerchunk',
-            subset_bypass=False,
+            proj_code: str,
+            mode: str = 'kerchunk',
+            bypass: Union[BypassSwitch,None] = None,
             **kwargs
         ) -> None:
         """
@@ -249,16 +366,17 @@ def _scan_config(
 
         so = ScanOperation(
             proj_code, self.workdir, groupID=self.groupID,
-            verbose=self._verbose, **kwargs, dryrun=self._dryrun)
-        status = so.run(mode=mode, subset_bypass=False)
+            verbose=self._verbose, bypass=bypass, 
+            dryrun=self._dryrun, **kwargs)
+        status = so.run(mode=mode)
         so.save_files()
         return status
 
     def _compute_config(
             self, 
-            proj_code,  
-            mode=None,
-            subset_bypass=False,
+            proj_code: str,
+            mode: str = 'kerchunk',
+            bypass: Union[BypassSwitch,None] = None,
             **kwargs
         ) -> None:
         """
@@ -286,10 +404,11 @@ def _compute_config(
             self.workdir,
             groupID=self.groupID,
             logger=self.logger,
+            bypass=bypass,
             **kwargs,
         )
 
-        mode = proj_op.cloud_format
+        mode = mode or proj_op.cloud_format
         if mode is None:
             mode = 'kerchunk'
 
@@ -315,7 +434,7 @@ def _compute_config(
             logger=self.logger,
             **kwargs
         )
-        status = proj_op.run(subset_bypass=subset_bypass)
+        status = proj_op.run()
         proj_op.save_files()
         return status
     
@@ -323,7 +442,7 @@ def _validate_config(
             self, 
             proj_code: str,  
             mode: str = 'kerchunk',
-            subset_bypass: bool = False,
+            bypass: Union[BypassSwitch,None] = None,
             forceful: Optional[bool] = None,
             thorough: Optional[bool] = None,
             dryrun: Optional[bool] = None,
@@ -332,30 +451,25 @@ def _validate_config(
 
         self.logger.debug(f"Starting validation for {proj_code}")
 
-        #try:
-        if True:
+        try:
             vop = ValidateOperation(
                 proj_code,
                 workdir=self.workdir,
                 groupID=self.groupID,
+                bypass=bypass,
                 **kwargs)
-        #except TypeError:
-            #raise ValueError(
-            #    f'{proj_code}, {self.groupID}, {self.workdir}'
-            #)
+        except TypeError:
+            raise ValueError(
+                f'{proj_code}, {self.groupID}, {self.workdir}'
+            )
         
         status = vop.run(
             mode=mode, 
-            subset_bypass=subset_bypass,
             forceful=forceful,
             thorough=thorough,
             dryrun=dryrun)
         return status
 
-
-    def add_project(self):
-        pass
-
     def _save_proj_codes(self):
         for pc in self.proj_codes.keys():
             self.proj_codes[pc].close()
@@ -366,16 +480,15 @@ def save_files(self):
         self._save_proj_codes()
 
     def _add_proj_codeset(self, name : str, newcodes : list):
-        self.proj_codes[name] = TextFileHandler(
+        self.proj_codes[name] = ListFileHandler(
             self.proj_codes_dir,
             name,
-            self.logger,
+            init_value=newcodes,
+            logger=self.logger,
             dryrun=self._dryrun,
             forceful=self._forceful
         )
 
-        self.proj_codes[name].set(newcodes)
-
     def check_writable(self):
         if not os.access(self.workdir, os.W_OK):
             self.logger.error('Workdir provided is not writable')
@@ -488,7 +601,7 @@ def _create_job_array(
             sbatch_file = f'{phase}_{joblabel}.sbatch'
             repeat_id = f'{repeat_id}/{joblabel}'
 
-        sbatch = TextFileHandler(sbatch_dir, sbatch_file, self.logger, dryrun=self._dryrun, forceful=self._forceful)
+        sbatch = ListFileHandler(sbatch_dir, sbatch_file, self.logger, dryrun=self._dryrun, forceful=self._forceful)
 
         master_script = f'{source}/single_run.py'
 
@@ -592,14 +705,14 @@ def _load_proj_codes(self):
             # Running for the first time
             self._add_proj_codeset(
                 'main', 
-                self.datasets.get()
+                self.datasets
             )
             
         for p in proj_codes:
-            self.proj_codes[p] = TextFileHandler(
+            self.proj_codes[p] = ListFileHandler(
                 self.proj_codes_dir, 
                 p, 
-                self.logger,
+                logger=self.logger,
                 dryrun=self._dryrun,
                 forceful=self._forceful,
             )
diff --git a/padocc/operations/mixins.py b/padocc/operations/mixins.py
index 706b69f..5538968 100644
--- a/padocc/operations/mixins.py
+++ b/padocc/operations/mixins.py
@@ -17,6 +17,9 @@
 from padocc.core.project import ProjectOperation
 
 class InitialisationMixin:
+    """
+    Mixin container class for initialisation
+    routines for groups via input files."""
 
     def init_from_stac(self):
         pass
@@ -114,8 +117,8 @@ def _init_group(self, datasets : list, substitutions: dict = None):
         # Group config is the contents of datasets.csv
         if substitutions:
             datasets, status = apply_substitutions('init_file',subs=substitutions, content=datasets)
-        if status:
-            self.logger.warning(status)
+            if status:
+                self.logger.warning(status)
 
         self.datasets.set(datasets)
 
@@ -131,16 +134,19 @@ def _open_json(file):
             cfg_values = {}
             ds_values  = datasets[index].split(',')
 
-            proj_code = ds_values[0]
-            pattern   = ds_values[1]
+            proj_code = ds_values[0].replace(' ','')
+            pattern   = ds_values[1].replace(' ','')
 
             if pattern.endswith('.txt') and substitutions:
                 pattern, status = apply_substitutions('dataset_file', subs=substitutions, content=[pattern])
                 pattern = pattern[0]
                 if status:
                     self.logger.warning(status)
-            else:
+            elif pattern.endswith('.csv'):
                 pattern = os.path.abspath(pattern)
+            else:
+                # Dont expand pattern if its not a csv
+                pass
 
             if substitutions:
                 cfg_values['substitutions'] = substitutions
@@ -181,6 +187,20 @@ def _open_json(file):
         self.logger.info(f'Written as group ID: {self.groupID}')
         self.save_files()
 
+class ModifiersMixin:
+
+    def add_project(self):
+        """
+        Add a project to this group
+        """
+        pass
+
+    def remove_project(self):
+        """
+        Remove a project from this group
+        """
+        pass
+
 """
 Replacement for assessor tool. Requires the following (public) methods:
  - progress (progress_check)
diff --git a/padocc/operations/shepard.py b/padocc/operations/shepard.py
new file mode 100644
index 0000000..0d2123d
--- /dev/null
+++ b/padocc/operations/shepard.py
@@ -0,0 +1,145 @@
+__author__    = "Daniel Westwood"
+__contact__   = "daniel.westwood@stfc.ac.uk"
+__copyright__ = "Copyright 2024 United Kingdom Research and Innovation"
+
+"""
+SHEPARD:
+Serialised Handler for Enabling Padocc Aggregations via Recurrent Deployment
+"""
+
+import os
+import yaml
+import argparse
+from typing import Union
+import glob
+import json
+
+from padocc.core.logs import LoggedOperation
+from padocc.operations.group import GroupOperation
+
+shepard_template = {
+    'workdir': '/my/workdir',
+    'group_file': '/my/group/file.csv',
+    'groupID': 'my-group1',
+    'substitutions':['a','b']
+}
+
+class ShepardOperator(LoggedOperation):
+
+    def __init__(self, conf: Union[dict,None] = None, verbose: int = 0) -> None:
+
+        self.conf = self._load_config(conf)
+
+        if self.conf is None:
+            raise NotImplementedError(
+                'Shepard use without a config file is not enabled.'
+            )
+        
+        self.flock_dir = self.conf.get('flock_dir',None)
+        if self.flock_dir is None:
+            raise ValueError(
+                'Missing "flock_dir" from config.'
+            )
+        
+        super().__init__(label='shepard-deploy',verbose=verbose)
+
+        # Shepard Files
+        # - workdir for operations.
+        # - path to a group file.
+
+    def run_batch(self, batch_limit: int = 100):
+
+        batch_limit = self.conf.get('batch_limit',None) or batch_limit
+
+        # Initialise all groups if needed (outside of batch limit)
+
+        flock = self._init_all_flocks()
+
+        self.logger.info("All flocks initialised")
+
+    def _init_all_flocks(self):
+        shepard_files = self.find_flocks()
+        missed_flocks = []
+        shp_flock = []
+        for idx, shp in enumerate(shepard_files):
+            self.logger.info(f'Instantiating flock {idx+1}: {shp}')
+            try:
+                fconf = self.open_flock(shp)
+            except ValueError as err:
+                missed_flocks.append((shp, err))
+                continue
+
+            flock = GroupOperation(
+                fconf['groupID'],
+                fconf['workdir'],
+                label=f'shepard->{fconf["groupID"]}',
+                verbose=self._verbose,
+            )
+
+            if not flock.datasets.get():
+                flock.init_from_file(fconf['group_file'], substitutions=fconf['substitutions'])
+            else:
+                self.logger.info(f'Skipped existing flock: {fconf["groupID"]}')
+
+            shp_flock.append(flock)
+
+        # Handle missed flocks here.
+
+        return shp_flock
+
+    def open_flock(self, file: str):
+
+        if not os.path.isfile(file):
+            raise ValueError(f'Unable to open {file}')
+        
+        with open(file) as f:
+            return json.load(f)
+
+    def find_flocks(self):
+        
+        if not os.path.isdir(self.flock_dir):
+            raise ValueError(
+                f'Flock Directory: {self.flock_dir} - inaccessible.'
+            )
+        
+        return glob.glob(f'{self.flock_dir}/**/*.shp', recursive=True)
+
+    def _load_config(self, conf: str) -> Union[dict,None]:
+        """
+        Load a conf.yaml file to a dictionary
+        """
+        if conf is None:
+            return None
+
+        if os.path.isfile(conf):
+            with open(conf) as f:
+                config = yaml.safe_load(f)
+            return config
+        else:
+            self.logger.error(f'Config file {conf} unreachable')
+            return None
+
+def _get_cmdline_args():
+    """
+    Get command line arguments passed to shepard
+    """
+
+    parser = argparse.ArgumentParser(description='Entrypoint for SHEPARD module')
+    parser.add_argument('--conf',type=str, help='Config file as part of deployment')
+    parser.add_argument('-v','--verbose', action='count', default=0, help='Set level of verbosity for logs')
+
+    args = parser.parse_args()
+
+    return {
+        'conf': args.conf,
+        'verbose': args.verbose}
+
+def main():
+
+    kwargs = _get_cmdline_args()
+
+    shepherd = ShepardOperator(**kwargs)
+    shepherd.run_batch()
+
+if __name__ == '__main__':
+    main()
\ No newline at end of file
diff --git a/padocc/phases/__init__.py b/padocc/phases/__init__.py
index 6fa0a99..fee7b8e 100644
--- a/padocc/phases/__init__.py
+++ b/padocc/phases/__init__.py
@@ -3,7 +3,7 @@
 __copyright__ = "Copyright 2024 United Kingdom Research and Innovation"
 
 from .scan import ScanOperation
-from .compute import KerchunkDS, ZarrDS, ComputeOperation, cfa_handler
+from .compute import KerchunkDS, ZarrDS, ComputeOperation, cfa_handler, CfaDS
 from .ingest import IngestOperation
 from .validate import ValidateOperation
 
diff --git a/padocc/phases/compute.py b/padocc/phases/compute.py
index 911cc4d..8a3d480 100644
--- a/padocc/phases/compute.py
+++ b/padocc/phases/compute.py
@@ -26,12 +26,9 @@
 
 from padocc.core.errors import (
     PartialDriverError,
-    SoftfailBypassError,
     KerchunkDriverFatalError,
     ConcatFatalError,
     SourceNotFoundError,
-    ValidationError,
-    IdenticalVariablesError,
     ComputeError
 )
 
@@ -290,7 +287,8 @@ def __init__(
 
         self.partial = (limiter and num_files != limiter)
 
-        self._determine_version()
+        # Perform this later
+        #self._determine_version()
 
         self.limiter = limiter
         if not self.limiter:
@@ -301,7 +299,7 @@ def __init__(
         self.temp_zattrs = JSONFileHandler(
             self.cache, 
             'temp_zattrs',
-            self.logger,
+            logger=self.logger,
             dryrun=self._dryrun,
             forceful=self._forceful
         )
@@ -346,7 +344,7 @@ def _run(self, mode: str = 'kerchunk'):
         self.logger.error('Nothing to do with this class - use KerchunkDS/ZarrDS instead!')
         raise ComputeError
 
-    def _run_with_timings(self, func):
+    def _run_with_timings(self, func, **kwargs) -> str:
         """
         Configure all required steps for Kerchunk processing.
         - Check if output files already exist.
@@ -355,7 +353,7 @@ def _run_with_timings(self, func):
 
         # Timed func call
         t1 = datetime.now()
-        func()
+        func(**kwargs)
         compute_time = (datetime.now()-t1).total_seconds()
 
         timings      = self._get_timings()
@@ -603,7 +601,7 @@ def _dims_via_validator(self) -> tuple[list[str]]:
         vd.save_report(
             JSONFileHandler(
                 self.dir,
-                'potential_issues.json',
+                'potential_issues',
                 logger=self.logger
             )
         )
@@ -671,22 +669,23 @@ def __init__(
         
     def _run(
             self,
-            **kwargs) -> None:
+            check_dimensions: bool = False,
+            **kwargs) -> str:
         """
         ``_run`` hook method called from the ``ProjectOperation.run`` 
         which this subclass inherits. The kwargs capture the ``mode``
         parameter from ``ProjectOperation.run`` which is not needed 
         because we already know we're running for ``Kerchunk``.
         """
-        status = self._run_with_timings(self.create_refs)
+        status = self._run_with_timings(self.create_refs, check_dimensions=check_dimensions)
         results = cfa_handler(self)
         if results is not None:
             self.base_cfg['data_properties'] = results
             self.detail_cfg['cfa'] = True
-        self.update_status('compute',status,jobid=self._logid, dryrun=self._dryrun)
+        self.update_status('compute',status,jobid=self._logid)
         return status
 
-    def create_refs(self) -> None:
+    def create_refs(self, check_dimensions: bool = False) -> None:
         """Organise creation and loading of refs
         - Load existing cached refs
         - Create new refs
@@ -709,7 +708,7 @@ def create_refs(self) -> None:
         t1 = datetime.now()
         for x, nfile in enumerate(listfiles[:self.limiter]):
             ref = None
-            CacheFile = JSONFileHandler(self.cache, f'{x}.json', 
+            CacheFile = JSONFileHandler(self.cache, f'{x}', 
                                             dryrun=self._dryrun, forceful=self._forceful,
                                             logger=self.logger)
             if not self._thorough:
@@ -736,6 +735,10 @@ def create_refs(self) -> None:
 
             if not self.quality_required:
                 self._perform_shape_checks(ref)
+
+            if check_dimensions:
+                refs = self._perform_dimensions_checks(ref)
+
             CacheFile.set(ref)
             CacheFile.close()
             ctypes.append(ctype)
@@ -923,6 +926,17 @@ def _perform_shape_checks(self, ref: dict) -> None:
             if self.var_shapes[key] != zarray['shape']:
                 self.quality_required = True
 
+    def _perform_dimension_checks(self, ref: dict) -> dict:
+        """
+        Perform dimensional corrections, developed in 
+        response to issues with CCI lakes datasets.
+        """
+
+        raise NotImplementedError(
+            'This feature is not implemented in pre-release v1.3a'
+        )
+
+
 class ZarrDS(ComputeOperation):
 
     def __init__(
@@ -937,7 +951,7 @@ def __init__(
         
         super().__init__(proj_code, workdir, stage, *kwargs)
 
-        self.tempstore   = ZarrStore(self.dir, "zarrcache.zarr", self.logger, **self.fh_kwargs)
+        self.tempstore   = ZarrStore(self.dir, "zarrcache.zarr", logger=self.logger, **self.fh_kwargs)
         self.preferences = preferences
 
         if self.thorough or self.forceful:
@@ -946,12 +960,12 @@ def __init__(
         self.filelist    = []
         self.mem_allowed = mem_allowed
 
-    def _run(self, **kwargs) -> None:
+    def _run(self, **kwargs) -> str:
         """
         Recommended way of running an operation - includes timers etc.
         """
         status = self._run_with_timings(self.create_store)
-        self.update_status('compute',status,jobid=self._logid, dryrun=self._dryrun)
+        self.update_status('compute',status,jobid=self._logid)
         return status
 
     def create_store(self):
@@ -1056,6 +1070,18 @@ def _get_rechunk_scheme(self):
 
         return concat_dim_rechunk, dim_sizes, cpf/self.limiter, volume/self.limiter
 
+class CfaDS(ComputeOperation):
+
+    def _run(self, **kwargs) -> str:
+        """
+        Integration of CFA Converter to 
+        Padocc Operation class."""
+        if cfa_handler(self):
+            return 'Success'
+        return 'Fatal'
+    
+        # Deal with setting proper values here in specific files.
+
 if __name__ == '__main__':
     print('Serial Processor for Kerchunk Pipeline - run with single_run.py')
     
\ No newline at end of file
diff --git a/padocc/phases/scan.py b/padocc/phases/scan.py
index a245c3a..fd590c0 100644
--- a/padocc/phases/scan.py
+++ b/padocc/phases/scan.py
@@ -8,11 +8,13 @@
 import re
 import logging
 
+from typing import Union
+
 from padocc.core import FalseLogger
 from padocc.core.errors import ConcatFatalError
 from padocc.core import ProjectOperation
 from padocc.core.utils import BypassSwitch
-from .compute import KerchunkDS
+from .compute import KerchunkDS, cfa_handler
 
 from padocc.core.filehandlers import JSONFileHandler
 
@@ -170,16 +172,18 @@ def _run(self, mode: str = 'kerchunk') -> None:
             self._scan_zarr(limiter=limiter)
         elif mode == 'kerchunk':
             self._scan_kerchunk(limiter=limiter)
+        elif mode == 'cfa':
+            self._scan_cfa(limiter=limiter)
         else:
-            self.update_status('scan','ValueError',jobid=self._logid, dryrun=self._dryrun)
+            self.update_status('scan','ValueError',jobid=self._logid)
             raise ValueError(
                 f'Unrecognised mode: {mode} - must be one of ["kerchunk","zarr","CFA"]'
             )
 
-        self.update_status('scan','Success',jobid=self._logid, dryrun=self._dryrun)
+        self.update_status('scan','Success',jobid=self._logid)
         return 'Success'
 
-    def _scan_kerchunk(self, limiter: int = None):
+    def _scan_kerchunk(self, limiter: Union[int,None] = None):
         """
         Function to perform scanning with output Kerchunk format.
         """
@@ -247,7 +251,19 @@ def _scan_kerchunk(self, limiter: int = None):
             ctypes, escape=escape, scanned_with='kerchunk'
         )
 
-    def _scan_zarr(self, limiter=None):
+    def _scan_cfa(self, limiter: Union[int,None] = None):
+        """
+        Function to perform scanning with output CFA format.
+        """
+        self.logger.info('Starting scan process for CFA cloud format')
+
+        # Redo this processor call.
+        results = cfa_handler(self, file_limit=limiter)
+
+        # Record results here
+        print(results)
+
+    def _scan_zarr(self, limiter: Union[int,None] = None):
         """
         Function to perform scanning with output Zarr format.
         """
@@ -291,7 +307,7 @@ def _summarise_json(self, identifier) -> tuple:
                 'forceful':self._forceful,
             }
 
-            fh = JSONFileHandler(self.dir, f'cache/{identifier}.json', self.logger, **fh_kwargs)
+            fh = JSONFileHandler(self.dir, f'cache/{identifier}', self.logger, **fh_kwargs)
             kdict = fh['refs']
 
             self.logger.debug(f'Starting Analysis of references for {identifier}')
diff --git a/padocc/phases/validate.py b/padocc/phases/validate.py
index 84fb99e..38014cb 100644
--- a/padocc/phases/validate.py
+++ b/padocc/phases/validate.py
@@ -858,6 +858,8 @@ class ValidateOperation(ProjectOperation):
     def __init__(self, *args, **kwargs):
         super().__init__(*args, **kwargs)
 
+        self.phase = 'validate'
+
     def _run(
             self,
             mode: str = 'kerchunk',
@@ -896,7 +898,7 @@ def _run(
         # Save report
         vd.save_report()
 
-        self.update_status('validate',vd.pass_fail,jobid=self._logid, dryrun=self._dryrun)
+        self.update_status('validate',vd.pass_fail,jobid=self._logid)
         return vd.pass_fail
 
     def _open_sample(self):
@@ -922,9 +924,12 @@ def _open_product(self):
         """
 
         if self.cloud_format == 'kerchunk':
+
+            self.create_new_kfile(self.outproduct)
+
             # Kerchunk opening sequence
             return open_kerchunk(
-                self.outpath, 
+                self.kfile.filepath, 
                 self.logger,
                 isparq = (self.file_type == 'parq'),
                 retry = True,
diff --git a/padocc/temp.txt b/padocc/temp.txt
new file mode 100644
index 0000000..6239f41
--- /dev/null
+++ b/padocc/temp.txt
@@ -0,0 +1,890 @@
+netcdf ESACCI-LAKES-L3S-LK_PRODUCTS-MERGED-19920926-fv2.1.0 {
+dimensions:
+	time = UNLIMITED ; // (1 currently)
+	lat = 21600 ;
+	lon = 43200 ;
+	nv = 2 ;
+variables:
+	float time(time) ;
+		time:long_name = "time" ;
+		time:standard_name = "time" ;
+		time:calendar = "gregorian" ;
+		time:units = "seconds since 1970-01-01 00:00:00" ;
+	int num_obs(time, lat, lon) ;
+		num_obs:_FillValue = 0 ;
+		num_obs:coordinates = "lat lon" ;
+		num_obs:long_name = "number of observations of the pixel area" ;
+		num_obs:units = "1" ;
+		num_obs:grid_mapping = "crs" ;
+	float Rw412(time, lat, lon) ;
+		Rw412:_FillValue = 9.96921e+36f ;
+		Rw412:coordinates = "lat lon" ;
+		Rw412:radiation_wavelength = 412. ;
+		Rw412:radiation_wavelength_units = "nm" ;
+		Rw412:long_name = "Fully normalized water-leaving reflectance at 412 nm" ;
+		Rw412:units = "1" ;
+		Rw412:valid_min = 0. ;
+		Rw412:valid_max = 1. ;
+		Rw412:ancillary_variables = "Rw412_uncertainty_relative, Rw412_uncertainty_relative_unbiased" ;
+		Rw412:grid_mapping = "crs" ;
+	float Rw412_uncertainty_relative(time, lat, lon) ;
+		Rw412_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw412_uncertainty_relative:coordinates = "lat lon" ;
+		Rw412_uncertainty_relative:radiation_wavelength = 412. ;
+		Rw412_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw412_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 412 nm" ;
+		Rw412_uncertainty_relative:units = "percent" ;
+		Rw412_uncertainty_relative:valid_min = -10000. ;
+		Rw412_uncertainty_relative:valid_max = 10000. ;
+		Rw412_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw412_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw412_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw412_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw412_uncertainty_relative_unbiased:radiation_wavelength = 412. ;
+		Rw412_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw412_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 412 nm" ;
+		Rw412_uncertainty_relative_unbiased:units = "percent" ;
+		Rw412_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw412_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw412_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw443(time, lat, lon) ;
+		Rw443:_FillValue = 9.96921e+36f ;
+		Rw443:coordinates = "lat lon" ;
+		Rw443:radiation_wavelength = 443. ;
+		Rw443:radiation_wavelength_units = "nm" ;
+		Rw443:long_name = "Fully normalized water-leaving reflectance at 443 nm" ;
+		Rw443:units = "1" ;
+		Rw443:valid_min = 0. ;
+		Rw443:valid_max = 1. ;
+		Rw443:ancillary_variables = "Rw443_uncertainty_relative, Rw443_uncertainty_relative_unbiased" ;
+		Rw443:grid_mapping = "crs" ;
+	float Rw443_uncertainty_relative(time, lat, lon) ;
+		Rw443_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw443_uncertainty_relative:coordinates = "lat lon" ;
+		Rw443_uncertainty_relative:radiation_wavelength = 443. ;
+		Rw443_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw443_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 443 nm" ;
+		Rw443_uncertainty_relative:units = "percent" ;
+		Rw443_uncertainty_relative:valid_min = -10000. ;
+		Rw443_uncertainty_relative:valid_max = 10000. ;
+		Rw443_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw443_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw443_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw443_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw443_uncertainty_relative_unbiased:radiation_wavelength = 443. ;
+		Rw443_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw443_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 443 nm" ;
+		Rw443_uncertainty_relative_unbiased:units = "percent" ;
+		Rw443_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw443_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw443_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw490(time, lat, lon) ;
+		Rw490:_FillValue = 9.96921e+36f ;
+		Rw490:coordinates = "lat lon" ;
+		Rw490:radiation_wavelength = 490. ;
+		Rw490:radiation_wavelength_units = "nm" ;
+		Rw490:long_name = "Fully normalized water-leaving reflectance at 490 nm" ;
+		Rw490:units = "1" ;
+		Rw490:valid_min = 0. ;
+		Rw490:valid_max = 1. ;
+		Rw490:ancillary_variables = "Rw490_uncertainty_relative, Rw490_uncertainty_relative_unbiased" ;
+		Rw490:grid_mapping = "crs" ;
+	float Rw490_uncertainty_relative(time, lat, lon) ;
+		Rw490_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw490_uncertainty_relative:coordinates = "lat lon" ;
+		Rw490_uncertainty_relative:radiation_wavelength = 490. ;
+		Rw490_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw490_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 490 nm" ;
+		Rw490_uncertainty_relative:units = "percent" ;
+		Rw490_uncertainty_relative:valid_min = -10000. ;
+		Rw490_uncertainty_relative:valid_max = 10000. ;
+		Rw490_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw490_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw490_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw490_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw490_uncertainty_relative_unbiased:radiation_wavelength = 490. ;
+		Rw490_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw490_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 490 nm" ;
+		Rw490_uncertainty_relative_unbiased:units = "percent" ;
+		Rw490_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw490_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw490_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw510(time, lat, lon) ;
+		Rw510:_FillValue = 9.96921e+36f ;
+		Rw510:coordinates = "lat lon" ;
+		Rw510:radiation_wavelength = 510. ;
+		Rw510:radiation_wavelength_units = "nm" ;
+		Rw510:long_name = "Fully normalized water-leaving reflectance at 510 nm" ;
+		Rw510:units = "1" ;
+		Rw510:valid_min = 0. ;
+		Rw510:valid_max = 1. ;
+		Rw510:ancillary_variables = "Rw510_uncertainty_relative, Rw510_uncertainty_relative_unbiased" ;
+		Rw510:grid_mapping = "crs" ;
+	float Rw510_uncertainty_relative(time, lat, lon) ;
+		Rw510_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw510_uncertainty_relative:coordinates = "lat lon" ;
+		Rw510_uncertainty_relative:radiation_wavelength = 510. ;
+		Rw510_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw510_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 510 nm" ;
+		Rw510_uncertainty_relative:units = "percent" ;
+		Rw510_uncertainty_relative:valid_min = -10000. ;
+		Rw510_uncertainty_relative:valid_max = 10000. ;
+		Rw510_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw510_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw510_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw510_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw510_uncertainty_relative_unbiased:radiation_wavelength = 510. ;
+		Rw510_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw510_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 510 nm" ;
+		Rw510_uncertainty_relative_unbiased:units = "percent" ;
+		Rw510_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw510_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw510_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw560(time, lat, lon) ;
+		Rw560:_FillValue = 9.96921e+36f ;
+		Rw560:coordinates = "lat lon" ;
+		Rw560:radiation_wavelength = 560. ;
+		Rw560:radiation_wavelength_units = "nm" ;
+		Rw560:long_name = "Fully normalized water-leaving reflectance at 560 nm" ;
+		Rw560:units = "1" ;
+		Rw560:valid_min = 0. ;
+		Rw560:valid_max = 1. ;
+		Rw560:ancillary_variables = "Rw560_uncertainty_relative, Rw560_uncertainty_relative_unbiased" ;
+		Rw560:grid_mapping = "crs" ;
+	float Rw560_uncertainty_relative(time, lat, lon) ;
+		Rw560_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw560_uncertainty_relative:coordinates = "lat lon" ;
+		Rw560_uncertainty_relative:radiation_wavelength = 560. ;
+		Rw560_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw560_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 560 nm" ;
+		Rw560_uncertainty_relative:units = "percent" ;
+		Rw560_uncertainty_relative:valid_min = -10000. ;
+		Rw560_uncertainty_relative:valid_max = 10000. ;
+		Rw560_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw560_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw560_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw560_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw560_uncertainty_relative_unbiased:radiation_wavelength = 560. ;
+		Rw560_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw560_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 560 nm" ;
+		Rw560_uncertainty_relative_unbiased:units = "percent" ;
+		Rw560_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw560_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw560_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw620(time, lat, lon) ;
+		Rw620:_FillValue = 9.96921e+36f ;
+		Rw620:coordinates = "lat lon" ;
+		Rw620:radiation_wavelength = 620. ;
+		Rw620:radiation_wavelength_units = "nm" ;
+		Rw620:long_name = "Fully normalized water-leaving reflectance at 620 nm" ;
+		Rw620:units = "1" ;
+		Rw620:valid_min = 0. ;
+		Rw620:valid_max = 1. ;
+		Rw620:ancillary_variables = "Rw620_uncertainty_relative, Rw620_uncertainty_relative_unbiased" ;
+		Rw620:grid_mapping = "crs" ;
+	float Rw620_uncertainty_relative(time, lat, lon) ;
+		Rw620_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw620_uncertainty_relative:coordinates = "lat lon" ;
+		Rw620_uncertainty_relative:radiation_wavelength = 620. ;
+		Rw620_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw620_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 620 nm" ;
+		Rw620_uncertainty_relative:units = "percent" ;
+		Rw620_uncertainty_relative:valid_min = -10000. ;
+		Rw620_uncertainty_relative:valid_max = 10000. ;
+		Rw620_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw620_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw620_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw620_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw620_uncertainty_relative_unbiased:radiation_wavelength = 620. ;
+		Rw620_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw620_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 620 nm" ;
+		Rw620_uncertainty_relative_unbiased:units = "percent" ;
+		Rw620_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw620_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw620_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw665(time, lat, lon) ;
+		Rw665:_FillValue = 9.96921e+36f ;
+		Rw665:coordinates = "lat lon" ;
+		Rw665:radiation_wavelength = 665. ;
+		Rw665:radiation_wavelength_units = "nm" ;
+		Rw665:long_name = "Fully normalized water-leaving reflectance at 665 nm" ;
+		Rw665:units = "1" ;
+		Rw665:valid_min = 0. ;
+		Rw665:valid_max = 1. ;
+		Rw665:ancillary_variables = "Rw665_uncertainty_relative, Rw665_uncertainty_relative_unbiased" ;
+		Rw665:grid_mapping = "crs" ;
+	float Rw665_uncertainty_relative(time, lat, lon) ;
+		Rw665_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw665_uncertainty_relative:coordinates = "lat lon" ;
+		Rw665_uncertainty_relative:radiation_wavelength = 665. ;
+		Rw665_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw665_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 665 nm" ;
+		Rw665_uncertainty_relative:units = "percent" ;
+		Rw665_uncertainty_relative:valid_min = -10000. ;
+		Rw665_uncertainty_relative:valid_max = 10000. ;
+		Rw665_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw665_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw665_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw665_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw665_uncertainty_relative_unbiased:radiation_wavelength = 665. ;
+		Rw665_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw665_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 665 nm" ;
+		Rw665_uncertainty_relative_unbiased:units = "percent" ;
+		Rw665_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw665_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw665_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw681(time, lat, lon) ;
+		Rw681:_FillValue = 9.96921e+36f ;
+		Rw681:coordinates = "lat lon" ;
+		Rw681:radiation_wavelength = 681. ;
+		Rw681:radiation_wavelength_units = "nm" ;
+		Rw681:long_name = "Fully normalized water-leaving reflectance at 681 nm" ;
+		Rw681:units = "1" ;
+		Rw681:valid_min = 0. ;
+		Rw681:valid_max = 1. ;
+		Rw681:ancillary_variables = "Rw681_uncertainty_relative, Rw681_uncertainty_relative_unbiased" ;
+		Rw681:grid_mapping = "crs" ;
+	float Rw681_uncertainty_relative(time, lat, lon) ;
+		Rw681_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw681_uncertainty_relative:coordinates = "lat lon" ;
+		Rw681_uncertainty_relative:radiation_wavelength = 681. ;
+		Rw681_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw681_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 681 nm" ;
+		Rw681_uncertainty_relative:units = "percent" ;
+		Rw681_uncertainty_relative:valid_min = -10000. ;
+		Rw681_uncertainty_relative:valid_max = 10000. ;
+		Rw681_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw681_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw681_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw681_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw681_uncertainty_relative_unbiased:radiation_wavelength = 681. ;
+		Rw681_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw681_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 681 nm" ;
+		Rw681_uncertainty_relative_unbiased:units = "percent" ;
+		Rw681_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw681_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw681_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw709(time, lat, lon) ;
+		Rw709:_FillValue = 9.96921e+36f ;
+		Rw709:coordinates = "lat lon" ;
+		Rw709:radiation_wavelength = 709. ;
+		Rw709:radiation_wavelength_units = "nm" ;
+		Rw709:long_name = "Fully normalized water-leaving reflectance at 709 nm" ;
+		Rw709:units = "1" ;
+		Rw709:valid_min = 0. ;
+		Rw709:valid_max = 1. ;
+		Rw709:ancillary_variables = "Rw709_uncertainty_relative, Rw709_uncertainty_relative_unbiased" ;
+		Rw709:grid_mapping = "crs" ;
+	float Rw709_uncertainty_relative(time, lat, lon) ;
+		Rw709_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw709_uncertainty_relative:coordinates = "lat lon" ;
+		Rw709_uncertainty_relative:radiation_wavelength = 709. ;
+		Rw709_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw709_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 709 nm" ;
+		Rw709_uncertainty_relative:units = "percent" ;
+		Rw709_uncertainty_relative:valid_min = -10000. ;
+		Rw709_uncertainty_relative:valid_max = 10000. ;
+		Rw709_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw709_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw709_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw709_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw709_uncertainty_relative_unbiased:radiation_wavelength = 709. ;
+		Rw709_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw709_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 709 nm" ;
+		Rw709_uncertainty_relative_unbiased:units = "percent" ;
+		Rw709_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw709_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw709_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw754(time, lat, lon) ;
+		Rw754:_FillValue = 9.96921e+36f ;
+		Rw754:coordinates = "lat lon" ;
+		Rw754:radiation_wavelength = 754. ;
+		Rw754:radiation_wavelength_units = "nm" ;
+		Rw754:long_name = "Fully normalized water-leaving reflectance at 754 nm" ;
+		Rw754:units = "1" ;
+		Rw754:valid_min = 0. ;
+		Rw754:valid_max = 1. ;
+		Rw754:ancillary_variables = "Rw754_uncertainty_relative, Rw754_uncertainty_relative_unbiased" ;
+		Rw754:grid_mapping = "crs" ;
+	float Rw754_uncertainty_relative(time, lat, lon) ;
+		Rw754_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw754_uncertainty_relative:coordinates = "lat lon" ;
+		Rw754_uncertainty_relative:radiation_wavelength = 754. ;
+		Rw754_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw754_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 754 nm" ;
+		Rw754_uncertainty_relative:units = "percent" ;
+		Rw754_uncertainty_relative:valid_min = -10000. ;
+		Rw754_uncertainty_relative:valid_max = 10000. ;
+		Rw754_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw754_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw754_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw754_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw754_uncertainty_relative_unbiased:radiation_wavelength = 754. ;
+		Rw754_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw754_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 754 nm" ;
+		Rw754_uncertainty_relative_unbiased:units = "percent" ;
+		Rw754_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw754_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw754_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw779(time, lat, lon) ;
+		Rw779:_FillValue = 9.96921e+36f ;
+		Rw779:coordinates = "lat lon" ;
+		Rw779:radiation_wavelength = 779. ;
+		Rw779:radiation_wavelength_units = "nm" ;
+		Rw779:long_name = "Fully normalized water-leaving reflectance at 779 nm" ;
+		Rw779:units = "1" ;
+		Rw779:valid_min = 0. ;
+		Rw779:valid_max = 1. ;
+		Rw779:ancillary_variables = "Rw779_uncertainty_relative, Rw779_uncertainty_relative_unbiased" ;
+		Rw779:grid_mapping = "crs" ;
+	float Rw779_uncertainty_relative(time, lat, lon) ;
+		Rw779_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw779_uncertainty_relative:coordinates = "lat lon" ;
+		Rw779_uncertainty_relative:radiation_wavelength = 779. ;
+		Rw779_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw779_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 779 nm" ;
+		Rw779_uncertainty_relative:units = "percent" ;
+		Rw779_uncertainty_relative:valid_min = -10000. ;
+		Rw779_uncertainty_relative:valid_max = 10000. ;
+		Rw779_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw779_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw779_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw779_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw779_uncertainty_relative_unbiased:radiation_wavelength = 779. ;
+		Rw779_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw779_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 779 nm" ;
+		Rw779_uncertainty_relative_unbiased:units = "percent" ;
+		Rw779_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw779_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw779_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw885(time, lat, lon) ;
+		Rw885:_FillValue = 9.96921e+36f ;
+		Rw885:coordinates = "lat lon" ;
+		Rw885:radiation_wavelength = 885. ;
+		Rw885:radiation_wavelength_units = "nm" ;
+		Rw885:long_name = "Fully normalized water-leaving reflectance at 885 nm" ;
+		Rw885:units = "1" ;
+		Rw885:valid_min = 0. ;
+		Rw885:valid_max = 1. ;
+		Rw885:ancillary_variables = "Rw885_uncertainty_relative, Rw885_uncertainty_relative_unbiased" ;
+		Rw885:grid_mapping = "crs" ;
+	float Rw885_uncertainty_relative(time, lat, lon) ;
+		Rw885_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw885_uncertainty_relative:coordinates = "lat lon" ;
+		Rw885_uncertainty_relative:radiation_wavelength = 885. ;
+		Rw885_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw885_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 885 nm" ;
+		Rw885_uncertainty_relative:units = "percent" ;
+		Rw885_uncertainty_relative:valid_min = -10000. ;
+		Rw885_uncertainty_relative:valid_max = 10000. ;
+		Rw885_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw885_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw885_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw885_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw885_uncertainty_relative_unbiased:radiation_wavelength = 885. ;
+		Rw885_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw885_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 885 nm" ;
+		Rw885_uncertainty_relative_unbiased:units = "percent" ;
+		Rw885_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw885_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw885_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw900(time, lat, lon) ;
+		Rw900:_FillValue = 9.96921e+36f ;
+		Rw900:coordinates = "lat lon" ;
+		Rw900:radiation_wavelength = 900. ;
+		Rw900:radiation_wavelength_units = "nm" ;
+		Rw900:long_name = "Fully normalized water-leaving reflectance at 900 nm" ;
+		Rw900:units = "1" ;
+		Rw900:valid_min = 0. ;
+		Rw900:valid_max = 1. ;
+		Rw900:ancillary_variables = "Rw900_uncertainty_relative, Rw900_uncertainty_relative_unbiased" ;
+		Rw900:grid_mapping = "crs" ;
+	float Rw900_uncertainty_relative(time, lat, lon) ;
+		Rw900_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw900_uncertainty_relative:coordinates = "lat lon" ;
+		Rw900_uncertainty_relative:radiation_wavelength = 900. ;
+		Rw900_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw900_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 900 nm" ;
+		Rw900_uncertainty_relative:units = "percent" ;
+		Rw900_uncertainty_relative:valid_min = -10000. ;
+		Rw900_uncertainty_relative:valid_max = 10000. ;
+		Rw900_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw900_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw900_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw900_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw900_uncertainty_relative_unbiased:radiation_wavelength = 900. ;
+		Rw900_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw900_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 900 nm" ;
+		Rw900_uncertainty_relative_unbiased:units = "percent" ;
+		Rw900_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw900_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw900_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float chla_mean(time, lat, lon) ;
+		chla_mean:_FillValue = 9.96921e+36f ;
+		chla_mean:coordinates = "lat lon" ;
+		chla_mean:long_name = "Concentration of chlorophyll-a" ;
+		chla_mean:units = "mg.m-3" ;
+		chla_mean:valid_min = 0. ;
+		chla_mean:valid_max = 1000. ;
+		chla_mean:ancillary_variables = "chla_uncertainty" ;
+		chla_mean:grid_mapping = "crs" ;
+	float chla_uncertainty(time, lat, lon) ;
+		chla_uncertainty:_FillValue = 9.96921e+36f ;
+		chla_uncertainty:coordinates = "lat lon" ;
+		chla_uncertainty:long_name = "Relative uncertainty in concentration of chlorophyll-a" ;
+		chla_uncertainty:units = "percent" ;
+		chla_uncertainty:valid_min = 0. ;
+		chla_uncertainty:valid_max = 200. ;
+		chla_uncertainty:grid_mapping = "crs" ;
+	float turbidity_mean(time, lat, lon) ;
+		turbidity_mean:_FillValue = 9.96921e+36f ;
+		turbidity_mean:coordinates = "lat lon" ;
+		turbidity_mean:long_name = "Turbidity in Nephelometric Turbidity Units" ;
+		turbidity_mean:units = "1" ;
+		turbidity_mean:valid_min = 0. ;
+		turbidity_mean:valid_max = 1000. ;
+		turbidity_mean:ancillary_variables = "turbidity_uncertainty" ;
+		turbidity_mean:grid_mapping = "crs" ;
+	float turbidity_uncertainty(time, lat, lon) ;
+		turbidity_uncertainty:_FillValue = 9.96921e+36f ;
+		turbidity_uncertainty:coordinates = "lat lon" ;
+		turbidity_uncertainty:long_name = "Relative uncertainty in turbidity" ;
+		turbidity_uncertainty:units = "percent" ;
+		turbidity_uncertainty:valid_min = 0. ;
+		turbidity_uncertainty:valid_max = 200. ;
+		turbidity_uncertainty:grid_mapping = "crs" ;
+	double lat(lat) ;
+		lat:valid_min = -90. ;
+		lat:valid_max = 90. ;
+		lat:axis = "Y" ;
+		lat:long_name = "latitude" ;
+		lat:standard_name = "latitude" ;
+		lat:units = "degrees_north" ;
+		lat:reference_datum = "WGS84 datum" ;
+		lat:bounds = "lat_bounds" ;
+	double lon(lon) ;
+		lon:axis = "X" ;
+		lon:long_name = "longitude" ;
+		lon:standard_name = "longitude" ;
+		lon:units = "degrees_east" ;
+		lon:reference_datum = "WGS84 datum" ;
+		lon:valid_min = -180LL ;
+		lon:valid_max = 180LL ;
+		lon:bounds = "lon_bounds" ;
+	ubyte lwlr_quality_flag(time, lat, lon) ;
+		lwlr_quality_flag:_FillValue = 255UB ;
+		lwlr_quality_flag:flag_values = 1UB, 2UB, 4UB, 8UB, 16UB, 32UB, 64UB, 128UB ;
+		lwlr_quality_flag:flag_meanings = "lwlr_cloud lwlr_land lwlr_snow_ice lwlr_bright_pixel lwlr_land_contaminated lwlr_atmospheric_correction_failure lwlr_poor_consistency lwlr_low_consistency" ;
+		lwlr_quality_flag:comment = "These quality indicators inform the user on the reasons behind missing observations and on proper use of the data. lwlr_cloud: not processed due to suspected cloud; lwlr_land: not processed due to suspected land; lwlr_snow_ice: not processed due to suspected snow or ice; lwlr_bright_pixel: extreme values masked; lwlr_land_contaminated: risk of land influence on water signal; lwlr_atmospheric_correction_failure: no atmospheric correction result; lwlr_poor_consistency: illegal combination of LWLR, LWST, and/or LIC; lwlr_ low_consistency: pixel includes at least some suspect combinations of LWLR, LWST, and/or LIC" ;
+		lwlr_quality_flag:long_name = "Quality flag of LWLR pixels" ;
+		lwlr_quality_flag:valid_min = 0. ;
+		lwlr_quality_flag:valid_max = 128. ;
+		lwlr_quality_flag:units = "" ;
+		lwlr_quality_flag:grid_mapping = "crs" ;
+	float Rw531(time, lat, lon) ;
+		Rw531:_FillValue = 9.96921e+36f ;
+		Rw531:ancillary_variables = "Rw531_uncertainty_relative, Rw531_uncertainty_relative_unbiased" ;
+		Rw531:radiation_wavelength_units = "nm" ;
+		Rw531:coordinates = "lat lon" ;
+		Rw531:long_name = "Fully normalized water-leaving reflectance at 531 nm" ;
+		Rw531:radiation_wavelength = 531. ;
+		Rw531:units = "1" ;
+		Rw531:valid_min = 0. ;
+		Rw531:valid_max = 1. ;
+		Rw531:grid_mapping = "crs" ;
+	float Rw859_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw859_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw859_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw859_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw859_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 859 nm" ;
+		Rw859_uncertainty_relative_unbiased:radiation_wavelength = 859. ;
+		Rw859_uncertainty_relative_unbiased:units = "percent" ;
+		Rw859_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw859_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw859_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw400_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw400_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw400_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw400_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw400_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 400 nm" ;
+		Rw400_uncertainty_relative_unbiased:radiation_wavelength = 400. ;
+		Rw400_uncertainty_relative_unbiased:units = "percent" ;
+		Rw400_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw400_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw400_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw400_uncertainty_relative(time, lat, lon) ;
+		Rw400_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw400_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw400_uncertainty_relative:coordinates = "lat lon" ;
+		Rw400_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 400 nm" ;
+		Rw400_uncertainty_relative:radiation_wavelength = 400. ;
+		Rw400_uncertainty_relative:units = "percent" ;
+		Rw400_uncertainty_relative:valid_min = -10000. ;
+		Rw400_uncertainty_relative:valid_max = 10000. ;
+		Rw400_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw531_uncertainty_relative(time, lat, lon) ;
+		Rw531_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw531_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw531_uncertainty_relative:coordinates = "lat lon" ;
+		Rw531_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 531 nm" ;
+		Rw531_uncertainty_relative:radiation_wavelength = 531. ;
+		Rw531_uncertainty_relative:units = "percent" ;
+		Rw531_uncertainty_relative:valid_min = -10000. ;
+		Rw531_uncertainty_relative:valid_max = 10000. ;
+		Rw531_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw547(time, lat, lon) ;
+		Rw547:_FillValue = 9.96921e+36f ;
+		Rw547:ancillary_variables = "Rw547_uncertainty_relative, Rw547_uncertainty_relative_unbiased" ;
+		Rw547:radiation_wavelength_units = "nm" ;
+		Rw547:coordinates = "lat lon" ;
+		Rw547:long_name = "Fully normalized water-leaving reflectance at 547 nm" ;
+		Rw547:radiation_wavelength = 547. ;
+		Rw547:units = "1" ;
+		Rw547:valid_min = 0. ;
+		Rw547:valid_max = 1. ;
+		Rw547:grid_mapping = "crs" ;
+	float Rw645(time, lat, lon) ;
+		Rw645:_FillValue = 9.96921e+36f ;
+		Rw645:ancillary_variables = "Rw645_uncertainty_relative, Rw645_uncertainty_relative_unbiased" ;
+		Rw645:radiation_wavelength_units = "nm" ;
+		Rw645:coordinates = "lat lon" ;
+		Rw645:long_name = "Fully normalized water-leaving reflectance at 645 nm" ;
+		Rw645:radiation_wavelength = 645. ;
+		Rw645:units = "1" ;
+		Rw645:valid_min = 0. ;
+		Rw645:valid_max = 1. ;
+		Rw645:grid_mapping = "crs" ;
+	float Rw547_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw547_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw547_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw547_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw547_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 547 nm" ;
+		Rw547_uncertainty_relative_unbiased:radiation_wavelength = 547. ;
+		Rw547_uncertainty_relative_unbiased:units = "percent" ;
+		Rw547_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw547_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw547_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw469_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw469_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw469_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw469_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw469_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 469 nm" ;
+		Rw469_uncertainty_relative_unbiased:radiation_wavelength = 469. ;
+		Rw469_uncertainty_relative_unbiased:units = "percent" ;
+		Rw469_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw469_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw469_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw859(time, lat, lon) ;
+		Rw859:_FillValue = 9.96921e+36f ;
+		Rw859:ancillary_variables = "Rw859_uncertainty_relative, Rw859_uncertainty_relative_unbiased" ;
+		Rw859:radiation_wavelength_units = "nm" ;
+		Rw859:coordinates = "lat lon" ;
+		Rw859:long_name = "Fully normalized water-leaving reflectance at 859 nm" ;
+		Rw859:radiation_wavelength = 859. ;
+		Rw859:units = "1" ;
+		Rw859:valid_min = 0. ;
+		Rw859:valid_max = 1. ;
+		Rw859:grid_mapping = "crs" ;
+	float Rw531_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw531_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw531_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw531_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw531_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 531 nm" ;
+		Rw531_uncertainty_relative_unbiased:radiation_wavelength = 531. ;
+		Rw531_uncertainty_relative_unbiased:units = "percent" ;
+		Rw531_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw531_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw531_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw1020(time, lat, lon) ;
+		Rw1020:_FillValue = 9.96921e+36f ;
+		Rw1020:ancillary_variables = "Rw1020_uncertainty_relative, Rw1020_uncertainty_relative_unbiased" ;
+		Rw1020:radiation_wavelength_units = "nm" ;
+		Rw1020:coordinates = "lat lon" ;
+		Rw1020:long_name = "Fully normalized water-leaving reflectance at 1020 nm" ;
+		Rw1020:radiation_wavelength = 1020. ;
+		Rw1020:units = "1" ;
+		Rw1020:valid_min = 0. ;
+		Rw1020:valid_max = 1. ;
+		Rw1020:grid_mapping = "crs" ;
+	float Rw645_uncertainty_relative(time, lat, lon) ;
+		Rw645_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw645_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw645_uncertainty_relative:coordinates = "lat lon" ;
+		Rw645_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 645 nm" ;
+		Rw645_uncertainty_relative:radiation_wavelength = 645. ;
+		Rw645_uncertainty_relative:units = "percent" ;
+		Rw645_uncertainty_relative:valid_min = -10000. ;
+		Rw645_uncertainty_relative:valid_max = 10000. ;
+		Rw645_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw469(time, lat, lon) ;
+		Rw469:_FillValue = 9.96921e+36f ;
+		Rw469:ancillary_variables = "Rw469_uncertainty_relative, Rw469_uncertainty_relative_unbiased" ;
+		Rw469:radiation_wavelength_units = "nm" ;
+		Rw469:coordinates = "lat lon" ;
+		Rw469:long_name = "Fully normalized water-leaving reflectance at 469 nm" ;
+		Rw469:radiation_wavelength = 469. ;
+		Rw469:units = "1" ;
+		Rw469:valid_min = 0. ;
+		Rw469:valid_max = 1. ;
+		Rw469:grid_mapping = "crs" ;
+	float Rw674(time, lat, lon) ;
+		Rw674:_FillValue = 9.96921e+36f ;
+		Rw674:ancillary_variables = "Rw674_uncertainty_relative, Rw674_uncertainty_relative_unbiased" ;
+		Rw674:radiation_wavelength_units = "nm" ;
+		Rw674:coordinates = "lat lon" ;
+		Rw674:long_name = "Fully normalized water-leaving reflectance at 674 nm" ;
+		Rw674:radiation_wavelength = 674. ;
+		Rw674:units = "1" ;
+		Rw674:valid_min = 0. ;
+		Rw674:valid_max = 1. ;
+		Rw674:grid_mapping = "crs" ;
+	float Rw400(time, lat, lon) ;
+		Rw400:_FillValue = 9.96921e+36f ;
+		Rw400:ancillary_variables = "Rw400_uncertainty_relative, Rw400_uncertainty_relative_unbiased" ;
+		Rw400:radiation_wavelength_units = "nm" ;
+		Rw400:coordinates = "lat lon" ;
+		Rw400:long_name = "Fully normalized water-leaving reflectance at 400 nm" ;
+		Rw400:radiation_wavelength = 400. ;
+		Rw400:units = "1" ;
+		Rw400:valid_min = 0. ;
+		Rw400:valid_max = 1. ;
+		Rw400:grid_mapping = "crs" ;
+	float Rw674_uncertainty_relative(time, lat, lon) ;
+		Rw674_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw674_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw674_uncertainty_relative:coordinates = "lat lon" ;
+		Rw674_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 674 nm" ;
+		Rw674_uncertainty_relative:radiation_wavelength = 674. ;
+		Rw674_uncertainty_relative:units = "percent" ;
+		Rw674_uncertainty_relative:valid_min = -10000. ;
+		Rw674_uncertainty_relative:valid_max = 10000. ;
+		Rw674_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw674_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw674_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw674_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw674_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw674_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 674 nm" ;
+		Rw674_uncertainty_relative_unbiased:radiation_wavelength = 674. ;
+		Rw674_uncertainty_relative_unbiased:units = "percent" ;
+		Rw674_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw674_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw674_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw1020_uncertainty_relative(time, lat, lon) ;
+		Rw1020_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw1020_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw1020_uncertainty_relative:coordinates = "lat lon" ;
+		Rw1020_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 1020 nm" ;
+		Rw1020_uncertainty_relative:radiation_wavelength = 1020. ;
+		Rw1020_uncertainty_relative:units = "percent" ;
+		Rw1020_uncertainty_relative:valid_min = -10000. ;
+		Rw1020_uncertainty_relative:valid_max = 10000. ;
+		Rw1020_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw547_uncertainty_relative(time, lat, lon) ;
+		Rw547_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw547_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw547_uncertainty_relative:coordinates = "lat lon" ;
+		Rw547_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 547 nm" ;
+		Rw547_uncertainty_relative:radiation_wavelength = 547. ;
+		Rw547_uncertainty_relative:units = "percent" ;
+		Rw547_uncertainty_relative:valid_min = -10000. ;
+		Rw547_uncertainty_relative:valid_max = 10000. ;
+		Rw547_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw645_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw645_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw645_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw645_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw645_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 645 nm" ;
+		Rw645_uncertainty_relative_unbiased:radiation_wavelength = 645. ;
+		Rw645_uncertainty_relative_unbiased:units = "percent" ;
+		Rw645_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw645_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw645_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw469_uncertainty_relative(time, lat, lon) ;
+		Rw469_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw469_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw469_uncertainty_relative:coordinates = "lat lon" ;
+		Rw469_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 469 nm" ;
+		Rw469_uncertainty_relative:radiation_wavelength = 469. ;
+		Rw469_uncertainty_relative:units = "percent" ;
+		Rw469_uncertainty_relative:valid_min = -10000. ;
+		Rw469_uncertainty_relative:valid_max = 10000. ;
+		Rw469_uncertainty_relative:grid_mapping = "crs" ;
+	float Rw1020_uncertainty_relative_unbiased(time, lat, lon) ;
+		Rw1020_uncertainty_relative_unbiased:_FillValue = 9.96921e+36f ;
+		Rw1020_uncertainty_relative_unbiased:radiation_wavelength_units = "nm" ;
+		Rw1020_uncertainty_relative_unbiased:coordinates = "lat lon" ;
+		Rw1020_uncertainty_relative_unbiased:long_name = "Relative unbiased uncertainty in fully normalized water-leaving reflectance at 1020 nm" ;
+		Rw1020_uncertainty_relative_unbiased:radiation_wavelength = 1020. ;
+		Rw1020_uncertainty_relative_unbiased:units = "percent" ;
+		Rw1020_uncertainty_relative_unbiased:valid_min = -10000. ;
+		Rw1020_uncertainty_relative_unbiased:valid_max = 10000. ;
+		Rw1020_uncertainty_relative_unbiased:grid_mapping = "crs" ;
+	float Rw859_uncertainty_relative(time, lat, lon) ;
+		Rw859_uncertainty_relative:_FillValue = 9.96921e+36f ;
+		Rw859_uncertainty_relative:radiation_wavelength_units = "nm" ;
+		Rw859_uncertainty_relative:coordinates = "lat lon" ;
+		Rw859_uncertainty_relative:long_name = "Relative uncertainty in fully normalized water-leaving reflectance at 859 nm" ;
+		Rw859_uncertainty_relative:radiation_wavelength = 859. ;
+		Rw859_uncertainty_relative:units = "percent" ;
+		Rw859_uncertainty_relative:valid_min = -10000. ;
+		Rw859_uncertainty_relative:valid_max = 10000. ;
+		Rw859_uncertainty_relative:grid_mapping = "crs" ;
+	float lat_bounds(lat, nv) ;
+	float lon_bounds(lon, nv) ;
+	byte lake_ice_cover_class(time, lat, lon) ;
+		lake_ice_cover_class:_FillValue = 0b ;
+		lake_ice_cover_class:long_name = "lake ice cover" ;
+		lake_ice_cover_class:flag_values = 1b, 2b, 3b, 4b ;
+		lake_ice_cover_class:flag_meanings = "water ice cloud bad" ;
+		lake_ice_cover_class:ancillary_variables = "lake_ice_cover_flag lake_ice_cover_uncertainty" ;
+		lake_ice_cover_class:valid_min = 1b ;
+		lake_ice_cover_class:valid_max = 4b ;
+		lake_ice_cover_class:grid_mapping = "crs" ;
+	byte lake_ice_cover_flag(time, lat, lon) ;
+		lake_ice_cover_flag:_FillValue = 0b ;
+		lake_ice_cover_flag:long_name = "lake ice cover flag" ;
+		lake_ice_cover_flag:flag_values = 1b, 2b ;
+		lake_ice_cover_flag:flag_meanings = "does_not_form_ice forms_ice" ;
+		lake_ice_cover_flag:valid_min = 1b ;
+		lake_ice_cover_flag:valid_max = 2b ;
+		lake_ice_cover_flag:grid_mapping = "crs" ;
+	short lake_ice_cover_uncertainty(time, lat, lon) ;
+		lake_ice_cover_uncertainty:_FillValue = -32768s ;
+		lake_ice_cover_uncertainty:long_name = "lake ice cover uncertainty" ;
+		lake_ice_cover_uncertainty:units = "percent" ;
+		lake_ice_cover_uncertainty:valid_min = 0s ;
+		lake_ice_cover_uncertainty:valid_max = 10000s ;
+		lake_ice_cover_uncertainty:scale_factor = 0.01f ;
+		lake_ice_cover_uncertainty:grid_mapping = "crs" ;
+	short lake_surface_water_temperature(time, lat, lon) ;
+		lake_surface_water_temperature:_FillValue = -32768s ;
+		lake_surface_water_temperature:units = "Kelvin" ;
+		lake_surface_water_temperature:scale_factor = 0.01f ;
+		lake_surface_water_temperature:add_offset = 273.15f ;
+		lake_surface_water_temperature:long_name = "lake surface skin temperature" ;
+		lake_surface_water_temperature:valid_min = -200s ;
+		lake_surface_water_temperature:valid_max = 5000s ;
+		lake_surface_water_temperature:comment = "The observations from different instruments have been combined." ;
+		lake_surface_water_temperature:ancillary_variables = "lswt_uncertainty lswt_quality_level lswt_obs_instr lswt_flag_bias_correction" ;
+		lake_surface_water_temperature:grid_mapping = "crs" ;
+	short lswt_uncertainty(time, lat, lon) ;
+		lswt_uncertainty:_FillValue = -32768s ;
+		lswt_uncertainty:units = "Kelvin" ;
+		lswt_uncertainty:long_name = "total uncertainty" ;
+		lswt_uncertainty:scale_factor = 0.001f ;
+		lswt_uncertainty:add_offset = 0.f ;
+		lswt_uncertainty:valid_min = 0s ;
+		lswt_uncertainty:valid_max = 10000s ;
+		lswt_uncertainty:comment = "Total uncertainty was computed with LSWT uncertainties from the Optimal Estimation and bias correction uncertainty." ;
+		lswt_uncertainty:grid_mapping = "crs" ;
+	byte lswt_quality_level(time, lat, lon) ;
+		lswt_quality_level:_FillValue = 0b ;
+		lswt_quality_level:flag_meanings = "bad_data worst_quality low_quality acceptable_quality best_quality" ;
+		lswt_quality_level:flag_masks = 1b, 2b, 3b, 4b, 5b ;
+		lswt_quality_level:long_name = "lake surface water temperature quality levels" ;
+		lswt_quality_level:valid_min = 1b ;
+		lswt_quality_level:valid_max = 5b ;
+		lswt_quality_level:comment = "These are overall quality indicators and they are important to properly use the data." ;
+		lswt_quality_level:grid_mapping = "crs" ;
+	float water_surface_height_above_reference_datum(time, lat, lon) ;
+		water_surface_height_above_reference_datum:_FillValue = 9.96921e+36f ;
+		water_surface_height_above_reference_datum:long_name = "Lake water level above geoid" ;
+		water_surface_height_above_reference_datum:units = "m" ;
+		water_surface_height_above_reference_datum:valid_min = -999.f ;
+		water_surface_height_above_reference_datum:anciliary_variables = "lwl_uncertainty, lwl_quality_flag" ;
+		water_surface_height_above_reference_datum:grid_mapping = "crs" ;
+		water_surface_height_above_reference_datum:comment = "A Lake Water Level value of -999 indicates that no past or present altimetry missions monitor the lake" ;
+		water_surface_height_above_reference_datum:valid_max = 6000.f ;
+	int lwl_uncertainty(time, lat, lon) ;
+		lwl_uncertainty:_FillValue = -32767 ;
+		lwl_uncertainty:long_name = "Water surface height uncertainty" ;
+		lwl_uncertainty:units = "cm" ;
+		lwl_uncertainty:valid_min = 0 ;
+		lwl_uncertainty:scale_factor = 0.01 ;
+		lwl_uncertainty:grid_mapping = "crs" ;
+		lwl_uncertainty:valid_max = 30000 ;
+	byte lwl_quality_flag(time, lat, lon) ;
+		lwl_quality_flag:_FillValue = -127b ;
+		lwl_quality_flag:long_name = "quality of the lake water level estimated" ;
+		lwl_quality_flag:valid_min = 0b ;
+		lwl_quality_flag:valid_max = 2b ;
+		lwl_quality_flag:flag_values = 0b, 1b, 2b ;
+		lwl_quality_flag:flag_meanings = "best_quality medium_quality lower_quality" ;
+		lwl_quality_flag:grid_mapping = "crs" ;
+		lwl_quality_flag:comment = "These are quality indicators, and they are important to properly use the data" ;
+	float lake_surface_water_extent(time, lat, lon) ;
+		lake_surface_water_extent:_FillValue = 9.96921e+36f ;
+		lake_surface_water_extent:long_name = "Lake Water Extent" ;
+		lake_surface_water_extent:units = "km2" ;
+		lake_surface_water_extent:valid_min = 0.f ;
+		lake_surface_water_extent:valid_max = 500000.f ;
+		lake_surface_water_extent:anciliary_variables = "lwe_uncertainty, lwe_quality_flag" ;
+		lake_surface_water_extent:grid_mapping = "crs" ;
+	int lwe_uncertainty(time, lat, lon) ;
+		lwe_uncertainty:_FillValue = -32767 ;
+		lwe_uncertainty:long_name = "Water extent uncertainty" ;
+		lwe_uncertainty:units = "percent" ;
+		lwe_uncertainty:valid_min = 0 ;
+		lwe_uncertainty:valid_max = 10000 ;
+		lwe_uncertainty:scale_factor = 0.01 ;
+		lwe_uncertainty:grid_mapping = "crs" ;
+	byte lwe_quality_flag(time, lat, lon) ;
+		lwe_quality_flag:_FillValue = -127b ;
+		lwe_quality_flag:long_name = "quality of the lake water extent estimated" ;
+		lwe_quality_flag:valid_min = 0b ;
+		lwe_quality_flag:valid_max = 2b ;
+		lwe_quality_flag:flag_meanings = "best_quality medium_quality lower_quality" ;
+		lwe_quality_flag:grid_mapping = "crs" ;
+		lwe_quality_flag:comment = "These are quality indicators, and they are important to properly use the data" ;
+		lwe_quality_flag:flag_values = 0b, 1b, 2b ;
+	int crs ;
+		crs:grid_mapping_name = "latitude_longitude" ;
+		crs:longitude_of_prime_meridian = 0. ;
+		crs:semi_major_axis = 6378137. ;
+		crs:inverse_flattening = 298.257223563 ;
+		crs:crs_wkt = "GEODCRS[\"WGS 84\",\nDATUM[\"World Geodetic System 1984\",\nELLIPSOID[\"WGS 84\",6378137,298.257223563,\nLENGTHUNIT[\"metre\",1.0]]],\nPRIMEM[\"Greenwich\",0],\nCS[ellipsoidal,3],\nAXIS[\"(lat)\",north,ANGLEUNIT[\"degree\",0.0174532925199433]],\nAXIS[\"(lon)\",east,ANGLEUNIT[\"degree\",0.0174532925199433]],\nAXIS[\"ellipsoidal height (h)\",up,LENGTHUNIT[\"metre\",1.0]]]\n" ;
+
+// global attributes:
+		:title = "ESA Lakes_cci product" ;
+		:institution = "LWL: Laboratoire d\'Etudes en Geophysique et Oceanographie Spatiales, Collecte Localisation Satellites; LWE: Laboratoire d\'Etudes en Geophysique et Oceanographie Spatiales, Collecte Localisation Satellites; LSWT: University of Reading; LIC: H2O Geomatics; LWLR: Plymouth Marine Laboratory" ;
+		:source = "LWL: European Space Agency (ESA), National Aeronautics and Space Administration (NASA), European Organisation for the Exploitation of Meteorological Satellites (EUMETSAT), National Oceanic and Atmospheric Administration (NOAA), Indian Space Research Organisation (ISRO); LWE: European Space Agency (ESA), National Aeronautics and Space Administration (NASA); LSWT: European Space Agency (ESA), European Organisation for the Exploitation of Meteorological Satellites (EUMETSAT), European Centre for Medium-Range Weather Forecasts (ECMWF) LIC: European Space Agency (ESA), National Aeronautics and Space Administration (NASA); LWLR: Calimnos processor by Plymouth Marine Laboratory, including calls to Idepix (SNAP) and POLYMER (Hygeos) algorithms;" ;
+		:history = "LWL: Generated by Laboratoire d\'Etudes en Geophysique et Oceanographie Spatiales, Collecte Localisation Satellites; LWE: Generated by Laboratoire d\'Etudes en Geophysique et Oceanographie Spatiales, Collecte Localisation Satellites; LSWT: created with collate_10DAYS_onMASK_RES120_new_unc.py; LIC: Lake ice cover processor by H2O Geomatics; LWLR: Calimnos processor by Plymouth Marine Laboratory, including calls to Idepix (SNAP) and POLYMER (Hygeos) algorithms" ;
+		:references = "https://climate.esa.int/en/projects/lakes/" ;
+		:tracking_id = "af8cefca-75c7-4b43-b316-a7ff7a870c0d" ;
+		:Conventions = "CF-1.8" ;
+		:product_version = "v2.1.0" ;
+		:format_version = "CCI Data Standards v2.3" ;
+		:summary = "This dataset contains L3S daily ECV Lakes products: Water Level (LWL), Water Extent (LWE), Ice cover (LIC), Surface Water Temperature (LSWT) and Water Leaving Reflectance (LWLR). L3S data are observations combined from multiple instruments into a common spatiotemporal grid" ;
+		:keywords = "Satellite, Lake, Climate Change, Lake Water Level, Lake Water Extent, Lake Surface Water Temperature, Lake Ice Cover, Lake Water Leaving Reflectance; inspire: Orthoimagery; gcmd: WATER-LEAVING RADIANCE, EARTH_SCIENCE-TERRESTRIAL_HYDROSPHERE, WATER_QUALITY_WATER_CHEMISTRY-CHLOROPHYLL, SUSPENDED_SOLIDS, TURBIDITY; gemet_keywords: water, algal bloom, aquatic environment, freshwater, freshwater quality, ice, inland water, lagoon, lake, dam, phytoplankton, turbidity, water monitoring, water quality, water reservoir, climate, seasonal variation, environmental data, environmental monitoring, monitoring, remote sensing" ;
+		:keywords_vocabulary = "inspire: INSPIRE spatial data themes, gcmd: NASA Global Change Master Directory (GCMD) Science Keywords, gemet: GEMET keywords" ;
+		:id = "ESACCI-LAKES-L3S-LK_PRODUCTS-MERGED-19920926-fv2.1.0.nc" ;
+		:naming_authority = "lakes.esa-cci" ;
+		:cdm_data_type = "Grid" ;
+		:comment = "These data were produced for the ESA Lakes_cci project" ;
+		:date_created = "2024-01-24" ;
+		:creator_name = "ESA Lakes_cci" ;
+		:creator_url = "https://climate.esa.int/en/projects/lakes/" ;
+		:creator_email = "lakes_cci@groupcls.com" ;
+		:project = "Climate Change Initiative - European Space Agency" ;
+		:geospatial_lat_min = -90. ;
+		:geospatial_lat_max = 90. ;
+		:geospatial_lon_min = -180. ;
+		:geospatial_lon_max = 180. ;
+		:geospatial_vertical_min = "NA" ;
+		:geospatial_vertical_max = "NA" ;
+		:time_coverage_start = "19920926T000000Z" ;
+		:time_coverage_end = "19920926T235959Z" ;
+		:time_coverage_duration = "P1D" ;
+		:time_coverage_resolution = "P1D" ;
+		:standard_name_vocabulary = "CF Standard Name Table v83, 17 October 2023" ;
+		:license = "ESA CCI Data Policy: free and open access" ;
+		:platform = "LWL: TOPEX/Poseidon, Jason-1, Jason-2, Jason-3, Sentinel-6A, Envisat, SARAL, GFO, Sentinel-3A, Sentinel-3B, ERS-1, ERS-2; LWE: Landsat-<4,5,7,8>, Sentinel-1, Sentinel-2, Sentinel-3A, Sentinel-3B, ERS-1, ERS-2; LSWT: Envisat, Terra/Aqua, Sentinel-3A, Sentinel-3B, ERS-2, Metop-A/B; LIC: Terra/Aqua; LWLR: Envisat, Sentinel-3A, Terra/Aqua;" ;
+		:sensor = "LWL: Poseidon-1, Poseidon-2, Poseidon-3, Poseidon-3B, Poseidon-4, RA, RA-2, AltiKa, SRAL; LWE: C-band SAR, MSI, SRAL, MSS, TM, ETM+, OLI, AMI, SAR,; LSWT: AATSR, MODIS, ATTSR-2, AVHRR; LIC: MODIS; LWLR: MERIS, OLCI A/B, MODIS;" ;
+		:spatial_resolution = "1 km at Equator" ;
+		:key_variables = "water_surface_height_above_reference_datum, lake_surface_water_extent, lake_ice_cover_class, lake_surface_water_temperature, chla_mean, turbidity_mean, Rw400, Rw412, Rw443, Rw469, Rw490, Rw510, Rw531, Rw547, Rw560, Rw620, Rw645, Rw665, Rw674, Rw681, Rw709, Rw754, Rw779, Rw859, Rw885, Rw900, Rw1020" ;
+		:geospatial_lat_units = "degrees_north" ;
+		:geospatial_lon_units = "degrees_east" ;
+		:geospatial_lat_resolution = 0.008333333 ;
+		:geospatial_lon_resolution = 0.008333333 ;
+		:doi = "10.5285/7fc9df8070d34cacab8092e45ef276f1" ;
+}
diff --git a/padocc/tests/data_creator/flock_conf.yaml b/padocc/tests/data_creator/flock_conf.yaml
new file mode 100644
index 0000000..ab4ab75
--- /dev/null
+++ b/padocc/tests/data_creator/flock_conf.yaml
@@ -0,0 +1 @@
+flock_dir: "/Users/daniel.westwood/cedadev/padocc/padocc/tests/data_creator"
\ No newline at end of file
diff --git a/padocc/tests/data_creator/test-flock.shp b/padocc/tests/data_creator/test-flock.shp
new file mode 100644
index 0000000..4cdf46f
--- /dev/null
+++ b/padocc/tests/data_creator/test-flock.shp
@@ -0,0 +1,16 @@
+{
+    "workdir":"/Users/daniel.westwood/cedadev/padocc/padocc/tests/auto_testdata_dir",
+    "groupID":"padocc-test-suite",
+    "group_file":"/Users/daniel.westwood/cedadev/padocc/padocc/tests/data_creator/Aggs.csv",
+    "substitutions":{
+        "init_file": {
+            "/home/users/dwest77/cedadev/":"/Users/daniel.westwood/cedadev/"
+        },
+        "dataset_file": {
+            "/home/users/dwest77/cedadev/":"/Users/daniel.westwood/cedadev/"
+        },
+        "datasets": {
+            "/home/users/dwest77/cedadev/":"/Users/daniel.westwood/cedadev/"
+        }
+    }
+}
\ No newline at end of file
diff --git a/padocc/tests/test_compute.py b/padocc/tests/test_compute.py
index 4f4c338..da3e435 100644
--- a/padocc/tests/test_compute.py
+++ b/padocc/tests/test_compute.py
@@ -17,5 +17,5 @@ def test_compute_basic(self, workdir=WORKDIR):
         assert results['Success'] == 3
 
 if __name__ == '__main__':
-    workdir = '/home/users/dwest77/cedadev/padocc/padocc/tests/auto_testdata_dir'
-    TestCompute().test_compute_basic(workdir=workdir)
\ No newline at end of file
+    #workdir = '/home/users/dwest77/cedadev/padocc/padocc/tests/auto_testdata_dir'
+    TestCompute().test_compute_basic()#workdir=workdir)
\ No newline at end of file
diff --git a/padocc/tests/test_fhs.py b/padocc/tests/test_fhs.py
index dcd3f35..8604599 100644
--- a/padocc/tests/test_fhs.py
+++ b/padocc/tests/test_fhs.py
@@ -4,7 +4,7 @@
 from padocc.core.filehandlers import (
     JSONFileHandler,
     KerchunkFile,
-    TextFileHandler,
+    ListFileHandler,
     LogFileHandler,
     CSVFileHandler
 
@@ -13,7 +13,6 @@
 WORKDIR = 'padocc/tests/auto_testdata_dir'
 
 testdict = {
-    0: 'test0',
     'test':None
 }
 
@@ -29,14 +28,6 @@ def generic(fh, testdata, dryrun):
     fh.create_file()
     assert dryrun == (not os.path.isfile(fh.filepath))
 
-    # Magic methods
-    assert 'test' in fh
-    assert 'real' not in fh
-
-    assert fh[0] == 'test0'
-    fh[0] = 'test1'
-    assert fh[0] == 'test1'
-
     if os.path.isfile(fh.filepath):
         os.system(f'rm -rf {fh.filepath}')
     
@@ -55,6 +46,10 @@ def generic_list(fh, testdata):
     for x, item in enumerate(fh):
         assert item == testdata[x]
 
+    assert fh[0] == 'test0'
+    fh[0] = 'test1'
+    assert fh[0] == 'test1'
+
     # Append
     fh.append('testlist')
     assert fh[-1] == 'testlist'
@@ -69,8 +64,8 @@ def test_json_fh(self):
 
         print("Unit Tests: JSON FH")
 
-        for dryrun in [True, False]:
-            json_fh = JSONFileHandler(WORKDIR,'testjs.json', dryrun=dryrun)
+        for dryrun in [True]:
+            json_fh = JSONFileHandler(WORKDIR,'testjs', dryrun=dryrun, verbose=2)
 
             json_fh.set(testdict)
 
@@ -80,6 +75,10 @@ def test_json_fh(self):
             # Generic
             assert generic(json_fh, testdict, dryrun)
 
+            # Magic methods
+            assert 'test' in json_fh
+            assert 'real' not in json_fh
+
             print(f' - JSON FH (dryrun={dryrun}) - Complete')
 
     def test_text_fh(self):
@@ -88,7 +87,7 @@ def test_text_fh(self):
 
         for dryrun in [True, False]:
 
-            text_fh = TextFileHandler(WORKDIR, 'testtx.txt', dryrun=dryrun)
+            text_fh = ListFileHandler(WORKDIR, 'testtx', dryrun=dryrun)
 
             text_fh.set(testlist)
             if dryrun:
@@ -104,7 +103,7 @@ def test_csv_fh(self):
 
         for dryrun in [True, False]:
 
-            csv_fh = CSVFileHandler(WORKDIR, 'test.csv', dryrun=dryrun)
+            csv_fh = CSVFileHandler(WORKDIR, 'test', dryrun=dryrun)
 
             csv_fh.set(testlist)
 
@@ -113,7 +112,7 @@ def test_csv_fh(self):
 
             assert generic(csv_fh, testlist, dryrun)
 
-            csv_fh.update_status('testp','tests','jid1',dryrun)
+            csv_fh.update_status('testp','tests','jid1')
             assert not len(csv_fh) == len(testlist)
 
             print(f' - CSV FH (dryrun={dryrun}) - Complete')
diff --git a/padocc/tests/test_init.py b/padocc/tests/test_init.py
index 4fdad98..239cf26 100644
--- a/padocc/tests/test_init.py
+++ b/padocc/tests/test_init.py
@@ -27,7 +27,8 @@ def test_init_basic(self, wd=WORKDIR):
         process = GroupOperation(
             groupID,
             workdir=workdir,
-            label='test_init')
+            label='test_init',
+            verbose=2)
 
         process.init_from_file(infile, substitutions=substitutions)
 
diff --git a/padocc/tests/test_scan.py b/padocc/tests/test_scan.py
index e5a3f10..4b9d6c8 100644
--- a/padocc/tests/test_scan.py
+++ b/padocc/tests/test_scan.py
@@ -4,7 +4,7 @@
 WORKDIR = 'padocc/tests/auto_testdata_dir'
 
 class TestScan:
-    def test_scan_basic(self, workdir=WORKDIR, verbose=1):
+    def test_scan_basic(self, workdir=WORKDIR, verbose=2):
         groupID = 'padocc-test-suite'
 
         process = GroupOperation(
diff --git a/padocc/tests/test_validate.py b/padocc/tests/test_validate.py
index 5f4a67e..03a1b65 100644
--- a/padocc/tests/test_validate.py
+++ b/padocc/tests/test_validate.py
@@ -1,5 +1,7 @@
 from padocc.operations import GroupOperation
 
+from padocc.core.utils import BypassSwitch
+
 WORKDIR = 'padocc/tests/auto_testdata_dir'
 
 class TestValidate:
@@ -12,11 +14,11 @@ def test_validate(self, workdir=WORKDIR):
             label='test_validate',
             verbose=1)
 
-        results = process.run('validate', forceful=True)
+        results = process.run('validate', forceful=True, bypass=BypassSwitch('DS'))
 
         assert results['Fatal'] == 2
         assert results['Warning'] == 1
 
 if __name__ == '__main__':
-    workdir = '/home/users/dwest77/cedadev/padocc/padocc/tests/auto_testdata_dir'
-    TestValidate().test_validate(workdir=workdir)
\ No newline at end of file
+    #workdir = '/home/users/dwest77/cedadev/padocc/padocc/tests/auto_testdata_dir'
+    TestValidate().test_validate()#workdir=workdir)
\ No newline at end of file
diff --git a/poetry.lock b/poetry.lock
index 50efa2f..9df7048 100644
--- a/poetry.lock
+++ b/poetry.lock
@@ -3789,6 +3789,17 @@ files = [
 docs = ["myst-parser", "pydata-sphinx-theme", "sphinx"]
 test = ["argcomplete (>=3.0.3)", "mypy (>=1.7.0)", "pre-commit", "pytest (>=7.0,<8.2)", "pytest-mock", "pytest-mypy-testing"]
 
+[[package]]
+name = "types-pyyaml"
+version = "6.0.12.20240917"
+description = "Typing stubs for PyYAML"
+optional = false
+python-versions = ">=3.8"
+files = [
+    {file = "types-PyYAML-6.0.12.20240917.tar.gz", hash = "sha256:d1405a86f9576682234ef83bcb4e6fff7c9305c8b1fbad5e0bcd4f7dbdc9c587"},
+    {file = "types_PyYAML-6.0.12.20240917-py3-none-any.whl", hash = "sha256:392b267f1c0fe6022952462bf5d6523f31e37f6cea49b14cee7ad634b6301570"},
+]
+
 [[package]]
 name = "typing-extensions"
 version = "4.12.2"
@@ -4190,4 +4201,4 @@ type = ["pytest-mypy"]
 [metadata]
 lock-version = "2.0"
 python-versions = "^3.11"
-content-hash = "f69dd587cefd00c0517ce729f654c5ab80071f997bbeff01f87819fccb956feb"
+content-hash = "dccf0780de448b44fa42f42f973c943f2701c6fe8852b261409ceb5996924eab"
diff --git a/pyproject.toml b/pyproject.toml
index bdb939b..8174113 100644
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -31,8 +31,12 @@ sphinx = "7.1.2"
 sphinx-rtd-theme = "2.0.0"
 cfapyx = "2024.11.27"
 myst-nb = "^1.1.2"
-
+types-pyyaml = "^6.0.12.20240917"
 
 [build-system]
 requires = ["poetry-core"]
 build-backend = "poetry.core.masonry.api"
+
+[tool.poetry.scripts]
+shepard_deploy = "padocc.operations.shepard:main"
+padocc = "padocc.cli:main"
\ No newline at end of file
diff --git a/release_notes/pre-release_v1.3 b/release_notes/pre-release_v1.3
new file mode 100644
index 0000000..1f2022b
--- /dev/null
+++ b/release_notes/pre-release_v1.3
@@ -0,0 +1,105 @@
+
+# Release Notes for pre-release 1.3
+
+## Module Restructuring
+ - Restructured module sections into three components:
+   - Core: Central components
+   - Operations: Relating specifically to group operations.
+   - Phases: Classes for specific phases within the pipeline (scan, compute etc.)
+   - Added a Tests module for automated testing.
+
+ - Other details
+   - The entire module is now referred to as `padocc` throughout.
+   - `padocc` is consistent with other packages in the CEDA package landscape in terms of its use of poetry for dependency management.
+   - 
+
+## Filehandlers
+
+ - Generic properties of all filehandlers:
+   - move_file: Specify a new path/name for the attached file.
+   - remove_file: Delete the file from the filesystem.
+   - file: Name of the attached file.
+   - filename: Full path to the attached file.
+
+ - Magic Methods of all filehandlers - implementations vary.
+   - `__contains__`: Check if a value exists within the filehandler contents.
+   - `__str__`: String representation of the specific instance.
+   - `__repr__`: Programmatic representation of the specific instance.
+   - `__len__`: Length of contents within the filehandler (where appropriate)
+   - `__iter__`: Allows generation of iterable content of filehandler (where appropriate)
+   - `__getitem__`: Filehandlers are indexable.
+
+ - Other Methods
+   - Append: ListFileHandler operates like a list with `append()` function.
+   - Set/Get: Set or get the value attributed to a filehandler
+   - Close: Save content of the filehandler to the filesystem.
+
+ - Methods for Special Filehandlers
+   - add_download_link: Specific to the Kerchunk JSON Filehandler, resets the path for all chunks to use the CEDA dap connection.
+   - add_kerchunk_history: Adds specific parameters to the Kerchunk content's `history` parameter to state when it was generated etc.
+   - clear: Clears the Store-type filehandlers of all internal files.
+   - open (beta): Open a cloud-format (Store or JSON) Filehandler as a dataset
+
+ - Other Features
+   - Conf: JSON Filehandlers allow a special property to be supplied, a default dictionary which contains some template values. These are then applied before saving any content, where all new values override the template.
+   - Documentation: Added the documentation page for filehandlers and subcomponents.
+
+## Project Operator
+
+Documentation is provided for the project operator, detailed here are some of the key features.
+
+ - Key Methods
+   - info: Obtain information about the specific operator instance.
+   - help: Get help with public methods 
+   - run: Can only be used with a Phase Operator (see below) for running a phased operation. All errors are handled and logged when using this function.
+   - increment_versions: Increment major or minor versions 
+
+ - Properties
+   - dir: The directory on the filesystem where all project files can be found
+   - groupdir: The path to the group directory on the filesystem.
+   - cfa_path: The path to a cfa dataset for this project.
+   - outproduct: The complete filename of the output product
+   - outpath: The combined path to the output product.
+   - revision: The full revision describing the product version i.e 'kr1.0'
+   - version_no: The version number (second part of the revision i.e '1.0') with a major and minor component.
+   - cloud_format [Editable]: The cloud format being used in current workflows.
+   - file_type [Editable]: The file type being used in current workflows.
+   - source_format: The source format detected for this project.
+
+## Group Operator
+
+The group operator enables the application of phased operators across a group of datasets at once, either by parallel deployment or serial handling.
+
+ - Key Methods
+   - info: Obtain useful information about the group
+   - help: Find helpful user functions for the group.
+   - merge/unmerge (beta): Allows the merging and unmerging of groups of datasets.
+   - run: Perform an operation on a subset of full set of the datasets within the group.
+   - create_sbatch: Create a job deployment to SLURM for group processing.
+   - init_from_file [Mixin]: Initialise a group from parameters in a CSV file.
+   - add/remove_project [Mixin]: Add or remove a specific project from a group (Not implemented in pre-release)
+   - get_project [Mixin]: Enables retrieval of a specific project, also accomplished by indexing the group which utilises this function.
+
+ - Still in development:
+   - summary_data: Summarise the data across all projects in the group, including status within the pipeline.
+   - progress: Obtain pipeline specific information about the projects in the group.
+   - create_allocations: Assemble allocations using binpacking for parallel deployment.
+
+ - Magic Methods
+   - `__str__`: String representation of the group
+   - `__repr__`: Programmatic representation of the group.
+   - `__getitem__`: Group is integer indexable to obtain a specific dataset as a ProjectOperator.
+
+## Phased Operators
+
+The phased operators can be used to individually operate on a specific project, although 
+it is instead suggested that the `GroupOperator.run()` method is used as this includes all error logging
+as part of the project operator. Specifics of the phased operators are described below:
+
+ - Scan Operator: Scan a subset of the source files in a specific project and generate some extrapolated results for the whole dataset, including number of chunks, data size and volumes etc.
+ - Compute Operator: Inherited by DatasetProcessors within the compute module (Kerchunk, Zarr, CFA etc.) and enables the computation step. The scan operator uses the compute processors with a file limiter, to operate on a small subset for scanning purposes.
+ - Validate Operator: Perform dataset validation using the CFA dataset generated as part of the pipeline. If a CFA dataset is the only dataset generated, this step is currently not utilised.
+
+Future improvements:
+ - Reorganisation of the compute operator as the current inheritance system is overly complicated.
+ - Addition of an ingest operator for ingestion into STAC catalogs and the CEDA Archive (CEDA-only.)
\ No newline at end of file