Skip to content

senzing-garage/validate

Repository files navigation

validate

If you are beginning your journey with Senzing, please start with Senzing Quick Start guides.

You are in the Senzing Garage where projects are "tinkered" on. Although this GitHub repository may help you understand an approach to using Senzing, it's not considered to be "production ready" and is not considered to be part of the Senzing product. Heck, it may not even be appropriate for your application of Senzing!

Synopsis

validate is a command in the senzing-tools suite of tools. This command validates that a JSONL file is properly formatted and each line contains sufficient key-value pairs for Senzing to each as a record. It is highly recommend that this code be taken and extended to validate JSONL records to meet your needs.

Go Reference Badge Go Report Card Badge License Badge go-test-linux.yaml Badge go-test-darwin.yaml Badge go-test-windows.yaml Badge

golangci-lint.yaml Badge

Overview

validate tests each line of a give JSONL file to ensure that it is valid JSON and contains two necessary key-value pairs: RECORD_ID and DATA_SOURCE.

The file is given to validate with the command-line parameter input-url or as the environment variable SENZING_TOOLS_INPUT_URL. Note this is a URL so local files will need file:// and remote files http:// or https://. If the given file has the .gz extension, it will be treated as a compressed file JSONL file. If the file has a .jsonl extension it will be treated accordingly. If the file has another extension it will be rejected, unless the input-file-type or SENZING_TOOLS_INPUT_FILE_TYPE is set to JSONL.

validate is intended as a starting point for other validation needs. It should be fairly straight forward to extend it to test other JSON objects or extend it to other file types.

Install

  1. The validate command is installed with the senzing-tools suite of tools. See senzing-tools install.

Use

senzing-tools validate [flags]
  1. For options and flags:

    1. Online documentation

    2. Runtime documentation:

      senzing-tools validate --help
  2. In addition to the following simple usage examples, there are additional Examples.

Using command line options

  1. ✏️ Specify file URL using command line option. Example:

    senzing-tools validate \
        --input-url https://public-read-access.s3.amazonaws.com/TestDataSets/SenzingTruthSet/truth-set-3.0.0.jsonl
  2. See Parameters for additional parameters.

Using environment variables

  1. ✏️ Specify file URL using environment variable. Example:

    export SENZING_TOOLS_INPUT_URL=https://public-read-access.s3.amazonaws.com/TestDataSets/SenzingTruthSet/truth-set-3.0.0.jsonl
    senzing-tools validate
  2. See Parameters for additional parameters.

Using Docker

This usage shows how to validate a file with a Docker container.

  1. ✏️ Run senzing/senzing-tools. Example:

    docker run \
        --env SENZING_TOOLS_COMMAND=validate \
        --env SENZING_TOOLS_INPUT_URL=https://public-read-access.s3.amazonaws.com/TestDataSets/SenzingTruthSet/truth-set-3.0.0.jsonl \
        --rm \
        senzing/senzing-tools
  2. See Parameters for additional parameters.

Parameters

References

  1. SDK documentation
  2. Development
  3. Errors
  4. Examples
  5. Package reference
  6. Related artifacts:
    1. DockerHub