spark-influx-sink

A spark metrics sink that pushes to InfluxDb

Why is this useful?

Collecting diagnostic metrics from Apache Spark can be difficult because of the distributed nature of Spark. Polling Spark executor processes or scraping logs becomes tedious when executors run on an arbitrary number of remote hosts. This package instead uses a "push" method of sending metrics to a central host running InfluxDb, where they can be centrally analyzed.

How to deploy

Run ./gradlew build
Copy the JAR that is output to a path where Spark can read it, and add it to Spark's extraClassPath, along with izettle/metrics-influxdb (available on maven)
Add your new sink to Spark's conf/metrics.properties

Example metrics.properties snippet:

*.sink.influx.class=org.apache.spark.metrics.sink.InfluxDbSink
*.sink.influx.protocol=https
*.sink.influx.host=localhost
*.sink.influx.port=8086
*.sink.influx.database=my_metrics
*.sink.influx.auth=metric_client:PASSWORD
*.sink.influx.tags=product:my_product,parent:my_service

Notes

This takes a dependency on the Apache2-licensed com.izettle.dropwizard-metrics-influxdb library, which is an improved version of Dropwizard's upstream InfluxDb support, which exists only in the DropWizard Metrics 4.0 branch.
The package that this code lives in is org.apache.spark.metrics.sink, which is necessary because Spark makes its Sink interface package-private.

License

This project is made available under the Apache 2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
.baseline		.baseline
.github		.github
gradle		gradle
spark-influx-sink		spark-influx-sink
.bulldozer.yml		.bulldozer.yml
.changelog.yml		.changelog.yml
.excavator.yml		.excavator.yml
.gitignore		.gitignore
.policy.yml		.policy.yml
LICENSE.txt		LICENSE.txt
README.md		README.md
build.gradle		build.gradle
circle.yml		circle.yml
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle		settings.gradle
versions.props		versions.props

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spark-influx-sink

Why is this useful?

How to deploy

Notes

License

About

Releases 1

Packages

Contributors 4

Languages

License

palantir/spark-influx-sink

Folders and files

Latest commit

History

Repository files navigation

spark-influx-sink

Why is this useful?

How to deploy

Notes

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 4

Languages

Packages