Skip to content

v0.2

Compare
Choose a tag to compare
@Atinoda Atinoda released this 13 Feb 22:15
d21346e

Refactor project to reflect upstream development.

  • Introduce variants: base, default
  • Introduce platforms: nvidia, cpu, rocm, arc
  • Change base image from nvidia/cuda to ubuntu:22.04 to slim docker images
  • Deprecate variants: cuda, llama-cpu, triton, monkey-patch
    • Use default-cpu instead of llama-cpu for new deployments
    • Use default-nvidia instead of cuda, triton
    • Please raise an issue if new version has missing functionality