Skip to content

v0.1.0

Latest
Compare
Choose a tag to compare
@JaehongCS20 JaehongCS20 released this 03 Jan 06:14
· 1 commit to main since this release

Performance model update for LLMServingSim

New features

  • Support GPU with a performance model
  • Auto config generator (network and memory)
  • Verbose option for more detailed log
  • More metrics (queuing_delay, TTFT, TPOT)
  • Refactored code structure for readability