Skip to content

Inference serving 压测时 随着请求数累计,推理性能会下降 3 倍多,并且最终稳定在一个值 #1018

@songyaheng

Description

@songyaheng

System information

  • OS Platform and Distribution (e.g., Linux Ubuntu 20.04):
  • DeepRec version 最新:
  • Python version: 3.8
  • Bazel version (if compiling from source): 4.5
  • GCC/Compiler version (if compiling from source): 12
  • CUDA/cuDNN version: no

Describe the current behavior

Describe the expected behavior

Code to reproduce the issue

Provide a reproducible test case that is the bare minimum necessary to generate the problem.

Other info / logs

Include any logs or source code that would be helpful to diagnose the problem. If including tracebacks, please include the full traceback. Large logs and files should be attached.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions