Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 7 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,8 @@ For detailed customization examples and best practices, refer to the [Neuron DLC

| Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL | Other Packages |
|-----------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------|--------------------|------------------------------|------------------------|--------------------------------------------------------------------------------------------|-------------------|
| [PyTorch 2.9.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.28.0/pytorch/inference/2.9.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx | Neuron 2.28.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.9.0-neuronx-py312-sdk2.28.0-ubuntu24.04 | torchserve 0.11.0 |
| [PyTorch 2.9.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.29.0/pytorch/inference/2.9.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, nki | Neuron 2.29.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.9.0-neuronx-py312-sdk2.29.0-ubuntu24.04 | torchserve 0.11.0 |
| [PyTorch 2.9.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.28.0/pytorch/inference/2.9.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, nki | Neuron 2.28.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.9.0-neuronx-py312-sdk2.28.0-ubuntu24.04 | torchserve 0.11.0 |
| [PyTorch 2.9.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.27.1/pytorch/inference/2.9.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx | Neuron 2.27.1 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.9.0-neuronx-py312-sdk2.27.1-ubuntu24.04 | torchserve 0.11.0 |
| [PyTorch 2.8.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.26.1/pytorch/inference/2.8.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx | Neuron 2.26.1 | trn1,trn2,inf2 | 3.11 (py311) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.8.0-neuronx-py311-sdk2.26.1-ubuntu22.04 | torchserve 0.11.0 |
| [PyTorch 2.7.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.25.0/docker/pytorch/inference/2.7.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, transformers-neuronx | Neuron 2.25.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.7.0-neuronx-py310-sdk2.25.0-ubuntu22.04 | torchserve 0.11.0 |
Expand All @@ -52,7 +53,8 @@ For detailed customization examples and best practices, refer to the [Neuron DLC

| Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL |
|----------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------|--------------------|------------------------------|------------------------|-------------------------------------------------------------------------------------------|
| [PyTorch 2.9.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.28.0/pytorch/training/2.9.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.28.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-training-neuronx:2.9.0-neuronx-py312-sdk2.28.0-ubuntu24.04 |
| [PyTorch 2.9.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.29.0/pytorch/training/2.9.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx, nki | Neuron 2.29.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-training-neuronx:2.9.0-neuronx-py312-sdk2.29.0-ubuntu24.04 |
| [PyTorch 2.9.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.28.0/pytorch/training/2.9.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx, nki | Neuron 2.28.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-training-neuronx:2.9.0-neuronx-py312-sdk2.28.0-ubuntu24.04 |
| [PyTorch 2.9.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.27.1/pytorch/training/2.9.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.27.1 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-training-neuronx:2.9.0-neuronx-py312-sdk2.27.1-ubuntu24.04 |
| [PyTorch 2.8.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.26.1/pytorch/training/2.8.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.26.1 | trn1,trn2,inf2 | 3.11 (py311) | public.ecr.aws/neuron/pytorch-training-neuronx:2.8.0-neuronx-py311-sdk2.26.1-ubuntu22.04 |
| [PyTorch 2.7.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.25.0/docker/pytorch/training/2.7.0/Dockerfile.neuronx) | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.25.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-training-neuronx:2.7.0-neuronx-py310-sdk2.25.0-ubuntu22.04 |
Expand All @@ -65,7 +67,8 @@ For detailed customization examples and best practices, refer to the [Neuron DLC

| Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL | Other Packages |
|----------------------------------------------------------------------------------------------------------------------------------------|---------------------------------|--------------------|------------------------------|------------------------|------------------------------------------------------------------------------------------|-------------------|
| [JAX 0.7](https://github.com/aws-neuron/deep-learning-containers/blob/2.28.0/jax/training/0.7/Dockerfile.neuronx) | jax-neuronx, libneuronxla | Neuron 2.28.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/jax-training-neuronx:0.7-neuronx-py312-sdk2.28.0-ubuntu24.04 | jaxlib 0.7 |
| [JAX 0.7](https://github.com/aws-neuron/deep-learning-containers/blob/2.29.0/jax/training/0.7/Dockerfile.neuronx) | jax-neuronx, libneuronxla, nki | Neuron 2.29.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/jax-training-neuronx:0.7-neuronx-py312-sdk2.29.0-ubuntu24.04 | jaxlib 0.7 |
| [JAX 0.7](https://github.com/aws-neuron/deep-learning-containers/blob/2.28.0/jax/training/0.7/Dockerfile.neuronx) | jax-neuronx, libneuronxla, nki | Neuron 2.28.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/jax-training-neuronx:0.7-neuronx-py312-sdk2.28.0-ubuntu24.04 | jaxlib 0.7 |
| [JAX 0.7](https://github.com/aws-neuron/deep-learning-containers/blob/2.27.1/jax/training/0.7/Dockerfile.neuronx) | jax-neuronx, libneuronxla | Neuron 2.27.1 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/jax-training-neuronx:0.7-neuronx-py312-sdk2.27.1-ubuntu24.04 | jaxlib 0.7 |
| [JAX 0.6](https://github.com/aws-neuron/deep-learning-containers/blob/2.26.1/jax/training/0.6/Dockerfile.neuronx) | jax-neuronx, libneuronxla | Neuron 2.26.1 | trn1,trn2,inf2 | 3.11 (py311) | public.ecr.aws/neuron/jax-training-neuronx:0.6-neuronx-py311-sdk2.26.1-ubuntu22.04 | jaxlib 0.6 |
| [JAX 0.6](https://github.com/aws-neuron/deep-learning-containers/blob/2.25.0/docker/jax/training/0.6/Dockerfile.neuronx) | jax-neuronx, libneuronxla | Neuron 2.25.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/jax-training-neuronx:0.6-neuronx-py310-sdk2.25.0-ubuntu22.04 | jaxlib 0.6 |
Expand All @@ -75,6 +78,7 @@ For detailed customization examples and best practices, refer to the [Neuron DLC

| Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL |
|----------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------|--------------------|------------------------------|------------------------|-------------------------------------------------------------------------------------------|
| [VLLM 0.16.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.29.0/vllm/inference/0.16.0/Dockerfile.neuronx) | vllm-neuronx, libneuronxla, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, aws-neuronx-tools, nki | Neuron 2.29.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-inference-vllm-neuronx:0.16.0-neuronx-py312-sdk2.29.0-ubuntu24.04 |
| [VLLM 0.13.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.28.0/vllm/inference/0.13.0/Dockerfile.neuronx) | vllm-neuronx, libneuronxla, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, aws-neuronx-tools, nki | Neuron 2.28.0 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-inference-vllm-neuronx:0.13.0-neuronx-py312-sdk2.28.0-ubuntu24.04 |
| [VLLM 0.13.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.27.1/vllm/inference/0.13.0/Dockerfile.neuronx) | vllm-neuronx, libneuronxla, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, aws-neuronx-tools, nki | Neuron 2.27.1 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-inference-vllm-neuronx:0.13.0-neuronx-py312-sdk2.27.1-ubuntu24.04 |
| [VLLM 0.11.0](https://github.com/aws-neuron/deep-learning-containers/blob/2.27.1/vllm/inference/0.11.0/Dockerfile.neuronx) | vllm-neuronx, libneuronxla, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, aws-neuronx-tools | Neuron 2.27.1 | trn1,trn2,trn3,inf2 | 3.12 (py312) | public.ecr.aws/neuron/pytorch-inference-vllm-neuronx:0.11.0-neuronx-py312-sdk2.27.1-ubuntu24.04 |
Expand Down
27 changes: 19 additions & 8 deletions jax/training/0.7/Dockerfile.neuronx
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@
ENV PYTHONIOENCODING=UTF-8
ENV LANG=C.UTF-8
ENV LC_ALL=C.UTF-8
ENV LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/opt/aws/neuron/lib"

Check warning on line 23 in jax/training/0.7/Dockerfile.neuronx

View workflow job for this annotation

GitHub Actions / build (0.7, jax/training/0.7)

Variables should be defined before their use

UndefinedVar: Usage of undefined variable '$LD_LIBRARY_PATH' More info: https://docs.docker.com/go/dockerfile/rule/undefined-var/
ENV LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/opt/amazon/efa/lib"
ENV LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/opt/amazon/efa/lib64"
ENV LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/opt/amazon/openmpi/lib64"
Expand Down Expand Up @@ -102,13 +102,18 @@
&& ln -s /usr/local/bin/pip3 /usr/bin/pip \
&& ln -s /usr/local/bin/$PYTHON /usr/local/bin/python \
&& ${PIP} --no-cache-dir install --upgrade \
"awscli<2" \
pip \
requests \
setuptools \
uv \
&& rm -rf ~/.cache/pip/*

# Install AWS CLI
RUN curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "awscliv2.zip" \
&& unzip awscliv2.zip \
&& ./aws/install \
&& rm -rf awscliv2.zip aws

# U24 will not allow installation of pip packages outside of venv without this flag
# This is because U24 ships with Python 3.12 by default and installation into the Python
# interpreter’s directory are disabled outside of a virtual environment.
Expand Down Expand Up @@ -157,22 +162,22 @@

# Setting up APT and PIP repo for neuron artifacts
ARG NEURON_APT_REPO=apt.repos.neuron.amazonaws.com
ARG NEURON_APT_REPO_KEY

Check warning on line 165 in jax/training/0.7/Dockerfile.neuronx

View workflow job for this annotation

GitHub Actions / build (0.7, jax/training/0.7)

Sensitive data should not be used in the ARG or ENV commands

SecretsUsedInArgOrEnv: Do not use ARG or ENV instructions for sensitive data (ARG "NEURON_APT_REPO_KEY") More info: https://docs.docker.com/go/dockerfile/rule/secrets-used-in-arg-or-env/
ARG NEURON_PIP_REPO=pip.repos.neuron.amazonaws.com
ARG NEURON_PIP_REPO_KEY

Check warning on line 167 in jax/training/0.7/Dockerfile.neuronx

View workflow job for this annotation

GitHub Actions / build (0.7, jax/training/0.7)

Sensitive data should not be used in the ARG or ENV commands

SecretsUsedInArgOrEnv: Do not use ARG or ENV instructions for sensitive data (ARG "NEURON_PIP_REPO_KEY") More info: https://docs.docker.com/go/dockerfile/rule/secrets-used-in-arg-or-env/
RUN mkdir -p /etc/apt/keyrings \
&& APT_REPO_PREFIX=$([ -n "${NEURON_APT_REPO_KEY}" ] && echo "${NEURON_APT_REPO_KEY}@" || echo "") \
&& echo "deb [signed-by=/etc/apt/keyrings/neuron.gpg] https://${APT_REPO_PREFIX}${NEURON_APT_REPO} jammy main" > /etc/apt/sources.list.d/neuron.list \
&& curl $([ -n "${NEURON_APT_REPO_KEY}" ] && echo "-u ${NEURON_APT_REPO_KEY}") --retry 3 --retry-delay 1 --retry-all-errors -fSL "https://${NEURON_APT_REPO}/GPG-PUB-KEY-AMAZON-AWS-NEURON.PUB" | gpg --dearmor > /etc/apt/keyrings/neuron.gpg

# Neuron SDK components version numbers
ARG NEURONX_RUNTIME_LIB_VERSION=2.30.51.0-faafe26f0
ARG NEURONX_COLLECTIVES_LIB_VERSION=2.30.59.0-f5cdefb39
ARG NEURONX_TOOLS_VERSION=2.28.23.0-f1c114a9d
ARG NEURONX_RUNTIME_LIB_VERSION=2.31.24.0-0b044f4ce
ARG NEURONX_COLLECTIVES_LIB_VERSION=2.31.24.0-1a31ba186
ARG NEURONX_TOOLS_VERSION=2.29.18.0-d5fe7ba42

ARG NEURONX_CC_VERSION=2.23.6484.0+3b612583
ARG NEURONX_JAX_TRAINING_VERSION=0.7.0.1.0.7584+5c8f4c3c
ARG NKI_VERSION=0.2.0+g82fdb402
ARG NEURONX_CC_VERSION=2.24.5133.0+58f8de22
ARG NEURONX_JAX_TRAINING_VERSION=0.7.0.1.0.8181+1e892be0
ARG NKI_VERSION=0.3.0+23928721754.g18aa1271

FROM base AS repo

Expand All @@ -191,7 +196,7 @@
--index-url ${PIP_REPO_URL} \
--extra-index-url ${PYPI_SIMPLE_URL} \
--trusted-host ${NEURON_PIP_REPO} \
"neuronx-cc>=2.0" \
neuronx-cc \
jax-neuronx \
nki \
&& rm -rf ~/.cache/pip/*
Expand Down Expand Up @@ -222,6 +227,12 @@

FROM ${BUILD_STAGE} AS final

# Upgrade OS packages to latest versions
RUN apt-get update \
&& apt-get upgrade -y \
&& rm -rf /var/lib/apt/lists/* \
&& apt-get clean

# Starts framework
ENTRYPOINT ["bash", "-m", "start_with_right_hostname.sh"]
CMD ["/bin/bash"]
Expand Down
39 changes: 25 additions & 14 deletions pytorch/inference/2.9.0/Dockerfile.neuronx
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@

# See http://bugs.python.org/issue19846
ENV LANG=C.UTF-8
ENV LD_LIBRARY_PATH=/opt/aws/neuron/lib:/lib/x86_64-linux-gnu:/opt/conda/lib/:$LD_LIBRARY_PATH

Check warning on line 19 in pytorch/inference/2.9.0/Dockerfile.neuronx

View workflow job for this annotation

GitHub Actions / build (2.9.0, pytorch/inference/2.9.0)

Variables should be defined before their use

UndefinedVar: Usage of undefined variable '$LD_LIBRARY_PATH' More info: https://docs.docker.com/go/dockerfile/rule/undefined-var/
ENV PATH=/opt/conda/bin:/opt/aws/neuron/bin:$PATH

RUN apt-get update \
Expand Down Expand Up @@ -117,17 +117,22 @@
&& rm -rf /tmp/tmp* \
&& apt-get clean

# Install AWS CLI
RUN curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "awscliv2.zip" \
&& unzip awscliv2.zip \
&& ./aws/install \
&& rm -rf awscliv2.zip aws

RUN ${PIP} install --upgrade pip --trusted-host pypi.org --trusted-host files.pythonhosted.org \
&& ${PIP} install --no-cache-dir -U \
"opencv-python>=4.8.1.78" \
"scipy>=1.8.0" \
opencv-python \
scipy \
six \
"awscli<2" \
pandas \
boto3 \
uv \
cryptography \
"protobuf>=3.18.3,<4" \
protobuf \
torchserve==${TORCHSERVE_VERSION} \
torch-model-archiver==${TORCHSERVE_VERSION} \
&& rm -rf ~/.cache/pip/*
Expand Down Expand Up @@ -167,24 +172,24 @@

# Setting up APT and PIP repo for neuron artifacts
ARG NEURON_APT_REPO=apt.repos.neuron.amazonaws.com
ARG NEURON_APT_REPO_KEY

Check warning on line 175 in pytorch/inference/2.9.0/Dockerfile.neuronx

View workflow job for this annotation

GitHub Actions / build (2.9.0, pytorch/inference/2.9.0)

Sensitive data should not be used in the ARG or ENV commands

SecretsUsedInArgOrEnv: Do not use ARG or ENV instructions for sensitive data (ARG "NEURON_APT_REPO_KEY") More info: https://docs.docker.com/go/dockerfile/rule/secrets-used-in-arg-or-env/
ARG NEURON_PIP_REPO=pip.repos.neuron.amazonaws.com
ARG NEURON_PIP_REPO_KEY

Check warning on line 177 in pytorch/inference/2.9.0/Dockerfile.neuronx

View workflow job for this annotation

GitHub Actions / build (2.9.0, pytorch/inference/2.9.0)

Sensitive data should not be used in the ARG or ENV commands

SecretsUsedInArgOrEnv: Do not use ARG or ENV instructions for sensitive data (ARG "NEURON_PIP_REPO_KEY") More info: https://docs.docker.com/go/dockerfile/rule/secrets-used-in-arg-or-env/
RUN mkdir -p /etc/apt/keyrings \
&& APT_REPO_PREFIX=$([ -n "${NEURON_APT_REPO_KEY}" ] && echo "${NEURON_APT_REPO_KEY}@" || echo "") \
&& echo "deb [signed-by=/etc/apt/keyrings/neuron.gpg] https://${APT_REPO_PREFIX}${NEURON_APT_REPO} jammy main" > /etc/apt/sources.list.d/neuron.list \
&& curl $([ -n "${NEURON_APT_REPO_KEY}" ] && echo "-u ${NEURON_APT_REPO_KEY}") --retry 3 --retry-delay 1 --retry-all-errors -fSL "https://${NEURON_APT_REPO}/GPG-PUB-KEY-AMAZON-AWS-NEURON.PUB" | gpg --dearmor > /etc/apt/keyrings/neuron.gpg

# Neuron SDK components version numbers
ARG NEURONX_COLLECTIVES_LIB_VERSION=2.30.59.0-f5cdefb39
ARG NEURONX_RUNTIME_LIB_VERSION=2.30.51.0-faafe26f0
ARG NEURONX_TOOLS_VERSION=2.28.23.0-f1c114a9d
ARG NEURONX_COLLECTIVES_LIB_VERSION=2.31.24.0-1a31ba186
ARG NEURONX_RUNTIME_LIB_VERSION=2.31.24.0-0b044f4ce
ARG NEURONX_TOOLS_VERSION=2.29.18.0-d5fe7ba42

ARG NEURONX_CC_VERSION=2.23.6484.0+3b612583
ARG NEURONX_FRAMEWORK_VERSION=2.9.0.2.12.22436+0f1dac25
ARG NEURONX_DISTRIBUTED_VERSION=0.17.26814+4b18de63
ARG NEURONX_DISTRIBUTED_INFERENCE_VERSION=0.8.16251+f3ca5575
ARG NKI_VERSION=0.2.0+g82fdb402
ARG NEURONX_CC_VERSION=2.24.5133.0+58f8de22
ARG NEURONX_FRAMEWORK_VERSION=2.9.0.2.13.24727+8e870898
ARG NEURONX_DISTRIBUTED_VERSION=0.18.27753+1cafd54f
ARG NEURONX_DISTRIBUTED_INFERENCE_VERSION=0.9.17334+ced6ae4e
ARG NKI_VERSION=0.3.0+23928721754.g18aa1271

FROM base AS repo

Expand All @@ -203,8 +208,8 @@
--index-url ${PIP_REPO_URL} \
--trusted-host ${NEURON_PIP_REPO} \
--extra-index-url ${PYPI_SIMPLE_URL} \
"neuronx-cc>=2.0" \
"torch-neuronx==2.9.*" \
neuronx-cc \
torch-neuronx \
neuronx_distributed \
neuronx_distributed_inference \
nki \
Expand Down Expand Up @@ -236,6 +241,12 @@

FROM ${BUILD_STAGE} AS final

# Upgrade OS packages to latest versions
RUN apt-get update \
&& apt-get upgrade -y \
&& rm -rf /var/lib/apt/lists/* \
&& apt-get clean

EXPOSE 8080 8081

ENTRYPOINT ["python", "/usr/local/bin/dockerd-entrypoint.py"]
Expand Down
Loading
Loading