Skip to main content

Use Dedicated Endpoint with NIM

Precondition

  1. Retrieve API key
  2. Create docker registry secret
kubectl create secret docker-registry ngc-secret --docker-server=nvcr.io --docker-username='$oauthtoken' --docker-password=xxxx -n space
  1. Create NGC_API_KEY secret
kubectl apply -f- <<EOF
apiVersion: v1
kind: Secret
metadata:
name: nvidia-nim-secrets
namespace: space
stringData:
NGC_API_KEY: xxxx
type: Opaque
EOF
  1. Configuration for model inference, refer to docs

Playground

In Playground module, user can testing the model inference without coding. playground

API

Click API Tab in Playgound module. The platform show the code samples to call the dedicated instance. API

Log

Check the log of each replicate instance in log Tab. Log

Billing

Click Billing Tab to check the billing information of each instance. Billing