Deploy a Model¶

You can deploy models from the Model Store or Model Services. Below are the details for configuring the deployment:

Select the Model
Choose the model to deploy (e.g., Qwen2-0.5B-Instruct), ensuring it aligns with your business needs and task requirements.
Model Service Name Requirements
The name of the model service must meet the following criteria:
- Length: 2–64 characters
- Allowed characters: Lowercase letters, numbers, and hyphens (-). The name must start and end with a lowercase letter or number.
- Example: text-gen-service or model-01
Region

Select the region for service deployment (e.g., "Shanghai Area 2"). Choose a region based on your business coverage and latency needs.

Number of Instances
- Configure the number of instances to deploy. Default value: 1.
- More instances enhance service concurrency, but they will also increase costs.
Billing Method

d.run offers two billing options:
1. Pay-as-you-go:
  - Real-time billing based on usage, ideal for short-term or variable-demand users.
  - Cost formula: Number of instances × hourly rate.
  - Example: 1 instance × 3 Yuan/hour = 3 Yuan/hour.
2. Annual or Monthly Subscription (currently not supported):
  - Prepay for service instances at a discounted rate, suitable for long-term users.
  - Upon selecting this option, the system will display the corresponding annual or monthly fee.
View Configuration Costs

At the bottom of the page, the cost calculation formula and estimated costs will be displayed automatically.
- Configuration cost: 3 Yuan/hour
- Calculation formula: 1 instance × 3 Yuan/hour = 3 Yuan/hour