Deploy a Model¶
You can deploy models from the Model Store or Model Services. Below are the details for configuring the deployment:
-
Select the Model
Choose the model to deploy (e.g., Qwen2-0.5B-Instruct), ensuring it aligns with your business needs and task requirements. -
Model Service Name Requirements
The name of the model service must meet the following criteria:- Length: 2–64 characters
- Allowed characters: Lowercase letters, numbers, and hyphens (-). The name must start and end with a lowercase letter or number.
- Example:
text-gen-service
ormodel-01
-
Region
Select the region for service deployment (e.g., "Shanghai Area 2"). Choose a region based on your business coverage and latency needs.
-
Number of Instances
- Configure the number of instances to deploy. Default value: 1.
- More instances enhance service concurrency, but they will also increase costs.
-
Billing Method
d.run offers two billing options:
-
Pay-as-you-go:
- Real-time billing based on usage, ideal for short-term or variable-demand users.
- Cost formula: Number of instances × hourly rate.
- Example: 1 instance × 3 Yuan/hour = 3 Yuan/hour.
-
Annual or Monthly Subscription (currently not supported):
- Prepay for service instances at a discounted rate, suitable for long-term users.
- Upon selecting this option, the system will display the corresponding annual or monthly fee.
-
-
View Configuration Costs
At the bottom of the page, the cost calculation formula and estimated costs will be displayed automatically.
- Configuration cost: 3 Yuan/hour
- Calculation formula: 1 instance × 3 Yuan/hour = 3 Yuan/hour