AWS SageMaker 推理实践:部署 Hugging Face 模型到 Serverless 端点 (2025 指南)
· 2 min read
准备
- 环境: AWS SageMarker Notebook (ml.t3.medium)
- 服务: AWS SageMarker Inference(Models, Endpoint configurations, Endpoints)
- Models Repo: huggingface models (other like S3 bucket, fine-tuning models)
Abbreviation
- tgi: Text Generate Interface