10.4 模型选择与成本权衡
10.4.1 模型分层 (The Model Hierarchy)
Tier
Model
擅长
成本
延迟
10.4.2 静态路由 (Static Routing)
def route_request(task_type, prompt):
if task_type == "coding":
return call_sonnet(prompt)
elif task_type == "summarization":
return call_haiku(prompt) # Haiku 读长文很便宜
elif task_type == "creative_writing":
return call_opus(prompt)10.4.3 动态路由 (Dynamic Routing)
10.4.4 级联降级 (Cascading / Fallback)
10.4.5 A/B Testing 与 Evals
最后更新于
