Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
p-e-w
on Jan 6, 2024
|
parent
|
context
|
favorite
| on:
Nitro: A fast, lightweight inference server with O...
Because if the client specifically requests GPT-3.5, but is silently being served something else instead, the client will rely on having GPT-3.5 capabilities without them actually being available, which is a recipe for breakage.
brigadier132
on Jan 6, 2024
[–]
You do understand that the client will be written by the same people setting up the inference server?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: