You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Below thing is working: client = InferenceClient(
provider="sambanova",
api_key="my_token",
)
completion = client.chat.completions.create(
model="meta-llama/Llama-3.1-8B-Instruct",
messages=[
{
"role": "user",
"content": "What is the capital of France?"
}
],
max_tokens=512,
)
Describe the bug
client.text_generation() throws 503 error every time I use it, however, chat.completion works without any issue
To Reproduce
os.environ["HF_TOKEN"]="my_token"
client = InferenceClient("meta-llama/Llama-3.2-3B-Instruct")
output = client.text_generation(
"The capital of France is",
max_new_tokens=100,
)
print(output)
The text was updated successfully, but these errors were encountered: