[HANDS-ON BUG] #441

ss1034 · 2025-04-21T03:13:54Z

Describe the bug
client.text_generation() throws 503 error every time I use it, however, chat.completion works without any issue

To Reproduce
os.environ["HF_TOKEN"]="my_token"

client = InferenceClient("meta-llama/Llama-3.2-3B-Instruct")

output = client.text_generation(
"The capital of France is",
max_new_tokens=100,
)

print(output)

ss1034 · 2025-04-21T03:16:32Z

Below thing is working: client = InferenceClient(
provider="sambanova",
api_key="my_token",
)

completion = client.chat.completions.create(
model="meta-llama/Llama-3.1-8B-Instruct",
messages=[
{
"role": "user",
"content": "What is the capital of France?"
}
],
max_tokens=512,
)

print(completion.choices[0].message.content)

Oorgien · 2025-04-24T14:36:11Z

TypeError: InferenceClient.__init__() got an unexpected keyword argument 'provider'

Are you sure that works?

ss1034 added the hands-on-bug label Apr 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[HANDS-ON BUG] #441

[HANDS-ON BUG] #441

ss1034 commented Apr 21, 2025

ss1034 commented Apr 21, 2025

Uh oh!

Oorgien commented Apr 24, 2025

Uh oh!

[HANDS-ON BUG] #441

[HANDS-ON BUG] #441

Comments

ss1034 commented Apr 21, 2025

ss1034 commented Apr 21, 2025

Uh oh!

Oorgien commented Apr 24, 2025

Uh oh!