Not so good by estimating high confidence for random text/gibberish #113
Unanswered
GabrielKesler
asked this question in
Q&A
Replies: 1 comment 2 replies
-
Hi @GabrielKesler, thanks for your question. Have you read the documentation about the confidence metric? It is a relative metric, i.e. the most likely language always gets the value What is the point of feeding the language detector with gibberish text anyway? This is a very contrived example. I don't think that the texts you want to classify are of this sort. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I have this code snippet:
Resulting in this:
Seems that this library is giving very high confidence values for gibberish/random words, which is unacceptable.
Any suggestions ?
Beta Was this translation helpful? Give feedback.
All reactions