-
-
Notifications
You must be signed in to change notification settings - Fork 596
Fuzzy search return wrong results #3355
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I can't reproduce your second issue (empty results with
Could you please share your table files or adjust this example so the issue shows up? We'll look into the first issue. |
@MohammedOmar123 what does this return for you?
Do you mean |
@donhardman another interesting case (when
|
Hello @sanikolaev Thanks for reviewing the issue, Yes I used manticoresearch/manticore:latest |
You can send us the physical tables files via our write-only S3 storage - https://manual.manticoresearch.com/Reporting_bugs#Uploading-your-data |
I can't provide the entire table, but I can share the table settings with you. Just let me know which files you need. |
Hi @MohammedOmar123, and thanks for reporting the issue! We eliminated it and merged to master. Please give a try on the fresh If you notice any incorrect behavior that you think is an issue, please let me know, and we can take a look into it. @PavelShilin89, let's add tests for this case to make sure it works, you can use @sanikolaev MRE. |
Also, there is a
You can try to play with it and see the differences on your own. |
Thank you @donhardman , This "preserve" option resolved the issue as I see. For cybersecurity, yes it is presented in many rows in my dataset as you see in the this query |
Hello @donhardman , I am facing another issue, could you please check if I am doing something wrong, I tried dev version and the latest version (manticoresearch/manticore:latest or manticoresearch/manticore:dev )
|
@MohammedOmar123 please try disabling
This can alleviate the issue and you should consider control the layouts in general, but still the result seems to be not perfect. Perhaps there's a bug:
@donhardman please look into this. |
I found one issue and already fixed it, but anyway, what I found will not fix the root core of the problem. I still recommend for now to disable layouts and also use the The core issue is lying deep in the core methods we use.
So when we are preparing fuzzy, the flow goes through multiple stages, and one of them is trying to use suggest to guess. As I found, it looks like it does not return anything. I will continue to investigate further. But in the meantime, |
Uh oh!
There was an error while loading. Please reload this page.
Bug Description:
I tried the following query with the following options, I expect that all results to be have both searched terms, but as you see I got very different results
And when I increased the distance to 2 like this
SELECT title FROM occupations WHERE MATCH('cybersecurity advisor') OPTION fuzzy=1, distance=2;
I didn't get any results
My index config
id | bigint | |
| title | text | indexed stored |
| title_str | string | |
| occupation_id | bigint | |
| type | string | |
| language | string | |
| vector | float_vector | knn |
| title_str_attr | string | |
+----------------+--------------+-----------
Variable_name | Value |
+---------------+----------------------------------------------------------------------------+
| settings | min_infix_len = 2
index_exact_words = 1
morphology = lemmatize_en, stem_ar |
I use RT index.
Manticore Search Version:
I pulled the latest version using docker
Operating System Version:
I used Docker
Have you tried the latest development version?
Yes
Internal Checklist:
To be completed by the assignee. Check off tasks that have been completed or are not applicable.
The text was updated successfully, but these errors were encountered: