Replies: 5 comments 1 reply
-
I found a similar resolved issue that might be relevant to your situation. The issue discussed how to handle text block segmentation when special characters are used as delimiters in RAGFlow. The suggested solution was to:
You might want to try these steps to see if they resolve the issue with the block identifier not working as expected in the general mode. To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other |
Beta Was this translation helpful? Give feedback.
-
@chaijl Thanks for the report. I will do a double check and get back to you. |
Beta Was this translation helpful? Give feedback.
-
A delimiter or separator can consist of one or multiple special characters. If it is multiple characters, ensure they are enclosed in backticks( ``) |
Beta Was this translation helpful? Give feedback.
-
After testing multiple times, I found the problem. I don't know if the system has this issue. As long as there is a character in my text that is the same as the ('') delimiter, the delimiter will be invalid, and this character will also be cleared in normal text. Just now, because my text has the symbol #, but my delimiter is 4 #, thank you for your reply, teachers |
Beta Was this translation helpful? Give feedback.
-
Note: These should be backticks (`) not single quotes (') |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
In the general mode of the slicing method, I uploaded a very simple txt file with 4 # characters between each paragraph. I also configured the '#######' segment identifier in the block setting. The text block size settings range from 500-1000 and are not segmented according to this identifier. If there is a # character in the text, it will be deleted by default after document parsing. Attached are my configuration screenshots and test files.

测试数据.txt
Beta Was this translation helpful? Give feedback.
All reactions