-
-
Notifications
You must be signed in to change notification settings - Fork 11.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] 分块失败 #4975
Comments
Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible. |
@wangxiaodong1021 你好!我是Dosu,一个可以帮助你解决 bug、回答问题,甚至成为贡献者的机器人。我会在这里帮助你,等待人类维护者的到来。期待与你一起解决问题!🤖 The error message "invalid byte sequence for encoding 'UTF8': 0x00" suggests that there might be an issue with the character encoding of the PDF file when attempting to process it. This could be due to non-UTF8 characters or null bytes in the file. Here are a few steps you can take to troubleshoot and potentially resolve this issue:
If the issue persists, you might need to delve deeper into the specific PDF processing logic in the To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Other |
附议,我也遇到了同样的问题同样的报错。Window11,Chrome浏览器 |
Seconded, I also encountered the same problem and the same error. Window11, Chrome browser |
📦 Environment
Zeabur
📌 Version
v1.36.6
💻 Operating System
Ubuntu
🌐 Browser
Edge
🐛 Bug Description
将一个网页保存为 pdf,分块失败,pdf 在知识库里可以正常打开查阅。且之前的别的 pdf 都正常,我将 pdfzuo'wei 作为附件提供了
Diffusion Meets Flow Matching.pdf
📷 Recurrence Steps
No response
🚦 Expected Behavior
No response
📝 Additional Information
No response
The text was updated successfully, but these errors were encountered: