You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Question]: When parsing a docx file using the Book parsing method, to_page is always -1, resulting in a block count of 0 even if parsing is successful
#3230
Closed
kuschzzp opened this issue
Nov 6, 2024
· 2 comments
…page is always -1, resulting in a block count of 0 even if parsing is successful (#3249)
### What problem does this PR solve?
When parsing a docx file using the Book parsing method, to_page is
always -1, resulting in a block count of 0 even if parsing is successful
Fix:#3230
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>
…ethod, to_page is always -1, resulting in a block count of 0 even if parsing is successful (infiniflow#3249)
### What problem does this PR solve?
When parsing a docx file using the Book parsing method, to_page is
always -1, resulting in a block count of 0 even if parsing is successful
Fix:infiniflow#3230
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>
Describe your problem
docx
withBook
,to_page
always-1
I don't know what code can be changed to make to_page correct
ragflow/deepdoc/parser/docx_parser.py
Lines 113 to 136 in af74bf0
When parsing a docx file using the Book parsing method, to_page is always -1, resulting in a block count of 0 even if parsing is successful
to fix another error info , i update the below code
ragflow/deepdoc/parser/docx_parser.py
Line 133 in af74bf0
to
secs.append(("".join(runs_within_single_paragraph), p.style.name if hasattr(p.style, 'name') else ''))
i found here is the default value
ragflow/api/db/db_models.py
Line 843 in af74bf0
The text was updated successfully, but these errors were encountered: