You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"""
Traceback (most recent call last):
...
pipe_result = (infer_result.pipe_ocr_mode(image_writer)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/model/operators.py", line 180, in pipe_ocr_mode
res = self.apply(
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/model/operators.py", line 72, in apply
return proc(copy.deepcopy(self._infer_res), *args, **kwargs)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/model/operators.py", line 173, in proc
res = pdf_parse_union(*args, **kwargs)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/pdf_parse_union_core_v2.py", line 820, in pdf_parse_union
para_split(pdf_info_dict)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/para/para_split_v3.py", line 378, in para_split
__para_merge_page(all_blocks)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/para/para_split_v3.py", line 355, in __para_merge_page
__merge_2_text_blocks(current_block, prev_block)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/para/para_split_v3.py", line 288, in __merge_2_text_blocks
and not last_span['content'].endswith(LINE_STOP_FLAG)
KeyError: 'content'
"""
Description of the bug | 错误描述
"""
Traceback (most recent call last):
...
pipe_result = (infer_result.pipe_ocr_mode(image_writer)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/model/operators.py", line 180, in pipe_ocr_mode
res = self.apply(
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/model/operators.py", line 72, in apply
return proc(copy.deepcopy(self._infer_res), *args, **kwargs)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/model/operators.py", line 173, in proc
res = pdf_parse_union(*args, **kwargs)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/pdf_parse_union_core_v2.py", line 820, in pdf_parse_union
para_split(pdf_info_dict)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/para/para_split_v3.py", line 378, in para_split
__para_merge_page(all_blocks)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/para/para_split_v3.py", line 355, in __para_merge_page
__merge_2_text_blocks(current_block, prev_block)
File "/home/.../conda/envs/MinerU2/lib/python3.10/site-packages/magic_pdf/para/para_split_v3.py", line 288, in __merge_2_text_blocks
and not last_span['content'].endswith(LINE_STOP_FLAG)
KeyError: 'content'
"""
How to reproduce the bug | 如何复现
Operating system | 操作系统
Linux
Python version | Python 版本
3.10
Software version | 软件版本 (magic-pdf --version)
0.10.x
Device mode | 设备模式
cuda
The text was updated successfully, but these errors were encountered: