How to recognize the soft return and hard return? #971
Replies: 1 comment 1 reply
-
Hi @fitexmage, and thank you for you interest in However, it is possible that, for your specific PDF, there is enough information to distinguish between these different transitions between lines. The first thing I'd suggest is examining the results of |
Beta Was this translation helpful? Give feedback.
-
I would like to combine the texts separated by soft return but keep the hard return.
For example, in this PDF, I want to extract the text to
额外参数:\n-b : extra_post_body 额外增加的请求内容,如-b "isp:001#002",抓包会用到,平时不会用到,如果value值为数组值,则用 # 进行分隔元素,也可以使用 # 强制生成数组类型如:001#
which will keep the line break between line 1 and 2 but ignore the line break between line 2 and 3.
Is there any way to handle this?
Beta Was this translation helpful? Give feedback.
All reactions