You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I downloaded a few books but I see that often the sentence is splitted in few lines.
So I was wondering if it is a way to rebuild the original sentence in a way that there is a sentence with an uppercase letter and a dot (or a question dot etc).
Example:
Savio di terraferma alla scrittura e le magistrature
Le armi nel loro complesso, il governo ed
il riparto difensivo e territoriale.
to
Savio di terraferma alla scrittura e le magistrature
Le armi nel loro complesso, il governo ed il riparto difensivo e territoriale.
The text was updated successfully, but these errors were encountered:
I had forked this repo to change that along with one other thing I needed for my use case (making replacing deletions with "[deleted]" optional. I have just made a pull request with both changes. My fork is presently available here: github.com/coreybobco/gutenberg_cleaner
I saw the pr but I think that will generate a text like:
Savio di terraferma alla scrittura e le magistrature Le armi nel loro complesso, il governo ed il riparto difensivo e territoriale.
Instead sentences with an uppercase letter should stay in a specific line. I was trying to do an algorithm for that but I think that with a regex is possible to it.
I downloaded a few books but I see that often the sentence is splitted in few lines.
So I was wondering if it is a way to rebuild the original sentence in a way that there is a sentence with an uppercase letter and a dot (or a question dot etc).
Example:
to
The text was updated successfully, but these errors were encountered: