Microsoft Office Forums

Go Back   Microsoft Office Forums > >

Reply
 
Thread Tools Display Modes
  #1  
Old 05-20-2020, 09:58 AM
kikola kikola is offline Indention below paragraph mark... Windows 10 Indention below paragraph mark... Office 2013
Novice
Indention below paragraph mark...
 
Join Date: May 2020
Posts: 23
kikola is on a distinguished road
Default Indention below paragraph mark...

Hello friends,
I making ocr of scanned document and indention and paragraphs must be as in the original document but I have some problems in doing this.
So, after making ocr below some paragraph which's length is not standard size(full size), below I need indention but cannot think how it is possible. Could you help with vba or some other ways accomplishing this?
Attached Files
File Type: docx for layout.docx (13.3 KB, 8 views)
Reply With Quote
  #2  
Old 05-20-2020, 04:07 PM
macropod's Avatar
macropod macropod is offline Indention below paragraph mark... Windows 7 64bit Indention below paragraph mark... Office 2010 32bit
Administrator
 
Join Date: Dec 2010
Location: Canberra, Australia
Posts: 21,956
macropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond repute
Default

See the Cleaning up Text Pasted from Websites, E-mails, PDFs etc. 'Sticky' thread at the top of the Word forum: https://www.msofficeforums.com/word/...-pdfs-etc.html
__________________
Cheers,
Paul Edstein
[Fmr MS MVP - Word]
Reply With Quote
  #3  
Old 05-21-2020, 03:43 AM
kikola kikola is offline Indention below paragraph mark... Windows 10 Indention below paragraph mark... Office 2013
Novice
Indention below paragraph mark...
 
Join Date: May 2020
Posts: 23
kikola is on a distinguished road
Default

Thanks but it does not work as my word document is not from pdf or web site. It is from tesseract OCR. This OCR I quiet good recognizing characters but with some errors.
So, I need a new layout (indention) in paragraph where previous paragraph is no full length).
Also, I need deleting paragraph marks only for the page last paragraphs.(where the page ends new page starts).

Thanks, in advanced...
Reply With Quote
  #4  
Old 05-21-2020, 06:03 AM
macropod's Avatar
macropod macropod is offline Indention below paragraph mark... Windows 7 64bit Indention below paragraph mark... Office 2010 32bit
Administrator
 
Join Date: Dec 2010
Location: Canberra, Australia
Posts: 21,956
macropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond repute
Default

The source is irrelevant. In your attachment, every line has been rendered as a separate paragraph - which is exactly the same as sometimes happens with data extracted from PDFs via OCR. As long as your document remains formatted that way, you will not be able to get the layout you desire.
__________________
Cheers,
Paul Edstein
[Fmr MS MVP - Word]
Reply With Quote
  #5  
Old 05-24-2020, 10:54 PM
kikola kikola is offline Indention below paragraph mark... Windows 10 Indention below paragraph mark... Office 2013
Novice
Indention below paragraph mark...
 
Join Date: May 2020
Posts: 23
kikola is on a distinguished road
Default

Could this be done by chracarter counting so if the characters on line is less then for example 55, then make the new paragraph indention??
Reply With Quote
  #6  
Old 05-24-2020, 11:58 PM
macropod's Avatar
macropod macropod is offline Indention below paragraph mark... Windows 7 64bit Indention below paragraph mark... Office 2010 32bit
Administrator
 
Join Date: Dec 2010
Location: Canberra, Australia
Posts: 21,956
macropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond repute
Default

You could use a wildcard Find/Replace to do something like that, inserting an empty paragraph after every line containing less than 56 characters. For example:
Find = ^13[!^13]{1,55}^13
Replace = ^&^p
__________________
Cheers,
Paul Edstein
[Fmr MS MVP - Word]
Reply With Quote
  #7  
Old 05-26-2020, 02:11 AM
kikola kikola is offline Indention below paragraph mark... Windows 10 Indention below paragraph mark... Office 2013
Novice
Indention below paragraph mark...
 
Join Date: May 2020
Posts: 23
kikola is on a distinguished road
Default

ANd how to implement is less than 56 characters and ends with "."
Is it possible??
Reply With Quote
  #8  
Old 05-26-2020, 03:22 AM
macropod's Avatar
macropod macropod is offline Indention below paragraph mark... Windows 7 64bit Indention below paragraph mark... Office 2010 32bit
Administrator
 
Join Date: Dec 2010
Location: Canberra, Australia
Posts: 21,956
macropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond repute
Default

How would that be relevant? Paragraphs never end with '.' - they only ever end with paragraph breaks.
__________________
Cheers,
Paul Edstein
[Fmr MS MVP - Word]
Reply With Quote
  #9  
Old 05-26-2020, 05:37 AM
kikola kikola is offline Indention below paragraph mark... Windows 10 Indention below paragraph mark... Office 2013
Novice
Indention below paragraph mark...
 
Join Date: May 2020
Posts: 23
kikola is on a distinguished road
Default

Not the paragraph but the word. (which last 55 symbol is the ".")
Reply With Quote
  #10  
Old 05-26-2020, 05:48 AM
macropod's Avatar
macropod macropod is offline Indention below paragraph mark... Windows 7 64bit Indention below paragraph mark... Office 2010 32bit
Administrator
 
Join Date: Dec 2010
Location: Canberra, Australia
Posts: 21,956
macropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond repute
Default

And what about sentences that have less than 56 characters? Or sentences with abbreviations ending in '.' but those abbreviations don't end the sentence? You really haven't explained what you're trying to achieve.
__________________
Cheers,
Paul Edstein
[Fmr MS MVP - Word]
Reply With Quote
  #11  
Old 05-26-2020, 05:58 AM
kikola kikola is offline Indention below paragraph mark... Windows 10 Indention below paragraph mark... Office 2013
Novice
Indention below paragraph mark...
 
Join Date: May 2020
Posts: 23
kikola is on a distinguished road
Default

I need: If the sentences has less then 56 characters and last charachter is ".", then insert empthy paragraph below. If not I have another find/replace feature (([a-z])^13; \1) and i will use it.
Reply With Quote
  #12  
Old 05-26-2020, 06:04 AM
macropod's Avatar
macropod macropod is offline Indention below paragraph mark... Windows 7 64bit Indention below paragraph mark... Office 2010 32bit
Administrator
 
Join Date: Dec 2010
Location: Canberra, Australia
Posts: 21,956
macropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond repute
Default

There is no reliable way for Find/Replace (or VBA) to know how long a sentence is.
__________________
Cheers,
Paul Edstein
[Fmr MS MVP - Word]
Reply With Quote
  #13  
Old 05-26-2020, 06:15 AM
kikola kikola is offline Indention below paragraph mark... Windows 10 Indention below paragraph mark... Office 2013
Novice
Indention below paragraph mark...
 
Join Date: May 2020
Posts: 23
kikola is on a distinguished road
Default

When I OCR some pdf's, if the source is in good quality, the paragraph indentions(new paragraphs, layouts) starts as an empty paragraph for this instances I know how to use find and replace, but sometimes (lines paragraphs) are not separated from below or above paragraphs so the only possibility that I could differentiate it is the lines which is not full length (less then 56 characters and ends with ".") therefore I need some find/replace code for that purposes, Is it possible?
Reply With Quote
  #14  
Old 05-26-2020, 06:21 AM
macropod's Avatar
macropod macropod is offline Indention below paragraph mark... Windows 7 64bit Indention below paragraph mark... Office 2010 32bit
Administrator
 
Join Date: Dec 2010
Location: Canberra, Australia
Posts: 21,956
macropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond reputemacropod has a reputation beyond repute
Default

I already gave you the Find/Replace code for lines less than 56 characters. Use it.
__________________
Cheers,
Paul Edstein
[Fmr MS MVP - Word]
Reply With Quote
Reply

Thread Tools
Display Modes


Similar Threads
Thread Thread Starter Forum Replies Last Post
Paragraph mark added on the top of every second page aptbs00 Word 6 09-14-2018 03:53 PM
Indention below paragraph mark... Replace space with paragraph mark jeffreybrown Word VBA 8 08-22-2018 03:31 PM
Indention below paragraph mark... can't delete paragraph mark at end of document kb Word 10 10-06-2017 02:32 PM
Indention below paragraph mark... Final paragraph mark Caroline Word 2 02-22-2011 10:39 AM
Adding a paragraph mark by style? Jazz43 Word 0 02-14-2011 06:08 AM

Other Forums: Access Forums

All times are GMT -7. The time now is 12:56 PM.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.
Search Engine Optimisation provided by DragonByte SEO (Lite) - vBulletin Mods & Addons Copyright © 2024 DragonByte Technologies Ltd.
MSOfficeForums.com is not affiliated with Microsoft