User Tools

Site Tools


en:infra-convert:user:stamppdf

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
en:infra-convert:user:stamppdf [2022/10/12 13:59]
me
en:infra-convert:user:stamppdf [2022/10/12 14:03] (current)
me
Line 46: Line 46:
 A computer can only interpret a text character as such if it is linked to a meaning by so-called character encoding (→[[en:​infra-convert:​user:​terms#​font|Font]]). You can check whether character encoding is applied using a PDF reader, for example Adobe Acrobat Reader. Activate the selection tool (**Selection tool for text and images** in Acrobat Reader) and try to select or highlight the text. If elements are highlighted,​ usually in blue color, coded text was detected. You can copy it (right-click > **Copy** or **Ctrl** + **C**) and paste it in a text editor, for example. A computer can only interpret a text character as such if it is linked to a meaning by so-called character encoding (→[[en:​infra-convert:​user:​terms#​font|Font]]). You can check whether character encoding is applied using a PDF reader, for example Adobe Acrobat Reader. Activate the selection tool (**Selection tool for text and images** in Acrobat Reader) and try to select or highlight the text. If elements are highlighted,​ usually in blue color, coded text was detected. You can copy it (right-click > **Copy** or **Ctrl** + **C**) and paste it in a text editor, for example.
  
-In the first example belowthere is no character-coded text, in the secondthe number "​20"​ is character-coded, ​and in the third the diameter character is also encoded. CAD systems usually export graphical symbols not character coded. Thus, PDFs without encodedwith partially encoded or fully encoded ​text can be named.+Four typical cases can be distinguishedas shown in the following examples: 1) no character-coded text is present; 2) textsexcept graphic symbolsare character-coded; 3) as 2, but invisible, character-coded text overlays the visiblenon-character-coded ​text; 4) all texts are character-coded.
  
-{{ :de:​infra-convert:​user:​terms:​pdf_zeichencodierung_erkennen.png?​nolink |}} +{{ :en:​infra-convert:​user:​terms:​pdf_zeichencodierung_erkennen_2.png?nolink |}}
- +
-\\  +
- +
-A special feature in PDF is overlaid text. Non-visible coded text characters overlay the content – the content can be rasterized or vectorized. A document becomes "​searchable"​ with this information. This export variant is available in some CAD systems. Often, optical character recognition (OCR) is used to add overlaid text during scanning. It should be noted, however, that the recognition rate of OCR is (significantly) below one hundred percent. +
- +
-Overlaid text can be marked as described above. Often the presence of overlaid text can be recognized by the fact that the marking does not exactly match the visible text characters. +
- +
-{{ :​de:​infra-convert:​user:​terms:​pdf_zeichencodierung_erkennen_ueberlagert.png?nolink |}}+
  
 \\  \\ 
Line 103: Line 95:
 == PDF with partially character coded text == == PDF with partially character coded text ==
  
-Create characteristics one by one. The text fragments belonging to a characteristic can be evaluated automatically if you stamp them as a group (press ​**Ctrl** key, hold it down and draw a frame over the elements to be stamped or click the elements individually).\\  **See** Functions > Characteristics>​ Automatics stamping > [[https://​wiki.elias-gmbh.de/​doku.php?​id=en:​infra-convert:​user:​functions:​ballooning#​workflow|Workflow]] > Step 4c +The **Stamp with preset** function has been specially developed for this use case (available from program version 3.3.0)Follow ​the instructions on this page: 
- +\\  **See** Functions > Characteristics > [[de:​infra-convert:​user:​functions:​ballooning_preset|Stamp with preset]]
-If a drawing entry contains a graphic symbol at the beginning of the text, for example the diameter sign in "⌀ 20", stamp the text as described previously. You then correct the property that was not interpreted or was interpreted incorrectly due to the missing character in the feature properties. For example, the class "​Length"​ is recognized from "​20"​. You then change ​this to "​Diameter"​. +
- +
-Drawing entries with non-character-coded symbols in the middle of the dimension text, for example "20 ±0.1" with uncoded "​±",​ as well as specifications that are complex in the description with characteristic properties, such as form and position tolerances, are best stamped directly manually.\\  **See** Functions > Characteristics > [[en:​infra-convert:​user:​functions:​ballooning_man|Manually stamp]] +
- +
-Example of a PDF with partially coded text: +
- +
-{{ :​en:​infra-convert:​user:​terms:​pdf_stempeln_teilweise_codiert.png?​nolink |}} +
- +
-Example of a PDF with partially encoded, overlaid text (rasterized content): +
- +
-{{ :​en:​infra-convert:​user:​terms:​pdf_stempeln_teilweise_codiert_raster.png?​nolink |}} +
- +
-Example of a PDF with partially encoded, overlaid text (vectorized content):+
  
-{{ :​en:​infra-convert:​user:​terms:​pdf_stempeln_teilweise_codiert_ueberlagert_vektor.png?​nolink |}}+> **Recommended procedure up to program version 3.2.7:**\\  
 +> Create characteristics one by one. The text fragments belonging to a characteristic can be evaluated automatically if you stamp them as a group (press **Ctrl** key, hold it down and draw a frame over the elements to be stamped or click the elements individually).\\  **See** Functions > Characteristics>​ Automatics stamping > [[https://​wiki.elias-gmbh.de/​doku.php?​id=en:​infra-convert:​user:​functions:​ballooning#​workflow|Workflow]] > Step 4c\\  
 +> If a drawing entry contains a graphic symbol at the beginning of the text, for example the diameter sign in "⌀ 20", stamp the text as described previously. You then correct the property that was not interpreted or was interpreted incorrectly due to the missing character in the feature properties. For example, the class "​Length"​ is recognized from "​20"​. You then change this to "​Diameter"​.\\  
 +> Drawing entries with non-character-coded symbols in the middle of the dimension text, for example "20 ±0.1" with uncoded "​±",​ as well as specifications that are complex in the description with characteristic properties, such as form and position tolerances, are best stamped directly manually.\\  **See** Functions > Characteristics > [[en:​infra-convert:​user:​functions:​ballooning_man|Manually stamp]]\\  
 +> Example of a PDF with partially coded text:\\  
 +> {{ :​en:​infra-convert:​user:​terms:​pdf_stempeln_teilweise_codiert.png?​nolink |}} 
 +> Example of a PDF with partially encoded, overlaid text (rasterized content):\\  
 +> {{ :​en:​infra-convert:​user:​terms:​pdf_stempeln_teilweise_codiert_raster.png?​nolink |}} 
 +> Example of a PDF with partially encoded, overlaid text (vectorized content):\\  
 +{{ :​en:​infra-convert:​user:​terms:​pdf_stempeln_teilweise_codiert_ueberlagert_vektor.png?​nolink |}}
  
 </​WRAP>​ </​WRAP>​
en/infra-convert/user/stamppdf.1665575945.txt.gz · Last modified: 2022/10/12 13:59 by me