This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
en:infra-convert:user:stamppdf [2022/10/12 13:59] me |
en:infra-convert:user:stamppdf [2022/10/12 14:03] (current) me |
||
---|---|---|---|
Line 46: | Line 46: | ||
A computer can only interpret a text character as such if it is linked to a meaning by so-called character encoding (→[[en:infra-convert:user:terms#font|Font]]). You can check whether character encoding is applied using a PDF reader, for example Adobe Acrobat Reader. Activate the selection tool (**Selection tool for text and images** in Acrobat Reader) and try to select or highlight the text. If elements are highlighted, usually in blue color, coded text was detected. You can copy it (right-click > **Copy** or **Ctrl** + **C**) and paste it in a text editor, for example. | A computer can only interpret a text character as such if it is linked to a meaning by so-called character encoding (→[[en:infra-convert:user:terms#font|Font]]). You can check whether character encoding is applied using a PDF reader, for example Adobe Acrobat Reader. Activate the selection tool (**Selection tool for text and images** in Acrobat Reader) and try to select or highlight the text. If elements are highlighted, usually in blue color, coded text was detected. You can copy it (right-click > **Copy** or **Ctrl** + **C**) and paste it in a text editor, for example. | ||
- | In the first example below, there is no character-coded text, in the second, the number "20" is character-coded, and in the third the diameter character is also encoded. CAD systems usually export graphical symbols not character coded. Thus, PDFs without encoded, with partially encoded or fully encoded text can be named. | + | Four typical cases can be distinguished, as shown in the following examples: 1) no character-coded text is present; 2) texts, except graphic symbols, are character-coded; 3) as 2, but invisible, character-coded text overlays the visible, non-character-coded text; 4) all texts are character-coded. |
- | {{ :de:infra-convert:user:terms:pdf_zeichencodierung_erkennen.png?nolink |}} | + | {{ :en:infra-convert:user:terms:pdf_zeichencodierung_erkennen_2.png?nolink |}} |
- | + | ||
- | \\ | + | |
- | + | ||
- | A special feature in PDF is overlaid text. Non-visible coded text characters overlay the content – the content can be rasterized or vectorized. A document becomes "searchable" with this information. This export variant is available in some CAD systems. Often, optical character recognition (OCR) is used to add overlaid text during scanning. It should be noted, however, that the recognition rate of OCR is (significantly) below one hundred percent. | + | |
- | + | ||
- | Overlaid text can be marked as described above. Often the presence of overlaid text can be recognized by the fact that the marking does not exactly match the visible text characters. | + | |
- | + | ||
- | {{ :de:infra-convert:user:terms:pdf_zeichencodierung_erkennen_ueberlagert.png?nolink |}} | + | |
\\ | \\ | ||
Line 103: | Line 95: | ||
== PDF with partially character coded text == | == PDF with partially character coded text == | ||
- | Create characteristics one by one. The text fragments belonging to a characteristic can be evaluated automatically if you stamp them as a group (press **Ctrl** key, hold it down and draw a frame over the elements to be stamped or click the elements individually).\\ **See** Functions > Characteristics> Automatics stamping > [[https://wiki.elias-gmbh.de/doku.php?id=en:infra-convert:user:functions:ballooning#workflow|Workflow]] > Step 4c | + | The **Stamp with preset** function has been specially developed for this use case (available from program version 3.3.0). Follow the instructions on this page: |
- | + | \\ **See** Functions > Characteristics > [[de:infra-convert:user:functions:ballooning_preset|Stamp with preset]] | |
- | If a drawing entry contains a graphic symbol at the beginning of the text, for example the diameter sign in "⌀ 20", stamp the text as described previously. You then correct the property that was not interpreted or was interpreted incorrectly due to the missing character in the feature properties. For example, the class "Length" is recognized from "20". You then change this to "Diameter". | + | |
- | + | ||
- | Drawing entries with non-character-coded symbols in the middle of the dimension text, for example "20 ±0.1" with uncoded "±", as well as specifications that are complex in the description with characteristic properties, such as form and position tolerances, are best stamped directly manually.\\ **See** Functions > Characteristics > [[en:infra-convert:user:functions:ballooning_man|Manually stamp]] | + | |
- | + | ||
- | Example of a PDF with partially coded text: | + | |
- | + | ||
- | {{ :en:infra-convert:user:terms:pdf_stempeln_teilweise_codiert.png?nolink |}} | + | |
- | + | ||
- | Example of a PDF with partially encoded, overlaid text (rasterized content): | + | |
- | + | ||
- | {{ :en:infra-convert:user:terms:pdf_stempeln_teilweise_codiert_raster.png?nolink |}} | + | |
- | + | ||
- | Example of a PDF with partially encoded, overlaid text (vectorized content): | + | |
- | {{ :en:infra-convert:user:terms:pdf_stempeln_teilweise_codiert_ueberlagert_vektor.png?nolink |}} | + | > **Recommended procedure up to program version 3.2.7:**\\ |
+ | > Create characteristics one by one. The text fragments belonging to a characteristic can be evaluated automatically if you stamp them as a group (press **Ctrl** key, hold it down and draw a frame over the elements to be stamped or click the elements individually).\\ **See** Functions > Characteristics> Automatics stamping > [[https://wiki.elias-gmbh.de/doku.php?id=en:infra-convert:user:functions:ballooning#workflow|Workflow]] > Step 4c\\ | ||
+ | > If a drawing entry contains a graphic symbol at the beginning of the text, for example the diameter sign in "⌀ 20", stamp the text as described previously. You then correct the property that was not interpreted or was interpreted incorrectly due to the missing character in the feature properties. For example, the class "Length" is recognized from "20". You then change this to "Diameter".\\ | ||
+ | > Drawing entries with non-character-coded symbols in the middle of the dimension text, for example "20 ±0.1" with uncoded "±", as well as specifications that are complex in the description with characteristic properties, such as form and position tolerances, are best stamped directly manually.\\ **See** Functions > Characteristics > [[en:infra-convert:user:functions:ballooning_man|Manually stamp]]\\ | ||
+ | > Example of a PDF with partially coded text:\\ | ||
+ | > {{ :en:infra-convert:user:terms:pdf_stempeln_teilweise_codiert.png?nolink |}} | ||
+ | > Example of a PDF with partially encoded, overlaid text (rasterized content):\\ | ||
+ | > {{ :en:infra-convert:user:terms:pdf_stempeln_teilweise_codiert_raster.png?nolink |}} | ||
+ | > Example of a PDF with partially encoded, overlaid text (vectorized content):\\ | ||
+ | > {{ :en:infra-convert:user:terms:pdf_stempeln_teilweise_codiert_ueberlagert_vektor.png?nolink |}} | ||
</WRAP> | </WRAP> |