{"id":5361,"date":"2011-06-24T17:34:02","date_gmt":"2011-06-24T12:27:17","guid":{"rendered":""},"modified":"2011-06-24T18:27:17","modified_gmt":"2011-06-24T12:27:17","slug":"moving-text-fro","status":"publish","type":"post","link":"https:\/\/kk.org\/screenpublishing\/moving-text-fro\/","title":{"rendered":"Moving text from PDF to InDesign"},"content":{"rendered":"<p>We&#8217;re working on another EPUB conversion now. For this one, we&#8217;re moving from PDF to EPUB via InDesign. Currently, the whole book exists as a single PDF. The text in the PDF includes a small but not insignificant amount of character styling (bold, italic, etc). The best workflow I&#8217;ve found so far seems to be:<\/p>\n<p><strong>1. Break the single PDF into multiple PDFs by chapter.<\/strong> I&#8217;ll be using a Book organization in InDesign, which means creating a separate InDesign document for each chapter. For that reason, I&#8217;d like to import the text chapter-by-chapter.<\/p>\n<p><strong>2. Copy-Paste the text for each chapter into a separate Word document. Remove extraneous carriage returns.<\/strong> There are two main methods for getting text into InDesign. You <em>can<\/em> copy-paste directly into a text frame, but for big sections of text, this is not recommended. The better method (I think) is to use InDesign&#8217;s Place command (ctrl+D). To do this, text needs to be in a place-able format like TXT, DOC, or RTF. Because I&#8217;m working with formatted text, DOC or RTF are my best options. After a few rounds of experimentation, I find Word to offer the best tools for maintaining character formatting and eliminating extraneous line breaks. Depending on what happens upon paste (or paste special &#8211; I&#8217;ve experimented with all of the options and it seems like I get a different result each time), I may use AutoFormat to fix extra carriage returns while maintaining paragraph breaks. <\/p>\n<p><strong>3. &#8216;Place&#8217; text in InDesign.<\/strong> When you place text, you can &#8220;Show options&#8221;. The only adjustment to the defaults I&#8217;ve made is to omit page breaks. I&#8217;ve left several defaults that don&#8217;t apply to my document and it doesn&#8217;t seem to break anything. This is what my Place options looks like:<\/p>\n<p><img loading=\"lazy\" alt=\"ind-place-text.jpg\" src=\"https:\/\/kk.org\/screenpublishing\/ind-place-text.jpg\" width=\"450\" height=\"413\" class=\"mt-image-none\" style=\"\" \/><\/p>\n<p>So far, these three steps seem to work pretty well. It seems like there&#8217;s a fair amount of variation in the way my pasted text lands in Word (especially in terms of the presence or absence of extra carriage returns). Also, although the first chapter I worked on seemed to transmit fairly clean text, in the second and third chapters I&#8217;m finding many instances of words running together (&#8220;inhalesthem&#8221; instead of &#8220;inhales them&#8221;). I suspect this is related to the carriage return issue. Although I&#8217;m not always seeing extra carriage returns, it seems like their absence is sometimes accompanied by a missing space.<\/p>\n<p>If anyone has a better workflow for getting styled text from PDF to InDesign, please share in the comments!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We&#8217;re working on another EPUB conversion now. For this one, we&#8217;re moving from PDF to EPUB via InDesign. Currently, the whole book exists as a single PDF. The text in the PDF includes a small but not insignificant amount of &hellip; <a href=\"https:\/\/kk.org\/screenpublishing\/moving-text-fro\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"open","ping_status":"0","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/posts\/5361"}],"collection":[{"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/comments?post=5361"}],"version-history":[{"count":1,"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/posts\/5361\/revisions"}],"predecessor-version":[{"id":6288,"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/posts\/5361\/revisions\/6288"}],"wp:attachment":[{"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/media?parent=5361"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/categories?post=5361"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/kk.org\/screenpublishing\/wp-json\/wp\/v2\/tags?post=5361"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}