A PDF Is Not an Ebook or The Incredible, Moveable Ebook

CeciNestPasUnEbookThere are two common misconceptions in the blogosphere these days:

  1. that a PDF is an ebook;
  2. that a PDF can seamlessly become an ebook (MOBI for Kindle or EPUB for all other eReaders).

Let’s start with the first one: a PDF is not an ebook.

You might say, “Well, when I look at a PDF on my computer, it looks exactly like a book. It has page numbers and page headings and everything. I can even read it on my iPad.”

But the fact that you can read it on your computer or your tablet does not make it an ebook. It does indeed look like a print book. And that’s where the problem lies.

How an Ebook Redefines Moveable Type

An ebook is a text that can adjust itself to the device you use to read it and to settings you choose.

A PDF (Portable Document Format) cannot adjust itself. It is an image of a document that is always meant to look the same — the page always starts with a certain word and ends with a certain word. It has a hidden layer of code speaking to the electronic PDF reader and telling it how to display a static page — imagine an invisible grid with horizontal and vertical coordinates to place each element in a precise location.

These hidden instructions are why a PDF is not an ebook and why a PDF cannot be seamlessly changed into either of the standard ebook formats: MOBI and EPUB.

Ebooks are about flexibility — about reflow. An ebook morphs depending on what your settings are and what device you are reading on. You can change the font size, read a book in one-column view or in two columns, read the same copy of the book on your iPhone, iPad, computer, or even on your Kindle. The book needs to adapt easily to each of those devices and to how you want to read it.

When we make an ebook, we work on creating something that resembles a print book but that is malleable, transforming itself to meet the needs of the reader and the reading device.

We have to think about all the different platforms that people read on (Kindle, Nook, iBook, etc.) and the different ways readers customize their screens. I know people, for example, who have failing eyesight and who thought they would have to give up reading. Ebooks — true ebooks! — have changed their lives, because they can make the font as big as they need or even adjust the text color and contrast.

But what happens when you enlarge the font on a PDF? You can’t: you just enlarge the PDF and if you enlarge it too much, you can’t see the whole page on your screen.

You may want to say, “Well, yes, that’s true. But if I want to play with font size and read a PDF on a Kindle, I can put it through a free converter that will make an ebook for Kindle.” (I have in fact seen this as part of the sales pitch on websites selling PDFs as “ebooks.”)

That is indeed true and, occasionally, it might turn out all right. But usually it needs some help. And often a lot of help.

Why a PDF Can’t Magically Become an Ebook

This brings us to point number 2. Remember: the PDF has a hidden layer of information that gives precise instructions on how to make a page look like the printed page of the book. These instructions are useless for the eReaders because an eReader does not want to make a static page.

In fact, the program you use to make an ebook out of a PDF will not know what to do with most of those instructions. Computer programs are useful tools, but it is very hard to create one that can catch all the complexities involved in translating a PDF into an ebook.

Here are some examples of conversion oddities that we regularly encounter.

(1) Mysterious Text

The other day I started creating an ebook out of a PDF. I had put my document through the first step of the process and was going through the text to see just how much needed to be fixed when I came upon what seemed to be a random list of geographical places:

English Channel

England

France

Spain

Atlantic

Mediterranean

Africa

Having no idea why the list was where it was, I looked at the PDF as a reference. It turns out that the list had originally been words labeling different parts of a map. The PDF maker had layered the names of places on top of the image of a map. When I extracted the text from the PDF to create the ebook (and thus temporarily removed images), the names that had been layered onto the map were left all on their lonesome, referring to nothing and not knowing how to position themselves on the screen.

(2) The Ebook vs. the Typesetter

Think about what typesetters do: they do not let words fall randomly on a page to create a book. They are, after all, setting the type. Typesetters have to think about things like widows (the last word or last few words of a paragraph printed alone at the beginning of a new page or column) and orphans (the first line of a paragraph starting at the very end of a page or column), and when they are confronted with widows and orphans, they manipulate the page to get rid of them. And this becomes part of the PDF’s code.

When translated into ebooks, the mechanisms used to avoid widows and orphans turn into strange spaces between paragraphs or into paragraphs that start on the same line the previous paragraph ended:

Be good enough, honourable deputies, to take our request seriously, and do not reject it without at least hearing the reasons that we have to advance in its support.                                                                                     First, if you shut off as much as possible all access to natural light, and thereby create a need for artificial light, what industry in France will not ultimately be encouraged?

(3) Ran-dom Hyphens

Ever read an ebook and found a word with a ran-dom hyphen in the middle? It’s not just a typo.

Sometimes a word at the end of a line of text is too long and is continued on the next line. To let you know that the word is interrupted, the typesetter (or word processor) adds a soft hyphen at the end of the line — or sometimes the typesetter (or program) makes a mistake and adds a regular (hard) hyphen instead.

A soft hyphen is meant to stay in a word only if it is interrupted at the end of a line. Systems converting PDFs into an ebooks don’t always know what to do with these and will often just turn them into hard hyphens. Thus, even if the word is not shared between two lines, it still has a hyphen in it.

For more on the when and why of ligatures, check out I Love Typography's "The Decline and fall of the ligature."

For more on the when and why of ligatures, check out I Love Typography’s “The Decline and fall of the ligature.”


(4) Ligatures

Printed books and their PDFs often have something called ligatures. A ligature is actually a combination of two letters that would otherwise look awkward next to each other. For example, f and i are often slurred together to avoid the awkward juxtaposition of the tip of the f and the dot of the i.

But an ebook converter won’t necessarily know what to do with ligatures and instead you’ll get a hyphen in the middle of a word, or a strange vertical line, or even a question mark to replace the ligature that once represented two letters.

These are only a few examples of why many ebook makers prefer to manually go through a text that is being converted from PDF to ebook: there are too many little details for a converting program to catch, and so the human eye is still vital in the process.

Think about how far we are from Gutenberg’s moveable type: a PDF may remind us of the beautiful, static pages Gutenberg offered the Western world, but the ebook redefines the notion of “moveable” type — with a well-made ebook (and often “handmade” ebook), each reader can move the type to create his own version of the text.

Leave a Comment

Filed under Ebooks, Publishing

Humane and Pro-Growth #1 Emigration & Immigration Law on Amazon

HumaneAndPro-GrowthBestsellerThis past Monday, Reason released Humane and Pro-Growth: A Reason Guide to Immigration Reform (prepared for electronic publication by Invisible Order). And today it is the #1 bestseller sold in the Amazon category “Emigration and Immigration Law.”

Congratulations to editor Shikha Dalmia and the Reason writers!

Leave a Comment

Filed under Ebooks, Publishing

Four Versions of Bourbon for Breakfast!

9781621290759_frontcoverInvisible Order is pleased and honored to have been involved in the production of all four versions of LFB’s new edition of Jeffrey A. Tucker’s Bourbon for Breakfast.

Four versions, you say?

Yes, four versions! This great collection of essays is available not only in a paperback edition but also as an eBook, a multimedia eBook (with videos of the author), and an audiobook narrated by Steven Ng.

And all four versions are in the top-five LFB bestsellers!

Invisible Order worked on all three text versions (eBook, paper, and multimedia) and also helped produce Steven Ng’s audio version (using our “editing” ears instead of pens).

Leave a Comment

Filed under Austrian Economics, Ebooks, Economics, Libertarian Letters, Libertarian Theory, Publishing

Prohibition Only Makes Weed Stronger

RhettButler4IOBK Marcus is up in the Freeman this morning, explaining why it’s so hard to get weak weed these days.

Check out “Why Rhett Butler’s Weed is So Strong.”

Prohibition has driven the development of ever-stronger drugs, where a free market would see a proliferation of lighter options.

Leave a Comment

Filed under Austrian Economics

New Release: Humane and Pro-Growth

Invisible Order is excited to announce the release of our latest project. We had the thrill of working on Reason‘s new ebook, Humane and Pro-Growth: A Reason Guide to Immigration Reform (edited by Shikha Dalmia). This collection of articles is an essential read for those thinking about —and rethinking — the current immigration debate.

We have a choice between raising the barricades further and ejecting people already here or moving toward a more open system that allows people to move about freely. This collection from Reason features the work of journalists, policy experts, and academics. It advances an informed discussion about realistic and workable reforms appropriate to America’s self-understanding as a Land of Immigrants.

Humane and Pro-Growth is available for Kindle, iPad, and Nook.

2 Comments

Filed under Ebooks, Publishing

Em Dashes and Ellipses in Ebooks

Although we love Chicago Style, just following it slavishly on every detail can produce some ugly results in your digital text.

In general, style guides are stuck in the print era. We have to adapt them for the realities of digital text in general and ebooks in particular. This need to adapt is most obvious in how Chicago treats two commonly overlooked characters: the ellipsis (…) and the em dash (—).

Continue reading

Leave a Comment

Filed under Ebooks

Lincoln and Booth in Hell

On the anniversary of the shooting of Abraham Lincoln, BK Marcus muses on the assassins of Caesar and their place at the very bottom of Dante’s Inferno:

If a modern Dante — someone with the same religious and political beliefs, but with an updated knowledge of history — were to write the Inferno today, he would replace Cassius in satanic mouth #3 with John Wilkes Booth, the man who shot Abraham Lincoln 148 years ago today, April 14, 1865.

Lincoln may not have shared Dante’s faith, but he knew how to use the rhetoric of the Bible to stoke the fervor of 19th-century Christian pietists; and after his assassination, the pietists returned the favor by enshrining Father Abraham in the language of Christian martyrdom.

As John Wilkes Booth shot the Union’s president, he shouted, “Sic semper tyrannis!”thus always to tyrants, a phrase attributed to Brutus as he stabbed the dying Caesar. It’s also the state motto of Virginia, where I live.

Today’s civic religion requires us to canonize Lincoln, to see his life as a crusade for ever-greater freedom and his violent death as the American equivalent of crucifixion.

Lincoln-Uncensored-CoverOne of the many things we’ve lost track of in this sanctioned interpretation is an understanding of the tradition that produced John Wilkes Booth. We are only allowed to perceive the evil of chattel slavery at the center of the Civil War; we are censured if we ever emphasize an older historical struggle between liberty and power.

But the liberals of that era did see that there was more than one liberal principle at stake and that not all the angels aligned themselves with only one side in that bloody struggle.

We don’t have to support political violence (or any violence) to recognize that the same ideological tradition that informed Booth’s name, his most famous words, and his most infamous act is the same tradition that produced Western liberalism and the American Revolution. Sic semper…

For the real story of the tyrant himself, in his own words, check out Joseph Fallon’s ebook Lincoln Uncensored.

3 Comments

Filed under Ebooks, History