Monday, February 23, 2015

Issues in HTML formatting

A number of questions have come up in trying to create an HTML version of the printed book.  This is just a list of some of them.

  • On page 21, we have a footnote -- the only footnote in the whole book.  It is not clear how to do footnotes for the Kindle, or for HTML generally.  Wikipedia makes each footnote mark a link to the actual footnote at the end of the web page.  Randall Munroe in his What If web site does some as links and others as text which shows up in a tiny window on top of the reference number, when you click on the reference number -- sort of like "alt text" for images.  My interpretation of this footnote is that is is an aside -- extra information that you may already know, and can sort of skip.  I would think this is effectively a parenthetical expression (sort of like this), and figure it is easiest to re-format it as such.
  • Where do we put the figures?  Most of the figures are roughly related to the text that it is near, but have no specific place where they need to be.  We will try to find the first paragraph break where the figure would help the reader, and put it there.
  • Several of the full-page images, are sideways.  That makes sense in the printed book, where the pages are longer than they are wide.  In standard HTML, the display window width is under user control, so it probably makes more sense to have them be right-side up.  But the Kindle is more like a printed page -- fixed size display window that is longer than it is wide, so let's leave the images sideways.
  • On page 27, the last two lines of the opening quote are partially indented, but I can see no reason for it.  Searching the web, this is Sonnet 81 by William Shakespeare. Of the copies I find on the web, some just center all the lines; some make them all left-justified.  But quite a few indent the last two lines.  So I guess we will do that too, to keep with the original document.
  • One thing we want to do is to use italics correctly, matching what was in the printed book.  If we compare our HTML version with the scanned bit-map version, we find a couple of places, mainly on pages 33 to 35, where we have to decide if some characters are (or are not) italic.  Specifically, there are (apparently) italic dashes and an italic colon, not to mention the periods. (Are they italic or regular periods?)

No comments:

Post a Comment