For shorter documents it is useful to read the lines backwards. It breaks the automatic pattern recognition up a little so you assume less about what is on the page. A