I agree that the number of data points needed for study and the number of variables involved are both key factors. The one area that comes to mind where a approach might be easiest is closed platforms like the hardware Kindle reader -- for example, measuring read times by a large population of the same text, with varying fonts, on the same hardware. Improved typography might reduce read times some modest percentage over baseline due to improved readability.