On using @Microformats and parsing HTML pages directly :

http://aaronparecki.com/2012/281/article/1/providing-apis-for-content-driven-websites