Microformat

Microformat

A microformat (sometimes abbreviated μF) is a web-based approach to semantic markup which seeks to re-use existing HTML/XHTML tags to convey metadata[1] and other attributes in web pages and other contexts that support (X)HTML, such as RSS. This approach allows software to process information intended for end-users (such as contact information, geographic coordinates, calendar events, and the like) automatically.

Although the content of web pages is technically already capable of "automated processing", and has been since the inception of the web, such processing is difficult because the traditional markup tags used to display information on the web do not describe what the information means.[2] Microformats can bridge this gap by attaching semantics, and thereby obviate other, more complicated, methods of automated processing, such as natural language processing or screen scraping. The use, adoption and processing of microformats enables data items to be indexed, searched for, saved or cross-referenced, so that information can be reused or combined.[2]

As of 2010, microformats allow the encoding and extraction of events, contact information, social relationships and so on. More are being developed.

Contents

Background

Microformats emerged as part of a grassroots movement to make recognizable data items (such as events, contact details or geographical locations) capable of automated processing by software, as well as directly readable by end-users.[2][3] Link-based microformats emerged first. These include vote links that express opinions of the linked page, which search engines can tally into instant polls.[4]

As the microformats community grew[when?], CommerceNet, a nonprofit organization that promotes electronic commerce on the Internet, helped sponsor and promote the technology and support the microformats community in various ways.[4] CommerceNet also helped co-found the Microformats.org community site.[4]

Neither CommerceNet nor Microformats.org operates as a standards body. The microformats community functions through an open wiki, a mailing list, and an Internet relay chat (IRC) channel.[4] Most of the existing microformats were created at the Microformats.org wiki and the associated mailing list, by a process of gathering examples of web publishing behaviour, then codifying it. Some other microformats (such as rel=nofollow and unAPI) have been proposed, or developed, elsewhere.

Technical overview

XHTML and HTML standards allow for the embedding and encoding of semantics within the attributes of markup tags. Microformats take advantage of these standards by indicating the presence of metadata using the following attributes:

  • class
  • rel
  • rev (in one case, otherwise deprecated in microformats[5])

For example, in the text "The birds roosted at 52.48, -1.89" is a pair of numbers which may be understood, from their context, to be a set of geographic coordinates. With wrapping in spans (or other HTML elements) with specific class names (in this case geo, latitude and longitude, all part of the geo microformat specification):

The birds roosted at
   <span class="geo">
     <span class="latitude">52.48</span>,
     <span class="longitude">-1.89</span>
   </span>

software agents can recognize exactly what each value represents and can then perform a variety of tasks such as indexing, locating it on a map and exporting it to a GPS device.

Example

In this example, the contact information is presented as follows:

 <div>
   <div>Joe Doe</div>
   <div>The Example Company</div>
   <div>604-555-1234</div>
   <a href="http://example.com/">http://example.com/</a>
 </div>

With hCard microformat markup, that becomes:

 <div class="vcard">
   <div class="fn">Joe Doe</div>
   <div class="org">The Example Company</div>
   <div class="tel">604-555-1234</div>
   <a class="url" href="http://example.com/">http://example.com/</a>
 </div>

Here, the formatted name (fn), organisation (org), telephone number (tel) and web address (url) have been identified using specific class names and the whole thing is wrapped in class="vcard", which indicates that the other classes form an hCard (short for "HTML vCard") and are not merely coincidentally named. Other, optional, hCard classes also exist. Software, such as browser plug-ins, can now extract the information, and transfer it to other applications, such as an address book.

In-context examples

For annotated examples of microformats on live pages, see HCard#Live example and Geo (microformat)#Three_classes.

Specific microformats

Several microformats have been developed to enable semantic markup of particular types of information.

  • hAtom – for marking up Atom feeds from within standard HTML
  • hCalendar – for events
  • hCard – for contact information; includes:
  • hMedia - for audio/video content [6][7]
  • hNews - for news content
  • hProduct – for products
  • hRecipe - for recipes and foodstuffs.
  • hResume – for resumes or CVs
  • hReview – for reviews
  • rel-directory – for distributed directory creation and inclusion[8]
  • rel-enclosure – for multimedia attachments to web pages[9]
  • rel-license – specification of copyright license[10]
  • rel-nofollow, an attempt to discourage third-party content spam (e.g. spam in blogs)
  • rel-tag – for decentralized tagging (Folksonomy)[11]
  • xFolk – for tagged links
  • XHTML Friends Network (XFN) – for social relationships
  • XOXO – for lists and outlines

Microformats under development

Among the many proposed microformats,[12] the following are undergoing active development:

  • hAudio – for audio files and references to released recordings
  • citation – for citing references
  • currency – for amounts of money
  • figure – for associating captions with images[13]
  • geo extensions – for places on Mars, the Moon, and other such bodies; for altitude; and for collections of waypoints marking routes or boundaries
  • species – for the names of living things (already used by Wikipedia [14] and the BBC Wildlife Finder)
  • measure – for physical quantities, structured data-values[15]

Uses of microformats

Using microformats within HTML code provides additional formatting and semantic data that applications can use. For example, applications such as web crawlers can collect data about on-line resources, or desktop applications such as e-mail clients or scheduling software can compile details. The use of microformats can also facilitate "mash ups" such as exporting all of the geographical locations on a web page into (for example) Google Maps to visualize them spatially.

Several browser extensions, such as Operator for Firefox and Oomph for Internet Explorer, provide the ability to detect microformats within an HTML document. When hCard or hCalendar are involved, such browser extensions allow to export them into formats compatible with contact management and calendar utilities, such as Microsoft Outlook. When dealing with geographical coordinates, they allow to send the location to maps applications such as Google Maps. Yahoo! Query Language can be used to extract microformats from web pages.[16] On 12 May 2009, Google announced that they would be parsing the hCard, hReview and hProduct microformats, and using them to populate search result pages[17]. They have since extended this to use hCalendar for events[18] and hRecipe for cookery recipes[18]. Similarly, microformats are also consumed by Bing[19] and Yahoo![20]. Together, these are the world's top three search engines.[21]

Microsoft expressed a desire to incorporate Microformats into upcoming projects;[22] as have other software companies.

Alex Faaborg summarizes the arguments for putting the responsibility for microformat user interfaces in the web browser rather than making more complicated HTML:[23]

  • Only the web browser knows what applications are accessible to the user and what the user's preferences are
  • It lowers the barrier to entry for web site developers if they only need to do the markup and not handle "appearance" or "action" issues
  • Retains backwards compatibility with web browsers that don't support microformats
  • The web browser presents a single point of entry from the web to the user's computer, which simplifies security issues

Evaluation of microformats

Various commentators have offered review and discussion on the design principles and practical aspects of microformats. Additionally, microformats have been compared to other approaches that seek to serve the same or similar purpose.[24] From time to time, there is criticism of a single, or all, microformats.[24] Documented efforts to advocate both the spread and use of microformats are known to exist as well.[25][26] Opera Software CTO and CSS creator Håkon Wium Lie said in 2005 "We will also see a bunch of microformats being developed, and that’s how the semantic web will be built, I believe."[27] However, as of August 2008, Toby Inkster, author of the "Swignition" (formerly "Cognition") microformat parsing service pointed out that no new microformat specifications had been published for over three years.[28]

Design principles

Computer scientist and entrepreneur, Rohit Khare stated that reduce, reuse, and recycle is "shorthand for several design principles" that motivated the development and practices behind microformats.[4]:71-72 These aspects can be summarized as follows:

  • Reduce: favor the simplest solutions and focus attention on specific problems;
  • Reuse: work from experience and favor examples of current practice;
  • Recycle: encourage modularity and the ability to embed, valid XHTML can be reused in blog posts, RSS feeds, and anywhere else you can access the web.[4]

Accessibility

Because some microformats make use of title attribute of HTML's abbr element to conceal machine-readable data (particularly date-times and geographical coordinates) in the "abbr design pattern", the plain text content of the element is inaccessible to those screen readers that expand abbreviations.[29] In June 2008, the BBC announced that it would be dropping use of microformats using the abbr design pattern because of accessibility concerns.[30]

Comparison with alternative approaches

Microformats are not the only solution for providing "more intelligent data" on the web. Alternative approaches exist and are under development as well. For example, the use of XML markup and standards of the Semantic Web are cited as alternative approaches.[4] Some contrast these with microformats in that they do not necessarily coincide with the design principles of "reduce, reuse, and recycle", at least not to the same extent.[4]

One advocate of microformats, Tantek Çelik, characterized a problem with alternative approaches:

Here's a new language we want you to learn, and now you need to output these additional files on your server. It's a hassle. (Microformats) lower the barrier to entry.[2]

For some applications the use of other approaches may be valid. If one wishes to use microformat-style embedding but the type of data one wishes to embed does not map to an existing microformat, one can use RDFa to embed arbitrary vocabularies into HTML, for example: embedding domain-specific scientific data on the Web like zoological or chemical data where no microformat for such data exists. Furthermore, standards such as W3C's GRDDL allow microformats to be converted into data compatible with the Semantic Web.[31]

Another advocate of microformats, Ryan King, put the compatibility of microformats with other approaches this way:

Microformats provide an easy way for many people to contribute semantic data to the web. With GRDDL all of that data is made available for RDF Semantic Web tools. Microformats and GRDDL can work together to build a better web.[31]

See also

Notes

  1. ^ "Class Names Across All Microformats". Microformats.org. 2007-09-23. http://microformats.org/wiki/existing-classes. Retrieved 2008-09-06. 
  2. ^ a b c d "What’s the Next Big Thing on the Web? It May Be a Small, Simple Thing -- Microformats". Knowledge@Wharton. Wharton School of the University of Pennsylvania. 2005-07-27. http://knowledge.wharton.upenn.edu/index.cfm?fa=printArticle&ID=1247. 
  3. ^ In this context, the definition of "end-user" includes a person reading a web page on a computer screen or mobile device, or an assistive technology software program such as a screen reader.
  4. ^ a b c d e f g h Khare, Rohit (January/February 2006). "Microformats: The Next (Small) Thing on the Semantic Web?". IEEE Internet Computing (IEEE Computer Society) 10 (1): 68–75. doi:10.1109/MIC.2006.13. http://csdl2.computer.org/persagen/DLAbsToc.jsp?resourcePath=/dl/mags/ic/&toc=comp/mags/ic/2006/01/w1toc.xml&DOI=10.1109/MIC.2006.13. Retrieved 2008-09-06. 
  5. ^ ""rel" attribute frequently asked questions". Microformats.org. 2008-08-06. http://microformats.org/wiki/rel-faq. Retrieved 2008-09-06. 
  6. ^ http://microformats.org/wiki/hmedia
  7. ^ http://sixrevisions.com/web-development/ultimate-guide-to-microformats-reference-and-examples/
  8. ^ http://microformats.org/wiki/rel-directory
  9. ^ http://microformats.org/wiki/rel-enclosure
  10. ^ http://microformats.org/wiki/rel-license
  11. ^ http://microformats.org/wiki/rel-tag
  12. ^ "Exploratory Discussions". Microformats.org. 2008-08-15. http://microformats.org/wiki/exploratory-discussions. Retrieved 2008-09-06. 
  13. ^ http://microformats.org/wiki/figure
  14. ^ http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Microformats/classes#Species
  15. ^ http://microformats.org/wiki/measure
  16. ^ Heilman, Chris (2009-01-19). "Retrieving and displaying data from Wikipedia with YQL". Yahoo Developer Network. Yahoo. http://developer.yahoo.net/blog/archives/2009/01/wikipedia_w_yql.html. Retrieved 2009-01-19. 
  17. ^ Goel, Kavi; Ramanathan V. Guha, Othar Hansson (2009-05-12). "Introducing Rich Snippets". Google Webmaster Central Blog. Google. http://googlewebmastercentral.blogspot.com/2009/05/introducing-rich-snippets.html. Retrieved 2009-05-25. 
  18. ^ a b Gong, Jun; Kosuke Suzuki, Yu Watanabe (2010-04-13). "Better recipes on the web: Introducing recipe rich snippets". Google. http://googlewebmastercentral.blogspot.com/2010/04/better-recipes-on-web-introducing.html. Retrieved 17 March 2011. 
  19. ^ "Bing Introducing Schema.org: Bing, Google and Yahoo Unite to Build the Web of Objects - Search Blog - Site Blogs - Bing Community". Bing. 2011-06-02. http://www.bing.com/community/site_blogs/b/search/archive/2011/06/02/bing-google-and-yahoo-unite-to-build-the-web-of-objects.aspx. Retrieved 2 June 2011. 
  20. ^ "Introducing schema.org: A Collaboration on Structured Data". 2011-06-02. http://www.ysearchblog.com/2011/06/02/introducing-schema-org-a-collaboration-on-structured-data. Retrieved 2 June 2011. 
  21. ^ "Top 5 Search Engines from Oct to Dec 10 | StatCounter Global Stats". StatCounter. http://gs.statcounter.com/#search_engine-ww-monthly-201010-201012. Retrieved 17 January 2011. 
  22. ^ "Bill Gates at Mix06 – "We need microformats"". 2006-03-20. http://microformats.org/blog/2006/03/20/bill-gates-at-mix06-we-need-microformats. Retrieved 2008-09-06. "We need microformats and to get people to agree on them. It is going to bootstrap exchanging data on the Web… …we need them for things like contact cards, events, directions…" 
  23. ^ http://blog.mozilla.com/faaborg/2007/02/04/microformats-part-4-the-user-interface-of-microformat-detection/
  24. ^ a b "Criticism". Microformats.org. 2007-03-24. http://microformats.org/wiki?title=criticism&oldid=18478. Retrieved 2007-08-15. 
  25. ^ "Advocacy". Microformats.org. 2008-08-27. http://microformats.org/wiki/advocacy. Retrieved 2007-08-15. 
  26. ^ "Spread Microformats". Microformats.org. 2008-08-29. http://microformats.org/wiki/spread-microformats. Retrieved 2007-08-15.  This includes community resources for marketing microformats such as buttons, banners, wallpaper / desktop screens, logo graphics, etc.
  27. ^ Holzschlag, Molly E. (2005-03-31). "Interview with Håkon Wium Lie". Molly.com. http://www.molly.com/2005/03/31/interview-with-hkon-wium-lie/. Retrieved 2007-11-18. 
  28. ^ Inkster, Toby A. (2008-04-22). "More than three years". Microformats.org. http://microformats.org/discuss/mail/microformats-discuss/2008-August/012402.html. Retrieved 2008-08-24. 
  29. ^ Craig, James (2007-04-27). "hAccessibility". Web Standards Project. http://www.webstandards.org/2007/04/27/haccessibility/. Retrieved 2007-08-16. 
  30. ^ Smethurst, Michael (2008-06-23). "Removing Microformats from bbc.co.uk/programmes". BBC. http://www.bbc.co.uk/blogs/radiolabs/2008/06/removing_microformats_from_bbc.shtml. Retrieved 2008-08-24. 
  31. ^ a b "W3C GRDDL Recommendation Bridges HTML/Microformats and the Semantic Web". XML Coverpages. OASIS. 2007-09-13. http://xml.coverpages.org/ni2007-09-13-a.html. Retrieved 2007-11-23. 

References

Further reading

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • microformat — MICROFORMÁT s. n. format de dimensiuni miniautorizate. (< micro1 + format) Trimis de raduborza, 15.09.2007. Sursa: MDN …   Dicționar Român

  • Microformat — Un microformat (parfois abrégé sous μF ou uF) est une approche de formatage de données basé sur le web[1] qui cherche à réutiliser le contenu existant comme les métadonnées, en n utilisant que des classes[2] et attributs[3] XHTML et HTML. Cette… …   Wikipédia en Français

  • microformat — mikroformatas statusas T sritis Gynyba apibrėžtis Bendrasis terminas, kuriuo vadinamas bet kuris (fotojuostos, vaizdajuostės, popieriaus ar kitos informacijos laikmenos) formatas, kuriame pateikiami labai sumažinti vaizdai, kurių neįmanoma… …   NATO terminų aiškinamasis žodynas

  • microformat — noun A simple data format that can be embedded in a webpage …   Wiktionary

  • Geo (microformat) — Geo is a microformat used for marking up WGS84 geographical coordinates (latitude;longitude) in (X)HTML. Although termed a draft specification, this is a formality, and the format is stable and in use; not least as a sub set of the published… …   Wikipedia

  • Geo (microformat) — Geo est un microformat utilisé pour exprimer les coordonnées géographiques (latitude, longitude) du système géodésique WGS84 dans le format XHTML. Geo permet l utilisation des outils d analyse (par exemple, d autres sites Web ou de l extension… …   Wikipédia en Français

  • Micro-format — Microformat Un microformat (parfois abrégé sous μF ou uF) est une approche de formatage de données basé sur le web[1] qui cherche à réutiliser le contenu existant comme les métadonnées, en n utilisant que des classes[2] et attributs[3] XHTML et… …   Wikipédia en Français

  • Micro format — Microformat Un microformat (parfois abrégé sous μF ou uF) est une approche de formatage de données basé sur le web[1] qui cherche à réutiliser le contenu existant comme les métadonnées, en n utilisant que des classes[2] et attributs[3] XHTML et… …   Wikipédia en Français

  • Microformats — Microformat Un microformat (parfois abrégé sous μF ou uF) est une approche de formatage de données basé sur le web[1] qui cherche à réutiliser le contenu existant comme les métadonnées, en n utilisant que des classes[2] et attributs[3] XHTML et… …   Wikipédia en Français

  • HCard — (short for HTML vCard ) is a microformat for publishing the contact details of people, companies, organizations, and places, in (X)HTML, Atom, RSS, or arbitrary XML. hCard does this using a 1:1 representation of vCard (RFC 2426) properties and… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”